Published December 4, 2021
| Version v1.0.0
Software
Open
pyDRESCALk: Python Distributed Non Negative RESCAL Decomposition with Determination of Latent Features
Creators
- 1. Los Alamos National Labs
- 2. Coloarado State University
Description
pyDRESCALk: Python Distributed Non Negative RESCAL with determination of hidden features is a software package for applying non-negative RESCAL decomposition in a distributed fashion to large datasets. It can be utilized for decomposing relational datasets. It can minimize the difference between reconstructed data and the original data through Frobenius norm. Additionally, the Custom Clustering algorithm allows for automated determination for the number of Latent features.
Features:
- Ability to decompose relational datasets.
- Utilization of MPI4py for distributed operation.
- Distributed random initializations.
- Distributed Custom Clustering algorithm for estimating automated latent feature number (k) determination.
- Objective of minimization of Frobenius norm.
- Support for distributed CPUs/GPUs.
- Support for Dense/Sparse data.
- Demonstrated scaling performance upto 10TB of dense and 9Exabytes of Sparse data.
Files
lanl/pyDRESCALk-v1.0.0.zip
Files
(6.2 MB)
Name | Size | Download all |
---|---|---|
md5:95a1d881a383ab0c0621a8279e1e4932
|
6.2 MB | Preview Download |
Additional details
Related works
- Is supplement to
- https://github.com/lanl/pyDRESCALk/tree/v1.0.0 (URL)