Published December 4, 2021 | Version v1.0.0
Software Open

pyDRESCALk: Python Distributed Non Negative RESCAL Decomposition with Determination of Latent Features

  • 1. Los Alamos National Labs
  • 2. Coloarado State University

Description

pyDRESCALk: Python Distributed Non Negative RESCAL with determination of hidden features is a software package for applying non-negative RESCAL decomposition in a distributed fashion to large datasets. It can be utilized for decomposing relational datasets. It can minimize the difference between reconstructed data and the original data through Frobenius norm. Additionally, the Custom Clustering algorithm allows for automated determination for the number of Latent features.

Features:

  • Ability to decompose relational datasets.
  • Utilization of MPI4py for distributed operation.
  • Distributed random initializations.
  • Distributed Custom Clustering algorithm for estimating automated latent feature number (k) determination.
  • Objective of minimization of Frobenius norm.
  • Support for distributed CPUs/GPUs.
  • Support for Dense/Sparse data.
  • Demonstrated scaling performance upto 10TB of dense and 9Exabytes of Sparse data.

 

Files

lanl/pyDRESCALk-v1.0.0.zip

Files (6.2 MB)

Name Size Download all
md5:95a1d881a383ab0c0621a8279e1e4932
6.2 MB Preview Download

Additional details

Related works