Published November 30, 2021 | Version v1
Journal article Open

Evaluation of Various DR Techniques in Massive Patient Datasets using HDFS

  • 1. Ph.D, Department of Computer Science and Engineering, Adikavi Nannaya University, Rajamahendravaram (A. P), India.
  • 2. Professor & Dean of Academics Department of Computer Science & Engineering of Adikavi Nannaya University, Rajamahendravaram (A. P), India
  • 1. Publisher


The objective of comparing various dimensionality techniques is to reduce feature sets in order to group attributes effectively with less computational processing time and utilization of memory. The various reduction algorithms can decrease the dimensionality of dataset consisting of a huge number of interrelated variables, while retaining the dissimilarity present in the dataset as much as possible. In this paper we use, Standard Deviation, Variance, Principal Component Analysis, Linear Discriminant Analysis, Factor Analysis, Positive Region, Information Entropy and Independent Component Analysis reduction algorithms using Hadoop Distributed File System for massive patient datasets to achieve lossless data reduction and to acquire required knowledge. The experimental results demonstrate that the ICA technique can efficiently operate on massive datasets eliminates irrelevant data without loss of accuracy, reduces storage space for the data and also the computation time compared to other techniques.



Files (345.3 kB)

Name Size Download all
345.3 kB Preview Download

Additional details

Related works

Is cited by
Journal article: 2277-3878 (ISSN)


Retrieval Number