Evaluation of Various DR Techniques in Massive Patient Datasets using HDFS
- 1. Ph.D, Department of Computer Science and Engineering, Adikavi Nannaya University, Rajamahendravaram (A. P), India.
- 2. Professor & Dean of Academics Department of Computer Science & Engineering of Adikavi Nannaya University, Rajamahendravaram (A. P), India
Contributors
- 1. Publisher
Description
The objective of comparing various dimensionality techniques is to reduce feature sets in order to group attributes effectively with less computational processing time and utilization of memory. The various reduction algorithms can decrease the dimensionality of dataset consisting of a huge number of interrelated variables, while retaining the dissimilarity present in the dataset as much as possible. In this paper we use, Standard Deviation, Variance, Principal Component Analysis, Linear Discriminant Analysis, Factor Analysis, Positive Region, Information Entropy and Independent Component Analysis reduction algorithms using Hadoop Distributed File System for massive patient datasets to achieve lossless data reduction and to acquire required knowledge. The experimental results demonstrate that the ICA technique can efficiently operate on massive datasets eliminates irrelevant data without loss of accuracy, reduces storage space for the data and also the computation time compared to other techniques.
Files
D65081110421.pdf
Files
(345.3 kB)
Name | Size | Download all |
---|---|---|
md5:534ce0b5c8939b6424e95a2e913589f7
|
345.3 kB | Preview Download |
Additional details
Related works
- Is cited by
- Journal article: 2277-3878 (ISSN)
Subjects
- ISSN
- 2277-3878
- Retrieval Number
- 100.1/ijrte.D65081110421