Published June 13, 2022 | Version v1
Journal article Open

Selecting representative samples from complex biological datasets using k-medoids clustering

  • 1. University of Chicago

Description

This method quantifies the relationships/similarities among samples using their Euclidian distances by vectorizing all given properties, and then determines an appropriate sample size by evaluating the coverage of key proprieties from multiple candidate sizes, following by a k-medoids clustering to group samples into several clusters, and selects centers from each cluster as the most representatives.

Files

Cookie-master.zip

Files (8.5 MB)

Name Size Download all
md5:23b1d1bb0f77d2be7aaeba28b17347d7
8.5 MB Preview Download