Published February 27, 2023 | Version v3
Dataset Open

UCI datasets

Description

Collection of two datasets from the UCI website that could be used for structure learning tasks. Includes datasets regarding

  • Air Quality
  • US census 1990

Size: Two datasets of sizes 9471*17 and 2458285*68 correspondingly

Number of features: 15-68

Ground truth: No

Type of Graph: No ground truth

 

More information about the datasets is contained in the dataset_description.html files.

Files

UCI_datasets.zip

Files (55.1 MB)

Name Size Download all
md5:7cf9ec72fe833400fbc936a3d673df9e
55.1 MB Preview Download

Additional details

References

  • Dua, D. and Graff, C. (2019). UCI Machine Learning Repository [http://archive.ics.uci.edu/ml]. Irvine, CA: University of California, School of Information and Computer Science.
  • Meek, Thiesson, and Heckerman (2001), "The Learning Curve Method Applied to Clustering", to appear in The Journal of Machine Learning Research.
  • S. De Vito, E. Massera, M. Piga, L. Martinotto, G. Di Francia, On field calibration of an electronic nose for benzene estimation in an urban pollution monitoring scenario, Sensors and Actuators B: Chemical, Volume 129, Issue 2, 22 February 2008, Pages 750-757, ISSN 0925-4005 https://doi.org/10.1016/j.snb.2007.09.060