Real world networks for network classification method evaluation
Authors/Creators
- 1. Computer Engineering Department, Polytechnic School of the University of São Paulo, São Paulo, SP, Brazil
- 2. São Carlos Institute of Physics, University of São Paulo
- 3. Institute of Biosciences, Humanities and Exact Sciences, São Paulo State University, São José do Rio Preto, SP, Brazil
Description
This research presents a set of network databases based on real-world data, aimed at evaluating the effectiveness of network classification methods. The first database, called the "Social database," consists of networks from the Stanford Network Analysis Project (SNAP) platform, including two classes, namely, Google+ and Twitter, we have made some preprocessment and also reduced the original dataset from the SNAP platform such that each class contains 50 networks. The other databases presented in this study are collectively known as the "Metabolic database," as they are constructed using the substrate-product network model and are based on biochemical reactions of organisms obtained from the Kyoto Encyclopedia of Genes and Genomes database (KEGG). The networks were generated using a model that considers metabolites as vertices and chemical reactions as edges. The Metabolic database comprises six classification schemes, which include the "kingdom-database" with 160 network samples, where each of the four classes contains 40 networks representing animal, plant, fungi, and protist kingdoms. The remaining databases in the Metabolic database are the "Animal-database," "Fungi-database," "Plant-database," "Firmicutes-Bacilis-database," and "Actinobacteria-database," each containing a varying number of network samples.
To ensure consistency and comparability of our results, we implemented a standardization procedure whereby all networks were converted to adjacency list format. This allowed us to more efficiently process and analyze the data.
Files
real.zip
Files
(8.1 MB)
| Name | Size | Download all |
|---|---|---|
|
md5:b4fe75bfa43a423cef46e83992c32e7d
|
8.1 MB | Preview Download |
Additional details
Related works
- Cites
- 10.1145/2898361 (DOI)
- 10.1093/nar/gkv1070 (DOI)