Published February 24, 2025 | Version v1

100 samples of 5000 instances of categorical BNs from bnlearn's Bayesian Network Repository

  • 1. Universidad de Castilla-La Mancha

Description

We provide 100 samples, each containing 5000 instances, of discrete Bayesian Networks from bnlearn's Bayesian Network Repository. Specifically, the BNs, along with their characteristics, are:

NETWORK #NODES #EDGES #PARAMETERS MAX. PARENTS MEAN DEGREE
Cancer 5 4 10 2 2.00
Earthquake 5 4 10 2 2.00
Survey 6 6 21 2 2.00
Asia 8 8 18 2 2.00
Sachs 11 17 178 3 3.09
Child 20 25 230 4 2.50
Insurance 27 52 230 4 2.50
Water 32 66 10083 5 4.12
Mildew 35 46 540150 3 2.63
Alarm 37 46 509 5 2.49
Barley 48 84 114005 4 3.50
Hailfinder 56 66 2656 4 2.36
Hepar2 70 123 1453 6 3.51
Win95pts 76 112 574 7 2.95
Pathfinder 109 195 72079 5 3.58
Munin1 186 273 15622 3 2.94
Andes 223 338 1157 6 3.03
Diabetes 413 602 429409 2 2.92
Pigs 441 592 5618 2 2.68
Link 724 1125 14211 3 3.11
Munin2 1003 1244  69431 3 2.48
Munin4 1038 1388 80352 3 2.67
Munin3 1041 1306 71059 3 2.51
Munin 1041 1397 80592 3 2.68

 

Each dataset is sampled using Python and the bnlearn package. The BN structure is loaded from the .bif file using bif = bnlearn.import_DAG(path), and samples are generated with bnlearn.sampling(bif, n=5000, methodtype='bayes'). Post-processing is then applied to replace the generated numerical values with those categorical specified in the .bif structure file.

Additionally, ten extra old database samples of most BNs can be found on OpenML.

Files

alarm.zip

Files (1.2 GB)

Name Size
md5:fd529771339e60d707287ac0ca230f8e
6.0 MB Preview Download
md5:727da70b89ead3869d6898a6a7b25ad9
30.0 MB Preview Download
md5:692c0bfb7d9e60c54b3a111de71f4003
671.8 kB Preview Download
md5:1345c01df353178c43b6bf4b918f9a32
23.6 MB Preview Download
md5:b41ae2399273886ddbc9273c8975135a
555.5 kB Preview Download
md5:98863262d85515845340cd1d6ce06ebc
3.8 MB Preview Download
md5:6f064569d7274f03c2aa5722197037d2
183.2 MB Preview Download
md5:7c5a69db42cabc08e48077b0b4b039d1
173.7 kB Preview Download
md5:abe6ded94828012539cca4c5ce804db8
22.0 MB Preview Download
md5:d78077f44c1e88c2cb84e8e31f2f268a
13.3 MB Preview Download
md5:3ae0399292198dc1878f4cee198bc14a
5.3 MB Preview Download
md5:a1c6ebd8f99bc9166a0be44e0dd47ffc
162.2 MB Preview Download
md5:26765fa7c2009a2a044b29a74abf2612
19.4 MB Preview Download
md5:a89265c099d62d1e315f6f0e20172249
199.1 MB Preview Download
md5:f71c719936d325ce3e2882a47edfacc5
29.2 MB Preview Download
md5:5318559fc9712e8307cfff72e3a18e4f
115.9 MB Preview Download
md5:72d6aa30a8dbe552241841e117103774
128.1 MB Preview Download
md5:7179443815bb381a85bb9e3874339a77
133.6 MB Preview Download
md5:a09414cb0aae736326ddd05d18519e27
21.7 MB Preview Download
md5:7cb3892512348b856077d816761cf463
65.0 MB Preview Download
md5:58aa942a2752cba651b040c2385a0fd3
2.0 MB Preview Download
md5:4682aacc44ab544fe834c560b984668c
866.8 kB Preview Download
md5:0c55ac7f5419850d0ef8e39bf8e5fe50
4.9 MB Preview Download
md5:777e52b3b7b052ec6bd27896cadbfded
7.4 MB Preview Download

Additional details

References

  • Scutari M (2010). "Learning Bayesian Networks with the bnlearn R Package." Journal of Statistical Software, 35(3), 1–22. doi:10.18637/jss.v035.i03