Published December 7, 2022
| Version v9
Dataset
Open
Wiki-based Communities of Interest: Demographics and Outliers
Creators
- 1. Max Planck Institute for Informatics
- 2. The University of Edinburgh
Description
These datasets contains statements about demographics and outliers of Wiki-based Communities of Interest.
Group-centric dataset (sample):
{
"title": "winners of Priestley Medal",
"recorded_members": 83,
"topics": ["STEM.Chemistry"],
"demographics": [
"occupation-chemist",
"gender-male",
"citizen-U.S."
],
"outliers": [
{
"reason": "NOT(chemist) unlike 82 recorded members",
"members": [
"Francis Garvan (lawyer, art collector)"
]
},
{
"reason": "NOT(male) unlike 80 recorded members",
"members": [
"Mary L. Good (female)",
"Darleane Hoffman (female)",
"Jacqueline Barton (female)"
]
}
]
}
Subject-centric dataset (sample):
{
"subject": "Serena Williams",
"statements": [
{
"statement": "NOT(sport-basketball) but (tennis) unlike 4 recorded winners of Best Female Athlete ESPY Award.",
"score": 0.36
},
{
"statement": "NOT(occupation-politician) but (tennis player, businessperson, autobiographer) unlike 20 recorded winners of Michigan Women's Hall of Fame.",
"score": 0.17
}
]
}
This data can be also browsed at: https://wikiknowledge.onrender.com/demographics/
Files
Files
(235.8 MB)
Name | Size | Download all |
---|---|---|
md5:7143c37a9190cdc2628ca64266b0fc99
|
63.7 MB | Download |
md5:578265b26d600f6ecb1bccf7dcfe2438
|
172.0 MB | Download |