Dataset Open Access

Wiki-based Communities of Interest: Demographics and Outliers

Hiba Arnaout; Simon Razniewski; Jeff Z. Pan


Dublin Core Export

<?xml version='1.0' encoding='utf-8'?>
<oai_dc:dc xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:oai_dc="http://www.openarchives.org/OAI/2.0/oai_dc/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.openarchives.org/OAI/2.0/oai_dc/ http://www.openarchives.org/OAI/2.0/oai_dc.xsd">
  <dc:creator>Hiba Arnaout</dc:creator>
  <dc:creator>Simon Razniewski</dc:creator>
  <dc:creator>Jeff Z. Pan</dc:creator>
  <dc:date>2022-12-07</dc:date>
  <dc:description>These datasets contains statements about demographics and outliers of Wiki-based Communities of Interest. 

Group-centric dataset (sample):

{
	"title": "winners of Priestley Medal", 
	"recorded_members": 83, 
	"topics": ["STEM.Chemistry"], 
	"demographics": [
            "occupation-chemist",
            "gender-male", 
            "citizen-U.S."
	], 
	"outliers": [
		{
			"reason": "NOT(chemist) unlike 82 recorded members", 
			"members": [
            "Francis Garvan (lawyer, art collector)"
            ]
		}, 
		{
			"reason": "NOT(male) unlike 80 recorded members", 
			"members": [
            "Mary L. Good (female)",
            "Darleane Hoffman (female)", 
            "Jacqueline Barton (female)"
            ]
		}
	]
}

Subject-centric dataset (sample):

{
	"subject": "Serena Williams", 
	"statements": [
		{
			"statement": "NOT(sport-basketball) but (tennis) unlike 4 recorded winners of Best Female Athlete ESPY Award.", 
			"score": 0.36
		},
  	{
			"statement": "NOT(occupation-politician) but (tennis player, businessperson, autobiographer) unlike 20 recorded  winners of Michigan Women's Hall of Fame.",
			"score": 0.17
		}
	]
}

This data can be also browsed at: https://wikiknowledge.onrender.com/demographics/</dc:description>
  <dc:identifier>https://zenodo.org/record/7537200</dc:identifier>
  <dc:identifier>10.5281/zenodo.7537200</dc:identifier>
  <dc:identifier>oai:zenodo.org:7537200</dc:identifier>
  <dc:language>eng</dc:language>
  <dc:relation>doi:10.5281/zenodo.7410436</dc:relation>
  <dc:rights>info:eu-repo/semantics/openAccess</dc:rights>
  <dc:rights>https://creativecommons.org/licenses/by/4.0/legalcode</dc:rights>
  <dc:subject>wikipedia</dc:subject>
  <dc:subject>wikimedia</dc:subject>
  <dc:subject>wikidata</dc:subject>
  <dc:subject>demography</dc:subject>
  <dc:subject>trivia</dc:subject>
  <dc:title>Wiki-based Communities of Interest: Demographics and Outliers</dc:title>
  <dc:type>info:eu-repo/semantics/other</dc:type>
  <dc:type>dataset</dc:type>
</oai_dc:dc>
116
15
views
downloads
All versions This version
Views 11659
Downloads 158
Data volume 1.2 GB834.7 MB
Unique views 8552
Unique downloads 115

Share

Cite as