Dataset Open Access

Crunchbase in RDF: A Large Data Set About Jobs, Websites, Organizations, News, People, Products, and Acquisitions

Färber, Michael; Menne, Carsten; Harth, Andreas

CrunchBase in an online platform providing information about startups and technology companies, including related entities such as the products they sell, key people they employ, and investments they made and received.

We provide here an RDF data set of Crunchbase as of October 2015. The data set contains information about

  • 1,946,435 jobs
  • 1,348,449 websites
  • 567,937 organizations
  • 519,763 news
  • 430,093 people
  • 60,076 products, and
  • 33,127 acquisitions.

The data set has been used, among other things, for data integration with financial data sources to evaluate the performance of particular companies and for monitoring news to find statements that are not in Crunchbase as an RDF knowledge graph yet.

Note that the provided data set was created in October 2015 when all Crunchbase data was licensed under Creative Commons Attribution-NonCommercial License 4.0 (CC-BY-NC) and partly under Creative Commons Attribution License 4.0 (CC-BY). Also the provied data set is licensed under these licenses. Concerning licensing of current Crunchbase data, we can refer to https://about.crunchbase.com/terms-of-service/.

For more information about the data set, see our paper A Linked Data Wrapper for CrunchBase.

When you use the data set, please cite us as follows:

Michael Färber, Carsten Menne, Andreas Harth. “A Linked Data Wrapper for CrunchBase”. In: Semantic Web Journal 9(4). IOS Press, 2018, pp. 505–5015. (BibTeX entry at DBLP)

Files (1.2 GB)
Name Size
crunchbase-dump-2015-10.nt.gz
md5:5947ed4e597e316292b24e6e68c8d9aa
1.2 GB Download
96
25
views
downloads
All versions This version
Views 9696
Downloads 2525
Data volume 29.8 GB29.8 GB
Unique views 9292
Unique downloads 1919

Share

Cite as