Published August 3, 2016 | Version 2015-10
Dataset Open

Crunchbase in RDF: A Large Data Set About Jobs, Websites, Organizations, News, People, Products, and Acquisitions

  • 1. Karlsruhe Institute of Technology

Description

CrunchBase in an online platform providing information about startups and technology companies, including related entities such as the products they sell, key people they employ, and investments they made and received.

We provide here an RDF data set of Crunchbase as of October 2015. The data set contains information about

  • 1,946,435 jobs
  • 1,348,449 websites
  • 567,937 organizations
  • 519,763 news
  • 430,093 people
  • 60,076 products, and
  • 33,127 acquisitions.

The data set has been used, among other things, for data integration with financial data sources to evaluate the performance of particular companies and for monitoring news to find statements that are not in Crunchbase as an RDF knowledge graph yet.

Note that the provided data set was created in October 2015 when all Crunchbase data was licensed under Creative Commons Attribution-NonCommercial License 4.0 (CC-BY-NC) and partly under Creative Commons Attribution License 4.0 (CC-BY). Also the provied data set is licensed under these licenses. Concerning licensing of current Crunchbase data, we can refer to https://about.crunchbase.com/terms-of-service/.

For more information about the data set, see our paper A Linked Data Wrapper for CrunchBase.

When you use the data set, please cite us as follows:

Michael Färber, Carsten Menne, Andreas Harth. “A Linked Data Wrapper for CrunchBase”. In: Semantic Web Journal 9(4). IOS Press, 2018, pp. 505–5015. (BibTeX entry at DBLP)

Files

Files (1.2 GB)

Name Size Download all
md5:5947ed4e597e316292b24e6e68c8d9aa
1.2 GB Download

Additional details

Related works

Is documented by
10.3233/SW-170278 (DOI)