There is a newer version of this record available.

Dataset Open Access

Big Data to Knowledge (BD2K) Training Coordinating Center (TCC) Educational Resource Discovery Index (ERuDIte) as Linked Data

Ambite, Jose Luis; Gordon, Jonathan; Fierro, Lily; Burns, Gully; Kamdar, Jeana; Abe, Sumiko; Stewart, Crystal; Bhattrai, Avnish; Lei, Xiaoxiao; Van Horn, John D.

This is a release of the Big Data to Knowledge (BD2K) Training Coordinating Center (TCC) Educational Resource Discovery Index (ERuDIte) as Linked Data.

ERuDIte contains over 10,000 training resources on data science including courses (MOOCs), video tutorials, conference talks, and other materials. The metadata of these resources is described uniformly using In addition, we use machine learning techniques to tag each resource with concepts from the Data Science Education Ontology (DSEO), which we developed to further describe the contents of the training resources. Resource relevance and tags are curated by experts to ensure high quality. Finally, we map the references to people and organizations in the learning resource metadata to entities in DBpedia, DBLP, and ORCID, thus embedding our collection in the web of linked data. Our collection is continually growing. We hope that ERuDIte will provide a framework to foster open linked educational resources on the web.

 Distributed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License (

This work is supported by the National Institutes of Health under Grant 1U24ES026465-01.
Files (31.5 MB)
Name Size
26.8 MB Download
493.6 kB Download
4.2 MB Download
All versions This version
Views 1,222222
Downloads 39462
Data volume 3.2 GB706.7 MB
Unique views 980181
Unique downloads 19135


Cite as