Published April 15, 2022 | Version v1
Lesson Open

Supplemental Notebook for Unsupervised Machine Learning Using Linked SED and UMETRICS Data

  • 1. Coleridge Initiative
  • 2. University of Maryland

Description

This Jupyter notebook introduces unsupervised machine learning through the lens of clustering. It demonstrates how k-means clustering can be employed to better understand the types of PhD students based on funding history by utilizing the linked Survey of Earned Doctorates (SED)-Universities: Measuring the Impacts of Research on Innovation, Competitiveness, and Science (UMETRICS) data. This supplemental notebook was developed for the Fall 2021 Applied Data Analytics training facilitated by the National Center for Science and Engineering Statistics (NCSES) and Coleridge Initiative.

Files

Supplemental_Machine_Learning.ipynb

Files (37.9 kB)

Name Size Download all
md5:e4fc8bc22d52bf9c33b755cf447d1c02
37.9 kB Preview Download