How to Feed Your Robot: Building and Maintaining Open Machine Learning Datasets

doi:10.5281/zenodo.3233117

csv,conf,v4

Published May 28, 2019 | Version 1

Presentation Open

How to Feed Your Robot: Building and Maintaining Open Machine Learning Datasets

Evan Tachovsky¹

1. Rockefeller Foundation

While algorithms and computing power get all the press, the special sauce behind many recent machine learning breakthroughs are meticulously labeled training data. Developing and maintaining these data sets as public goods is both an art and a science. In this talk I'll present a new set of best practices gleaned from interview with ~20 data set builders, maintainers, and funders. Topics include: encouraging collaboration between rival data teams; finding and addressing ethical issues with crowd labeling; launching competitions to spur data set use; and revenue generation models for sustainability.

Files

2019-05-08 How to Feed Your Robot.pdf

Files (856.8 kB)

Name	Size	Download all
2019-05-08 How to Feed Your Robot.pdf md5:df657eaa028196dca4899bd5949fef6d	856.8 kB	Preview Download

200

Views

Downloads

Show more details

	All versions	This version
Views	200	200
Downloads	77	77
Data volume	69.4 MB	69.4 MB

More info on how stats are collected....

DOI

Resource type

Presentation

Publisher

Zenodo

Languages

English

Creative Commons Attribution 4.0 International

The Creative Commons Attribution license allows re-distribution and re-use of a licensed work on the condition that the creator is appropriately credited. Read more

Technical metadata

Created: May 28, 2019
Modified: January 20, 2020

How to Feed Your Robot: Building and Maintaining Open Machine Learning Datasets

Creators

Description

Files

2019-05-08 How to Feed Your Robot.pdf

Files (856.8 kB)