Published July 30, 2024 | Version 1.0
Dataset Open

Embedded HuffPost New Category Dataset

  • 1. ROR icon Twitter (United States)
  • 2. ROR icon Massachusetts Institute of Technology

Description

This dataset was created by embedding the concatenation of title and short description of each entry in the HuffPost news category dataset, ordered by timestamp, using OpenAI's text-embedding-3-small embedding. Usage is subject to Huffington Post's user agreement and OpenAI's term of use.

Files

Files (429.1 MB)

Name Size Download all
md5:a9a27d213fc3ecf14791f6d3b1c357f2
429.1 MB Download

Additional details

Related works

Is published in
arXiv:2209.11429 (arXiv)