Published August 8, 2023 | Version 2023-08-08
Dataset Restricted

A collection of text embeddings of the arXiv corpus by title and abstract

  • 1. Yale School of Medicine

Description

A popular online repository of arXiv is home to numerous preprints in many scientific domains. Other than playing a role of disseminating up-to-date knowledge in pertaining domains, arXiv is an interesting complex system by itself from text analytics point of view. In this repository, we provide a collection of text embedding outputs for (almost) all papers' from the arXiv corpus by their titles and abstracts in order to provide multi-faceted characteristics of scientific knowledge.

Files

Restricted

The record is publicly accessible, but files are restricted to users with access.