CLIP-Embedded RedCaps Text-Image Dataset

Reddit; Desai, Karan; Kaul, Gaurav; Aysola, Zubin; Johnson, Justin; Engels, Joshua; Zhang, Ziyu; OpenAI (United States)

doi:10.5281/zenodo.13137120

Published July 30, 2024 | Version 1.0

Dataset Open

CLIP-Embedded RedCaps Text-Image Dataset

1. Massachusetts Institute of Technology

This dataset was created by applying the CLIP embedding to the RedCaps dataset. Queries are generated by OpenAI's GPT model simulating textual queries searching multimodal content, embedded via CLIP. The data was curated by Desai, Kaul, Aysola, and Johnson from data collected by Reddit, and further curated into vector data by Engels for this work.

Usage of the dataset itself is subject to Reddit terms, Reddit User Agreeement, Content Policy, and Privacy Policy (quoted from Desai et. al.'s accompanying paper for the image-and-text dataset). Usage of the queries are subject to OpenAI terms. Among others, OpenAI terms prohibits using the query component of this dataset to develop models that compete against OpenAI.

Files

Files (23.8 GB)

Name	Size	Download all
redcaps-512-angular.hdf5 md5:a6221cc0a4103af7e0f06f87bd989a0a	23.8 GB	Download

Additional details

arXiv: arXiv:2402.00943

Is published in: arXiv:2111.11431 (arXiv)
Is supplemented by: arXiv:2402.00943 (arXiv)

Available: 2024-07-30

embedded-redcaps

	All versions	This version
Views	40	40
Downloads	20	20
Data volume	475.1 GB	475.1 GB

CLIP-Embedded RedCaps Text-Image Dataset

Files

Files (23.8 GB)

Additional details

Identifiers

Related works

Dates

References

CLIP-Embedded RedCaps Text-Image Dataset

Creators

Description

Files

Files (23.8 GB)

Additional details

Identifiers

Related works

Dates

References