Quilt-1M: One Million Image-Text Pairs for Histopathology

doi:10.5281/zenodo.8239942

Published August 11, 2023 | Version v1

Dataset Restricted

Quilt-1M: One Million Image-Text Pairs for Histopathology

1. University of Washington

Recent accelerations in multi-modal applications have been made possible with the plethora of image and text data available online. However, the scarcity of similar data in the medical field, specifically in histopathology, has slowed similar progress. To enable similar representation learning for histopathology, we turn to YouTube, an untapped resource of videos, offering 1,087 hours of valuable educational histopathology videos from expert clinicians. From YouTube, we curate Quilt: a large-scale vision-language dataset consisting of 802,148 image and text pairs. Quilt was automatically curated using a mixture of models, including large language models), handcrafted algorithms, human knowledge databases, and automatic speech recognition. In comparison, the most comprehensive datasets curated for histopathology amass only around 200K samples. We combine Quilt with datasets, from other sources, including Twitter, research papers, and the internet in general, to create an even larger dataset: Quilt-1M, with 1M paired image-text samples, marking it as the largest vision-language histopathology dataset to date. We demonstrate the value of Quilt-1M by fine-tuning a pre-trained CLIP model. Our model outperforms state-of-the-art models on both zero-shot and linear probing tasks for classifying new pathology images across 13 diverse patch-level datasets of 8 different sub-pathologies and cross-modal retrieval tasks.

Notes

This version of the dataset has images resized to 512px by 512px, for the original variable-sized images between 1920px -320px please use the Google form to request restricted and timed access to the dataset, see https://quilt1m.github.io/.

Files

Restricted

The record is publicly accessible, but files are restricted to users with access.

Request access

If you would like to request access to these files, please fill out the form below.

Data use

I hereby commit myself

1. to inform the data provider in written form about the research question(s) and the intended analyses (brief description) no later than at the time of data transfer.

2. to comply with current laws and regulations on data protection when working with the data.

3. to use the data carefully, considering the associated documentation and specifications.

4. not to make the data available to a third party. In a research team, all users have to sign the contract individually.

5. not to link the data with other individual data and not to use the data for commercial purposes.

6. not to re-distribute, publish, copy, or further disseminated in any way or form whatsoever, whether for profit or not.

Publications and methodological principals

I hereby commit myself

7. to cite all used data conforming with accepted scientific standards.

Citation:

@misc{ikezogwo2023quilt1m,
title={Quilt-1M: One Million Image-Text Pairs for Histopathology},
author={Wisdom Oluchi Ikezogwo and Mehmet Saygin Seyfioglu and Fatemeh Ghezloo and Dylan Stefan Chan Geva and Fatwir Sheikh Mohammed and Pavan Kumar Anand and Ranjay Krishna and Linda Shapiro},
year={2023},
eprint={2306.11207},
archivePrefix={arXiv},
primaryClass={cs.CV},
}

You are currently not logged in. Do you have an account? Log in here

Additional details

Is derived from: Dataset: https://arxiv.org/abs/2306.11207 (URL)

	All versions	This version
Views	3,218	3,145
Downloads	4,383	4,320
Data volume	54.2 TB	53.6 TB

Quilt-1M: One Million Image-Text Pairs for Histopathology

Creators

Description

Notes

Files

Restricted

Request access

Additional details

Related works