Published September 19, 2024 | Version v1.0.2
Software Open

BioCLIP

Description

Images of the natural world are an abundant source of biological information. There are many computational methods and tools, particularly computer vision, for extracting information from images. However, existing methods consist of bespoke models, not adaptable or extendable from their targeted task to new questions, contexts, and datasets. We thus develop the first large-scale multimodal model, BioCLIP, for general biology questions on images. We leverage the unique properties of biology (abundance and variety of images and availability of rich structured biological knowledge) as the the application domain for computer vision.

Notes

If you use this software, please cite both the article from preferred-citation and the software itself.

 

Article Citation: Stevens, S., Wu, J., Thompson, M. J., Campolongo, E. G., Song, C. H., Carlyn, D. E., Dong, L., Dahdul, W. M., Stewart, C., Berger-Wolf, T., Chao, W., & Su, Y. (2024). BioCLIP: A Vision Foundation Model for the Tree of Life [Conference paper]. Proceedings of IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 19412-19424.

Files

Imageomics/bioclip-v1.0.2.zip

Files (9.4 MB)

Name Size Download all
md5:88b3f5e8292b5b5ea3415a94d5665735
9.4 MB Preview Download

Additional details

Related works

Is source of
Model: 10.57967/hf/1511 (DOI)
Is supplement to
Dataset: 10.57967/hf/1972 (DOI)
Dataset: 10.57967/hf/1981 (DOI)
Is version of
Software: https://github.com/Imageomics/bioclip/tree/v1.0.2 (URL)

Funding

U.S. National Science Foundation
HDR Institute: Imageomics: A New Frontier of Biological Information Powered by Knowledge-Guided Machine Learning 2118240

References

  • Ilharco, G., Wortsman, M., Carlini, N., Taori, R., Dave, A., Shankar, V., Namkoong, H., Miller, J., Hajishirzi, H., Farhadi, A., & Schmidt, L. (2021). OpenCLIP (0.1). Zenodo. https://doi.org/10.5281/zenodo.5143773