Antibody Watch: Text Mining Antibody Specificity from the Literature

Hsu, Chun-Nan; Chang, Chia-Hui; Poopradubsil, Thamolwan; Lo, Amanda; William, Karen A.; Lin, Ko-Wei; Bandrowski, Anita; Ozyurt, Ibrahim Burak; Grethe, Jeffery S.; Martone, Maryann E.

doi:10.5281/zenodo.3943128

Published July 12, 2020 | Version v3

Dataset Open

Antibody Watch: Text Mining Antibody Specificity from the Literature

Abstract

Motivation: Antibodies are widely used reagents to test for expression of proteins. However, they might not always reliably produce results when they do not specifically bind to the target proteins that their providers designed them for, leading to unreliable research results.

Results: We developed a deep neural network system and tested its performance with a corpus of more than two thousand articles that reported uses of antibodies. We divided the problem into two tasks. Given an input article, the first task is to identify snippets about antibody specificity and classify if the snippets report any antibody that is nonspecific, and thus problematic. The second task is to link each of these snippets to one or more antibodies that the snippet referred to. We leveraged the Research Resource Identifiers (RRID) to precisely identify antibodies linked to the extracted specificity snippets. The result shows that it is feasible to construct a reliable knowledge base about problematic antibodies by text mining.

Files

antibody_watch_dataset.zip

Files (1.7 MB)

Name	Size	Download all
antibody_watch_dataset.zip md5:509ff5756801e631d981520b45b97118	1.7 MB	Preview Download

	All versions	This version
Views	1,067	426
Downloads	141	82
Data volume	303.2 MB	146.4 MB

Antibody Watch: Text Mining Antibody Specificity from the Literature

Authors/Creators

Description

Files

antibody_watch_dataset.zip

Files (1.7 MB)