Exploring Biomedical Relation Extraction through ChatGPT Augmentation and Dual Training
Creators
- 1. Miin Wu School of Computing, National Cheng Kung University, Tainan, Taiwan
- 2. Dept. of Statistics, National Cheng Kung University, Tainan, Taiwan
- 3. Cross College Elite Program, National Cheng Kung University, Tainan, Taiwan
- 4. Dept. of Electrical Engineering, National Cheng Kung University, Tainan, Taiwan
Description
Abstract
Relation extraction in biomedical text mining faces challenges due to complex terminology and rapidly growing literature. Our research focuses on improving relation extraction through data augmentation and targeted dual training. We fine-tuned a PubMedBERT model and enriched it with GPT-4 generated examples, iteratively refining the process. A dual training approach focusing on chemical entities significantly improved F1 scores by 2.04% on average. Our strategies demonstrate the effectiveness of ChatGPT-based augmentation and selective dual training for advancing biomedical text mining.
This article is part of the Proceedings of the BioCreative VIII Challenge and Workshop: Curation and Evaluation in the era of Generative Models.
Files
Exploring Biomedical Relation Extraction through ChatGPT Augmentation and Dual Training.pdf
Files
(186.4 kB)
Name | Size | Download all |
---|---|---|
md5:6eafa6534791f8b4710bb486e35cd674
|
186.4 kB | Preview Download |
Additional details
Related works
- Is published in
- Conference proceeding: 10.5281/zenodo.10103190 (DOI)