Assisted Text Annotation Using Active Learning toAchieve High Quality with Little Effort
- 1. University of Konstanz
- 2. Heidelberg Academy of Sciences and Humanities
- 3. University of Zurich and Heidelberg Academy of Sciences and Humanities
- 4. University of Wuppertal
Description
arge amounts of annotated data have become moreimportant than ever, especially since the rise of deep learningtechniques. However, manual annotations are costly. We proposea tool that enables researchers to create large, high-quality,annotated datasets with only a few manual annotations, thusstrongly reducing annotation cost and effort. For this purpose,we combine an active learning (AL) approach with a pre-trained language model to semi-automatically identify annotationcategories in the given text documents. To highlight our researchdirection’s potential, we evaluate the approach on the task ofidentifying frames in news articles. Our preliminary results showthat employing AL strongly reduces the number of annotationsfor correct classification of even these complex and subtle frames.On the framing dataset, the AL approach needs only 16.3% of theannotations to reach the same performance as a model trainedon the full dataset.
Files
jcdl_poster.pdf
Files
(109.1 kB)
| Name | Size | Download all |
|---|---|---|
|
md5:f2fa428e0c994dd50f83e5f5c1e86bee
|
109.1 kB | Preview Download |