Understanding the Challenges of Training NLP Models on Multilingual Data
Authors/Creators
Description
This publication, authored by Nandhavardhan Battula, delves into the complexities and challenges of training multilingual Natural Language Processing (NLP) models. As multilingual NLP becomes increasingly crucial for global communication, this paper identifies the key obstacles that hinder effective model training, such as data scarcity, linguistic diversity, computational demands, and biases in evaluation. It examines the shortcomings of current solutions like pre-trained multilingual models and transfer learning, and highlights the disparity in performance between high-resource and low-resource languages. Additionally, the paper proposes future research directions focused on improving data collection for underrepresented languages, creating fairer evaluation benchmarks, and developing more efficient model architectures. The goal is to advance the inclusivity and scalability of multilingual NLP systems, ensuring that these technologies are more equitable and accessible to a wider range of languages and cultures.
Files
Understanding the Challenges of Training NLP Models on Multilingual Data.pdf
Files
(199.0 kB)
| Name | Size | Download all |
|---|---|---|
|
md5:1df227f7698999f9e6129844ba36f58c
|
199.0 kB | Preview Download |
Additional details
Software
- Development Status
- Active
References
- Devlin, J., Chang, M.-W., Lee, K., & Toutanova, K. (2019). BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. arXiv preprint arXiv:1810.04805.
- Conneau, A., Khandelwal, K., Goyal, N., Chaudhary, V., Wenzek, G., Guzmán, F., ... & Stoyanov, V. (2020). Unsupervised Cross-lingual Representation Learning at Scale. arXiv preprint arXiv:1911.02116.
- Artetxe, M., & Schwenk, H. (2019). Massively Multilingual Sentence Embeddings for Zero-Shot Cross-Lingual Transfer and Beyond. Transactions of the Association for Computational Linguistics, 7, 597-610.
- Ruder, S., Peters, M. E., Swayamdipta, S., & Wolf, T. (2019). Transfer Learning in Natural Language Processing. arXiv preprint arXiv:1903.11260.
- Tiedemann, J. (2012). Parallel Data, Tools and Interfaces in OPUS. Proceedings of the Eight International Conference on Language Resources and Evaluation (LREC'12).