Does increasing the diversity of pseudo-parallel synthetic data improve cross-domain generalization accuracy f
Description
In the past five years, research has shifted from traditional Machine Learning (ML) and Deep Learning (DL) approaches to leveraging Large Language Models (LLMs) , including multimodality, for data augmentation to enhance generalization, and combat overfitting in training deep convolutional neural networks. However, while existing surveys predominantly focus on ML and DL techniques or limited modalities (text or images), a gap remains in addressing the latest advancements and multi-modal applications of LLM-based methods. This survey fills that gap by exploring recent literature utilizing multi
Research goal: Does increasing the diversity of pseudo-parallel synthetic data improve cross-domain generalization accuracy for low-resource language pairs compared to standard duplication methods?
Autonomous synthesis report generated by SOVEREIGN Research Kernel. Tribunal consensus score: 9.0/10.
Notes
Files
paper.pdf
Files
(73.8 kB)
| Name | Size | Download all |
|---|---|---|
|
md5:d3d10f18ac8f2e3cadad559902b6a78d
|
73.8 kB | Preview Download |