Correlation of EchoMind Multi-Level Empathetic Evaluation with Human Judgments and Model Differentiation in Spoken Dialogue
Description
Driven by the rapid advancement of Large Language Models (LLMs), particularly Audio-LLMs and Omni-models, spoken dialogue systems have evolved significantly, progressively narrowing the gap between human-machine and human-human interactions. Achieving truly ``human-like'' communication necessitates a dual capability: emotional intelligence to perceive and resonate with users' emotional states, and robust interaction mechanisms to navigate the dynamic, natural flow of conversation, such as real-time turn-taking. Therefore, we launched the first Human-like Spoken Dialogue Systems Challenge (HumD
Research goal: To what extent does multi-level empathetic evaluation in EchoMind correlate with human judgments on empathy in spoken dialogue systems, and can this benchmark identify nuanced differences between models like OpenPangu-7B-MLA and Llama2-70B?
Autonomous synthesis report generated by SOVEREIGN Research Kernel. Tribunal consensus score: 8.1/10.
Notes
Files
paper.pdf
Files
(86.0 kB)
| Name | Size | Download all |
|---|---|---|
|
md5:403b926cccdcf888aaf947984a250758
|
86.0 kB | Preview Download |