Published March 8, 2024 | Version v1
Dataset Open

Mutation Testing for Task-Oriented Chatbots: Dataset

Description

Conversational agents, or chatbots, are increasingly used to access all sorts of services using natural language. While open-domain chatbots - like ChatGPT - can converse on any topic, task-oriented chatbots - the focus of this paper - are designed for specific tasks, like booking a flight, obtaining customer support, or setting an appointment. Like any other software, task-oriented chatbots need to be properly tested, usually by defining and executing test scenarios (i.e., sequences of user-chatbot interactions). However, there is currently a lack of methods to quantify the completeness and strength of such test scenarios, which can lead to low-quality tests, and hence to buggy chatbots.

To fill this gap, we propose adapting mutation testing (MuT) for task-oriented chatbots. To this end, we introduce a set of mutation operators that emulate faults in chatbot designs, an architecture that enables MuT on chatbots built using heterogeneous technologies, and a practical realisation as an Eclipse plugin. Moreover, we evaluate the applicability, effectiveness and efficiency of our approach on open-source chatbots, with promising results.

Files

MuTChatbots.zip

Files (26.8 MB)

Name Size Download all
md5:52e6b702f465d1643d3cde1fcdd91c7e
26.8 MB Preview Download

Additional details

Software

Repository URL
https://github.com/gomezabajo/MuTChatbots
Programming language
Java
Development Status
Active