Published October 26, 2024 | Version v1
Journal article Open

INTENT RECOGNITION USING DISTILBERT AND LANGUAGE MODELS

Authors/Creators

Contributors

Researcher (2):

Description

Intent classification in Natural Language Processing involves identifying the intention of the user based on their input/interaction with an interface. This can be in a natural usage setting (voice assistants) or an interaction between users, customer service personnel, or agents (in a large organization). This paper aims to study the problem of intent classification using language models (like DistilBERT) and Large Language Models (Phi2 and LLAMA3) on open-source datasets available in banking, travel, small talk, office, etc. This is a challenging problem as there is a need for more available data that spans various domains. Identifying the user’s intention is sometimes tricky as the classification changes based on the context (and ambiguity in natural language). Apart from the given intents, the data poses a challenge when a user presents the interface with out-of-scope queries (for which the models aren’t trained). We test this on transformer-based approaches like DistilBERT and LLMs (like Phi2 and LLAMA3). We find that DistilBERT, with fewer parameters, trains faster and runs inference faster than LLAMA3 and Phi2 (with PEFT-LoRA). We also find that the results from DistilBERT are much better than those of the language model-based approaches for the Banking-77 and CLINC-150 datasets. These two datasets cover various domains (as mentioned above).

Files

JAIML_03_02_002.pdf

Files (278.0 kB)

Name Size Download all
md5:dc42523eedafdd68c31f304f35e6b242
278.0 kB Preview Download

Additional details