Published February 11, 2026 | Version v1
Journal article Open

Natural Language Processing Pipeline for Co-Designing Culturally Aware Health Chatbots From User Stories to System Specifications

Description

Designing culturally aware health chatbots for underserved communities is essential to advancing global health equity. A major challenge, however, is the conversion of contextual, often qualitative user requirements into technical specifications. In this paper, we present a natural language processing (NLP) pipeline for the co-design of culturally aware health chatbots. The pipeline systematically converts user stories into system specifications. By employing Large Language Models (LLMs), the pipeline automates the extraction of cultural and contextual requirements from user stories, which inherently counteracts biases from general AI models. By using the requirements engineering phase to direct attention to domainspecific and culturally relevant data, this approach ensures that the resulting specifications for chatbots are grounded in the realities of the communities they aim to serve. This approach does not only improve the efficiency of the design process, it also proactively embeds cultural sensitivity and inclusivity at the heart of AI-driven health solutions

Files

15625ijcsea01.pdf

Files (464.9 kB)

Name Size Download all
md5:b1cef9cb6bdf20fdc737cf19fe0ca862
464.9 kB Preview Download

Additional details

Related works

Dates

Copyrighted
2025

References

  • [1] Pillai, M., Griffin, A. C., Kronk, C., & McCall, T. (2023). Toward Community-Based Natural Language Processing (CBNLP): Cocreating With Communities. Journal of Medical Internet Research, 25, e48498. https://doi.org/10.2196/48498 [2] Peters, D., Espinoza, F., da Re, M., Ivetta, G., Benotti, L., & Calvo, R. A. (2025). Towards culturally-appropriate conversational AI for health in the majority world: An exploratory study with citizens and professionals in Latin America. arXiv. https://arxiv.org/abs/2507.01719 [3] Dev, R., Ramani, D., Divate, T., Jalota, S., & Ismail, A. (2025). "Kya family planning after marriage hoti hai?": Integrating Cultural Sensitivity in an LLM Chatbot for Reproductive Health. Proceedings of the CHI Conference on Human Factors in Computing Systems (CHI '25). https://doi.org/10.1145/3706598.3713362 [4] Nadarzynski, T., Knights, N., Husbands, D., Graham, C. A., Llewellyn, C., Buchanan, T., Montgomery, I., & Ridge, D. (2024). Achieving health equity through conversational AI: A roadmap for design and implementation of inclusive chatbots in healthcare. PLOS Digital Health, 3(5), e0000492. https://doi.org/10.1371/journal.pdig.0000492 [5] Denecke, K. (2025). Bridging the Gap: Inclusive Artificial Intelligence for Patient-Oriented Language Processing in Conversational Agents in Healthcare. Applied Clinical Informatics. https://doi.org/10.18653/v1/2025.cl4health-1.42 [6] Bamurange, D., & Jonathan, K. N. (2025). Designing a Hybrid AI Chatbot Framework for Student Support: Integrating NLP and Human Oversight in African Universities. Journal of Information and Technology, 5(4), 41-52. https://doi.org/10.70619/vol5iss4pp41-52 [7] Sami, M., Wasim, M., Zhang, Z., Rasheed, Z., & Systä, K. (2024). AI based Multiagent Approach for Requirements Elicitation and Analysis. arXiv. https://arxiv.org/abs/2409.00038 [8] Sami, M., Wasim, M., Zhang, Z., Rasheed, Z., Systä, K., & Abrahamsson, P. (2024). Early Results of an AI Multiagent System for Requirements Elicitation and Analysis. In Lecture Notes in Computer Science: Vol. 15488. Springer, Cham. https://doi.org/10.1007/978-3-031-78386-9_20 [9] Bano, M., Zowghi, D., Gervasi, V., & Shams, R. A. (2023). AI for All: Operationalising Diversity and Inclusion Requirements for AI Systems. arXiv. https://arxiv.org/abs/2311.14695 [10] Askari, A., Sharbaf, M., Kolahdouz-Rahimi, S., Lano, K., & Tehrani, S. Y. (2025). Research directions for using LLM in software requirement engineering: a systematic review. Frontiers in Computer Science, 7. https://doi.org/10.3389/fcomp.2025.1519437 [11] Yamani, A., Baslyman, M., & Ahmed, M. (2025). Leveraging LLMs for User Stories in AI Systems: UStAI Dataset. Proceedings of the 32nd IEEE International Requirements Engineering Conference (RE'25). https://doi.org/10.1145/3727582.3728689 [12] Ronanki, K., Berger, C., & Horkoff, J. (2023). Investigating ChatGPT's Potential to Assist in Requirements Elicitation Processes. arXiv. https://arxiv.org/abs/2307.07381 [13] Hashemian, S. A. R., Jamal, F., Azal, M. U., Khan, A. H., Abbas, S., Cheema, H. N., Shabbir, M. U., Hameed, I., Ayub, M., Masood, H., Jafar, A., Khan, A. M., Nazir, M. A., Jamil, M. A., Sultan, F., & Khalid, S. (2024). Assessing equitable use of large language models for clinical decision support in real-world settings: fine-tuning and internal-external validation using electronic health records from South Asia. medRxiv. https://doi.org/10.1101/2024.06.05.24308365 [14] Malik, A., Rasheed, Z., Wasim, M., Zhang, Z., Tomas, H., & Abrahamsson, P. (2024). A Tool for Test Case Scenarios Generation Using Large Language Models. arXiv. https://arxiv.org/abs/2406.07021 [15] Ali, J., Qayyum, S., & Dar, H. S. (2025). Large Language Model for Requirements Engineering: A Systematic Literature Review. Requirements Engineering. https://doi.org/10.21203/rs.3.rs5589929/v1