Engineering a Topic Modeling System: Architecture, Process, and UML Modeling
Authors/Creators
- 1. Department of College of Business Informatics, University of Information Technology and Communications (UOITC), Baghdad, Iraq.
Contributors
Contact person:
- 1. Department of College of Business Informatics, University of Information Technology and Communications (UOITC), Baghdad, Iraq.
Description
Abstract: Software engineering is a discipline that uses Unified Modelling Language diagrams, which are among the most widely accepted standards for visualising object-oriented design models. Using UML diagrams, system and application requirements are specified by providing understandable models of the objects involved. Thus, the goal of this study is to describe how software engineering is applied to our topic modelling system. This study describes the software engineering process from story to system testing, and then supports it with a UML diagram for clarity. This paper demonstrated empirically how software engineering principles are integrated into a system using Latent Dirichlet Allocation (LDA). Several engineering operations were performed on data from a group of websites, yielding measurable, verifiable results that support the system's reliability.
Files
A902406010126.pdf
Files
(618.7 kB)
| Name | Size | Download all |
|---|---|---|
|
md5:a9d3fbd756655b381858f834ac07e096
|
618.7 kB | Preview Download |
Additional details
Identifiers
- DOI
- 10.54105/ijsepm.A9024.06010126
- EISSN
- 2582-8339
Dates
- Accepted
-
2026-01-15Manuscript received on 08 November 2025 | First Revised Manuscript received on 15 November 2025 | Second Revised Manuscript received on 17 December 2025 | Manuscript Accepted on 15 January 2026 | Manuscript published on 30 January 2026.
References
- G. Casale, C. Chesta, P. Deussen, E. Di Nitto, P. Gouvas, S. Koussouris, et al., "Current and future challenges of software engineering for services and applications," Procedia Computer Science, vol. 97, pp. 34–42, 2016. DOI: https://doi.org/10.1016/j.procs.2016.08.278
- D. Akdur, V. Garousi, and O. Demirörs, "A survey on modelling and model-driven engineering practices in the embedded software industry," Journal of Systems Architecture, vol. 91, pp. 62–82, 2018. DOI: https://doi.org/10.1016/j.sysarc.2018.09.007
- I. Sommerville, Software Engineering, 10th ed. Boston, MA: Pearson, 2016. ISBN 1-292-09613-6. https://www.pearson.com/en-us/subject-catalog/p/software-engineeri ng/P200000003258/9780137503148?srsltid=AfmBOoqFRLIwoOrG UnrE0sKf7CyXooXVstFEIolQ4r2jK_yr_zCgfKqY
- A. Tsertsvadze, Y. F. Chen, D. Moher, P. Sutcliffe, and N. McCarthy, "How to conduct systematic reviews more expeditiously?" Systematic Reviews, vol. 4, no. 1, p. 160, 2015. DOI: https://doi.org/10.1186/s13643-015-0147-7
- Allahyari, M., Pouriyeh, S., Assefi, M., Safaei, S., Trippe, E. D., Gutierrez, J. B., & Kochut, K. "A brief survey of text mining: Classification, clustering and extraction techniques". arXiv preprint arXiv:1707.02919. 2017. DOI: https://doi.org/10.48550/arXiv.1707.02919
- C. Treude and M. Wagner, "Predicting good configurations for GitHub and Stack Overflow topic models," in Proc. IEEE/ACM 16th Int. Conf. Mining Software Repositories (MSR), Montreal, QC, Canada, May 2019, pp. 84–95. DOI: https://doi.org/10.1109/MSR.2019.00022
- X. Sun, X. Liu, B. Li, Y. Duan, H. Yang, and J. Hu, "Exploring topic models in software engineering data analysis: A survey," in Proc. 17th IEEE/ACIS Int. Conf. Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing (SNPD), Shanghai, China, May 2016, pp. 357–362. DOI: https://doi.org/10.1109/SNPD.2016.7515925
- T. H. Chen, S. W. Thomas, and A. E. Hassan, "A survey on the use of topic models when mining software repositories," Empirical Software Engineering, vol. 21, no. 5, pp. 1843–1919, 2016. DOI: https://doi.org/10.1007/s10664-015-9402-8
- Kurbatova, Z., Veselov, I., Golubev, Y., & Bryksin, T. "Recommendation of move method refactoring using path-based representation of code". In Proceedings of the IEEE/ACM 42nd International Conference on Software Engineering Workshops (pp. 315-322). 2020, June. DOI: https://doi.org/10.1145/3387940.3392191
- Mehta, P., Aggarwal, S., & Tandon, A. "The Effect of Topic Modelling on Prediction of Criticality Levels of Software Vulnerabilities". Informatica, 47(6). 2023. DOI: https://doi.org/10.31449/inf.v47i6.3712
- Zhang, M., Li, P., & Wang, W. "An index-based algorithm for fast online query processing of latent semantic analysis". Plos one, 12(5), e0177523. 2017. DOI: https://doi.org/10.1371/journal.pone.0177523
- Jelodar, H., Wang, Y., Yuan, C., Feng, X., Jiang, X., Li, Y., & Zhao, L. "Latent Dirichlet allocation (LDA) and topic modelling: models, applications, a survey". Multimedia tools and applications, 78(11), 15169-15211. 2019. DOI: https://doi.org/10.1007/s11042-018-6894-4
- Feng, J., Zhang, Z., Ding, C., Rao, Y., Xie, H., & Wang, F. L. "Context reinforced neural Topic modelling over short texts". Information Sciences, 607, 79-91. 2022. DOI: https://doi.org/10.1016/j.ins.2022.05.098
- A. Agrawal, W. Fu, and T. Menzies, "What is wrong with topic modelling? And how to fix it using search-based software engineering," Inf. Softw. Technol., vol. 98, pp. 74–88, 2018. DOI: https://doi.org/10.1016/j.infsof.2018.02.005
- M. A. Mohammed, R. M. Mohammed, and H. A. Abbood, "Topic modelling for web page using LDA algorithm and web content mining," J. Educ. Pure Sci., vol. 15, no. 3, 2025. DOI: https://doi.org/10.32792/jeps.v15i3.686
- M. A. Mohammed, H. A. Abbood, and R. M. Mohammed, "MRH: A large-scale text dataset for web content mining," J. Port Sci. Res., vol. 8, no. 4, pp. 321–326, 2025. DOI: https://doi.org/10.36371/port.2025.4.2
- G. Team et al., "Gemini: A family of competent multimodal models," arXiv preprint arXiv:2312.11805, 2023. DOI: https://doi.org/10.48550/arXiv.2312.11805
- N. M. Abbas, R. M. Mohammed, H. A. Abbood, and M. A. Mohammed, "Topic modelling for web page using LDA algorithm and web content mining: Testing and evaluation," Int. J. Comput. (IJC), vol. 55, no. 1, pp. 117–129, 2025. Available: https://ijcjournal.org/InternationalJournalOfComputer/article/view/2 405
- Bass, L., Clements, P., & Kazman, R. "Software architecture in practice (4th ed.)". Addison-Wesley Professional. 2021. ISBN 978-0136886020. https://www.oreilly.com/library/view/software-architecture-in/97801 36885979/
- B. Unhelkar, Software Engineering with UML. Boca Raton, FL: Auerbach Publications, 2017. DOI: https://doi.org/10.1201/9781351235181
- A. Z. Umar, M. M. Gumel, and H. S. Tuge, "Comparing flowchart and swim lane activity diagram for aiding transitioning to object-oriented implementation," Amer. J. Educ. Technol., vol. 1, no. 2, pp. 99–106, 2022. DOI: https://doi.org/10.54536/ajet.v1i2.612
- R. G. Tiwari, A. P. Srivastava, G. Bhardwaj, and V. Kumar, "Exploiting UML diagrams for test case generation: A review," in Proc. 2nd Int. Conf. Intelligent Engineering and Management (ICIEM), London, U.K., Apr. 2021, pp. 457–460. DOI: https://doi.org/10.1109/ICIEM51511.2021.9445383
- A. Salim, J. C. A. Sujanto, F. W. Putra, L. Angelina, and C. O. Doaly, "UML modelling for web-based book lending library business systems," in AIP Conf. Proc., vol. 2680, no. 1, Dec. 2023. DOI: https://doi.org/10.1063/5.0176378
- S. Sundaramoorthy, UML Diagramming: A Case Study Approach. Boca Raton, FL: Auerbach Publications, 2022. DOI: https://doi.org/10.1201/9781003287124
- K. C. S. Murti, UML for Embedded Systems, Singapore: Springer, 2022, pp. 119–153. DOI: https://doi.org/10.1007/978-981-16-3293-8_5