Published March 14, 2026 | Version v1
Journal article Open

Applying Natural Language Processing (NLP) For Automated Code Review

  • 1. Department of Computer Science, Dr. D. Y. Patil, Arts, Commerce & Science College, Pimpri, Pune, Maharashtra, India

Description

Automated code review aims to reduce manual effort, catch defects early, and improve code quality by using machine learning and Natural Language Processing (NLP) to analyze source changes, generate review comments, and prioritize reviewer attention. This paper surveys prior work and presents a practical methodology combining transformer-based code models (e.g., CodeBERT/CodeT5-style encoders), structural program representations (ASTs/graphs), and comment-generation components to build an automated code review assistant. We describe dataset collection from open-source pull requests, preprocessing, model design, evaluation metrics, and an implementation plan. Finally, we discuss expected benefits, limitations, and directions for future work. (arXiv)

Files

1217-Article Text-3217-1-10-20260314.pdf

Files (220.5 kB)

Name Size Download all
md5:d112991a7c49e45a5fb6c5994f4666ec
220.5 kB Preview Download