Published May 23, 2023 | Version v1
Conference paper Open

Programming Language Prediction using Machine Learning

  • 1. Amal Jyothi College of Engineering

Description

Abstract : The primary tool used in the software development industry is programming languages. Since the 1940s, hundreds of them have been developed, and every day, a sizable number of new lines of code are written in a variety of programming languages and pushed to active repositories. We consider a source code classifier to be a highly valuable tool for automatic syntax highlighting and label suggestion on systems, such as code editors, that can identify the programming language used to write a certain piece of code. This motivated us to use cutting-edge AI methods for text classification to build a model for categorizing code snippets according to their language. We developed a new dataset for our empirical investigation using the GitHub Repos Dataset, which includes 131450 code snippets dispersed over 34 programming languages.

Files

Programming Language Prediction using Machine Learning.pdf

Files (360.4 kB)