Source Code Classification (SCC) is a tool that automaticly classify the programming language of a code snippet. It was bulit on Stack OverFlow dataset and Machine Learning Algorithms. The SCC was published in International Conference on Source Code Analysis & Manipulation (SCAM) in September 2018. It is currently support 21 programming langauages (Bash, C, C#, C++, CSS, Haskell, HTML, Java, JavaScript, Lua, Objective-C, Perl, PHP, Python, R, Ruby, Scala, SQL, Swift, VB, Markdown).
- Python (>= 3.4)
- Scikit-learn Website: http://scikit-learn.org
- Dataset https://drive.google.com/open?id=1leMs0rdKAfX1UYEhSEe9a1npIWrIvqr6
Kamel Alrashedy (Kamel@uvic.ca)