K-Closest Clusters

Overview

We wanted to lessen the limitations of popular machine learning models that classify quantitative data: K-Nearest Neighbors and K-Means in particular. In order to do so, our approach included combining aspects of the two by taking the clustering aspect of K-Means and the classification based on the nearest neighbors aspect of KNN. Our methodology was composed of creating a k number of clusters within a dataset and appointing the centroid of each cluster with the majority class label of that cluster. Then, to classify, we implemented KNN with k=1, considering each centroid as a neighbor. This algorithm is known as K-Closest Clusters (KCC). The results show that KCC achieved a slightly better accuracy, precision and recall than the existing KNN algorithm and that KCC is significantly more efficient at classifying test instances than KNN. This indicates that KCC is more practical than KNN as a classifier, especially when used for large datasets.

Running the Code

Download the ml_q2_project.py file and run the file. This file contains our implementation of the K-Closest Clusters algorithm described in our report. A graph of the validation accuracies, a visualization of the clusters, the test accuracy, and the classification time will be displayed after running the code.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
ML Q2 Final Presentation.pdf		ML Q2 Final Presentation.pdf
ML Q2 Final Presentation.pptx		ML Q2 Final Presentation.pptx
ML Q2 Project Final Report.docx		ML Q2 Project Final Report.docx
ML Q2 Project Final Report.pdf		ML Q2 Project Final Report.pdf
README.md		README.md
ml_q2_project.py		ml_q2_project.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

K-Closest Clusters

Overview

Running the Code

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

K-Closest Clusters

Overview

Running the Code

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages