G-Means Clustering Algorithm Implementation

Overview

This project implements the G-Means clustering algorithm, an adaptive version of K-Means that automatically determines the optimal number of clusters. The algorithm uses PCA for dimensionality reduction, Anderson–Darling tests for Gaussian validation, and silhouette scores to evaluate splits.

Two versions are implemented:

Basic G-Means with silhouette constraint
Enhanced G-Means with centroid initialization as described in the original paper

Algorithm Steps

Dimensionality Reduction using PCA
Initialize with one cluster (global mean)
Iteratively split clusters if they are not Gaussian-distributed
Validate splits using silhouette score and minimum centroid distance
Stop when no valid splits remain or max clusters reached

Datasets Used

Iris
Digits
Wine
Breast Cancer
Synthetic Blobs, Moons, and Circles

Evaluation

Similarity Score: Compares predicted vs. true cluster count
Visualization: Bar charts comparing true vs. predicted k
Iteration Tracking: Measures convergence speed

Technologies

Python 3
NumPy, scikit-learn, SciPy
Matplotlib for visualization
Jupyter Notebook for interactive analysis

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
.gitignore		.gitignore
IJERT_New_fast_k_means_clustering_algori.pdf		IJERT_New_fast_k_means_clustering_algori.pdf
Iris.csv		Iris.csv
NIPS-2003-learning-the-k-in-k-means-Paper.pdf		NIPS-2003-learning-the-k-in-k-means-Paper.pdf
RAPPORT_TP2_FD.pdf		RAPPORT_TP2_FD.pdf
README.md		README.md
tp2-fd.ipynb		tp2-fd.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

G-Means Clustering Algorithm Implementation

Overview

Algorithm Steps

Datasets Used

Evaluation

Technologies

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

G-Means Clustering Algorithm Implementation

Overview

Algorithm Steps

Datasets Used

Evaluation

Technologies

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages