Silhouette coefficient for finding optimal number of clusters

Silhouette coefficient is another method to determine the optimal number of clusters. Here I introduced c-index earlier. The silhouette coefficient of a data measures how well data are assigned to its own cluster and how far they are from other clusters. A silhouette close to 1 means the data points are in an appropriate cluster and a silhouette coefficient close to −1 implies out data is in the wrong cluster. The following is python code for computing the coefficient and plotting number fo clusters vs Silhouette coefficient.

 

1
Leave a Reply

avatar
1 Comment threads
0 Thread replies
0 Followers
 
Most reacted comment
Hottest comment thread
0 Comment authors
Finding optimal number of Clusters by using Cluster validation - Tutorials on Robotics for ROS developers Recent comment authors
  Subscribe  
newest oldest most voted
Notify of
trackback

[…] Here there is also another method called “Silhouette coefficient” for finding the optimal number of components for clustering. […]