2019-01-09[Stay Sharp]Clustering
Validation Approaches
Popular validation approaches involve Internal evaluation, External evaluation, Manual evaluation and Indirect evaluation.
Internal evaluation
Internal evaluation means the clustering result is evaluated based on the data that was clustered itself. With this evaluation, we get the algorithm, which produces clusters with high similarity within a cluster and low similarity between clusters. Also, this evaluation is biased towards algorithms that use the same cluster model.
the following methods are some popular internal evaluation methods:
- Davies-Bouldin index (DB index or DBI).
where is the number of clusters, is the centroid of cluster , is the average distance of all elements in cluster to centroid , and is the distance between centroids and .
- Dunn index (DI)
External evaluation
is the number of true positives, is the number of true negatives, is the number of false positives, and is the number of false negatives.
- Jaccard index
- Fowlkes–Mallows index