2019-01-11[Stay Sharp]DBSCAN

2019-01-12  本文已影响10人  三千雨点

what is DBSCAN ?

DBSCAN stands for Density-based spatial clustering of applications with noise, it's a density-based clustering algorithm. It groups together points in high-density regions, and marks points in low-density regions as outliers. DBSCAN is one of the most common clustering algorithms and also most cited in scientific literature.

how does DBSCAN work?

dbscan is an algorithm with parameters of ( \epsilon , MinPts ).
Before we describe how dbscan works, there are some terms we need to know at first.

image.png

In the above picture, the MinPts is 4, all red points such as A are core points because each point has 4 points around(including itself) within the distance of \epsilon which is the radius of the red circle. B and C are not core points, but they are reachable to A. N is not reachable.

so, let's take a step back, what kind of points can be clustered as a cluster: any point is reachable from any other point of the cluster.
All points in the cluster are mutaully density-connected.

References

https://en.wikipedia.org/wiki/DBSCAN

上一篇 下一篇

猜你喜欢

热点阅读