Optimization of cluster one-dimensional data?

Does anyone have a document explaining how Ckmeans.1d.dp works?

Or: what is the most optimal way k-means clustering in one-dimensional?

+26
r cluster-analysis k-means cran
Oct 23 '11 at 10:12
source share
3 answers
+20
Jul 09 2018-12-12T00:
source share

This is a very old Bellman technique: A Note on Cluster Analysis and Dynamic Programming http://www.sciencedirect.com/science/article/pii/0025556473900072

www.informationgeometry.org

+2
Feb 23 '14 at 7:18
source share

One-dimensional clustering of k-values โ€‹โ€‹can be solved in O (kn) time (at an already sorted input) based on theoretical results on Monge matrices, but this approach was not popular, most likely due to numerical instability, and also, possibly, for encoding tasks.

The best option is the O (knlgn) method, which is now implemented in Ckmeans.1d.dp version 3.4.6. This implementation is as fast as the heuristic k-tool, but provides guaranteed optimality, an order of magnitude better than the heuristic k-tool, especially for large k.

The general dynamic programming solution by Richard Bellman (Richard Bellman, 1973) does not affect the specifics of the k-means problem, but the implied runtime is O (kn ^ 3).

+1
Jun 03 '16 at 1:03
source share



All Articles