Cluster of thousands of text documents in java

Question

Cluster of thousands of text documents in java

Is there an effective way to clustering text documents? I was thinking about K-Means, but it seems like it takes too much time. Can someone provide me with an efficient method?

+3

performance cluster-analysis k-means

Knsiva Dec 24 '10 at 10:32

source share

2 answers

, java ?, weka , .

+1

Radi 24 . '10 11:01

Mike dunlavey · Accepted Answer · 2010-12-24T16:26:08+0000

If K-Means really does the job and just seems slow, why not try to do it faster? The method I use is random-pause .

It usually happens that there are many opportunities for speeding up, in code that you would not consider a problem, without changing the underlying algorithm. Here is an example.

Cluster of thousands of text documents in java

More articles: