Is there a seqFileDir option for "clusterdump" in the latest apache mahout library?

Question

Is there a seqFileDir option for "clusterdump" in the latest apache mahout library?

I am trying to make "clusterdump" in the output. Example of grouping mahout kmeans (synthetic control example). But I am experiencing the following error:

> ~/MAHOUT/trunk/bin/mahout clusterdump --seqFileDir clusters-10-final --pointsDir clusteredPoints --output a1.txt MAHOUT_LOCAL is not set; adding HADOOP_CONF_DIR to classpath. Running on hadoop, using /usr/lib/hadoop/bin/hadoop and HADOOP_CONF_DIR=/usr/lib/hadoop/conf/ MAHOUT-JOB: /home/<username>/MAHOUT/trunk/examples/target/mahout-examples-0.8-SNAPSHOT-job.jar 12/06/21 22:43:18 WARN conf.Configuration: DEPRECATED: hadoop-site.xml found in the classpath. Usage of hadoop-site.xml is deprecated. Instead use core-site.xml, mapred-site.xml and hdfs-site.xml to override properties of core-default.xml, mapred-default.xml and hdfs-default.xml respectively 12/06/21 22:43:25 ERROR common.AbstractJob: Unexpected --seqFileDir while processing Job-Specific Options: usage: <command> [Generic Options] [Job-Specific Options] .....

So, I think there is no seqFileDir option for clusterdump, but all online tutorials (e.g. https://cwiki.apache.org/MAHOUT/cluster-dumper.html ) refer to this option. Could you offer me a remedy or something that I am missing?

+6

amazon-ec2 hadoop cluster-analysis mahout k-means

Aniruddha basak Jun 21 '12 at 23:16

source share

1 answer

Alex ott · Accepted Answer · 2012-06-22T08:56:42+0000

Have you tried to specify it as the --input parameter?

Is there a seqFileDir option for "clusterdump" in the latest apache mahout library?

More articles: