I want to classify really long lines of texts. Strings.
Here is the error I get:
Exception in thread "main" java.lang.IllegalStateException: No input instance format defined at weka.filters.unsupervised.attribute.StringToWordVector.input(StringToWordVector.java:681) at org.berlin.weka.test.TestWeka.main(TestWeka.java:61)
Here is the code, but I keep getting exceptions, maybe this is not configured correctly.
package org.berlin.weka.test; import weka.classifiers.Classifier; import weka.classifiers.functions.SMO; import weka.core.Attribute; import weka.core.FastVector; import weka.core.Instance; import weka.core.Instances; import weka.filters.Filter; import weka.filters.unsupervised.attribute.StringToWordVector; public class TestWeka { public static void main(final String [] args) throws Exception { System.out.println("Running"); final StringToWordVector filter = new StringToWordVector(); final Classifier classifier = new SMO();
source share