I use Cassandra with Hadoop for input and output. While working with job reduction, I received an error message:
2011-08-10 03:54:04,326 WARN org.apache.hadoop.mapred.Child: Error running child java.io.IOException: InvalidRequestException(why:[MyKeyspace][MyColumnFamily][66756c6c74657874] = [6c696e6bb66e68656974207a756d.................65697465726520536f6e67746578746520] failed validation (String didn't validate.)) at org.apache.cassandra.thrift.Cassandra$batch_mutate_result.read(Cassandra.java:19045) at org.apache.cassandra.thrift.Cassandra$Client.recv_batch_mutate(Cassandra.java:1035) at org.apache.cassandra.thrift.Cassandra$Client.batch_mutate(Cassandra.java:1009) at org.apache.cassandra.hadoop.ColumnFamilyRecordWriter$RangeClient.run(ColumnFamilyRecordWriter.java:285) 2011-08-10 03:54:04,339 INFO org.apache.hadoop.mapred.Task: Runnning cleanup for the task 2011-08-10 03:54:04,340 WARN org.apache.hadoop.io.UTF8: truncating long string: 267364 chars, starting with java.io.IOException:
According to the magazine, this does not happen at the beginning, but after the successful merging, sorting and processing of 8981 keys. In 8982, this fails.
Searched in google and in stackoverflow, but did not find.
The column family looks like this:
create column family MyColumnFamily with comparator = UTF8Type and key_validation_class=UTF8Type and column_metadata = [ {column_name: column1, validation_class: UTF8Type, index_type: 0}, {column_name: column2, validation_class: UTF8Type, index_type: 0}, {column_name: column3, validation_class: UTF8Type, index_type: 0} ];
Thank you in advance!
source share