Card mismatch type: expected .. Text received ... LongWritable

Question

Card mismatch type: expected .. Text received ... LongWritable

I have a simple application for applications that receives one CSV file, then splits the entry into "," and then counts the first elements.

Below is the code.

  package com.bluedolphin;

 import java.io.IOException;
 import java.util.Iterator;

 import org.apache.hadoop.conf.Configuration;
 import org.apache.hadoop.conf.Configured;
 import org.apache.hadoop.fs.Path;
 import org.apache.hadoop.io.IntWritable;
 import org.apache.hadoop.io.LongWritable;
 import org.apache.hadoop.io.Text;
 import org.apache.hadoop.mapred.OutputCollector;
 import org.apache.hadoop.mapred.Reporter;
 import org.apache.hadoop.mapreduce.Job;
 import org.apache.hadoop.mapreduce.Mapper;
 import org.apache.hadoop.mapreduce.Reducer;
 import org.apache.hadoop.mapreduce.lib.input.FileInputFormat;
 import org.apache.hadoop.mapreduce.lib.input.TextInputFormat;
 import org.apache.hadoop.mapreduce.lib.output.FileOutputFormat;
 import org.apache.hadoop.util.Tool;
 import org.apache.hadoop.util.ToolRunner;

 public class MyJob extends Configured implements Tool {
     private final static LongWritable one = new LongWritable (1);


     public static class MapClass extends Mapper <Object, Text, Text, LongWritable> {
         private Text word = new Text ();
         public void map (Object key, 
                     Text value 
                     OutputCollector <Text, LongWritable> output,
                     Reporter reporter) throws IOException, InterruptedException {
             String [] citation = value.toString (). Split (",");
             word.set (citation [0]);
             output.collect (word, one);
         }
     }

     public static class Reduce extends Reducer <Text, LongWritable, Text, LongWritable> {
         public void reduce (
                 Text key 
                 Iterator <LongWritable> values, 
                 OutputCollector <Text, LongWritable> output,
                 Reporter reporter) throws IOException, InterruptedException {
             int sum = 0;

             while (values.hasNext ()) {
                 sum + = values.next (). get ();
             }
             output.collect (key, new LongWritable (sum));
         }
     }
     public static class Combiner extends Reducer <Text, IntWritable, Text, LongWritable> {
         public void reduce (
                 Text key 
                 Iterator <LongWritable> values, 
                 OutputCollector <Text, LongWritable> output,
                 Reporter reporter) throws IOException, InterruptedException {
             int sum = 0;

             while (values.hasNext ()) {
                 sum + = values.next (). get ();
             }
             output.collect (key, new LongWritable (sum));

         }
     }

     public int run (String [] args) throws Exception {
         Configuration conf = getConf ();

         Job job = new Job (conf, "MyJob");
         job.setJarByClass (MyJob.class);

         Path in = new Path (args [0]);
         Path out = new Path (args [1]);

         FileInputFormat.setInputPaths (job, in);
         FileOutputFormat.setOutputPath (job, out);

         job.setMapperClass (MapClass.class);
     // job.setCombinerClass (Combiner.class);
         job.setReducerClass (Reduce.class);
     // job.setInputFormatClass (KeyValueInputFormat.class);
         job.setInputFormatClass (TextInputFormat.class);
     // job.setOutputFormatClass (KeyValueOutputFormat.class);

         job.setOutputKeyClass (Text.class);
         job.setOutputValueClass (LongWritable.class);

         System.exit (job.waitForCompletion (true)? 0: 1);
         return 0;
     }

     public static void main (String args []) throws Exception {
         int res = ToolRunner.run (new Configuration (), new MyJob (), args);
         System.exit (res);
     }
 }

This is mistake:

  11/12/16 22:16:58 INFO mapred.JobClient: Task Id: attempt_201112161948_0005_m_000000_0, Status: FAILED
 java.io.IOException: Type mismatch in key from map: expected org.apache.hadoop.io.Text, recieved org.apache.hadoop.io.LongWritable
     at org.apache.hadoop.mapred.MapTask $ MapOutputBuffer.collect (MapTask.java:1013)
     at org.apache.hadoop.mapred.MapTask $ NewOutputCollector.write (MapTask.java:690)
     at org.apache.hadoop.mapreduce.TaskInputOutputContext.write (TaskInputOutputContext.java:80)
     at org.apache.hadoop.mapreduce.Mapper.map (Mapper.java:124)
     at org.apache.hadoop.mapreduce.Mapper.run (Mapper.java:144)
     at org.apache.hadoop.mapred.MapTask.runNewMapper (MapTask.java:763)
     at org.apache.hadoop.mapred.MapTask.run (MapTask.javahaps69)
     at org.apache.hadoop.mapred.Child $ 4.run (Child.java:259)
     at java.security.AccessController.doPrivileged (Native Method)
     at javax.security.auth.Subject.doAs (Subject.java:416)
     at org.apache.hadoop.security.UserGroupInformation.doAs (UserGroupInformation.java:1059)
     at org.apache.hadoop.mapred.Child.main (Child.java:253)

+6

java hadoop

Bluedolphin Dec 17 '11 at 3:23

source share

3 answers

General remark, if we have Mapper<K1,V1, K2,V2> and Reducer<K2,V2, K3,V3> , it is better to declare (in the assignment) the following

 JobConf conf = new JobConf(MyJob.class); ... conf.setMapOutputKeyClass(K2.class); conf.setMapOutputValueClass(V2.class);

Here you can see another example .

+5

user799188 Apr 9 '12 at 7:25

source share

The old API (oahmapred) and the new API (oammapreduce) are incompatible, so they should not be mixed.

 import org.apache.hadoop.mapred.OutputCollector; import org.apache.hadoop.mapred.Reporter; import org.apache.hadoop.mapreduce.Job; import org.apache.hadoop.mapreduce.Mapper; import org.apache.hadoop.mapreduce.Reducer;

You should try replacing OutputCollector and Reporter with Map and decreasing function signature with Context. map (key K1, V1 val, context context) and output.collect (k, v) with context.write (k, v)

For reference, use this link with more information about porting to the new API http://www.slideshare.net/sh1mmer/upgrading-to-the-new-map-reduce-api#

0

Akshay Apr 30 '14 at 12:25

source share

Praveen sripati · Accepted Answer · 2011-12-17T13:38:51+0000

A few things to fix in code

The old (oahmapred) and the new API (oammapreduce) are incompatible, so they should not be mixed.

import org.apache.hadoop.mapred.OutputCollector; import org.apache.hadoop.mapred.Reporter; import org.apache.hadoop.mapreduce.Job; import org.apache.hadoop.mapreduce.Mapper; import org.apache.hadoop.mapreduce.Reducer;

Make sure the I / O for cards / gears is oahio.Writable. The input key for Mapper is Object, making it LongWritable.
It appears that the function of Combiner and Reducer is the same, so you do not need to repeat it.

 job.setCombinerClass(Reducer.class);

Alternatively, you can use WordCount , the difference between your requirement and the WordCount example is small.

Card mismatch type: expected .. Text received ... LongWritable

More articles: