I used LZO for compression, reducing output. I tried this: Kevin Weil's Hadoop-LZO project , and then used the LzoCodec class with my work:
TextOutputFormat.setOutputCompressorClass(job, LzoCodec.class);
Now compression works just fine.
My problem is that the result of the compression is a .lzo_deflate file, which I just cannot unzip.
The Lzop utility does not seem to support this file type.
LzopCodec should provide the .lzo file, but it doesn’t work, however it is in the same package as LzoCodec ( org.apache.hadoop.io.compress ), which may refer to a compatibility issue, since I used the old API (0.19) ) to do the compression.
The answers to this question offer Python solutions, however I need it in Java.
I am using Hadoop 1.1.2 and Java 6.
source share