I have a fairly large text file that I would like to convert to a SequenceFile. Unfortunately, the file consists of Python code with logical lines running on several physical lines. For example, photographs print "Blah Blah \" ... blah blah "
Each logical line ends with NEWLINE. Can anyone clarify how I could generate Key, Value pairs in Map-Reduce, where each value is an entire logical line?
source share