You need to know if the records are really limited to some known sequence of characters.
If you know this, you can set the textinputformat.record.delimiter configuration textinputformat.record.delimiter to separate records.
If the records are not limited by a symbol, you will need additional logic, which, for example, counts a known number of fields (if there is a known number of fields) and presents it as a record. This usually makes things more complex, error prone, and slow as there is still a lot of text processing.
Try to determine if records are limited. It may help to publish a short example of several entries.
source share