Requirements: I have a python project that parses data feeds from several sources in different formats (Atom, valid XML, invalid XML, csv, near-garbage, etc.) and inserts the received data into the database. A trap is the information necessary for parsing each channel, and must also be stored in a database.
Current solution: My previous solution was to store small python scripts that are calculated from raw data and return a data object for the analyzed data. I would really like to get away from this method, since it obviously opens an unpleasant security hole.
Ideal solution: What I am looking for is what I would describe as a syntax syntax agent for python, so that I can write a template file for each of the feed formats, and this template file will be used to understand various data formats.
I had limited success in finding something similar in the past, and I was hoping someone might have a good suggestion.
Thanks everyone!
source
share