I have about 500 GB of text file shared over several months. In these text files, the first 43 lines are connection information only (not required). The next 75 lines are descriptors for observation. This is followed by 4 lines (not necessary), then the next observation, which is 75 lines.
All I want is these 75 lines (descriptors are in one place for each observation), which are characterized as follows:
ID: 5523 Date: 20052012 Mixed: <Null> . .
And I want to change it to csv 5523;20052012;;..
format for each observation. So I get a lot less text files. Since the descriptors are the same, I know that the first position, for example, is ID.
As soon as I finish the text file, I will open the next one and add it (or will we create a new file faster?).
What I did is pretty inefficient. I am opening a file. Download it. The removal of these observations occurs sequentially. If it accepts a fair bit with a test pattern, this is clearly not the best method.
Any suggestions would be great.
source share