I had an interesting problem:
file1.csv has several hundred lines, for example:
Code,DTime 1,2010-12-26 17:01 2,2010-12-26 17:07 2,2010-12-26 17:15
file2.csv has about 11 million lines, for example:
id,D,Sym,DateTime,Bid,Ask 1375022797,D,USD,2010-12-26 17:00:15,1.311400,1.311700 1375022965,D,USD,2010-12-26 17:00:56,1.311200,1.311500 1375022984,D,USD,2010-12-26 17:00:56,1.311300,1.311600 1375023013,D,USD,2010-12-26 17:01:01,1.311200,1.311500 1375023039,D,USD,2010-12-26 17:01:02,1.311100,1.311400 1375023055,D,USD,2010-12-26 17:01:03,1.311200,1.311500 1375023063,D,USD,2010-12-26 17:01:03,1.311300,1.311600
What I'm trying to do is write a script that takes each DTime value in file1.csv and finds the first partial match instance in the DateTime column of file2.csv and displays DateTime, Bid. Ask for this line. Partial match - first 16 characters.
Both files are sorted from oldest to newest, so if "2010-12-26 17:01" from file1.csv matches 4 files in2.csv, I only need to extract the first file: "2010-12-26 17: 01:01 "
I donโt know how to continue .. I tried the dictionary, but the order of the values โโis important, so I'm not sure if this will work. Maybe bring the column file1 DTime to the list and for each record in this DateTime search list in file2?
Thanks guys,