C # Quick read CSV from many files

Question

C # Quick read CSV from many files

I have a folder with 3000 csv files in size from 1Kb to 100kb. Each line in these files is 43 characters long. They have a total size of 171Mb.

I am trying to write a program to parse these files as quickly as possible.

At first I tried my own implementation, but was not happy with these results. Then I found LumenWorks.Framework.IO.Csv on StackOverflow. He has bold statements:

To give more approximate figures, with a 45 MB CSV file containing 145 fields and 50,000 records, the reader processed about 30 MB / s. So all in all, it took 1.5 seconds! Machine specifications were P4 3.0 GHz, 1024 MB.

I get nothing from these results. My process takes 10 minutes. Is this because it is not one large stream, but many small files, and is there overhead? Is there anything else I could do?

I feel that the LumenWorks implementation was not faster than mine (I did not test it), not to mention the fact that it handles quotes, escapes, comments, and multi-line fields, none of which I need. I have a very normal comma separated integer format.

Greetings

+3

c # csv bulk

Mitch Jan 7 '11 at 6:07

source share

4 answers

"" ? , "" , ? 10 - +/- 7 ( , )?

0

bunn_online 07 . '11 6:18

. , / .

0

Rob 07 . '11 6:18

LogParser? , , . .

2.2

Where it can be faster when reading from a large number of small CSVs, as in your example. In any case, no matter how you should compare your own code, so that you can compare it with the lumens and the logarithm (and any other sentences). assumptions are bad.

0

Anonymous type Jan 10 '11 at 10:41

source share

Hans Passant · Accepted Answer · 2011-01-07T06:23:32+0000

CSV -, , . , - , 50 60 . , LumenWorks .

. , , . - , 16 .

, 3000 . 50 . , , . /, . , Defraggler - .

, . . / . - , , , , . , , , .

. , , , . , , .

C # Quick read CSV from many files

More articles: