I have about 200,000 text files that fit in a bz2 file. The problem is that when scanning a bz2 file to extract the necessary data, this happens very slowly. It should look at the whole bz2 file to clear the single file that I am looking for. Is there any way to speed this up?
Also, I was thinking about the possibility of organizing files in tar.bz2 so that I could instead know where to look. Is there a way to organize files placed in bz2?
Additional Information / Edit: I need to request a compressed file for each text file. Is there a better compression method that supports such a large number of files and is so carefully compressed?
Do you need to use bzip2? Reading his documentation , he is very clear that he is not intended to support random access. Perhaps you should use a compression format that more closely matches your requirements. The good old Zip format supports random access, but it can certainly compress worse.
Bzip2 ( , , 900 KiB). , tar , , . , , Zip- ( zlib, bzip2). tar (), .
, tar , ( , Python tar, ). , , , .
tar
Source: https://habr.com/ru/post/1759896/More articles:Mercurial share expansion: how to parse? - mercurialMEF: how to manually configure export to implement a contract - .netrun perl -e from within perl script in windows - windowsjQuery fullcalendar: Fadeout deleted Event - jqueryIE popup menu error - htmlASP.NET main page - reboot problem in IE - internet-explorerAggregating and displaying facebook and twitter status updates on the Coldfusion page - coldfusionWhat is wrong with this linq query? - vb.netCombining multiple SQL fields into 1 line of output - sqlполучение "Объект E2197 Constant не может быть передан как параметр var" при передаче параметра var - delphiAll Articles