I have a hash table that I want to save to disk. The list is as follows:
<16-byte key > <1-byte result>
a7b4903def8764941bac7485d97e4f76 04
b859de04f2f2ff76496879bda875aecf 03
etc...
There are 1-5 million records. Currently, I just store them in one file, 17 bytes per record, number of records. This file has tens of megabytes. My goal is to save them in such a way as to optimize disk space first, and then look for the search time. The insertion time does not matter.
What is the best way to do this? I would like the file to be as small as possible. Multiple files will be fine too. Patricia three? Radix trie?
No matter what good offers I receive, I will implement and test. I will post the results here for everyone to see.
source
share