I have a large data set (200 GB uncompressed, 9 GB compressed in bz2 -9) stock data.
I want to start the analysis of the main time series.
My machine has 16 GB of RAM.
I would prefer:
save all data compressed in memory
unzip this data on the fly and sink it [so that nothing ever gets to disk]
do all the analysis in memory
Now I think that there are nice interactions with Clojure laziness and future objects (i.e. I can define st objects, when I try to access them, I will unpack them on the fly.)
Question: What things should I keep in mind when analyzing high performance time series in Clojure?
I am particularly interested in tricks involving:
Suggestions of books / articles / research articles are welcome. (I am a PhD student).
Thanks.
user1647794
source share