In my experience, we used Hbase as the data store for spark streaming data (we also have the same scenario as many records and several reads), since we use hasoop, hbase has built-in integration with hadoop, and everything went well.
Above, we used tostore hight-speed messages coming from consolation.
HBase . ...
, , rawdata hdfs ( + avro) (SaveMode.Append)), rawdata
Ex hdf:
completion ofbusinessdate/environment/businesssubtype/message type .....
Hbase .
, , , repartion(1) colelese FileUtils.copymerge . , .
Corm , .

Casandra AP.
Hbase CP.