With traditional DBMSs, we usually use RAID10 in most cases, but if we use cassandra RF = 2, then we definitely have one copy as a backup, then in this case why and why use RAID10.
I think this will reduce the overhead of cassandra for replication.
In addition, in RAID10, if the hard drive fails, then the entire node will continue to work, but if replication is used, then one hard drive failure will cause the whole node to go down?
Although I think that using RAID10 there will be overhead per record, cleaning is done when SSTABLE is full so that it is not felt all the time.
source share