These two systems implement two concepts. HDFS is a distributed file system optimized for massive serial I / O.
Oracle Coherence is a caching solution that can use the disk for part of its data. It is optimized for random IO
Technologically:
HDFS does not cache relay to the base OS for this. Its services (Datanode) should be considered as a block device.
Conventional cache solutions provide an abstraction of KeyValue, some expiration functions and primarily use memory as storage.
Distributed cache is usually created using consistent hashing, while HDFS supports a centralized directory (NameNode).
At the same time, the File System and Cache have much in common (if you look at the file path as a key).
From the above: these systems are somewhat interchangeable, but are built for different scenarios and are fully integrated inside.
source share