What is the difference between Oracle and Hadoop HDFS connectivity

What is the difference between Oracle Coherence and Hadoop HDFS read about HDFS and consistency (but nothing was clear) sounds the same as the same. Are these different technologies solutions to different problems or the same technology, but different products? It is necessary to know the difference and similarities between the technologies, i.e. Prospects for implementation, in a broader way, where do these two fit in?

Note. I do not ask for product comparison (so there is no gorilla and shark).

+4
source share
1 answer

These two systems implement two concepts. HDFS is a distributed file system optimized for massive serial I / O.
Oracle Coherence is a caching solution that can use the disk for part of its data. It is optimized for random IO

Technologically:
HDFS does not cache relay to the base OS for this. Its services (Datanode) should be considered as a block device.
Conventional cache solutions provide an abstraction of KeyValue, some expiration functions and primarily use memory as storage.
Distributed cache is usually created using consistent hashing, while HDFS supports a centralized directory (NameNode).
At the same time, the File System and Cache have much in common (if you look at the file path as a key).

From the above: these systems are somewhat interchangeable, but are built for different scenarios and are fully integrated inside.

+6
source

Source: https://habr.com/ru/post/1436970/


All Articles