In my project we use Hadoop 2, Spark, Scala. Scalais a programming language and is Sparkused here for analysis. we use Hiveand HBasetwo. I can access all the details like file etc. HDFSusing Hive. But my confusion is
- When I can complete all tasks with
Hive, then why do I HBaseneed to store data. Is this not overhead? - What are the functionality
Hiveand HBase? - If we used only Hive, then what is the problem?
Can someone please let me know.
source
share