I'm starting to draw. Everywhere I see that only mapreduce uses key-value pairs. But I see no clear reason to use key-value pairs.
Thanks in advance!
Hadoop is mainly used for data analysis. In data analysis, we look at statistical and / or logical methods for describing and illustrating, condensing and repeating and evaluating data. We do not modify the data in any way.
Hadoop , . , . , .
, , . , - , , (Key) ().
, group by , , , , - .
, ,
file1.txt 1 2 3 file2.txt 4 5 6
file1.txt 1 ( ) 1 () 2 ( ) 2 () 3 ( ) 3 ()
(, ) :
file1 as key 3 as value
file2 as key 6 as value
file1 3 file2 6
, :
MAX 6
. compper . (, 1) mapper.Now , . , , hasoop , - , - . , , , .
Source: https://habr.com/ru/post/1569197/More articles:Cordoba / OpenEars Phone Book Speech Recognition - javascriptCan a Java object be a prototype of a JavaScript object in Nashorn? - javaSearch for maximum subscriptions in Prolog - prologA function that takes a 2D array and returns a tuple of array columns? - arraysHow to get the found substring when using the Regex.Replace method? - c #Clojure, Quil: creating / debugging common functions - debuggingWhy does the Akka application fail with an out-of-memory error when performing an NLP task? - scalaHow to disable timeout in Node.js http servers? - javascriptИспользование EntityManagerFactory и ApplicationContext - javaAndroid SlidingPaneLayout скрывает неправильные представления - androidAll Articles