I’m afraid I don’t understand the results of synchronizing the Map-Reduce job. For example, the work that I start gives me the following results from the job tracker.
Finished in: 1mins, 39sec
CPU Time (ms) 150,460 152,030 302,490
Records in elapsed processor time (ms) are for Map, Reduce, and Total, respectively. But then how is “processor time” measured and what does it mean? Is this the total total time spent on each of the cards and gears assigned to work? Is it possible to measure other times from frames, such as time for shuffling, sorting, partitioning, etc.? If so, how?
The second question that bothers me. I saw here a few posts ( Link1 , Link2 ) that suggest using getTime () in the driver class:
long start = new Date().getTime(); boolean status = job.waitForCompletion(true); long end = new Date().getTime(); System.out.println("Job took "+(end-start) + "milliseconds");
Doesn't that mean the first entry in the Job Tracker report provides? It's necessary? What is the best time method for working with haup, especially when I want to use I / O time, to calculate the time per node for each step?
source share