You cannot with the current implementation. However, people βcrackedβ the Hadoop code to do what you want to do.
In the MapReduce model, you need to wait for the completion of all cartographers, since the keys need to be grouped and sorted; plus, you may have some speculative counters running, and you still donβt know which of the duplicate cards will be completed first.
However, since "Overcome the MapReduce Transition Barrier" indicates that for some applications it may make sense not to wait for all cartographers to exit. If you want to implement this behavior (most likely for research purposes), then you should take a look at the class org.apache.hadoop.mapred.ReduceTask.ReduceCopier , which implements ShuffleConsumerPlugin .
EDIT: Finally, as @teo points out in this related SO question ,
ReduceCopier.fetchOutputs() method is the one that contains the shortcut task from starting until all copies of the maps are copied (through a loop in line 2026 of the Hadoop 1.0.4 release).
cabad source share