Hadoop reduces stops

I have a terrible question right now. When I start work in chaos, the map process was in order, which reached 100%, while nothing happened. However, when the reduction process was started, it stopped, reaching 67%. It is very strange. I am new to hadoop and have been looking for a lot of material on the Internet, but it still puzzled me now. Follow the conclusion.

13/10/25 21:40:00 INFO input.FileInputFormat: Total input paths to process : 2 13/10/25 21:40:01 INFO mapred.JobClient: Running job: job_201310252001_0003 13/10/25 21:40:02 INFO mapred.JobClient: map 0% reduce 0% 13/10/25 21:40:30 INFO mapred.JobClient: map 1% reduce 0% 13/10/25 21:40:37 INFO mapred.JobClient: map 2% reduce 0% 13/10/25 21:40:39 INFO mapred.JobClient: map 3% reduce 0% 13/10/25 21:40:40 INFO mapred.JobClient: map 4% reduce 0% 13/10/25 21:40:42 INFO mapred.JobClient: map 5% reduce 0% 13/10/25 21:40:43 INFO mapred.JobClient: map 6% reduce 0% 13/10/25 21:40:45 INFO mapred.JobClient: map 7% reduce 0% 13/10/25 21:40:46 INFO mapred.JobClient: map 9% reduce 0% 13/10/25 21:40:48 INFO mapred.JobClient: map 10% reduce 0% 13/10/25 21:40:49 INFO mapred.JobClient: map 11% reduce 0% 13/10/25 21:40:52 INFO mapred.JobClient: map 14% reduce 0% 13/10/25 21:40:55 INFO mapred.JobClient: map 17% reduce 0% 13/10/25 21:40:58 INFO mapred.JobClient: map 19% reduce 0% 13/10/25 21:41:01 INFO mapred.JobClient: map 22% reduce 0% 13/10/25 21:41:04 INFO mapred.JobClient: map 23% reduce 0% 13/10/25 21:41:05 INFO mapred.JobClient: map 24% reduce 0% 13/10/25 21:41:07 INFO mapred.JobClient: map 26% reduce 0% 13/10/25 21:41:08 INFO mapred.JobClient: map 27% reduce 0% 13/10/25 21:41:10 INFO mapred.JobClient: map 28% reduce 0% 13/10/25 21:41:11 INFO mapred.JobClient: map 29% reduce 0% 13/10/25 21:41:13 INFO mapred.JobClient: map 30% reduce 0% 13/10/25 21:41:14 INFO mapred.JobClient: map 31% reduce 0% 13/10/25 21:41:16 INFO mapred.JobClient: map 32% reduce 0% 13/10/25 21:41:20 INFO mapred.JobClient: map 34% reduce 0% 13/10/25 21:41:23 INFO mapred.JobClient: map 35% reduce 0% 13/10/25 21:41:26 INFO mapred.JobClient: map 36% reduce 0% 13/10/25 21:41:34 INFO mapred.JobClient: map 37% reduce 0% 13/10/25 21:41:39 INFO mapred.JobClient: map 38% reduce 0% 13/10/25 21:41:43 INFO mapred.JobClient: map 40% reduce 0% 13/10/25 21:41:44 INFO mapred.JobClient: map 40% reduce 6% 13/10/25 21:41:46 INFO mapred.JobClient: map 42% reduce 6% 13/10/25 21:41:49 INFO mapred.JobClient: map 43% reduce 6% 13/10/25 21:41:51 INFO mapred.JobClient: map 44% reduce 6% 13/10/25 21:41:52 INFO mapred.JobClient: map 45% reduce 6% 13/10/25 21:41:55 INFO mapred.JobClient: map 46% reduce 6% 13/10/25 21:41:57 INFO mapred.JobClient: map 47% reduce 6% 13/10/25 21:41:58 INFO mapred.JobClient: map 48% reduce 9% 13/10/25 21:42:01 INFO mapred.JobClient: map 51% reduce 12% 13/10/25 21:42:04 INFO mapred.JobClient: map 54% reduce 12% 13/10/25 21:42:07 INFO mapred.JobClient: map 56% reduce 12% 13/10/25 21:42:10 INFO mapred.JobClient: map 58% reduce 12% 13/10/25 21:42:13 INFO mapred.JobClient: map 60% reduce 12% 13/10/25 21:42:16 INFO mapred.JobClient: map 61% reduce 12% 13/10/25 21:42:19 INFO mapred.JobClient: map 62% reduce 15% 13/10/25 21:42:22 INFO mapred.JobClient: map 63% reduce 15% 13/10/25 21:42:23 INFO mapred.JobClient: map 65% reduce 15% 13/10/25 21:42:26 INFO mapred.JobClient: map 66% reduce 15% 13/10/25 21:42:28 INFO mapred.JobClient: map 67% reduce 15% 13/10/25 21:42:29 INFO mapred.JobClient: map 68% reduce 15% 13/10/25 21:42:32 INFO mapred.JobClient: map 69% reduce 15% 13/10/25 21:42:34 INFO mapred.JobClient: map 70% reduce 18% 13/10/25 21:42:35 INFO mapred.JobClient: map 72% reduce 18% 13/10/25 21:42:38 INFO mapred.JobClient: map 75% reduce 18% 13/10/25 21:42:41 INFO mapred.JobClient: map 77% reduce 18% 13/10/25 21:42:44 INFO mapred.JobClient: map 80% reduce 18% 13/10/25 21:42:47 INFO mapred.JobClient: map 82% reduce 18% 13/10/25 21:42:50 INFO mapred.JobClient: map 85% reduce 18% 13/10/25 21:42:53 INFO mapred.JobClient: map 87% reduce 18% 13/10/25 21:42:56 INFO mapred.JobClient: map 88% reduce 18% 13/10/25 21:42:59 INFO mapred.JobClient: map 89% reduce 18% 13/10/25 21:43:02 INFO mapred.JobClient: map 90% reduce 18% 13/10/25 21:43:05 INFO mapred.JobClient: map 91% reduce 18% 13/10/25 21:43:18 INFO mapred.JobClient: map 94% reduce 21% 13/10/25 21:43:21 INFO mapred.JobClient: map 97% reduce 21% 13/10/25 21:43:24 INFO mapred.JobClient: map 99% reduce 27% 13/10/25 21:43:27 INFO mapred.JobClient: map 100% reduce 30% 13/10/25 21:43:30 INFO mapred.JobClient: map 100% reduce 67% 
+6
source share
1 answer

The symptom here is that your code gets stuck in your pruning phase, either because of an endless loop, or just a ridiculous amount of data received, or something else (maybe after your code pruning?).

Here's how the percentages in the gearbox work:

  • 0-33% is a shuffle. This is data moving from transducers to gearboxes (see How they start before the display completes).
  • 33% -67% is sorting. This can only begin at the completion of the work of the cartographer (see How it goes from 30% to 67% after the card is 100%).
  • 67% -100% is the actual abbreviation code that you are using. This percentage increases each time the reduction task completes. None of your reduction tasks are complete.

In the JobTracker interface, review your work and see how much data the gearboxes will receive. If the number of records in the gearbox is increasing, it means that you probably have too much data going to the gearboxes. If this number remains fixed, you may have an infinite loop.

+22
source

Source: https://habr.com/ru/post/956721/


All Articles