Hi
  I'm using Tez 0.9.1 and Hive 2.3.3. Running TPC-DS query15 with 1TB.
In Reduce5, i see long pause during fetch occasionally so it's hard to make a 
Jstack.
Reduce5 depends on Map4 and Reduce2,these two vertexes finished before 16:33:00.
So the fetch task should not be in wait state for upstream vertexes.

How do i find out what makes this long pause? 

  thanks.

2020-05-30 16:36:29,531 [INFO] [Fetcher_B {Map_4} #12] |ShuffleManager.fetch|: 
Completed fetch for attempt: {5, 0, 
attempt_1590728138875_0202_1_01_000005_0_10003} to MEMORY, csize=24509, 
dsize=66456, EndTime=1590827789531, TimeTaken=1, Rate=23.37 MB/s 2020-05-30 
16:37:41,368 [INFO] [Fetcher_B {Map_4} #2] |HttpConnection.url|: for 
url=http://node-ana-coreLKpD0001:13562/mapOutput?job=job_1590728138875_0202&dag=1&reduce=245&map=attempt_1590728138875_0202_1_01_000027_0_10002,attempt_1590728138875_0202_1_01_000023_0_10003,attempt_1590728138875_0202_1_01_000030_0_10002,attempt_1590728138875_0202_1_01_000025_0_10003,attempt_1590728138875_0202_1_01_000029_0_10003
 sent hash and receievd reply 0 ms

Reply via email to