I'm on HDP 2.1 build running a Hive job that has created 3 stages. The first stage has 1045 maps, the second has 2 reducers the 3rd has 1 reducer. The job churns through the first stage and never starts the second.
I can see from the log file syslog_dag_.... that the job releases the containers and gets down to heldContainers=3 (which makes sense to me). How can I diagnose this further? How can I run the job in a safer mode, low gear, something to get through this stall? I should note, that I turned off the timeline service because of numerous errors by modifying yarn-site.xml: via Ambari <name>yarn.timeline-service.enabled</name> <value>false</value> Thanks in advance, Douglas
