Anandha,
If you are seeing incorrect results from Hive, I would suggest sending a
mail to user@hive.
As Ravi pointed out, containers can continue to run after a query
completes, due to how container re-use is configured.

On Sun, May 21, 2017 at 11:37 PM, Ravi Teja Chilukuri <raviort...@gmail.com>
wrote:

> HI Anandha,
>
> | reducers are still running but " DAG finished successfully" and
> inserted  zero records.
>
> The "containers" used by reducers can be still running even after the DAG
> has completed , due to the container re-use, they will be waiting for
> another "task" to be assigned to the same container.
> Can you please verify the task logs of the container where you see the
> reducer is still running, I think the reduce attempt would have already
> have completed and the container is waiting for a new task.
>
> If that is the case, can you verify the hive query which might be causing
> 0 records being inserted.
>
> Thanks,
> Ravi
>
> On Mon, May 22, 2017 at 11:51 AM, Anandha L Ranganathan <
> analog.s...@gmail.com> wrote:
>
>> Thanks Ravi for the information. We will udpate
>> tez.session.am.dag.submit.timeout.secs to lower value in our config.
>>
>> Our main concern is  when   reducers are still running but " DAG
>> finished successfully" and inserted  zero records.
>>
>> [2017-04-21 18:29:38,597] {hive_hooks.py:130} INFO - INFO  : Table 
>> dw_insight.d_experiment stats: [numFiles=0, numRows=0, totalSize=0, 
>> rawDataSize=0]
>>
>>
>> This resulted incorrect result set and data got corrupted.
>>
>> Thanks
>> Anand
>>
>> On Sun, May 21, 2017 at 7:18 AM, Ravi Teja Chilukuri <
>> raviort...@gmail.com> wrote:
>>
>>> Hi Anandha,
>>>
>>> If your hive query has completed and still the containers are running,
>>> you might want to set the param: tez.session.am.dag.submit.timeout.secs
>>> to a lower value , for the yarn application to be completely shutdown.
>>> In session mode, by default the AM container waits till 5 minutes for
>>> any other queries to be fired.
>>>
>>>
>>> Thanks,
>>> Ravi
>>>
>>> On Sat, May 20, 2017 at 4:44 AM, Anandha L Ranganathan <
>>> analog.s...@gmail.com> wrote:
>>>
>>>> We hit with unique problem. Still the tez containers are running but Hive 
>>>> moved the file from staging location to actual location.  Is this bug in 
>>>> the Tez engine?
>>>>
>>>> I have attached logs and  you can see still Map1, Reducer3 , Reducer 4, 
>>>> Reducer 7 are running but DAG is SUCCEEDED.
>>>>
>>>>
>>>>
>>>> [2017-04-21 18:08:57,108] {hive_hooks.py:130} INFO - INFO  : Map 1: 
>>>> 0(+2)/8        Map 12: 1/1     Map 5: 1/1      Map 8: 1/1      Map 9: 1/1  
>>>>     Reducer 11: 0/1 Reducer 3: 0/308        Reducer 4: 0/2  Reducer 7: 0/1
>>>> [2017-04-21 18:09:57,116] {hive_hooks.py:130} INFO - INFO  : Map 1: 
>>>> 0(+2)/8        Map 12: 1/1     Map 5: 1/1      Map 8: 1/1      Map 9: 1/1  
>>>>     Reducer 11: 0/1 Reducer 3: 0/308        Reducer 4: 0/2  Reducer 7: 0/1
>>>> [2017-04-21 18:09:57,116] {hive_hooks.py:130} INFO - INFO  : Map 1: 
>>>> 0(+2)/8        Map 12: 1/1     Map 5: 1/1      Map 8: 1/1      Map 9: 1/1  
>>>>     Reducer 11: 0/1 Reducer 3: 0/308        Reducer 4: 0/2  Reducer 7: 0/1
>>>> [2017-04-21 18:29:38,595] {hive_hooks.py:130} INFO - INFO  : Map 1: 
>>>> 0(+2)/8        Map 12: 1/1     Map 5: 1/1      Map 8: 1/1      Map 9: 1/1  
>>>>     Reducer 11: 0/1 Reducer 3: 0/308        Reducer 4: 0/2  Reducer 7: 0/1
>>>> [2017-04-21 18:29:38,595] {hive_hooks.py:130} INFO - INFO  :
>>>> [2017-04-21 18:29:38,596] {hive_hooks.py:130} INFO - INFO  : Status: DAG 
>>>> finished successfully in 1560.07 seconds
>>>> [2017-04-21 18:29:38,596] {hive_hooks.py:130} INFO - INFO  :
>>>> [2017-04-21 18:29:38,596] {hive_hooks.py:130} INFO -
>>>> [2017-04-21 18:29:38,596] {hive_hooks.py:130} INFO - INFO  : METHOD        
>>>>                  DURATION(ms)
>>>> [2017-04-21 18:29:38,596] {hive_hooks.py:130} INFO - INFO  : parse         
>>>>                            0
>>>> [2017-04-21 18:29:38,596] {hive_hooks.py:130} INFO - INFO  : 
>>>> semanticAnalyze                          0
>>>> [2017-04-21 18:29:38,596] {hive_hooks.py:130} INFO - INFO  : TezBuildDag   
>>>>                            0
>>>> [2017-04-21 18:29:38,596] {hive_hooks.py:130} INFO - INFO  : 
>>>> TezSubmitToRunningDag               12,957
>>>> [2017-04-21 18:29:38,597] {hive_hooks.py:130} INFO - INFO  : TotalPrepTime 
>>>>                       53,372
>>>> [2017-04-21 18:29:38,597] {hive_hooks.py:130} INFO -
>>>> [2017-04-21 18:29:38,597] {hive_hooks.py:130} INFO - INFO  : Loading data 
>>>> to table dw_insight.d_experiment from 
>>>> hdfs://dfs-nameservices/data/insight/dwuser/d_experiment/.hive-staging_hive_2017-04-21_18-02-31_741_4202396962843353301-4404/-ext-10000
>>>> [2017-04-21 18:29:38,597] {hive_hooks.py:130} INFO - INFO  : Table 
>>>> dw_insight.d_experiment stats: [numFiles=0, numRows=0, totalSize=0, 
>>>> rawDataSize=0]
>>>> [2017-04-21 18:29:38,597] {hive_hooks.py:130} INFO - No rows affected 
>>>> (1626.877 seconds)
>>>> [2017-04-21 18:29:38,604] {hive_hooks.py:130} INFO - 0: 
>>>> jdbc:hive2://usw2prdpmn01:10000/dw_insight>
>>>> [2017-04-21 18:29:38,611] {hive_hooks.py:130} INFO - 0: 
>>>> jdbc:hive2://usw2prdpmn01:10000/dw_insight> ANALYZE TABLE 
>>>> ${hiveconf:target_t
>>>>
>>>>
>>>
>>
>

Reply via email to