Makoto Yui created HIVEMALL-73:
----------------------------------

             Summary: Reduce memory usage of each_top_k
                 Key: HIVEMALL-73
                 URL: https://issues.apache.org/jira/browse/HIVEMALL-73
             Project: Hivemall
          Issue Type: Improvement
            Reporter: Makoto Yui
            Assignee: Makoto Yui
            Priority: Minor


Better to reduce memory usage in drainQueue().

{code}
2017-02-16 05:56:22,378 FATAL [Thread-4] org.apache.hadoop.mapred.YarnChild: 
Error running child : java.lang.OutOfMemoryError: Java heap space
    at hivemall.tools.EachTopKUDTF.drainQueue(EachTopKUDTF.java:182)
    at hivemall.tools.EachTopKUDTF.close(EachTopKUDTF.java:215)
    at 
org.apache.hadoop.hive.ql.exec.UDTFOperator.closeOp(UDTFOperator.java:143)
    at org.apache.hadoop.hive.ql.exec.Operator.close(Operator.java:577)
    at org.apache.hadoop.hive.ql.exec.Operator.close(Operator.java:588)
    at org.apache.hadoop.hive.ql.exec.Operator.close(Operator.java:588)
    at org.apache.hadoop.hive.ql.exec.mr.ExecReducer.close(ExecReducer.java:318)
    at org.apache.hadoop.io.IOUtils.cleanup(IOUtils.java:237)
    at org.apache.hadoop.mapred.ReduceTask.runOldReducer(ReduceTask.java:459)
    at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:392)
    at org.apache.hadoop.mapred.YarnChild$2.run(YarnChild.java:168)
{code}



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Reply via email to