Hyoungjun Kim created TAJO-992:
----------------------------------

             Summary: Reduce number of hash shuffle output file.
                 Key: TAJO-992
                 URL: https://issues.apache.org/jira/browse/TAJO-992
             Project: Tajo
          Issue Type: Sub-task
            Reporter: Hyoungjun Kim


Currently Tajo creates too many intermediate files in the case of hash shuffle. 
A execution block(SubQuery) on a TajoWorker creates intermediate files  as 
following rule:

  # intermediate files  in a worker = # tasks / # workers * # partitions 

This may cause 'too many file opens' error and makes it difficult to scale out. 
To solve this problem, We should reduce number of hash shuffle output file.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to