[ 
https://issues.apache.org/jira/browse/TAJO-992?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14104213#comment-14104213
 ] 

ASF GitHub Bot commented on TAJO-992:
-------------------------------------

Github user hyunsik commented on the pull request:

    https://github.com/apache/tajo/pull/115#issuecomment-52815070
  
    +1
    
    Even though Travis CI shows failure, it seems to be not related to the 
patch. I manually verified 'mvn clean install'. It works well and pass all unit 
tests.
    
    Also, the patch looks nice to me. Ship it



> Reduce number of hash shuffle output file.
> ------------------------------------------
>
>                 Key: TAJO-992
>                 URL: https://issues.apache.org/jira/browse/TAJO-992
>             Project: Tajo
>          Issue Type: Sub-task
>          Components: data shuffle
>            Reporter: Hyoungjun Kim
>            Assignee: Hyoungjun Kim
>
> Currently Tajo creates too many intermediate files in the case of hash 
> shuffle. A execution block(SubQuery) on a TajoWorker creates intermediate 
> files  as following rule:
>   # intermediate files  in a worker = # tasks / # workers * # partitions 
> This may cause 'too many file opens' error and makes it difficult to scale 
> out. To solve this problem, We should reduce number of hash shuffle output 
> file.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to