[ 
https://issues.apache.org/jira/browse/TAJO-983?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14104163#comment-14104163
 ] 

Hyunsik Choi commented on TAJO-983:
-----------------------------------

Hi [~mhthanh],

Your approach looks good to me. BTW, I have some better approach for it. I'm 
sorry for not mentioning it earlier.

Please see Task::localizeFetchedData() method. This method makes FileFragments 
for input data from fetched files. The FileFragments will be ultimately used 
for a sequential scanner or merge scanner. I think that we can directly make 
FileFragment with a original local path without unnecessary copying the files 
stored in local file system.

How do you think about that?

Best regards,
Hyunsik



> Worker should directly read Intermediate data stored in localhost rather than 
> fetching
> --------------------------------------------------------------------------------------
>
>                 Key: TAJO-983
>                 URL: https://issues.apache.org/jira/browse/TAJO-983
>             Project: Tajo
>          Issue Type: Bug
>          Components: data shuffle
>            Reporter: Hyunsik Choi
>            Assignee: Mai Hai Thanh
>         Attachments: TAJO-983.140820.0.patch.txt
>
>
> Currently, worker always fetches all intermediate via Fetcher and than store 
> them in local file system even though some intermediate data already are  
> stored in local file system. It is inefficient and causes unnecessary I/O and 
> extra storage occupation. We should improve it.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to