[ 
https://issues.apache.org/jira/browse/PIG-4246?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14182463#comment-14182463
 ] 

Daniel Dai commented on PIG-4246:
---------------------------------

+1

> HBaseStorage should implement getShipFiles
> ------------------------------------------
>
>                 Key: PIG-4246
>                 URL: https://issues.apache.org/jira/browse/PIG-4246
>             Project: Pig
>          Issue Type: Improvement
>            Reporter: Rohini Palaniswamy
>            Assignee: Rohini Palaniswamy
>             Fix For: 0.14.0
>
>         Attachments: PIG-4246-1.patch
>
>
> HBaseStorage.initializeHBaseClassLoaderResources() uses TableMapReduceUtil 
> APIs to add dependency jars. That sets the tmpjars setting which makes 
> JobClient ship the jars to hdfs and use that path in distributed cache. That 
> bypasses the optimizations in PIG-2672 and PIG-3861 which avoid shipping the 
> jars to hdfs. Instead it should implement the getShipFiles() API introduced 
> in PIG-4141 so that PIG-2672 or PIG-3861 avoid shipping the same jar multiple 
> times to hdfs for a job.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to