[ 
https://issues.apache.org/jira/browse/PHOENIX-1561?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14259371#comment-14259371
 ] 

Brian Johnson commented on PHOENIX-1561:
----------------------------------------

Implementing all 3 interfaces allows pig to do map-side joins with other data 
sources, something that is obviously impossible with Phoenix alone. It also 
allows pig to do map side joins on relations that don't fit in memory which 
Phoenix is not yet able to do.

https://pig.apache.org/docs/r0.14.0/perf.html#specialized-joins

> Pig optimized joins
> -------------------
>
>                 Key: PHOENIX-1561
>                 URL: https://issues.apache.org/jira/browse/PHOENIX-1561
>             Project: Phoenix
>          Issue Type: Bug
>    Affects Versions: 4.2
>            Reporter: Brian Johnson
>            Assignee: Brian Johnson
>         Attachments: patch
>
>
> PhoenixHBaseLoader should implement both OrderedLoadFunc and 
> CollectableLoadFunc just like HBaseStorage. There is nothing special that 
> needs to be done other than implementing a single method. As in HBaseStorage, 
> it is up to the user to ensure that the required constraints are not 
> violated. 
> {code:java}
>     public void ensureAllKeyInstancesInSameSplit() throws IOException {
>         /** 
>          * no-op because hbase keys are unique 
>          * This will also work with things like 
> DelimitedKeyPrefixRegionSplitPolicy
>          * if you need a partial key match to be included in the split
>          */
>         LOG.debug("ensureAllKeyInstancesInSameSplit");
>     }
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to