[jira] [Commented] (PHOENIX-2216) Support single mapper pass to CSV bulk load table and indexes

James Taylor (JIRA) Tue, 13 Oct 2015 14:15:19 -0700

    [ 
https://issues.apache.org/jira/browse/PHOENIX-2216?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14955690#comment-14955690
 ]


James Taylor commented on PHOENIX-2216:
---------------------------------------

I like the approach that works. :-) 

Ravi and I talked offline about him putting together an HBase patch that makes 
public or protected the necessary methods such that the copy/paste code is not 
necessary. [[email protected]] - if you could annotate the code a bit 
and indicate *what* you copy/pasted and *where* it diverged, that would be 
helpful.

[~gabriel.reid] - would you have a few spare cycles to review? If you feel 
strongly about one approach versus the other (or even a new one), then that's 
ok too. Not sure if you know off the top of your head why the second approach 
isn't working. Would be great if we could get this into our 4.6.0 release, 
though. 

> Support single mapper pass to CSV bulk load table and indexes
> -------------------------------------------------------------
>
>                 Key: PHOENIX-2216
>                 URL: https://issues.apache.org/jira/browse/PHOENIX-2216
>             Project: Phoenix
>          Issue Type: Bug
>            Reporter: James Taylor
>            Assignee: maghamravikiran
>         Attachments: phoenix-custom-hfileoutputformat.patch, 
> phoenix-multipleoutputs.patch
>
>
> Instead of running separate MR jobs for CSV bulk load: once for the table and 
> then once for each secondary index, generate both the data table HFiles and 
> the index table(s) HFiles in one mapper phase.
> Not sure if we need HBASE-3727 to be implemented for this or if we can do it 
> with existing HBase APIs.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (PHOENIX-2216) Support single mapper pass to CSV bulk load table and indexes

Reply via email to