[ 
https://issues.apache.org/jira/browse/PHOENIX-1056?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14053505#comment-14053505
 ] 

jay wong commented on PHOENIX-1056:
-----------------------------------

nomarl HBase ImportTsv is :

bin/hbase org.apache.hadoop.hbase.mapreduce.ImportTsv 
-Dimporttsv.columns=HBASE_ROW_KEY,cf:a,cf:b,cf:c  
-Dimporttsv.bulk.output=hdfs://storefile-outputdir <tablename> <hdfs-inputdir>

Phoenix ImportTsv is this. and it support phoenix datatype

bin/hbase org.apache.hadoop.hbase.mapreduce.PhoneixImportTsv 
-Dimporttsv.columns=HBASE_ROW_KEY,CF:A:PH_INT,CF:B:PH:BIGINT,cf:c 
-Dimporttsv.index.all=true  -Dimporttsv.bulk.output=hdfs://storefile-outputdir 
<tablename> <hdfs-inputdir>

If the primary key is mutil-col. support the rule 
replace HBASE_ROW_KEY to HBASE_ROW_KEY^CF1:Q1:PH_INT^CF2:Q2^0^CF1:Q3:PH_INT

parameter:
-Dimporttsv.index.all=true.  If build all index table data, default is false
-Dimporttsv.build.table=true if build the data table, default is true.
-Dimporttsv.index.names=INDEX1,INDEX2.  which index table we build. need set 
-Dimporttsv.index.all=false.






> A ImportTsv tool for phoenix to build table data and all index data.
> --------------------------------------------------------------------
>
>                 Key: PHOENIX-1056
>                 URL: https://issues.apache.org/jira/browse/PHOENIX-1056
>             Project: Phoenix
>          Issue Type: Task
>    Affects Versions: 3.0.0
>            Reporter: jay wong
>             Fix For: 3.1
>
>         Attachments: PHOENIX-1056.patch
>
>
> I have just build a tool for build table data and index table data just like 
> ImportTsv job.
> http://hbase.apache.org/book/ops_mgt.html#importtsv
> when ImportTsv work it write HFile in a CF name path.
> for example A table has two cf, A and B.
> the output is 
> ...../outputpath/A
> ...../outputpath/B
> In my job. we has a table.  TableOne. and two Index IdxOne, IdxTwo.
> the output will be
> ...../outputpath/TableOne/A
> ...../outputpath/TableOne/B
> ...../outputpath/IdxOne
> ...../outputpath/IdxTwo.
> If anyone need it .I will build a clean tool.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to