[ 
https://issues.apache.org/jira/browse/PHOENIX-2238?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gabriel Reid resolved PHOENIX-2238.
-----------------------------------
    Resolution: Invalid

The separator character needs to be supplied as a single character, but you're 
supplying a four-character string ("\001") as the delimiter.

To supply a non-printable character like that on the command line, you'd have 
to supply it as follows:
{code}$'\001'{code}

This is touched upon in the docs at 
http://phoenix.apache.org/bulk_dataload.html, in the section discussing the 
usage of a tab character as separator.

For next time, this kind of thing is generally something you can ask about on 
the user mailing list (http://phoenix.apache.org/mailing_list.html) instead of 
logging as a bug in Jira.

> Bulk dataload Hive data with \001 delimiter
> -------------------------------------------
>
>                 Key: PHOENIX-2238
>                 URL: https://issues.apache.org/jira/browse/PHOENIX-2238
>             Project: Phoenix
>          Issue Type: Bug
>    Affects Versions: 4.3.0
>            Reporter: q79969786
>
> Bulk dataload :
> hadoop jar 
> /opt/cloudera/parcels/CLABS_PHOENIX-4.3.0-1.clabs_phoenix1.0.0.p0.78/lib/phoenix/phoenix-4.3.0-clabs-phoenix-1.0.0-client.jar
>  org.apache.phoenix.mapreduce.CsvBulkLoadTool --table test_phoenix_import  
> --zookeeper n1,n2,n3 --delimiter \001 --input  /user/wym/test_phoenix_import
> The path ‘/user/wym/test_phoenix_import’ is a directory contains Hive data, 
> so the default delimiter is \001.
> Exception:
> 15/09/08 15:22:17 INFO zookeeper.ClientCnxn: EventThread shut down
> 15/09/08 15:22:17 INFO zookeeper.ZooKeeper: Session: 0x34f1c9be8ba5667 closed
> Exception in thread "main" java.lang.IllegalArgumentException: Illegal 
> delimiter character: 001
>         at 
> org.apache.phoenix.mapreduce.CsvBulkLoadTool.configureOptions(CsvBulkLoadTool.java:327)
>         at 
> org.apache.phoenix.mapreduce.CsvBulkLoadTool.loadData(CsvBulkLoadTool.java:201)
>         at 
> org.apache.phoenix.mapreduce.CsvBulkLoadTool.run(CsvBulkLoadTool.java:186)
>         at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70)
>         at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:84)
>         at 
> org.apache.phoenix.mapreduce.CsvBulkLoadTool.main(CsvBulkLoadTool.java:97)
>         at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>         at 
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
>         at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>         at java.lang.reflect.Method.invoke(Method.java:606)
>         at org.apache.hadoop.util.RunJar.run(RunJar.java:221)
>         at org.apache.hadoop.util.RunJar.main(RunJar.java:136)
> 15/09/08 15:22:18 INFO client.ConnectionManager$HConnectionImplementation: 
> Closing zookeeper sessionid=0x34f1c9be8ba5665
> 15/09/08 15:22:18 INFO zookeeper.ClientCnxn: EventThread shut down
> 15/09/08 15:22:18 INFO zookeeper.ZooKeeper: Session: 0x34f1c9be8ba5665 closed



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to