[
https://issues.apache.org/jira/browse/OOZIE-2819?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Attila Sasvari updated OOZIE-2819:
----------------------------------
Attachment: OOZIE-2819-01.patch
> Make Oozie REST API accept multibyte characters via client side xml
> -------------------------------------------------------------------
>
> Key: OOZIE-2819
> URL: https://issues.apache.org/jira/browse/OOZIE-2819
> Project: Oozie
> Issue Type: Bug
> Reporter: Attila Sasvari
> Assignee: Attila Sasvari
> Attachments: OOZIE-2819-00.patch, OOZIE-2819-01.patch
>
>
> Submitted Pig action with client side xml failed via proxy submission when it
> contained multibyte characters.
> {code}
> curl -i -X POST -d @/tmp/pig.xml -H 'Content-Type: application/XML;
> charset=UTF-8'
> 'http://'localhost':11000/oozie/v1/jobs?jobtype=pig&action=start'
> {code}
> Where
> {code}
> $ hdfs dfs -cat /tmp/encoding/input.txt
> 松
> 林檎
> 松
> {code}
> {code}
> $ cat /tmp/pig.xml
> <configuration>
> <property>
> <name>fs.default.name</name>
> <value>hdfs://localhost:8020/</value>
> </property>
> <property>
> <name>mapred.job.tracker</name>
> <value>localhost:8032</value>
> </property>
> <property>
> <name>user.name</name>
> <value>hdfs</value>
> </property>
> <property>
> <name>oozie.pig.script</name>
> <value><![CDATA[
> lines = LOAD 'hdfs:///tmp/encoding/input.txt' USING PigStorage('\n') AS line;
> test = FILTER lines BY line == '松';
> STORE test INTO 'hdfs:///tmp/encoding/output' USING PigStorage('\n');
> ]]></value>
> </property>
> <property>
> <name>oozie.pig.script.params.size</name>
> <value>0</value>
> </property>
> <property>
> <name>oozie.pig.script.options.size</name>
> <value>0</value>
> </property>
> <property>
> <name>oozie.libpath</name>
> <value>hdfs:///user/oozie/share/lib</value>
> </property>
> <property>
> <name>oozie.use.system.libpath</name>
> <value>true</value>
> </property>
> <property>
> <name>oozie.proxysubmission</name>
> <value>true</value>
> </property>
> </configuration>
> {code}
> In the Oozie launcher log, I could see
> {code}
> lines = LOAD 'hdfs:///tmp/encoding/input.txt' USING PigStorage('\n') AS
> line;test = FILTER lines BY line == '~';STORE test INTO
> 'hdfs:///tmp/encoding/output' USING PigStorage('\n');
> {code}
> was used instead of the intended 松
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)