[
https://issues.apache.org/jira/browse/OOZIE-2819?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15924867#comment-15924867
]
Attila Sasvari commented on OOZIE-2819:
---------------------------------------
[~rkanter] I can create a minimalistic Oozie test that validates write() is
used instead of writeBytes() in addScriptToCache().
Writing an actual unit test would require massive refactoring starting with the
Services class (that is needed to be initialised when creating an
ActionExecutor object).
> Make Oozie REST API accept multibyte characters via client side xml
> -------------------------------------------------------------------
>
> Key: OOZIE-2819
> URL: https://issues.apache.org/jira/browse/OOZIE-2819
> Project: Oozie
> Issue Type: Bug
> Reporter: Attila Sasvari
> Assignee: Attila Sasvari
> Attachments: OOZIE-2819-00.patch
>
>
> Submitted Pig action with client side xml failed via proxy submission when it
> contained multibyte characters.
> {code}
> curl -i -X POST -d @/tmp/pig.xml -H 'Content-Type: application/XML;
> charset=UTF-8'
> 'http://'localhost':11000/oozie/v1/jobs?jobtype=pig&action=start'
> {code}
> Where
> {code}
> $ hdfs dfs -cat /tmp/encoding/input.txt
> 松
> 林檎
> 松
> {code}
> {code}
> $ cat /tmp/pig.xml
> <configuration>
> <property>
> <name>fs.default.name</name>
> <value>hdfs://localhost:8020/</value>
> </property>
> <property>
> <name>mapred.job.tracker</name>
> <value>localhost:8032</value>
> </property>
> <property>
> <name>user.name</name>
> <value>hdfs</value>
> </property>
> <property>
> <name>oozie.pig.script</name>
> <value><![CDATA[
> lines = LOAD 'hdfs:///tmp/encoding/input.txt' USING PigStorage('\n') AS line;
> test = FILTER lines BY line == '松';
> STORE test INTO 'hdfs:///tmp/encoding/output' USING PigStorage('\n');
> ]]></value>
> </property>
> <property>
> <name>oozie.pig.script.params.size</name>
> <value>0</value>
> </property>
> <property>
> <name>oozie.pig.script.options.size</name>
> <value>0</value>
> </property>
> <property>
> <name>oozie.libpath</name>
> <value>hdfs:///user/oozie/share/lib</value>
> </property>
> <property>
> <name>oozie.use.system.libpath</name>
> <value>true</value>
> </property>
> <property>
> <name>oozie.proxysubmission</name>
> <value>true</value>
> </property>
> </configuration>
> {code}
> In the Oozie launcher log, I could see
> {code}
> lines = LOAD 'hdfs:///tmp/encoding/input.txt' USING PigStorage('\n') AS
> line;test = FILTER lines BY line == '~';STORE test INTO
> 'hdfs:///tmp/encoding/output' USING PigStorage('\n');
> {code}
> was used instead of the intended 松
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)