Attila Sasvari created OOZIE-2819:
-------------------------------------

             Summary: Make Oozie REST API accept multibyte characters via 
client side xml
                 Key: OOZIE-2819
                 URL: https://issues.apache.org/jira/browse/OOZIE-2819
             Project: Oozie
          Issue Type: Bug
            Reporter: Attila Sasvari


Submitted Pig action with client side xml failed via proxy submission when it 
contained multibyte characters.

{code}
curl -i  -X POST -d @/tmp/pig.xml -H 'Content-Type: application/XML; 
charset=UTF-8' 'http://'localhost':11000/oozie/v1/jobs?jobtype=pig&action=start'
{code}

Where
{code}
$ hdfs dfs -cat /tmp/encoding/input.txt
松
林檎
松
{code}

{code}
$ cat /tmp/pig.xml 
<configuration>
<property>
<name>fs.default.name</name>
<value>hdfs://localhost:8020/</value>
</property>
<property>
<name>mapred.job.tracker</name>
<value>localhost:8032</value>
</property>
<property>
<name>user.name</name>
<value>hdfs</value>
</property>
<property>
<name>oozie.pig.script</name>
<value><![CDATA[
lines = LOAD 'hdfs:///tmp/encoding/input.txt' USING PigStorage('\n') AS line;
test = FILTER lines BY line == '松';
STORE test INTO 'hdfs:///tmp/encoding/output' USING PigStorage('\n');
]]></value>
</property>
<property>
<name>oozie.pig.script.params.size</name>
<value>0</value>
</property>
<property>
<name>oozie.pig.script.options.size</name>
<value>0</value>
</property>
<property>
<name>oozie.libpath</name>
<value>hdfs:///user/oozie/share/lib</value>
</property>
<property>
<name>oozie.use.system.libpath</name>
<value>true</value>
</property>
<property>
<name>oozie.proxysubmission</name>
<value>true</value>
</property>
</configuration>
{code}

In the Oozie launcher log, I could see
{code}
lines = LOAD 'hdfs:///tmp/encoding/input.txt' USING PigStorage('\n') AS 
line;test = FILTER lines BY line == '~';STORE test INTO 
'hdfs:///tmp/encoding/output' USING PigStorage('\n');
{code}
was used instead of the intended 松



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Reply via email to