Attila Sasvari created OOZIE-2819: ------------------------------------- Summary: Make Oozie REST API accept multibyte characters via client side xml Key: OOZIE-2819 URL: https://issues.apache.org/jira/browse/OOZIE-2819 Project: Oozie Issue Type: Bug Reporter: Attila Sasvari
Submitted Pig action with client side xml failed via proxy submission when it contained multibyte characters. {code} curl -i -X POST -d @/tmp/pig.xml -H 'Content-Type: application/XML; charset=UTF-8' 'http://'localhost':11000/oozie/v1/jobs?jobtype=pig&action=start' {code} Where {code} $ hdfs dfs -cat /tmp/encoding/input.txt 松 林檎 松 {code} {code} $ cat /tmp/pig.xml <configuration> <property> <name>fs.default.name</name> <value>hdfs://localhost:8020/</value> </property> <property> <name>mapred.job.tracker</name> <value>localhost:8032</value> </property> <property> <name>user.name</name> <value>hdfs</value> </property> <property> <name>oozie.pig.script</name> <value><![CDATA[ lines = LOAD 'hdfs:///tmp/encoding/input.txt' USING PigStorage('\n') AS line; test = FILTER lines BY line == '松'; STORE test INTO 'hdfs:///tmp/encoding/output' USING PigStorage('\n'); ]]></value> </property> <property> <name>oozie.pig.script.params.size</name> <value>0</value> </property> <property> <name>oozie.pig.script.options.size</name> <value>0</value> </property> <property> <name>oozie.libpath</name> <value>hdfs:///user/oozie/share/lib</value> </property> <property> <name>oozie.use.system.libpath</name> <value>true</value> </property> <property> <name>oozie.proxysubmission</name> <value>true</value> </property> </configuration> {code} In the Oozie launcher log, I could see {code} lines = LOAD 'hdfs:///tmp/encoding/input.txt' USING PigStorage('\n') AS line;test = FILTER lines BY line == '~';STORE test INTO 'hdfs:///tmp/encoding/output' USING PigStorage('\n'); {code} was used instead of the intended 松 -- This message was sent by Atlassian JIRA (v6.3.15#6346)