So, it seems it is a JVM and DB configuration issue, right? It would be great to verify this is the case and add the necessary config steps to the documentation.
thx On Mon, Aug 11, 2014 at 2:03 PM, bowen zhang < [email protected]> wrote: > I think I figured out this issue. For oozie to support unicode in path > name, the JAVA_TOOL_OPTIONS in the JVM needs to use an encoding method that > supports unicode. And the oozie DB needs to use an encoding method that's > consistent with the JVM. By default, mysql, for example, uses "latin1" if > running "create database oozie;". This won't work since "latin1" only > supports Western characters. > > > > On Friday, August 8, 2014 7:06 PM, bowen zhang <[email protected]> > wrote: > > > > here is an example: > cat job.properties: > nameNode=hdfs://localhost:9000 > jobTracker=localhost:8032 > queueName=default > examplesRoot=examples > > oozie.wf.application.path=${nameNode}/user/${user.name > }/${examplesRoot}/apps/map-reduceć–‡ > outputDir=map-reduce > > > > On Friday, August 8, 2014 6:55 PM, Alejandro Abdelnur <[email protected]> > wrote: > > > > hey bowen, it would help to know exactly where do you see issues, a simple > wf showing the issue would be great > > Alejandro > (phone typing) > > > > On Aug 8, 2014, at 18:12, bowen zhang <[email protected]> > wrote: > > > > Hi guys, > > I recently discovered a hole in oozie implementation where oozie breaks > when having hdfs app_path with unicode. If my app_path contains non-ascii > characters like Chinese words, the DB and server side will not retrieve it > correctly. Do we have strict utf-8 encoding throughout oozie? Any thoughts > on this? > > Bowen > -- Alejandro
