[ https://issues.apache.org/jira/browse/PIG-771?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12703165#action_12703165 ]
Daniel Dai commented on PIG-771: -------------------------------- Seems it works fine on my computer either dump it on the screen or store it into a file. My OS encoding is UTF8. Here is a log: grunt> dan...@ubuntu-daniel1:~/pig$ cat chinese.txt 中文测试 dan...@ubuntu-daniel1:~/pig$ java -jar pig.jar 2009-04-27 11:14:10,889 [main] INFO org.apache.pig.backend.hadoop.executionengine.HExecutionEngine - Connecting to hadoop file system at: file:/// 2009-04-27 11:14:11,046 [main] INFO org.apache.hadoop.metrics.jvm.JvmMetrics - Initializing JVM Metrics with processName=JobTracker, sessionId= grunt> A = load '/home/daniel/pig/chinese.txt' using PigStorage() as (str: chararray); grunt> dump A; 2009-04-27 11:14:18,767 [main] INFO org.apache.hadoop.metrics.jvm.JvmMetrics - Cannot initialize JVM Metrics with processName=JobTracker, sessionId= - already initialized 2009-04-27 11:14:18,774 [Thread-4] WARN org.apache.hadoop.mapred.JobClient - Use GenericOptionsParser for parsing the arguments. Applications should implement Tool for the same. 2009-04-27 11:14:19,289 [Thread-10] INFO org.apache.hadoop.mapred.MapTask - numReduceTasks: 0 2009-04-27 11:14:19,364 [Thread-10] INFO org.apache.hadoop.mapred.LocalJobRunner - 2009-04-27 11:14:19,368 [Thread-10] INFO org.apache.hadoop.mapred.TaskRunner - Task 'attempt_local_0001_m_000000_0' done. 2009-04-27 11:14:19,372 [Thread-10] INFO org.apache.hadoop.mapred.TaskRunner - Saved output of task 'attempt_local_0001_m_000000_0' to file:/tmp/temp-480716160/tmp2138119751 2009-04-27 11:14:24,220 [main] INFO org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher - 0% complete 2009-04-27 11:14:24,222 [main] INFO org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher - 100% complete 2009-04-27 11:14:24,222 [main] INFO org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher - Success! (中文测试) grunt> > PigDump does not properly output Chinese UTF8 characters - they are displayed > as question marks ?? > -------------------------------------------------------------------------------------------------- > > Key: PIG-771 > URL: https://issues.apache.org/jira/browse/PIG-771 > Project: Pig > Issue Type: Bug > Reporter: David Ciemiewicz > > PigDump does not properly output Chinese UTF8 characters. > The reason for this is that the function Tuple.toString() is called. > DefaultTuple implements Tuple.toString() and it calls Object.toString() on > the opaque object d. > Instead, I think that the code should be changed instead to call the new > DataType.toString() function. > {code} > @Override > public String toString() { > StringBuilder sb = new StringBuilder(); > sb.append('('); > for (Iterator<Object> it = mFields.iterator(); it.hasNext();) { > Object d = it.next(); > if(d != null) { > if(d instanceof Map) { > sb.append(DataType.mapToString((Map<Object, Object>)d)); > } else { > sb.append(DataType.toString(d)); // <<< Change this one > line > if(d instanceof Long) { > sb.append("L"); > } else if(d instanceof Float) { > sb.append("F"); > } > } > } else { > sb.append(""); > } > if (it.hasNext()) > sb.append(","); > } > sb.append(')'); > return sb.toString(); > } > {code} -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.