[ https://issues.apache.org/jira/browse/SQOOP-3261?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16266090#comment-16266090 ]
Yulei Yang commented on SQOOP-3261: ----------------------------------- Usage: sqoop import -D charset.from= -D charset.to=. As for regular case, no need to set these two parameters. If you want to convert WE8MSWIN1252 or US7ASCII to human readable Chinese, set charset.from='ISO-8859-1', charset.to='GBK' > Enable charset convert when importing > ------------------------------------- > > Key: SQOOP-3261 > URL: https://issues.apache.org/jira/browse/SQOOP-3261 > Project: Sqoop > Issue Type: New Feature > Components: codegen > Affects Versions: 1.4.6 > Reporter: Yulei Yang > Attachments: sqoop-3261.patch > > > Hi, > I think someone may have the requirement to convert charset of data when > importing them from RMDBS。In my case, if I do nothing, a table which store > some Chinese content in oracle with charset WE8MSWIN1252 will be unreadable > in hive. Yes I know some databases have the function to archive this by > setting charset in connection url, while some others don't have this function > or it's inconvenient to use. I have a common way to do this, and we have use > this solution for several months in our company. Could someone please add me > to the contributors list? -- This message was sent by Atlassian JIRA (v6.4.14#64029)