[ 
https://issues.apache.org/jira/browse/SQOOP-3261?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16266090#comment-16266090
 ] 

Yulei Yang commented on SQOOP-3261:
-----------------------------------

Usage: sqoop import -D charset.from= -D charset.to=.  As for regular case, no 
need to set these two parameters. If you want to convert WE8MSWIN1252 or 
US7ASCII to human readable Chinese, set charset.from='ISO-8859-1', 
charset.to='GBK'

> Enable charset convert when importing
> -------------------------------------
>
>                 Key: SQOOP-3261
>                 URL: https://issues.apache.org/jira/browse/SQOOP-3261
>             Project: Sqoop
>          Issue Type: New Feature
>          Components: codegen
>    Affects Versions: 1.4.6
>            Reporter: Yulei Yang
>         Attachments: sqoop-3261.patch
>
>
> Hi,
> I think someone may have the requirement to convert charset of data when 
> importing them from RMDBS。In my case, if I do nothing, a table which store 
> some Chinese content in oracle with charset WE8MSWIN1252 will be unreadable 
> in hive. Yes I know some databases have the function to archive this by 
> setting charset in connection url, while some others don't have this function 
> or it's inconvenient to use.  I have a common way to do this, and we have use 
> this solution for several months in our company. Could someone please add me 
> to the contributors list?



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to