[ 
https://issues.apache.org/jira/browse/SQOOP-3087?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15787890#comment-15787890
 ] 

Steve Senior commented on SQOOP-3087:
-------------------------------------

I discovered that there is a workaround to this and that is to use the 
{{--class-name}} Sqoop parameter. I'm not sure that this is the intended use 
case for that parameter, but it produces the desired result.

If I set for e.g. {{--class-name=SH.TEST}} then the Avro schema fields are set 
to {{name=TEST}} and {{namespace=SH}} which means we can overcome the exception.

> Dollar ($) in Oracle schema name causes sqoop to avro to throw exception
> ------------------------------------------------------------------------
>
>                 Key: SQOOP-3087
>                 URL: https://issues.apache.org/jira/browse/SQOOP-3087
>             Project: Sqoop
>          Issue Type: Bug
>    Affects Versions: 1.4.6
>            Reporter: Steve Senior
>
> Created an Oracle table called {{SH_$.TEST}} and attempt to Sqoop import as 
> Avro ({{--as-avrodatafile}}) throws:
> {code}
> 16/12/21 16:54:58 ERROR sqoop.Sqoop: Got exception running Sqoop: 
> org.apache.avro.SchemaParseException: Illegal character in: SH_$_TEST
> org.apache.avro.SchemaParseException: Illegal character in: SH_$_TEST
>       at org.apache.avro.Schema.validateName(Schema.java:1142)
>       at org.apache.avro.Schema.access$200(Schema.java:80)
>       at org.apache.avro.Schema$Name.<init>(Schema.java:483)
>       at org.apache.avro.Schema.createRecord(Schema.java:160)
>       at 
> org.apache.sqoop.orm.AvroSchemaGenerator.generate(AvroSchemaGenerator.java:97)
>       at 
> org.apache.sqoop.mapreduce.DataDrivenImportJob.generateAvroSchema(DataDrivenImportJob.java:154)
>       at 
> org.apache.sqoop.mapreduce.DataDrivenImportJob.configureMapper(DataDrivenImportJob.java:92)
>       at 
> org.apache.sqoop.mapreduce.ImportJobBase.runImport(ImportJobBase.java:260)
>       at org.apache.sqoop.manager.SqlManager.importTable(SqlManager.java:673)
>       at 
> org.apache.sqoop.manager.oracle.OraOopConnManager.importTable(OraOopConnManager.java:284)
>       at org.apache.sqoop.tool.ImportTool.importTable(ImportTool.java:507)
>       at org.apache.sqoop.tool.ImportTool.run(ImportTool.java:615)
>       at org.apache.sqoop.Sqoop.run(Sqoop.java:143)
>       at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70)
>       at org.apache.sqoop.Sqoop.runSqoop(Sqoop.java:179)
>       at org.apache.sqoop.Sqoop.runTool(Sqoop.java:218)
>       at org.apache.sqoop.Sqoop.runTool(Sqoop.java:227)
>       at org.apache.sqoop.Sqoop.main(Sqoop.java:236)
> {code}
> I believe this is because it is trying to use the value {{SH_$_TEST}} for the 
> {{"name"}} field in the Avro schema definition in the Avro data files. It 
> seems that this is not allowed as per the Avro specification on names: 
> https://avro.apache.org/docs/1.7.7/spec.html#Names
> Command line is:
> {code}
> sqoop import --connect jdbc:oracle:thin:@localhost:1521/ORA11204 --username 
> xxx --password xxx --table SH_$.TEST --target-dir=/user/oracle/sh.db/test 
> --delete-target-dir -m2 --direct --fetch-size=5000 --as-avrodatafile 
> --outdir=/tmp
> {code}
> We know we can work around this by using the {{--query}} option for Sqoop, 
> but this means we are not able to benefit from the OraOop optimisations with 
> direct mode.
> Can functionality (such as a schema conversion parameter) be added to Sqoop 
> to remove the {{$}} (and any other unsupported characters) from the {{name}} 
> field stored in the Avro schema in the Avro data file?



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to