[
https://issues.apache.org/jira/browse/SQOOP-2561?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15187546#comment-15187546
]
Jarek Jarcec Cecho edited comment on SQOOP-2561 at 3/9/16 6:08 PM:
-------------------------------------------------------------------
Thanks for the reminder [~vishnusn].
I think that this solution wont't solve all the cases. For example if the table
have columns {{first~column}} and {{first_column}}, then we again create
duplicates. I've was looking into the
[ClassWriter|https://github.com/apache/sqoop/blob/trunk/src/java/org/apache/sqoop/orm/ClassWriter.java]
and it seems to me that it will hit the same problem, so I guess that we can
leave it be for now and create follow up JIRA to solve that problem. What do
you think [~vishnusn]?
was (Author: jarcec):
Thanks for the reminder [~vishnusn].
I think that this solution wont't solve all the cases. For example if the table
have columns {{first~column]] and {{first_column}}, then we again create
duplicates. I've was looking into the
[ClassWriter|https://github.com/apache/sqoop/blob/trunk/src/java/org/apache/sqoop/orm/ClassWriter.java]
and it seems to me that it will hit the same problem, so I guess that we can
leave it be for now and create follow up JIRA to solve that problem. What do
you think [~vishnusn]?
> Special Character removal from Column name as avro data results in duplicate
> column and fails the import
> --------------------------------------------------------------------------------------------------------
>
> Key: SQOOP-2561
> URL: https://issues.apache.org/jira/browse/SQOOP-2561
> Project: Sqoop
> Issue Type: Bug
> Affects Versions: 1.4.6
> Environment: cdh5.3.2
> Reporter: Suresh
> Assignee: VISHNU S NAIR
> Labels: AVRO, SQOOP
> Fix For: 1.4.7
>
> Attachments: 0001-SQOOP-2561.patch
>
>
> When a Special character like '$' or '#' are present in column name,
> sqoop/avro removes those special character. In some cases it leads to
> duplicate column.
> e.g. If we have COL$1 and COL1$ in the schema, it removes both of them and
> creates the duplicate column as COL1 and it results in failure of the SQOOP
> import job as a avro data. The same table can be loaded without
> --as-avarodata flag.
> The similar issue is raised in,
> https://issues.apache.org/jira/browse/SQOOP-1361 - which i suppose is fixed
> and the fix is creating this new issue.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)