[
https://issues.apache.org/jira/browse/SQOOP-788?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13535560#comment-13535560
]
Hudson commented on SQOOP-788:
------------------------------
Integrated in Sqoop2-hadoop100 #14 (See
[https://builds.apache.org/job/Sqoop2-hadoop100/14/])
SQOOP-788 Import sometimes duplicate some data (Revision
d9465bba216372f053ba9c652b8758f5941b3ead)
Result = UNSTABLE
blee :
https://git-wip-us.apache.org/repos/asf?p=sqoop.git&a=commit&h=d9465bba216372f053ba9c652b8758f5941b3ead
Files :
*
execution/mapreduce/src/main/java/org/apache/sqoop/job/etl/HdfsTextExportExtractor.java
*
connector/connector-generic-jdbc/src/main/java/org/apache/sqoop/connector/jdbc/GenericJdbcImportExtractor.java
*
execution/mapreduce/src/main/java/org/apache/sqoop/job/etl/HdfsSequenceExportExtractor.java
> Sqoop2: Import sometimes duplicate some data
> --------------------------------------------
>
> Key: SQOOP-788
> URL: https://issues.apache.org/jira/browse/SQOOP-788
> Project: Sqoop
> Issue Type: Bug
> Reporter: Jarek Jarcec Cecho
> Assignee: Jarek Jarcec Cecho
> Priority: Blocker
> Fix For: 1.99.1
>
> Attachments: bugSQOOP-788.patch
>
>
> I've noticed that on my unique testing set of 408957 rows, import always
> imports 408957 rows. However when I check number of unique lines I usually
> got smaller number - for example 408056 (less by 901).
> Because total number of rows fits, I'm expecting that we sometimes read one
> value twice somehow. I'm not quite sure why.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira