[ 
https://issues.apache.org/jira/browse/SQOOP-788?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13535560#comment-13535560
 ] 

Hudson commented on SQOOP-788:
------------------------------

Integrated in Sqoop2-hadoop100 #14 (See 
[https://builds.apache.org/job/Sqoop2-hadoop100/14/])
    SQOOP-788 Import sometimes duplicate some data (Revision 
d9465bba216372f053ba9c652b8758f5941b3ead)

     Result = UNSTABLE
blee : 
https://git-wip-us.apache.org/repos/asf?p=sqoop.git&a=commit&h=d9465bba216372f053ba9c652b8758f5941b3ead
Files : 
* 
execution/mapreduce/src/main/java/org/apache/sqoop/job/etl/HdfsTextExportExtractor.java
* 
connector/connector-generic-jdbc/src/main/java/org/apache/sqoop/connector/jdbc/GenericJdbcImportExtractor.java
* 
execution/mapreduce/src/main/java/org/apache/sqoop/job/etl/HdfsSequenceExportExtractor.java

                
> Sqoop2: Import sometimes duplicate some data
> --------------------------------------------
>
>                 Key: SQOOP-788
>                 URL: https://issues.apache.org/jira/browse/SQOOP-788
>             Project: Sqoop
>          Issue Type: Bug
>            Reporter: Jarek Jarcec Cecho
>            Assignee: Jarek Jarcec Cecho
>            Priority: Blocker
>             Fix For: 1.99.1
>
>         Attachments: bugSQOOP-788.patch
>
>
> I've noticed that on my unique testing set of 408957 rows, import always 
> imports 408957 rows. However when I check number of unique lines I usually 
> got smaller number - for example 408056 (less by 901). 
> Because total number of rows fits, I'm expecting that we sometimes read one 
> value twice somehow. I'm not quite sure why.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to