Jarek Jarcec Cecho created SQOOP-1154:
-----------------------------------------

             Summary: Sqoop2: Text partitioner might miss or include edge values
                 Key: SQOOP-1154
                 URL: https://issues.apache.org/jira/browse/SQOOP-1154
             Project: Sqoop
          Issue Type: Bug
    Affects Versions: 1.99.2
            Reporter: Jarek Jarcec Cecho
             Fix For: 1.99.3


Attached is test case that will use Varchar partitioner to generate 3, 5, 10 
and 13 partitions for interval "Breezy Badger" to "Warty Warthog". For all 
cases generated partitions looks like the following:

{code}
'Bree' <= VCCOL AND VCCOL < SOME_VALUE
SOME_VALUE <= VCCOL AND VCCOL <= 'Wart'
{code}

As {{'Warty Warthog' > 'Wart'}}, the last value will be never imported. Also 
similarly as {{'Bree' < 'Breezy Badger'}}, additional values might be imported 
as well (for example 'Breedy Budget'). I think that the varchar partitioner 
must have the interval boundaries there without any truncation, for example:

{code}
'Breezy Badget' <= VCCOL AND VCCOL < SOME_VALUE
SOME_VALUE <= VCCOL AND VCCOL <= 'Warty Warthog'
{code}


--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to