Jarek Jarcec Cecho created SQOOP-1154:
-----------------------------------------
Summary: Sqoop2: Text partitioner might miss or include edge values
Key: SQOOP-1154
URL: https://issues.apache.org/jira/browse/SQOOP-1154
Project: Sqoop
Issue Type: Bug
Affects Versions: 1.99.2
Reporter: Jarek Jarcec Cecho
Fix For: 1.99.3
Attached is test case that will use Varchar partitioner to generate 3, 5, 10
and 13 partitions for interval "Breezy Badger" to "Warty Warthog". For all
cases generated partitions looks like the following:
{code}
'Bree' <= VCCOL AND VCCOL < SOME_VALUE
SOME_VALUE <= VCCOL AND VCCOL <= 'Wart'
{code}
As {{'Warty Warthog' > 'Wart'}}, the last value will be never imported. Also
similarly as {{'Bree' < 'Breezy Badger'}}, additional values might be imported
as well (for example 'Breedy Budget'). I think that the varchar partitioner
must have the interval boundaries there without any truncation, for example:
{code}
'Breezy Badget' <= VCCOL AND VCCOL < SOME_VALUE
SOME_VALUE <= VCCOL AND VCCOL <= 'Warty Warthog'
{code}
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira