Hello Selvam,

You can use a single maptask (no split).  If you are ingesting from Oracle you 
can us --direct which does not use column keys to generate splits.

Thanks, Markus

> On Sep 25, 2016, at 10:14, Selvam Raman <[email protected]> wrote:
> 
> I have 1 TB of data in databse. Primary key are alphanumeric.
> Now how can I use sqoop.
> 
> Is it possible to use sqoop to import.
> 
> Thanks,
> Selvam R
> +91-97877-87724
> 
>> On Sep 23, 2016 3:17 PM, "Markus Kemper" <[email protected]> wrote:
>> As Ravi noted, non-numeric keys are not reliable and can result in both 
>> duplicate as well as missing rows.  When using a non-numeric key for 
>> split-by you should observe a warning in the debug console output.
>> 
>> 
>> Markus Kemper
>> Customer Operations Engineer
>> 
>> 
>> 
>>> On Fri, Sep 23, 2016 at 10:11 AM, Ravi, Chandramouli 
>>> <[email protected]> wrote:
>>> It won't work well when Primary key is alpha numeric. I think data will be 
>>> skewed or won't come back as expected creating non-balanced split files.
>>> 
>>> Specify different numeric index as Split key if numeric primary key is not 
>>> present.
>>> 
>>>  
>>> 
>>> From: Selvam Raman [mailto:[email protected]] 
>>> Sent: Friday, September 23, 2016 10:09 AM
>>> To: [email protected]
>>> Subject: sqoop import for UUID(primary key)
>>> 
>>>  
>>> 
>>> Hi,
>>> 
>>>  
>>> 
>>> In Sqoop If i am having primary key (Number value) and number of parallel 
>>> task then it will work (max-min/number of task), to pull the data from 
>>> table to hdfs.
>>> 
>>>  
>>> 
>>> suppose if i have the primary key as UUID(alpha numeric value), how the 
>>> load will be distributed.
>>> 
>>>  
>>> 
>>> Thank you for your help.
>>> 
>>>  
>>> 
>>> --
>>> 
>>> Selvam Raman
>>> "லஞ்சம் தவிர்த்து நெஞ்சம் நிமிர்த்து"
>>> 
>>> 
>>>  **NOTICE: This e-mail message, including any attachments hereto, is for 
>>> the sole use of the intended recipient(s) and may contain confidential 
>>> and/or privileged information.  If you are not the intended recipient(s), 
>>> any unauthorized review, use, copying, disclosure or distribution is 
>>> prohibited.  If you are not the intended recipient(s), please contact the 
>>> sender by reply e-mail immediately and destroy the original and all copies 
>>> (including electronic versions) of this message and any of its attachments.
>>> 
>> 

Reply via email to