Hi Anil -

Good question and sorry for any confusion earlier. To be sure, because
HBase permits atomic operations across a single column family only, Sqoop
can not support multiple column families.

Regards, Kathleen

On Wed, Feb 22, 2012 at 12:43 PM, anil gupta <anilg...@buffalo.edu> wrote:

> Hi Kathleen,
>
> Yes, that is always an option. Thanks for suggestion.
>
> I am a beginner at HBase. However, I was thinking of cutting down the time
> to dump the data from Database. If i do it twice(assuming i have 2 column
> families) then it increases the time of load the entire HBase table.
> AFAIK, Sqoop generates put statements to import data into HBase. If we can
> generate put statements for more than one column family. Would it violate
> the atomicity principle of HBase? I went through the atomicity section of
> http://hbase.apache.org/acid-semantics.html and I cant find anything
> which would stop sqoop loading more than one column family and Hbase bulk
> load also allows more than one column family although the approach of
> HBase bulk loading might be different from Sqoop. Could you provide me more
> insight?  Sorry, if my question is dumb.
>
> Thanks,
> Anil Gupta
>
>
> On Wed, Feb 22, 2012 at 11:51 AM, Kathleen Ting <kathl...@cloudera.com>wrote:
>
>> Hi Anil,
>>
>> Sqoop does not support multiple column families because HBase only
>> permits atomic operations.
>>
>> One workaround is to run two imports, specifying a different column
>> family each time.
>>
>> Regards,
>> Kathleen
>>
>> On Wed, Feb 22, 2012 at 11:31 AM, anil gupta <anilgupt...@gmail.com>wrote:
>>
>>> Hi All,
>>>
>>> I went through the User guide of Sqoop but i could not find anything for
>>> importing more than one columnfamily in HBase. Am i missing something? Is
>>> it planned for future release?
>>>
>>> --
>>> Thanks & Regards,
>>> Anil Gupta
>>>
>>
>>
>
>
> --
> Thanks & Regards,
> Anil Gupta
>

Reply via email to