Hi Anil - Good question and sorry for any confusion earlier. To be sure, because HBase permits atomic operations across a single column family only, Sqoop can not support multiple column families.
Regards, Kathleen On Wed, Feb 22, 2012 at 12:43 PM, anil gupta <anilg...@buffalo.edu> wrote: > Hi Kathleen, > > Yes, that is always an option. Thanks for suggestion. > > I am a beginner at HBase. However, I was thinking of cutting down the time > to dump the data from Database. If i do it twice(assuming i have 2 column > families) then it increases the time of load the entire HBase table. > AFAIK, Sqoop generates put statements to import data into HBase. If we can > generate put statements for more than one column family. Would it violate > the atomicity principle of HBase? I went through the atomicity section of > http://hbase.apache.org/acid-semantics.html and I cant find anything > which would stop sqoop loading more than one column family and Hbase bulk > load also allows more than one column family although the approach of > HBase bulk loading might be different from Sqoop. Could you provide me more > insight? Sorry, if my question is dumb. > > Thanks, > Anil Gupta > > > On Wed, Feb 22, 2012 at 11:51 AM, Kathleen Ting <kathl...@cloudera.com>wrote: > >> Hi Anil, >> >> Sqoop does not support multiple column families because HBase only >> permits atomic operations. >> >> One workaround is to run two imports, specifying a different column >> family each time. >> >> Regards, >> Kathleen >> >> On Wed, Feb 22, 2012 at 11:31 AM, anil gupta <anilgupt...@gmail.com>wrote: >> >>> Hi All, >>> >>> I went through the User guide of Sqoop but i could not find anything for >>> importing more than one columnfamily in HBase. Am i missing something? Is >>> it planned for future release? >>> >>> -- >>> Thanks & Regards, >>> Anil Gupta >>> >> >> > > > -- > Thanks & Regards, > Anil Gupta >