Mohammad, 
I'm tight on time... Short answer...
Strip out the xml in to some object and then consider using Avro to write the 
object to HBase. 
This could probably shrink your footprint per record/row.

Note: I don't know anything about your data so you really have to take what I 
say with a large grain of salt. 

Based on what you've said, the best answer I can give to your original question 
is that it sounds like you're going down the right path.

HTH

-Mike


> From: [email protected]
> Date: Fri, 16 Dec 2011 01:23:21 +0530
> Subject: Re: Creating columns within columns
> To: [email protected]
> 
> Hi Lars,
> 
>       Files are not really big..Might go upto 20kB..Initially we were
> thinking about the HDFS as storage, but due to the lack of random data
> access we are now planning to use Hbase..Please guide me if you think
> there is some way that can help us, as we are new to the hadoop world.
> 
> Regards,
>     Mohammad Tariq
> 
> 
> 
> On Fri, Dec 16, 2011 at 1:13 AM, Lars <[email protected]> wrote:
> > How big are these XML files?
> >
> > You might want to consider storing them in HDFS directly and only Meta 
> > information in HBase.
> >
> > -- Lars
> >
> >
> > Mohammad Tariq <[email protected]> schrieb:
> >
> >>Hello list,
> >>
> >>       I want to store xml files in Hbase and these files may have
> >>tags within tags..And for that I have to create several columns within
> >>a column family..How can I do that..Sorry if my question is
> >>childish..And if that is the case please provide me some link where I
> >>can get the proper info..Many thanks.
> >>
> >>Regards,
> >>    Mohammad Tariq
                                          

Reply via email to