The only other thing I'd add is, by default HBase caps size of the data per
column at 10 MB (I think). You can change that by changing this setting:

hbase.client.keyvalue.maxsize
in hbase-site.xml

-1 means no cap. You can put other numbers for appropriate cap for your use
case.

Ameya


On Tue, Feb 25, 2014 at 12:12 AM, shashwat shriparv <
[email protected]> wrote:

> Yes for sure you can use hbase for this, you can have
> 1. different fields of mail in different column of a column family and
> attachment as a binary array also in a column.
> 2. you can keep whole message in columns in hbase and the attachments are
> large enoug on the hdfs and some reference to it in hbase table.
> 3. schema you can decide, you can have a matrix how you store values to
> that you can decide.
>
>
> *Warm Regards_**∞_*
> * Shashwat Shriparv*
>  [image: http://www.linkedin.com/pub/shashwat-shriparv/19/214/2a9]<
> http://www.linkedin.com/pub/shashwat-shriparv/19/214/2a9>[image:
> https://twitter.com/shriparv] <https://twitter.com/shriparv>[image:
> https://www.facebook.com/shriparv] <https://www.facebook.com/shriparv
> >[image:
> http://google.com/+ShashwatShriparv]
> <http://google.com/+ShashwatShriparv>[image:
> http://www.youtube.com/user/sShriparv/videos]<
> http://www.youtube.com/user/sShriparv/videos>[image:
> http://profile.yahoo.com/SWXSTW3DVSDTF2HHSRM47AV6DI/] <[email protected]>
>
>
>
> On Tue, Feb 25, 2014 at 12:55 PM, Upendra Yadav <[email protected]
> >wrote:
>
> > I have to use hbase and have mix type of data
> >
> > Some of them have size 1-4K(Mail- Header....) and others
> > >5MB(Attachments...)
> >
> > And also we need only random access: any data
> >
> > Is HBase is feasible for storing this type of data
> >
> > What will be my schema design -
> > will have to go with 2 different Table -> 1st one for  1-4K and 2nd for
> big
> > file
> > (because of memstore flush will flush other CF, and huge random access)
> >
> > Or there is other way:;
> >
> > Thanks
> >
>

Reply via email to