Clearly thr will be impact on performance but frankly depends on what you are trying to achieve with the dataset.
Mayur Rustagi Ph: +1 (760) 203 3257 http://www.sigmoidanalytics.com @mayur_rustagi <https://twitter.com/mayur_rustagi> On Sat, May 31, 2014 at 11:45 AM, Vibhor Banga <vibhorba...@gmail.com> wrote: > Some inputs will be really helpful. > > Thanks, > -Vibhor > > > On Fri, May 30, 2014 at 7:51 PM, Vibhor Banga <vibhorba...@gmail.com> > wrote: > >> Hi all, >> >> I am planning to use spark with HBase, where I generate RDD by reading >> data from HBase Table. >> >> I want to know that in the case when the size of HBase Table grows larger >> than the size of RAM available in the cluster, will the application fail, >> or will there be an impact in performance ? >> >> Any thoughts in this direction will be helpful and are welcome. >> >> Thanks, >> -Vibhor >> > > > > -- > Vibhor Banga > Software Development Engineer > Flipkart Internet Pvt. Ltd., Bangalore > >