Hi Ted, I have. So the book says there are two types of distributed modes. One is pseudo distributed, which is used when we want to test HBase's distributed capabilities using a single machine. As far as I understood, this is just to verify the use cases and the requirements. Then we have the fully distributed mode in which HBase can be installed over multiple machines.
I understand both the scenarios. But what if my application is not large enough to leverage the distributed mode and the pseudo distributed mode is pretty much for a PoC. Since the pseudo distributed mode won't be able to provide any fault tolerance, can one use the standalone mode in production. I hope my question is clear even if it does not make much sense. Thanks, Arun On Jul 7, 2014 5:17 PM, "Ted Yu" <[email protected]> wrote: > Have you read http://hbase.apache.org/book.html#standalone_dist ? > > Cheers > > > On Mon, Jul 7, 2014 at 3:55 PM, Arun Allamsetty <[email protected] > > > wrote: > > > Hi all, > > > > So this question might be stupid, retarded even, but it has been bugging > me > > for a while and I cannot think of a better place to ask this. I am really > > impressed with the way HBase works (as a key-value store). Since it > stores > > everything as a byte array, I find it really convenient to store > serialized > > objects. Also, I understand that HBase is supposed to be used when you > have > > too much data to be handled by a single machine, so we can scale our > > application by running it in distributed mode. > > > > But what if I want to use it because its HashMap kind of capabilities > with > > an added feature to track versions. Is it recommended that I use it for a > > small application (in standalone mode) with maybe 100K users and storage > > needs which probably won't exceed 100G. > > > > I know it is never recommended to be used as a transactional database (I > > have read that in a million places) but I would like to know more about > it. > > > > Thanks, > > Arun > > >
