Hi,

I have a single Linux (Ubuntu) server in Development environment and I plan
to use a single server for Production environment as well.

I am using HBase 0.90.6 as a backend datastore for Nutch 2.2.1. I have
tried standalone mode of HBase and it works fine for smaller crawls only.
Now, I plan to crawl extensively in a bit larger scale trying to achieve
300K urls data and hence I would like to migrate from standalone to a
distributed mode. However, **I don't intend to use multiple machines. All I
have is a single server, so which mode of HBase is ideal for Production
environment in my case - pseudo or fully-distributed?

Thanks for your help!

Reply via email to