I'm unclear about the 2TB disk thing, is it 1x2TB or 2x1TB or 4x500GB?
I hope it's the last one, as you want to have as many spindles as
possible. I would prefer 24GB to 16, this is what we run on and it
works like a charm, and gives more room for those memory hungry jobs.

What kind of stability issues are you having?

J-D

On Sun, Nov 21, 2010 at 5:53 AM, Oleg Ruchovets <[email protected]> wrote:
> Hi all,
> After testing HBase for few months with very light configurations  (5
> machines, 2 TB disk, 8 GB RAM), we are now planing for production.
> Our Load -
> 1) 50GB log files to process per day by Map/Reduce jobs.
> 2)  Insert 4-5GB to 3 tables in hbase.
> 3) Run 10-20 scans per day (scanning about 20 regions in a table).
> All this should run in parallel.
> Our current configuration can't cope with this load and we are having many
> stability issues.
>
> This is what we have in mind :
> 1. Master machine - 32 GB, 4 TB, Two quad core CPUs.
> 2. Name node - 16 GB, 2TB, Two quad core CPUs.
> we plan to have up to 20 name servers (starting with 5).
>
> We already read
> http://www.cloudera.com/blog/2010/03/clouderas-support-team-shares-some-basic-hardware-recommendations/
> .
>
> We would appreciate your feedback on our proposed configuration.
>
>
> Regards Oleg & Lior
>

Reply via email to