Hi I'm doing some evaluations with HBase. The workload I'm facing is mainly insert-only. Currently I'm inserting 1KB rows, where 100Bytes go into one column.
I have the following cluster machines at disposal: Intel Xeon L5520 2.26 Ghz (Nehalem, with HT enabled) 24 GiB Memory 1 GigE 2x 15k RPM Sas 73 GB (RAID1) I have 10 Nodes. The first node runs: Namenode, SecondaryNamenode, Datanode, HMaster, Zookeeper, and a RegionServer The other nodes run: Datanode and RegionServer Now running my test client and inserting rows, the throughput goes up to 150'000 inserts/sec. But then after some time the throughput drops down to 0 inserts/sec for quite some time, before it goes up again. My assumption is, that it happens when the RegionServers start to write the data from memory to the disks. I know, that the recommended hardware for HBase should contain multiple disks using JBOD or RAID 0. But at that point I am limited right now. I am just asking if in my hardware setup, the blocking periods are really caused by the non-optimal disk configuration. Thank you in advance for any suggestions. Martin
