Re: Advice for smaller clusters in write-heavy environments

Bryan Duxbury Wed, 07 May 2008 21:42:18 -0700

This is super useful information! Thanks for taking the time to writeit up. Maybe this should be transplanted onto the wiki?


-Bryan


On May 7, 2008, at 9:29 PM, Daniel Leffel wrote:

I thought I might share back with the users my experience ingetting HBaserunning on a small, 4 node cluster. I ran into a lot of trouble ingettingstarted, some because of bugs and some specific to my use case. Mylearnings
I think will hopefully be valuable to new users.
First of all, let me compliment the amazing group of folksdeveloping HBase.Also, I'd like to say that we owe a lot to the amazing strategyPowerset has
taken as a company to propel the development of their product, both
leveraging and contributing to open source - what you guys aredoing is
nothing short than amazing!
My basic use case is to persist a large (and growing) sparcedataset andenable constant incremental re-computation. In order to testperformance forthis use case, it was important to load a test initial dataset -roughly 220million rows and 6 columns (for now, I'll say columns generically -I'll get
it to strategy of column families).

Some of my learnings
- "Commodity Hardware" is relative. When I first heard the term,I (andmany others I know) considered this to be on the order ofdesktop-grademachines - the machines I'd purchased were Dual Core 2+ Ghz DellDesktops(purchased on eBay for $350 a piece). Well, you can definitelydo certaintasks within the framework with these types of machines, but anidealconfiguration consists of something much stronger - server-grade, quad core,8Gigs of RAM, etc. HBase (particularly if you are going to do alot ofwrites), needs really good Machine IO . If you are going to tryto usemachines with slow drives and controllers, it might be possibleif you have
   a ton of datanodes, but not as advisable on smaller clusters.
- Ideally, you should always insure that there is one processoravailablefor the region server daemon and at least 2 processors fortasktracker (or 1if you limit the map and reduce tasks to one each), if you aregoing to runheavy map/reduce jobs. The trouble with not doing so is thatuntil 0.2, whenthere will be better load balancing on regionservers, it'salways possiblethat a single region server can be called on to shoulder thefull load ofall tasktrackers. If you have a large write operationshappening, you couldotherwise cause splits and/or compaction to take too long(expensiveoperations) and cause your job to crawl to near halt if yourlucky, or diecompletely. This means, if you're only using dual core machines,I'd suggestthat at least during heavy data-writing periods, you considerrunning eitherregionservers or tasktracker, but probably not both on the samemachine.- All machines should run the datanode - this helps theregionservers todistribute the IO load better. That way, when an expensiveoperation likecompaction starts, it's spread over more machines. Also, Hadoopcan localize
   frequently used files, to some degree.
- Running bin/stop-hbase.sh can sometimes take a long time.Sometimes,regionservers are waiting for a lease to expire. There are a fewtimes when
   there are dead processes (especially if you didn't take the earlier
suggestions) so check the logs (.out), but often you just needto wait
   longer and it's worth it.
- If you are writing from a MR job, it's most beneficial to findtheright balance of number of tasks. Too many tasks means too manysplits,startup and commits. Too few, and your region servers don't getthe benefitof a break (the time it takes to commit and initialize a newtask) - not too
   mention less to repeat after a failure.
- Use the new release candidate 0.1.2 #1. It has a number offixes in itthat help for issues related to small clusters - I don't regardprior
   releases as usable for those of us being cheap about hardware.
- Don't be afraid to adjust which daemons you run on whichmachines. Forexample, for my first large (initial) load, I shutdown all but acouple of
   tasktrackers and started up more region servers, whereas in normal
   operation, that ratio will probably be flipped.
- Watch the number of regions you have on any particularregionserver.I'm in the process at the moment of testing how far you can pushthis, butthe big concern is OOME - and unless you're running the latestrelease
   candidate, you're going to have big problems after an OOME.
Hope this is helpful, St^ack, please feel free to point out whereI'm wrong.
:-)

Danny
PS. Thanks again to St^ack. He went over and above the call of dutyto help
me and it's bought a ton of confidence I now have in this project.

Re: Advice for smaller clusters in write-heavy environments

Reply via email to