very slow scan performance on just one region

Daniel Iancu Wed, 21 Dec 2011 08:56:56 -0800

Hi there

I'm investigating a problem we have with a MR job and I discovered thatthe tasks that fail (scan lease expired while fetching next row) wereprocessing one particular region.I've written a small app that scans that region and counts its rows andrun it on same machine where region is hosted. The result is very verypoor, scan speed is in average 7 rows/sec and sometimes when scancaching is increased it gets lease expired exception. By contrary,scanning the other regions from same table on same machine with samecaching value gets ~3800 rows/sec. Any idea what can cause suchdizastrous scan performance on a particular region ?


Some extra info

hbase is 0.90.4
lease timeout is 4 minutes

table has 1 family, cell values are empty, row keys and qualifiers aresmall strings, biggest row has 146 columnsrow sizes are almost identical since table was create by a load tool andeach row has almost the same nr of colums with same kind of values...

all regions have 1 store file of ~655MB
cluster has no activity except the test app
GC activity looks normal

regions might have many deleted KV (we were testing data cleanup with MRjobs)

major compaction is deactivated and we didn't run it for some time

Can this problem be caused by the last 2 points above, many deleted KVconcentrated on that region so they need to be skipped by the StoreScanners?

Any other thoughts?

Thanks
Daniel

very slow scan performance on just one region

Reply via email to