Re: HBase scan performance decreases over time.

2012-11-05 Thread Leonid Fedotov
There is property dfs.balance.bandwidthPerSec in hdfs-site.xml dfs.balance.bandwidthPerSec 625 Specifies the maximum amount of bandwidth that each datanode can utilize for the balancing purpose in term of the number of bytes per second. Thank you

Re: HBase scan performance decreases over time.

2012-11-05 Thread Michael Segel
hdfs-site.xml Its an HDFS setting that may impact the balancing of HBase as well. (I'm sure someone can give a better response by looking at the code. ) On Nov 5, 2012, at 12:14 PM, Asaf Mesika wrote: > Where is this settings located? > > Sent from my iPhone > > On 5 בנוב 2012, at 15:05, M

Re: HBase scan performance decreases over time.

2012-11-05 Thread Asaf Mesika
Where is this settings located? Sent from my iPhone On 5 בנוב 2012, at 15:05, Michael Segel wrote: > There's an HDFS bandwidth setting which is set to 10MB/s. > > Way too low for even 1GBe. > > Have you modified this setting yet? > > -Mike > > On Nov 3, 2012, at 2:50 PM, David Koch wrote: > >>

Re: HBase scan performance decreases over time.

2012-11-05 Thread Michael Segel
There's an HDFS bandwidth setting which is set to 10MB/s. Way too low for even 1GBe. Have you modified this setting yet? -Mike On Nov 3, 2012, at 2:50 PM, David Koch wrote: > Hello Ted, > > We never initiate major compaction manually. I have not looked at I/O > balance between nodes in det

Re: HBase scan performance decreases over time.

2012-11-03 Thread Ted Yu
Have you looked at http://hbase.apache.org/book.html#performance ? Thanks On Sat, Nov 3, 2012 at 12:50 PM, David Koch wrote: > Hello Ted, > > We never initiate major compaction manually. I have not looked at I/O > balance between nodes in detail. We have noticed that after running for a > coupl

Re: HBase scan performance decreases over time.

2012-11-03 Thread David Koch
Hello Ted, We never initiate major compaction manually. I have not looked at I/O balance between nodes in detail. We have noticed that after running for a couple of weeks HBase seems to spend hours pushing blocks between nodes in order to optimize things. We add data daily in one ~30gb push to sev

Re: HBase scan performance decreases over time.

2012-11-03 Thread Ted Yu
Can you tell us how often you run major compaction after the import ? Have you noticed imbalanced read / write requests in the cluster ? Meaning subset of region servers receive bulk of the writes. We do some manual movement of regions when the above happens. Cheers On Sat, Nov 3, 2012 at 8:12 A

HBase scan performance decreases over time.

2012-11-03 Thread David Koch
Hello, Every now and then we need to flatten our cluster and re-import all data from log files (changes in data format, etc.) Afterwards we notice a significant increase in scan performance. As data is added and shuffled around between region servers, performance goes down again over time (say a c