benchmarking

2018-08-25 Thread guy sharon
hi, I've just started working with Accumulo and I think I'm experiencing slow reads/writes. I'm aware of the recommended configuration. Does anyone know of any standard benchmarks and benchmarking tools I can use to tell if the performance I'm getting is reasonable?

Re: benchmarking

2018-08-28 Thread guy sharon
loads/6.3m_entries.avro > > -rwxrwxrwx 1 busbey staff 186M Aug 28 00:31 > /Users/busbey/Downloads/6.3m_entries.avro > > busbey$ time java -jar avro-tools-1.7.7.jar tojson > ~/Downloads/6.3m_entries.avro | wc -l > > 630 > > > > real 0m4.239s > > user 0

Re: benchmarking

2018-08-28 Thread guy sharon
nloads/6.3m_entries.avro | wc -l > 630 > > real0m4.239s > user0m6.026s > sys 0m0.721s > > I'd recommend that you start at >= 5 nodes if you want to look at rough > per-node throughput capabilities. > > > On 2018/08/28 06:59:38, guy sharo

Re: benchmarking

2018-08-28 Thread guy sharon
links I found. Can you tell us more about your setup > and what you are seeing? > > https://accumulo.apache.org/papers/accumulo-benchmarking-2.1.pdf > https://www.youtube.com/watch?v=Ae9THpmpFpM > > Mike > > > On Sat, Aug 25, 2018 at 5:09 PM guy sharon > wrote: &

Accumulo performance on various hardware configurations

2018-08-29 Thread guy sharon
hi, Continuing my performance benchmarks, I'm still trying to figure out if the results I'm getting are reasonable and why throwing more hardware at the problem doesn't help. What I'm doing is a full table scan on a table with 6M entries. This is Accumulo 1.7.4 with Zookeeper 3.4.12 and Hadoop

Re: Accumulo performance on various hardware configurations

2018-08-29 Thread guy sharon
; faster client side access to the data? Certainly improvements could be > > made -- of that I have no doubt -- but the time to bring 6M entries to > > the client is a cost you will incur if you use the ReadData example. > > > > [1] If you have four tablets it's reasonable to s

Re: Accumulo performance on various hardware configurations

2018-08-29 Thread guy sharon
might not help since you only have 2 cores so added parallism > can't > be exploited. > > Why do you think 500K/sec is slow? > > To determine slowness one would have to compare with other database > technology on the same platform. > > > On Wed, Aug 29, 2018 at 03:04:51

Re: Accumulo performance on various hardware configurations

2018-08-29 Thread guy sharon
er wrote: > >> Does Muchos actually change the Accumulo configuration when you are >> changing the underlying hardware? >> >> On 8/29/18 8:04 AM, guy sharon wrote: >> > hi, >> > >> > Continuing my performance benchmarks, I'm still trying to figure

Re: Accumulo performance on various hardware configurations

2018-08-29 Thread guy sharon
make). I’m curious what the end goal is here. Is > this a real world use case? If you are using this type of benchmark to > evaluate the speed of Accumulo, then you will likely not get the same > performance when you apply your data and your real use cases. > > > > *From:* gu

Re: Accumulo performance on various hardware configurations

2018-08-29 Thread guy sharon
-np" | > wc -l" as you mentioned earlier in the thread? As Mike Miller suggested, > serializing that back to the display and then counting 6M entries is going > to take some time. Try using a Batch Scanner directly. > > Mike > > On Wed, Aug 29, 2018 at 2:56 PM guy sh