hi,
I've just started working with Accumulo and I think I'm experiencing slow
reads/writes. I'm aware of the recommended configuration. Does anyone know
of any standard benchmarks and benchmarking tools I can use to tell if the
performance I'm getting is reasonable?
loads/6.3m_entries.avro
> > -rwxrwxrwx 1 busbey staff 186M Aug 28 00:31
> /Users/busbey/Downloads/6.3m_entries.avro
> > busbey$ time java -jar avro-tools-1.7.7.jar tojson
> ~/Downloads/6.3m_entries.avro | wc -l
> > 630
> >
> > real 0m4.239s
> > user 0
nloads/6.3m_entries.avro | wc -l
> 630
>
> real0m4.239s
> user0m6.026s
> sys 0m0.721s
>
> I'd recommend that you start at >= 5 nodes if you want to look at rough
> per-node throughput capabilities.
>
>
> On 2018/08/28 06:59:38, guy sharo
links I found. Can you tell us more about your setup
> and what you are seeing?
>
> https://accumulo.apache.org/papers/accumulo-benchmarking-2.1.pdf
> https://www.youtube.com/watch?v=Ae9THpmpFpM
>
> Mike
>
>
> On Sat, Aug 25, 2018 at 5:09 PM guy sharon
> wrote:
&
hi,
Continuing my performance benchmarks, I'm still trying to figure out if the
results I'm getting are reasonable and why throwing more hardware at the
problem doesn't help. What I'm doing is a full table scan on a table with
6M entries. This is Accumulo 1.7.4 with Zookeeper 3.4.12 and Hadoop
; faster client side access to the data? Certainly improvements could be
> > made -- of that I have no doubt -- but the time to bring 6M entries to
> > the client is a cost you will incur if you use the ReadData example.
> >
> > [1] If you have four tablets it's reasonable to s
might not help since you only have 2 cores so added parallism
> can't
> be exploited.
>
> Why do you think 500K/sec is slow?
>
> To determine slowness one would have to compare with other database
> technology on the same platform.
>
>
> On Wed, Aug 29, 2018 at 03:04:51
er wrote:
>
>> Does Muchos actually change the Accumulo configuration when you are
>> changing the underlying hardware?
>>
>> On 8/29/18 8:04 AM, guy sharon wrote:
>> > hi,
>> >
>> > Continuing my performance benchmarks, I'm still trying to figure
make). I’m curious what the end goal is here. Is
> this a real world use case? If you are using this type of benchmark to
> evaluate the speed of Accumulo, then you will likely not get the same
> performance when you apply your data and your real use cases.
>
>
>
> *From:* gu
-np" |
> wc -l" as you mentioned earlier in the thread? As Mike Miller suggested,
> serializing that back to the display and then counting 6M entries is going
> to take some time. Try using a Batch Scanner directly.
>
> Mike
>
> On Wed, Aug 29, 2018 at 2:56 PM guy sh
10 matches
Mail list logo