On Sat, Mar 11, 2017 at 2:08 PM, Bhuvan Rawal <bhu1ra...@gmail.com> wrote:

> "Lastly, why don't you test Scylla yourself?  It's pretty easy to set up,
> there's nothing to tune."
>  - The details are indeed compelling to have a go ahead and test it for
> specific use case.
>
> If it works out good it can lead to good cost cut in infra costs as well
> as having to manage less servers plus probably less time to bootstrap &
> decommission nodes!
>
> It will also be interesting to have a benchmark with Cassandra 3 version
> as well, as the new storage engine is said to have better performance:
> https://www.datastax.com/2015/12/storage-engine-30
>
> Regards,
> Bhuvan
>
> On Sat, Mar 11, 2017 at 2:59 PM, Avi Kivity <a...@scylladb.com> wrote:
>
>> There is no magic 10X bullet.  It's a mix of multiple factors, which can
>> come up to less than 10X in some circumstances and more than 10X in others,
>> as has been reported on this thread by others.
>>
>> TPC doesn't give _any_ advantage when you have just one core, and can
>> give more than 10X on a machine with a large number of cores.  These are
>> becoming more and more common, think of the recent AMD Naples announcement;
>> with 32 cores per socket you can have 128 logical cores in a two-socket
>> server; or the AWS i3.16xlarge instance with 32 cores / 64 vcpus.
>>
>> You're welcome to browse our site to learn more about the architecture,
>> or watch this technical talk [1] I gave in QConSF that highlights some of
>> the techniques we use.
>>
>> Of course it's possible to mistune Cassandra to give bad results, that is
>> why we spent a lot more time tuning Cassandra and documenting everything
>> than we spent on Scylla.  You can read the report in [2], it is very
>> detailed, and provides a wealth of metrics like you'd expect.
>>
>> I'm not going to comment about the Aerospike numbers, I haven't studied
>> them in detail.  And no, you can't multiply results like that unless they
>> were done with very similar configurations and test harnesses.
>>
>> Lastly, why don't you test Scylla yourself?  It's pretty easy to set up,
>> there's nothing to tune.
>>
>> Avi
>>
>> [1] https://www.infoq.com/presentations/scylladb
>> [2] http://www.scylladb.com/technology/cassandra-vs-scylla-bench
>> mark-cluster-1/
>>
>>
>> On 03/10/2017 06:58 PM, Bhuvan Rawal wrote:
>>
>> Agreed C++ gives an added advantage to talk to underlying hardware with
>> better efficiency, it sound good but can a pice of code written in C++ give
>> 1000% throughput than a Java app? Is TPC design 10X more performant than
>> SEDA arch?
>>
>> And if C/C++ is indeed that fast how can Aerospike (which is itself
>> written in C) claim to be 10X faster than Scylla here
>> http://www.aerospike.com/benchmarks/scylladb-initial/ ? (Combining
>> your's and aerospike's benchmarks it appears that Aerospike is 100X
>> performant than C* - I highly doubt that!! )
>>
>> For a moment lets forget about evaluating 2 different databases, one can
>> observe 10X performance difference between a mistuned cassandra cluster and
>> one thats tuned as per data model - there are so many Tunables in yaml as
>> well as table configs.
>>
>> Idea is - in order to strengthen your claim, you need to provide complete
>> system metrics (Disk, CPU, Network), the OPS increase starts to decay along
>> with the configs used. Having plain ops per second and 99p latency is
>> blackbox.
>>
>> Regards,
>> Bhuvan
>>
>> On Fri, Mar 10, 2017 at 12:47 PM, Avi Kivity <a...@scylladb.com> wrote:
>>
>>> ScyllaDB engineer here.
>>>
>>> C++ is really an enabling technology here. It is directly responsible
>>> for a small fraction of the gain by executing faster than Java.  But it is
>>> indirectly responsible for the gain by allowing us direct control over
>>> memory and threading.  Just as an example, Scylla starts by taking over
>>> almost all of the machine's memory, and dynamically assigning it to
>>> memtables, cache, and working memory needed to handle requests in flight.
>>> Memory is statically partitioned across cores, allowing us to exploit NUMA
>>> fully.  You can't do these things in Java.
>>>
>>> I would say the major contributors to Scylla performance are:
>>>  - thread-per-core design
>>>  - replacement of the page cache with a row cache
>>>  - careful attention to many small details, each contributing a little,
>>> but with a large overall impact
>>>
>>> While I'm here I can say that performance is not the only goal here, it
>>> is stable and predictable performance over varying loads and during
>>> maintenance operations like repair, without any special tuning.  We measure
>>> the amount of CPU and I/O spent on foreground (user) and background
>>> (maintenance) tasks and divide them fairly.  This work is not complete but
>>> already makes operating Scylla a lot simpler.
>>>
>>>
>>> On 03/10/2017 01:42 AM, Kant Kodali wrote:
>>>
>>> I dont think ScyllaDB performance is because of C++. The design
>>> decisions in scylladb are indeed different from Cassandra such as getting
>>> rid of SEDA and moving to TPC and so on.
>>>
>>> If someone thinks it is because of C++ then just show the benchmarks
>>> that proves it is indeed the C++ which gave 10X performance boost as
>>> ScyllaDB claims instead of stating it.
>>>
>>>
>>> On Thu, Mar 9, 2017 at 3:22 PM, Richard L. Burton III <
>>> mrbur...@gmail.com> wrote:
>>>
>>>> They spend an enormous amount of time focusing on performance. You can
>>>> expect them to continue on with their optimization and keep crushing it.
>>>>
>>>> P.S., I don't work for ScyllaDB.
>>>>
>>>> On Thu, Mar 9, 2017 at 6:02 PM, Rakesh Kumar <
>>>> rakeshkumar...@outlook.com> wrote:
>>>>
>>>>> In all of their presentation they keep harping on the fact that
>>>>> scylladb is written in C++ and does not carry the overhead of Java.  Still
>>>>> the difference looks staggering.
>>>>> ________________________________________
>>>>> From: daemeon reiydelle <daeme...@gmail.com>
>>>>> Sent: Thursday, March 9, 2017 14:21
>>>>> To: user@cassandra.apache.org
>>>>> Subject: Re: scylladb
>>>>>
>>>>> The comparison is fair, and conservative. Did substantial performance
>>>>> comparisons for two clients, both results returned throughputs that were
>>>>> faster than the published comparisons (15x as I recall). At that time the
>>>>> client preferred to utilize a Cass COTS solution and use a caching 
>>>>> solution
>>>>> for OLA compliance.
>>>>>
>>>>>
>>>>> .......
>>>>>
>>>>> Daemeon C.M. Reiydelle
>>>>> USA (+1) 415.501.0198 <%28%2B1%29%20415.501.0198>
>>>>> London (+44) (0) 20 8144 9872
>>>>> <%28%2B44%29%20%280%29%2020%208144%209872>
>>>>>
>>>>> On Thu, Mar 9, 2017 at 11:04 AM, Robin Verlangen <ro...@us2.nl<mailto:
>>>>> ro...@us2.nl>> wrote:
>>>>> I was wondering how people feel about the comparison that's made here
>>>>> between Cassandra and ScyllaDB : http://www.scylladb.com/techno
>>>>> logy/ycsb-cassandra-scylla/#results-of-3-scylla-nodes-vs-30-
>>>>> cassandra-nodes
>>>>>
>>>>> They are claiming a 10x improvement, is that a fair comparison or
>>>>> maybe a somewhat coloured view of a (micro)benchmark in a specific setup?
>>>>> Any pros/cons known?
>>>>>
>>>>> Best regards,
>>>>>
>>>>> Robin Verlangen
>>>>> Chief Data Architect
>>>>>
>>>>> Disclaimer: The information contained in this message and attachments
>>>>> is intended solely for the attention and use of the named addressee and 
>>>>> may
>>>>> be confidential. If you are not the intended recipient, you are reminded
>>>>> that the information remains the property of the sender. You must not use,
>>>>> disclose, distribute, copy, print or rely on this e-mail. If you have
>>>>> received this message in error, please contact the sender immediately and
>>>>> irrevocably delete this message and any copies.
>>>>>
>>>>> On Wed, Dec 16, 2015 at 11:52 AM, Carlos Rolo <r...@pythian.com
>>>>> <mailto:r...@pythian.com>> wrote:
>>>>> No rain at all! But I almost had it running last weekend, but stopped
>>>>> short of installing it. Let's see if this one is for real!
>>>>>
>>>>> Regards,
>>>>>
>>>>> Carlos Juzarte Rolo
>>>>> Cassandra Consultant
>>>>>
>>>>> Pythian - Love your data
>>>>>
>>>>> rolo@pythian | Twitter: @cjrolo | Linkedin:
>>>>> linkedin.com/in/carlosjuzarterolo<http://linkedin.com/in/car
>>>>> losjuzarterolo>
>>>>> Mobile: +351 91 891 81 00<tel:+351%20918%20918%20100> | Tel: +1 613
>>>>> 565 8696 x1649 <%2B1%20613%20565%208696%20x1649>
>>>>> <tel:+1%20613-565-8696>
>>>>> www.pythian.com<http://www.pythian.com/>
>>>>>
>>>>> On Wed, Dec 16, 2015 at 12:38 AM, Dani Traphagen <
>>>>> dani.trapha...@datastax.com<mailto:dani.trapha...@datastax.com>>
>>>>> wrote:
>>>>> You'll be the first Carlos.
>>>>>
>>>>> [Inline image 1]
>>>>>
>>>>> Had any rain lately? Curious how this went, if so.
>>>>>
>>>>> On Thu, Nov 12, 2015 at 4:36 AM, Jack Krupansky <
>>>>> jack.krupan...@gmail.com<mailto:jack.krupan...@gmail.com>> wrote:
>>>>> I just did a Twitter search on scylladb and did not see any tweets
>>>>> about actual use, so far.
>>>>>
>>>>>
>>>>> -- Jack Krupansky
>>>>>
>>>>> On Wed, Nov 11, 2015 at 10:54 AM, Carlos Alonso <i...@mrcalonso.com
>>>>> <mailto:i...@mrcalonso.com>> wrote:
>>>>> Any update about this?
>>>>>
>>>>> @Carlos Rolo, did you tried it? Thoughts?
>>>>>
>>>>> Carlos Alonso | Software Engineer | @calonso<https://twitter.com/c
>>>>> alonso>
>>>>>
>>>>> On 5 November 2015 at 14:07, Carlos Rolo <r...@pythian.com<mailto:
>>>>> r...@pythian.com>> wrote:
>>>>> Something to do on a expected rainy weekend. Thanks for the
>>>>> information.
>>>>>
>>>>> Regards,
>>>>>
>>>>> Carlos Juzarte Rolo
>>>>> Cassandra Consultant
>>>>>
>>>>> Pythian - Love your data
>>>>>
>>>>> rolo@pythian | Twitter: @cjrolo | Linkedin:
>>>>> linkedin.com/in/carlosjuzarterolo<http://linkedin.com/in/car
>>>>> losjuzarterolo>
>>>>> Mobile: +351 91 891 81 00<tel:%2B351%2091%20891%2081%2000> | Tel: +1
>>>>> 613 565 8696 x1649 <%2B1%20613%20565%208696%20x1649>
>>>>> <tel:%2B1%20613%20565%208696%20x1649>
>>>>> www.pythian.com<http://www.pythian.com/>
>>>>>
>>>>> On Thu, Nov 5, 2015 at 12:07 PM, Dani Traphagen <
>>>>> dani.trapha...@datastax.com<mailto:dani.trapha...@datastax.com>>
>>>>> wrote:
>>>>> As of two days ago, they say they've got it @cjrolo.
>>>>>
>>>>> https://github.com/scylladb/scylla/wiki/RELEASE-Scylla-0.11-Beta
>>>>>
>>>>>
>>>>> On Thursday, November 5, 2015, Carlos Rolo <r...@pythian.com<mailto:
>>>>> r...@pythian.com>> wrote:
>>>>> I will not try until multi-DC is implemented. More than an month has
>>>>> passed since I looked for it, so it could possibly be in place, if so I 
>>>>> may
>>>>> take some time to test it.
>>>>>
>>>>> Regards,
>>>>>
>>>>> Carlos Juzarte Rolo
>>>>> Cassandra Consultant
>>>>>
>>>>> Pythian - Love your data
>>>>>
>>>>> rolo@pythian | Twitter: @cjrolo | Linkedin:
>>>>> linkedin.com/in/carlosjuzarterolo<http://linkedin.com/in/car
>>>>> losjuzarterolo>
>>>>> Mobile: +351 91 891 81 00<tel:%2B351%2091%20891%2081%2000> | Tel: +1
>>>>> 613 565 8696 x1649 <%2B1%20613%20565%208696%20x1649>
>>>>> <tel:%2B1%20613%20565%208696%20x1649>
>>>>> www.pythian.com<http://www.pythian.com/>
>>>>>
>>>>> On Thu, Nov 5, 2015 at 9:37 AM, Jon Haddad <jonathan.had...@gmail.com>
>>>>> wrote:
>>>>> Nope, no one I know.  Let me know if you try it I'd love to hear your
>>>>> feedback.
>>>>>
>>>>> > On Nov 5, 2015, at 9:22 AM, tommaso barbugli <tbarbu...@gmail.com>
>>>>> wrote:
>>>>> >
>>>>> > Hi guys,
>>>>> >
>>>>> > did anyone already try Scylladb (yet another fastest NoSQL database
>>>>> in town) and has some thoughts/hands-on experience to share?
>>>>> >
>>>>> > Cheers,
>>>>> > Tommaso
>>>>>
>>>>>
>>>>>
>>>>>
>>>>> --
>>>>>
>>>>>
>>>>>
>>>>>
>>>>> --
>>>>> Sent from mobile -- apologizes for brevity or errors.
>>>>>
>>>>>
>>>>>
>>>>> --
>>>>>
>>>>>
>>>>>
>>>>>
>>>>>
>>>>>
>>>>>
>>>>> --
>>>>> [datastax_logo.png]<http://www.datastax.com/>
>>>>>
>>>>> DANI TRAPHAGEN
>>>>>
>>>>> Technical Enablement Lead | dani.trapha...@datastax.com<mailto:
>>>>> dani.trapha...@datastax.com>
>>>>>
>>>>> [twitter.png]<https://twitter.com/dtrapezoid> [linkedin.png] <
>>>>> https://www.linkedin.com/pub/dani-traphagen/31/93b/b85>  [
>>>>> https://lh5.googleusercontent.com/WcFJcWZHKXnxu01V6zJIQapcG
>>>>> onoazqsv8O7_DtfhW-qbTRHxDjfX2owDNmQhgojRx5Y4mLEc-KiAeeTJjT0V
>>>>> mKiiIld8UP86AgQPJDK2o6oC6BhTmub4NLZ_MO9-E7l9Q] <
>>>>> https://github.com/dtrapezoid>
>>>>>
>>>>> [http://datastax.com/all/images/cs_logo_color_sm.png]
>>>>>
>>>>>
>>>>>
>>>>> --
>>>>>
>>>>>
>>>>>
>>>>>
>>>>>
>>>>
>>>>
>>>> --
>>>> -Richard L. Burton III
>>>> @rburton
>>>>
>>>
>>>
>>>
>>
>>
> Benchmarks are great for FUDly blog posts. Real world work loads matter
more. Every NoSQL vendor wins their benchmarks.

Reply via email to