Hello Roshan,

Thank you for your detailed answer. Details are important, because in my
organization, I am often asked to re-assess the reasons why we chose Storm
over its competitors.

Best regards,
Alexandre Vermeerbergen


2017-07-25 23:36 GMT+02:00 roshannaik <g...@git.apache.org>:

> Github user roshannaik commented on the issue:
>
>     https://github.com/apache/storm/pull/2241
>
>     @avermeer Looks like SuperChief blog is relaying the same basic claims
> that Heron has marketed. Since you ask, i will share my opinions wrt
> Heron's claims.
>
>     - Heron has never been a player in the high performance club. They
> have been smart about not comparing themselves with the real top performers
> of the day. I only included them here because they have built they have
> made much noise against Storm. They are smart about not mentioning which
> version of Storm they are comparing with (how does a paper with such
> critical info missing get accepted ?). That creates an illusion in people
> that their perf claims apply to all versions of Storm in general... even if
> Storm [publishes new perf numbers](hortonworks.com/blog/
> microbenchmarking-storm-1-0-performance/) comparing itself to a prior
> version.
>     - Heron's threading model (1 thread per process.. based on what i
> gather from their articles), is really primitive for this application
> domain.  I don't recommend it, but by setting 'topology.workers' equal to
> the number of spout& bolt instances, Storm can be run in Heron mode.
>     -  I find it much easier to debug a process with multiple components
> using a debugger rather start a separate debugger for every instance of
> spout bolt running. Also, I would imagine, having so many processes means
> you have an explosion of log files to deal with when triaging.
>     - Unclear why the recovery model (when worker process crashes) is any
> better ... the same kind of replay from the spout would be required. The
> gains may be minor if any. Making minor optimizations to the failure path
> and penalizing the normal operation path... is backwards.
>     - Cant get a stack from a Storm worker ? Thats clearly false. Try it
> yourself. I do it all the time. Heapdumps, on the other hand, can stall the
> worker and if the heap size is really large the supervisor might feel the
> worker is having a problem. There are timeouts that you can increase to for
> the supervisor to wait longer. I cant imagine that Heron doesn't monitor
> their workers and restart them if they are not responsive.
>     -  Heron's Backpressure model is simply too overweight, but marketed
> as a novel idea.
>     - A quick read of their latest perf blog, noted in the comparison, and
> it was evident that they missed recognizing their real perf problem.
>
>
>
> ---
> If your project is set up for it, you can reply to this email and have your
> reply appear on GitHub as well. If your project does not have this feature
> enabled and wishes so, or if the feature is enabled but not working, please
> contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
> with INFRA.
> ---
>

Reply via email to