[jira] [Commented] (STORM-1772) Create topologies for measuring performance

Roshan Naik (JIRA) Tue, 10 May 2016 16:15:52 -0700

    [ 
https://issues.apache.org/jira/browse/STORM-1772?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15279190#comment-15279190
 ]


Roshan Naik commented on STORM-1772:
------------------------------------

Hi [~mauzhang],
    Yes thats it. I first observed that perf difference issue when working on 
STORM-1632, but was not able to get to the bottom of it. The storm native 
topology mentioned here : 
https://github.com/apache/storm/pull/1217#issuecomment-201074919

I can try to locate the benchmark-specific version of the topology but its a 
straightforward rewrite.

The storm native showed a  difference of ~12% when doing a A/B test (with and 
without the fix)
The benchmark specific version of the topology .. it was 25%  as noted in the 
description of STORM-1632.



IMO..  briefly ignoring the perf diff issue, it would be good to go ahead and 
see what we can incorporate from that benchmark . In this jira my goal is to 
add a few topologies for perf testing... not to create a benchmarking 
tool/framework itself. In that sense its not conflicting with STORM-642. 

*side note:* If we are adding a benchmarking framework, it would be good if it 
can run standard Storm topologies directly and not require topologies to be 
written specifically for it.

> Create topologies for measuring performance
> -------------------------------------------
>
>                 Key: STORM-1772
>                 URL: https://issues.apache.org/jira/browse/STORM-1772
>             Project: Apache Storm
>          Issue Type: Bug
>            Reporter: Roshan Naik
>            Assignee: Roshan Naik
>
> Would be very useful to have some simple reference topologies included with 
> Storm that can be used to measure performance both by devs during development 
> (to start with) and perhaps also on a real storm cluster (subsequently). 
> To start with, the goal is to put the focus on the performance 
> characteristics of individual building blocks such as specifics bolts, 
> spouts,  grouping options, queues, etc. So, initially biased towards 
> micro-benchmarking but subsequently we could add higher level ones too.
> Although there is a storm benchmarking tool (originally written by Intel?) 
> that can be used, and i have personally used it, its better for this to be 
> integrated into Storm proper and also maintained by devs as storm evolves. 
> On a side note, in some instances I have noticed (to my surprise) that the 
> perf numbers change when the topologies written for Intel benchmark when 
> rewritten without the required wrappers so that they runs directly under 
> Storm.
> Have a few topologies in mind for measuring each of these:
> # *Queuing and Spout Emit Performance:* A topology with a Generator Spout but 
> no bolts.
> # *Queuing & Grouping performance*:   Generator Spout -> A grouping method -> 
> DevNull Bolt
> # *Hdfs Bolt:*    Generator Spout ->  Hdfs Bolt
> # *Hdfs Spout:*   Hdfs Spout ->  DevNull Botl
> # *Kafka Spout:*   Kafka Spout ->  DevNull Bolt 
> # *Simple Data Movement*: Kafka Spout -> Hdfs Bolt
> Shall add these for Storm core first. Then we can have the same for Trident 
> also.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (STORM-1772) Create topologies for measuring performance

Reply via email to