[jira] [Commented] (STORM-599) UI is slow due to fetching heartbeats from ZK

2015-01-14 Thread Robert Joseph Evans (JIRA)

[ 
https://issues.apache.org/jira/browse/STORM-599?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14277076#comment-14277076
 ] 

Robert Joseph Evans commented on STORM-599:
---

[~sriharsha]
You are correct I verified it too.  Hmm, we are running with this on based on a 
different code base that is working, it could have something to do with how 
often the cache is being updated.  I will revert the checking, reopen this and 
then we can investigate further.  Thanks for finding this.

 UI is slow due to fetching heartbeats from ZK
 -

 Key: STORM-599
 URL: https://issues.apache.org/jira/browse/STORM-599
 Project: Apache Storm
  Issue Type: Improvement
Affects Versions: 0.9.3
Reporter: Derek Dagit
Assignee: Derek Dagit
Priority: Minor
 Fix For: 0.10.0


 The method getTopologyInfo fetches every heartbeat from ZooKeeper in order to 
 gather statistics.  The UI calls this method via thrift to render the 
 topology page.
 When topologies are launched with thousands of executors, this fetching from 
 ZK dramatically slows things down.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (STORM-599) UI is slow due to fetching heartbeats from ZK

2015-01-14 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/STORM-599?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14277273#comment-14277273
 ] 

ASF GitHub Bot commented on STORM-599:
--

GitHub user d2r opened a pull request:

https://github.com/apache/storm/pull/381

[Storm 599] Use use nimbus's cached heartbeats rather than fetching again 
from ZK, take two

This is a retry of the reverted pull request #356.

In #356, a merge error (we think?) on the branch used for the pull request 
caused part of the change on one line to go missing.

```Diff
- beats (.executor-beats storm-cluster-state storm-id (:executor-node+port 
assignment))
+ beats (map-val :heartbeat @(:heartbeats-cache nimbus))
```

The new code should have been:
```Diff
+ beats (map-val :heartbeat (get @(:heartbeats-cache nimbus) storm-id))
```

This caused incorrect retrieval of heartbeats and therefore metrics were 
not populated as noted by @harshach 
[here](https://issues.apache.org/jira/browse/STORM-599?focusedCommentId=14275512page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-14275512).

Testing again with this branch in a clean environment shows that metrics 
are updated as expected once per `nimbus.monitor.freq.secs`, which defaults to 
10s.

You can merge this pull request into a Git repository by running:

$ git pull https://github.com/d2r/storm STORM-599-use-cached-hbs

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/storm/pull/381.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #381


commit 1d973c6838118e64024505269b82baffd0f41f8d
Author: Derek Dagit der...@yahoo-inc.com
Date:   2015-01-14T17:12:30Z

use hb cache instead of fetching via zk

commit fc952d4c64f52e29cd162ffacee984ef36ab9447
Author: Derek Dagit der...@yahoo-inc.com
Date:   2015-01-14T17:13:05Z

Remove unnecessary code




 UI is slow due to fetching heartbeats from ZK
 -

 Key: STORM-599
 URL: https://issues.apache.org/jira/browse/STORM-599
 Project: Apache Storm
  Issue Type: Improvement
Affects Versions: 0.9.3
Reporter: Derek Dagit
Assignee: Derek Dagit
Priority: Minor

 The method getTopologyInfo fetches every heartbeat from ZooKeeper in order to 
 gather statistics.  The UI calls this method via thrift to render the 
 topology page.
 When topologies are launched with thousands of executors, this fetching from 
 ZK dramatically slows things down.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (STORM-599) UI is slow due to fetching heartbeats from ZK

2015-01-06 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/STORM-599?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14266832#comment-14266832
 ] 

ASF GitHub Bot commented on STORM-599:
--

Github user asfgit closed the pull request at:

https://github.com/apache/storm/pull/356


 UI is slow due to fetching heartbeats from ZK
 -

 Key: STORM-599
 URL: https://issues.apache.org/jira/browse/STORM-599
 Project: Apache Storm
  Issue Type: Improvement
Affects Versions: 0.9.3
Reporter: Derek Dagit
Assignee: Derek Dagit
Priority: Minor

 The method getTopologyInfo fetches every heartbeat from ZooKeeper in order to 
 gather statistics.  The UI calls this method via thrift to render the 
 topology page.
 When topologies are launched with thousands of executors, this fetching from 
 ZK dramatically slows things down.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (STORM-599) UI is slow due to fetching heartbeats from ZK

2014-12-23 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/STORM-599?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14257528#comment-14257528
 ] 

ASF GitHub Bot commented on STORM-599:
--

Github user revans2 commented on the pull request:

https://github.com/apache/storm/pull/356#issuecomment-67997939
  
The changes look fine to me.  My only concern would be around what happens 
if the cache has not been updated and is empty, but it looks like it will 
return an empty map, and that looks like what happened before when there were 
no heartbeats.

+1


 UI is slow due to fetching heartbeats from ZK
 -

 Key: STORM-599
 URL: https://issues.apache.org/jira/browse/STORM-599
 Project: Apache Storm
  Issue Type: Improvement
Affects Versions: 0.9.3
Reporter: Derek Dagit
Assignee: Derek Dagit
Priority: Minor

 The method getTopologyInfo fetches every heartbeat from ZooKeeper in order to 
 gather statistics.  The UI calls this method via thrift to render the 
 topology page.
 When topologies are launched with thousands of executors, this fetching from 
 ZK dramatically slows things down.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (STORM-599) UI is slow due to fetching heartbeats from ZK

2014-12-22 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/STORM-599?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14256209#comment-14256209
 ] 

ASF GitHub Bot commented on STORM-599:
--

GitHub user d2r opened a pull request:

https://github.com/apache/storm/pull/356

[STORM-599] Use use nimbus's cached heartbeats rather than fetching again 
from ZK

In my manual testing, this seems to cut the time to load a topology page by 
around 50%.

This also removes some apparently unnecessary code.

You can merge this pull request into a Git repository by running:

$ git pull https://github.com/d2r/storm apache-use-hb-cache-for-topo-info

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/storm/pull/356.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #356


commit 3907b2d780b796e754bf3566d29567cff6e77679
Author: Derek Dagit der...@yahoo-inc.com
Date:   2014-12-13T15:36:31Z

use the cache instead of downloading from zk

commit 40be5ddb94b674f8fcf2c804c7af96fbfe6a
Author: Derek Dagit der...@yahoo-inc.com
Date:   2014-12-13T15:36:51Z

Remove unnecessary code




 UI is slow due to fetching heartbeats from ZK
 -

 Key: STORM-599
 URL: https://issues.apache.org/jira/browse/STORM-599
 Project: Apache Storm
  Issue Type: Improvement
Affects Versions: 0.9.3
Reporter: Derek Dagit
Assignee: Derek Dagit
Priority: Minor

 The method getTopologyInfo fetches every heartbeat from ZooKeeper in order to 
 gather statistics.  The UI calls this method via thrift to render the 
 topology page.
 When topologies are launched with thousands of executors, this fetching from 
 ZK dramatically slows things down.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)