[ 
https://issues.apache.org/jira/browse/STORM-763?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14567421#comment-14567421
 ] 

ASF GitHub Bot commented on STORM-763:
--------------------------------------

Github user eshioji commented on the pull request:

    https://github.com/apache/storm/pull/568#issuecomment-107578216
  
    @revans2 Re: pending metric, I see. I removed it thinking that it 
exclusively had to do with the `pendingMessage` field, but now I realise it was 
actually tracking messages inside Netty's internal buffer, too. I brought it 
back; should be equivalent to the old metrics.
    
    Re: performance, I couldn't see any performance difference on my test 
cluster (5 node, processes around 60K tuples per sec) under "normal" 
conditions. With `v0.9.4` it would start to go haywire after several hours and 
start to consume lots of CPU and a lot of tuples start to fail. That symptom 
seems to have been alleviated with the patch. However, this topology feeds off 
a live data stream so admittedly it's not a strict comparison (because the 
volume of data is not completely the same from run to run). Is there a standard 
way you guys use to verify performance regression?
    
    
      


> nimbus reassigned worker A to another machine, but other worker's netty 
> client can't connect to the new worker A 
> -----------------------------------------------------------------------------------------------------------------
>
>                 Key: STORM-763
>                 URL: https://issues.apache.org/jira/browse/STORM-763
>             Project: Apache Storm
>          Issue Type: Bug
>    Affects Versions: 0.9.4
>         Environment: Debian 3.16.3-2~bpo70+1 (2014-09-21) x86_64 GNU/Linux
> java version "1.7.0_03"
> storm 0.9.4
> cluster 50+ machines
>            Reporter: 3in
>
> Debian 3.16.3-2~bpo70+1 (2014-09-21) x86_64 GNU/Linux
> java version "1.7.0_03"
> storm 0.9.4
> cluster 50+ machines
> my topology have 50+ worker, it can't emit  50000 thousand tuples in ten 
> minutes.
> sometimes one worker is reassigned to another machine by nimbus because of 
> task heartbeat timeout:
> 2015-04-08T16:51:23.026+0800 b.s.d.nimbus [INFO] Executor 
> my_topology-22-1428243953:[440 440] not alive
> 2015-04-08T16:51:23.026+0800 b.s.d.nimbus [INFO] Executor 
> my_topology-22-1428243953:[90 90] not alive
> 2015-04-08T16:51:23.026+0800 b.s.d.nimbus [INFO] Executor 
> my_topology-22-1428243953:[510 510] not alive
> 2015-04-08T16:51:23.026+0800 b.s.d.nimbus [INFO] Executor 
> my_topology-22-1428243953:[160 160] not alive
> i can see the reassigned worker is already started in storm UI,  but  other 
> worker write error log all the time:
> 2015-04-08T16:56:43.091+0800 b.s.m.n.Client [ERROR] dropping 1 message(s) 
> destined for Netty-Client-host_19/192.168.163.19:5700
> 2015-04-08T16:56:45.660+0800 b.s.m.n.Client [ERROR] connection to 
> Netty-Client-host_19/192.168.163.19:5700 is unavailable
> 2015-04-08T16:56:45.660+0800 b.s.m.n.Client [ERROR] dropping 1 message(s) 
> destined for Netty-Client-host_19/192.168.163.19:5700
> 2015-04-08T16:56:45.715+0800 b.s.m.n.Client [ERROR] connection to 
> Netty-Client-host_19/192.168.163.19:5700 is unavailable
> 2015-04-08T16:56:45.716+0800 b.s.m.n.Client [ERROR] dropping 1 message(s) 
> destined for Netty-Client-host_19/192.168.163.19:5700
> 2015-04-08T16:56:46.277+0800 b.s.m.n.Client [ERROR] connection to 
> Netty-Client-host_19/192.168.163.19:5700 is unavailable
> 2015-04-08T16:56:46.278+0800 b.s.m.n.Client [ERROR] dropping 1 message(s) 
> destined for Netty-Client-host_19/192.168.163.19:5700
> 2015-04-08T16:56:46.306+0800 b.s.m.n.Client [ERROR] connection to 
> Netty-Client-host_19/192.168.163.19:5700 is unavailable
> 2015-04-08T16:56:46.306+0800 b.s.m.n.Client [ERROR] dropping 1 message(s) 
> destined for Netty-Client-host_19/192.168.163.19:5700
> 2015-04-08T16:56:46.586+0800 b.s.m.n.Client [ERROR] connection to 
> Netty-Client-host_19/192.168.163.19:5700 is unavailable
> 2015-04-08T16:56:46.586+0800 b.s.m.n.Client [ERROR] dropping 1 message(s) 
> destined for Netty-Client-host_19/192.168.163.19:5700
> 2015-04-08T16:56:46.835+0800 b.s.m.n.Client [ERROR] connection to 
> Netty-Client-host_19/192.168.163.19:5700 is unavailable
> The worker of destined host is already started, and i can telnet 
> 192.168.163.19 5700.
> however, why the netty client can't connect to the ip:port?



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to