[jira] [Updated] (DISPATCH-929) dangling connection pointer in epoll proactor(?)

2021-04-02 Thread Jira


 [ 
https://issues.apache.org/jira/browse/DISPATCH-929?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jiri Daněk updated DISPATCH-929:

Labels: memory-bug  (was: )

> dangling connection pointer in epoll proactor(?)
> 
>
> Key: DISPATCH-929
> URL: https://issues.apache.org/jira/browse/DISPATCH-929
> Project: Qpid Dispatch
>  Issue Type: Bug
>Reporter: Gordon Sim
>Priority: Major
>  Labels: memory-bug
> Fix For: Backlog
>
>
> As yet have no clear reproducer, but observed crashes under CI for a system 
> that uses the router which under valgrind log the following:
> {noformat}
> 2018-02-16 00:00:38.220403 + SERVER (info) Connection from 
> 172.17.0.15:40592 (to 0.0.0.0:55671) failed: amqp:connection:framing-error 
> SSL Failure: error:140940E5:SSL routines:ssl3_read_bytes:ssl handshake fa
> ilure
> 2018-02-16 00:00:38.222489 + SERVER (info) Connection from 
> 172.17.0.15:40590 (to 0.0.0.0:55671) failed: amqp:connection:framing-error 
> SSL Failure: error:140940E5:SSL routines:ssl3_read_bytes:ssl handshake fa
> ilure
> 2018-02-16 00:00:38.224534 + SERVER (info) Connection from 
> 172.17.0.15:40586 (to 0.0.0.0:55671) failed: amqp:connection:framing-error 
> SSL Failure: Unknown error
> ==1== Invalid read of size 4
> ==1==at 0x50B0E00: pthread_mutex_lock (in /usr/lib64/libpthread-2.24.so)
> ==1==by 0x58C9C68: lock (epoll.c:113)
> ==1==by 0x58C9C68: memory_barrier (epoll.c:142)
> ==1==by 0x58C9C68: proactor_do_epoll (epoll.c:2016)
> ==1==by 0x4E89998: thread_run (server.c:936)
> ==1==by 0x4E89A6D: qd_server_run (server.c:1199)
> ==1==by 0x401853: main_process (main.c:111)
> ==1==by 0x401603: main (main.c:318)
> ==1==  Address 0x8d76878 is 72 bytes inside a block of size 1,824 free'd
> ==1==at 0x4C2ED4A: free (vg_replace_malloc.c:530)
> ==1==by 0x569A985: pn_class_decref (object.c:101)
> ==1==by 0x58CB01B: pconnection_done (epoll.c:965)
> ==1==by 0x58CB01B: pn_proactor_done (epoll.c:2054)
> ==1==by 0x4E899E3: thread_run (server.c:941)
> ==1==by 0x50AE739: start_thread (in /usr/lib64/libpthread-2.24.so)
> ==1==by 0x53CCE7E: clone (in /usr/lib64/libc-2.24.so)
> ==1==  Block was alloc'd at
> ==1==at 0x4C2FA50: calloc (vg_replace_malloc.c:711)
> ==1==by 0x569A851: pn_object_new (object.c:202)
> ==1==by 0x569A8CB: pn_class_new (object.c:61)
> ==1==by 0x58CA677: pn_listener_accept2 (epoll.c:1717)
> ==1==by 0x4E88E6A: on_accept (server.c:550)
> ==1==by 0x4E88E6A: handle_listener (server.c:709)
> ==1==by 0x4E88E6A: handle (server.c:860)
> ==1==by 0x4E899B7: thread_run (server.c:939)
> ==1==by 0x4E89A6D: qd_server_run (server.c:1199)
> ==1==by 0x401853: main_process (main.c:111)
> ==1==by 0x401603: main (main.c:318)
> {noformat}
> All instances of the error did seem to have the ssl handshake failure error. 
> However even if I trigger that error, I can't reproduce the invalid read yet.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: dev-unsubscr...@qpid.apache.org
For additional commands, e-mail: dev-h...@qpid.apache.org



[jira] [Updated] (DISPATCH-929) dangling connection pointer in epoll proactor(?)

2018-02-19 Thread Gordon Sim (JIRA)

 [ 
https://issues.apache.org/jira/browse/DISPATCH-929?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gordon Sim updated DISPATCH-929:

Description: 
As yet have no clear reproducer, but observed crashes under CI for a system 
that uses the router which under valgrind log the following:

{noformat}
2018-02-16 00:00:38.220403 + SERVER (info) Connection from 
172.17.0.15:40592 (to 0.0.0.0:55671) failed: amqp:connection:framing-error SSL 
Failure: error:140940E5:SSL routines:ssl3_read_bytes:ssl handshake fa
ilure
2018-02-16 00:00:38.222489 + SERVER (info) Connection from 
172.17.0.15:40590 (to 0.0.0.0:55671) failed: amqp:connection:framing-error SSL 
Failure: error:140940E5:SSL routines:ssl3_read_bytes:ssl handshake fa
ilure
2018-02-16 00:00:38.224534 + SERVER (info) Connection from 
172.17.0.15:40586 (to 0.0.0.0:55671) failed: amqp:connection:framing-error SSL 
Failure: Unknown error
==1== Invalid read of size 4
==1==at 0x50B0E00: pthread_mutex_lock (in /usr/lib64/libpthread-2.24.so)
==1==by 0x58C9C68: lock (epoll.c:113)
==1==by 0x58C9C68: memory_barrier (epoll.c:142)
==1==by 0x58C9C68: proactor_do_epoll (epoll.c:2016)
==1==by 0x4E89998: thread_run (server.c:936)
==1==by 0x4E89A6D: qd_server_run (server.c:1199)
==1==by 0x401853: main_process (main.c:111)
==1==by 0x401603: main (main.c:318)
==1==  Address 0x8d76878 is 72 bytes inside a block of size 1,824 free'd
==1==at 0x4C2ED4A: free (vg_replace_malloc.c:530)
==1==by 0x569A985: pn_class_decref (object.c:101)
==1==by 0x58CB01B: pconnection_done (epoll.c:965)
==1==by 0x58CB01B: pn_proactor_done (epoll.c:2054)
==1==by 0x4E899E3: thread_run (server.c:941)
==1==by 0x50AE739: start_thread (in /usr/lib64/libpthread-2.24.so)
==1==by 0x53CCE7E: clone (in /usr/lib64/libc-2.24.so)
==1==  Block was alloc'd at
==1==at 0x4C2FA50: calloc (vg_replace_malloc.c:711)
==1==by 0x569A851: pn_object_new (object.c:202)
==1==by 0x569A8CB: pn_class_new (object.c:61)
==1==by 0x58CA677: pn_listener_accept2 (epoll.c:1717)
==1==by 0x4E88E6A: on_accept (server.c:550)
==1==by 0x4E88E6A: handle_listener (server.c:709)
==1==by 0x4E88E6A: handle (server.c:860)
==1==by 0x4E899B7: thread_run (server.c:939)
==1==by 0x4E89A6D: qd_server_run (server.c:1199)
==1==by 0x401853: main_process (main.c:111)
==1==by 0x401603: main (main.c:318)

{noformat}

All instances of the error did seem to have the ssl handshake failure error. 
However even if I trigger that error, I can't reproduce the invalid read yet.

  was:
As yet have no clear reproducer, but observed crashes under CI for a system 
that uses the router which under valgrind log the following:

{noformat}
2018-02-16 00:00:38.220403 + SERVER (info) Connection from 
172.17.0.15:40592 (to 0.0.0.0:55671) failed: amqp:connection:framing-error SSL 
Failure: error:140940E5:SSL routines:ssl3_read_bytes:ssl handshake fa
ilure
2018-02-16 00:00:38.222489 + SERVER (info) Connection from 
172.17.0.15:40590 (to 0.0.0.0:55671) failed: amqp:connection:framing-error SSL 
Failure: error:140940E5:SSL routines:ssl3_read_bytes:ssl handshake fa
ilure
2018-02-16 00:00:38.224534 + SERVER (info) Connection from 
172.17.0.15:40586 (to 0.0.0.0:55671) failed: amqp:connection:framing-error SSL 
Failure: Unknown error
==1== Invalid read of size 4
==1==at 0x50B0E00: pthread_mutex_lock (in /usr/lib64/libpthread-2.24.so)
==1==by 0x58C9C68: lock (epoll.c:113)
==1==by 0x58C9C68: memory_barrier (epoll.c:142)
==1==by 0x58C9C68: proactor_do_epoll (epoll.c:2016)
==1==by 0x4E89998: thread_run (server.c:936)
==1==by 0x4E89A6D: qd_server_run (server.c:1199)
==1==by 0x401853: main_process (main.c:111)
==1==by 0x401603: main (main.c:318)
==1==  Address 0x8d76878 is 72 bytes inside a block of size 1,824 free'd
==1==at 0x4C2ED4A: free (vg_replace_malloc.c:530)
==1==by 0x569A985: pn_class_decref (object.c:101)
==1==by 0x58CB01B: pconnection_done (epoll.c:965)
==1==by 0x58CB01B: pn_proactor_done (epoll.c:2054)
==1==by 0x4E899E3: thread_run (server.c:941)
==1==by 0x50AE739: start_thread (in /usr/lib64/libpthread-2.24.so)
==1==by 0x53CCE7E: clone (in /usr/lib64/libc-2.24.so)
==1==  Block was alloc'd at
==1==at 0x4C2FA50: calloc (vg_replace_malloc.c:711)
==1==by 0x569A851: pn_object_new (object.c:202)
==1==by 0x569A8CB: pn_class_new (object.c:61)
==1==by 0x58CA677: pn_listener_accept2 (epoll.c:1717)
==1==by 0x4E88E6A: on_accept (server.c:550)
==1==by 0x4E88E6A: handle_listener (server.c:709)
==1==by 0x4E88E6A: handle (server.c:860)
==1==by 0x4E899B7: thread_run (server.c:939)
==1==by 0x4E89A6D: qd_server_run (server.c:1199)
==1==by 0x401853: main_process (main.c:111)
==1==by 0x401603: main (main.c:318)

{noformat}

All instances of the error did seem to have the ssl handshake failure error. 
Not sure how