Re: [ovs-dev] [replication SMv2 7/7] ovsdb: Replication usability improvements
> > I tested the v3 version pulling from your github, and I am not seeing an >> crashes. >> > > > Thanks for testing. I added your name to tested-by in the V3 patch. ___ dev mailing list dev@openvswitch.org http://openvswitch.org/mailman/listinfo/dev
Re: [ovs-dev] [replication SMv2 7/7] ovsdb: Replication usability improvements
On Wed, Aug 31, 2016 at 5:36 PM, Numan Siddiquewrote: > > > On Wed, Aug 31, 2016 at 12:03 AM, Andy Zhou wrote: > >> >> >> On Tue, Aug 30, 2016 at 4:17 AM, Numan Siddique >> wrote: >> >>> >>> >>> On Tue, Aug 30, 2016 at 1:11 AM, Andy Zhou wrote: >>> On Mon, Aug 29, 2016 at 3:14 AM, Numan Siddique wrote: > > > On Sat, Aug 27, 2016 at 4:45 AM, Andy Zhou wrote: > >> Added the '--no-sync' option base on feedbacks of current >> implementation. >> >> Added appctl command "ovsdb-server/sync-status" based on feedbacks >> of current implementation. >> >> Added a test to simulate the integration of HA manager with OVSDB >> server using replication. >> >> Other documentation and API improvements. >> >> Signed-off-by: Andy Zhou >> -- >> >> I hope to get some review comments on the command line and appctl >> interfaces for replication. Since 2.6 is the first release of those >> interfaces, it is easier to making changes, compare to future >> releases. >> >> >> v1->v2: Fix creashes reported at: >> http://openvswitch.org/pipermail/dev/2016-August/078591.html >> --- >> > > I haven't tested these patches yet. This patch seems to have a white > space warning when applied. > Thanks for the reported. I will fold the fix in the next version when posting. In case it helps, you can also access the patches from my private repo at: https://github.com/azhou-nicira/ovs-review/tree/ovsdb-replic ation-sm-v2 >>> >>> Hi Andy, >>> >>> I am seeing the below crash when >>> >>> - The ovsdb-server changes from >>> master to standby and the active-ovsdb-server it is about to connect to >>> is killed just before that or it is not reachable. >>> >>> - >>> The pacemaker OCF script calls the sync-status cmd soon after that. >>> >>> >>> Please let me know if you need more information. >>> >>> >>> Core was generated by `ovsdb-server -vdbg >>> --log-file=/opt/stack/logs/ovsdb-server-sb.log >>> --remote=puni'. >>> Program terminated with signal SIGSEGV, Segmentation fault. >>> #0 0x0041241d in replication_status () at >>> ovsdb/replication.c:875 >>> 875SHASH_FOR_EACH (node, replication_dbs) { >>> Missing separate debuginfos, use: dnf debuginfo-install >>> glibc-2.23.1-10.fc24.x86_64 openssl-libs-1.0.2h-3.fc24.x86_64 >>> (gdb) bt >>> #0 0x0041241d in replication_status () at >>> ovsdb/replication.c:875 >>> #1 0x00406eda in ovsdb_server_get_sync_status (conn=0x1421fd0, >>> argc=, argv=, config_=) >>> at ovsdb/ovsdb-server.c:1480 >>> #2 0x004324ee in process_command (request=0x1421f30, >>> conn=0x1421fd0) at lib/unixctl.c:313 >>> #3 run_connection (conn=0x1421fd0) at lib/unixctl.c:347 >>> #4 unixctl_server_run (server=server@entry=0x141e140) at >>> lib/unixctl.c:400 >>> #5 0x00405bdc in main_loop (is_backup=0x7fff08062256, >>> exiting=0x7fff08062257, run_process=0x0, remotes=0x7fff080622a0, >>> unixctl=0x141e140, >>> all_dbs=0x7fff080622e0, jsonrpc=0x13f6f00) at >>> ovsdb/ovsdb-server.c:182 >>> #6 main (argc=, argv=) at >>> ovsdb/ovsdb-server.c:430 >>> >>> Numan, thanks for the report. I think I spotted the bug: >> >> Currently, when replication state machine is reset, the state update >> takes place after a round of main loop run. this time lag >> could lead to the back trace in case the unixctl commands was issued >> during this time lag. I have a fix that add another >> state to represent the reset condition. The fix is at: >> >> https://github.com/azhou-nicira/ovs-review/tree/ovsdb-replication-sm-v3 >> >> Would you please let me know if this version works any better?. Thanks! >> > > Sure. I would test and let you know. > > I tested the v3 version pulling from your github, and I am not seeing an crashes. Thanks Numan ___ dev mailing list dev@openvswitch.org http://openvswitch.org/mailman/listinfo/dev
Re: [ovs-dev] [replication SMv2 7/7] ovsdb: Replication usability improvements
On Wed, Aug 31, 2016 at 12:03 AM, Andy Zhouwrote: > > > On Tue, Aug 30, 2016 at 4:17 AM, Numan Siddique > wrote: > >> >> >> On Tue, Aug 30, 2016 at 1:11 AM, Andy Zhou wrote: >> >>> >>> >>> On Mon, Aug 29, 2016 at 3:14 AM, Numan Siddique >>> wrote: >>> On Sat, Aug 27, 2016 at 4:45 AM, Andy Zhou wrote: > Added the '--no-sync' option base on feedbacks of current > implementation. > > Added appctl command "ovsdb-server/sync-status" based on feedbacks > of current implementation. > > Added a test to simulate the integration of HA manager with OVSDB > server using replication. > > Other documentation and API improvements. > > Signed-off-by: Andy Zhou > -- > > I hope to get some review comments on the command line and appctl > interfaces for replication. Since 2.6 is the first release of those > interfaces, it is easier to making changes, compare to future > releases. > > > v1->v2: Fix creashes reported at: > http://openvswitch.org/pipermail/dev/2016-August/078591.html > --- > I haven't tested these patches yet. This patch seems to have a white space warning when applied. >>> Thanks for the reported. I will fold the fix in the next version when >>> posting. >>> >>> In case it helps, you can also access the patches from my private repo >>> at: >>> https://github.com/azhou-nicira/ovs-review/tree/ovsdb-replic >>> ation-sm-v2 >>> >>> >> >> Hi Andy, >> >> I am seeing the below crash when >> >> - The ovsdb-server changes from >> master to standby and the active-ovsdb-server it is about to connect to >> is killed just before that or it is not reachable. >> >> - >> The pacemaker OCF script calls the sync-status cmd soon after that. >> >> >> Please let me know if you need more information. >> >> >> Core was generated by `ovsdb-server -vdbg >> --log-file=/opt/stack/logs/ovsdb-server-sb.log >> --remote=puni'. >> Program terminated with signal SIGSEGV, Segmentation fault. >> #0 0x0041241d in replication_status () at ovsdb/replication.c:875 >> 875SHASH_FOR_EACH (node, replication_dbs) { >> Missing separate debuginfos, use: dnf debuginfo-install >> glibc-2.23.1-10.fc24.x86_64 openssl-libs-1.0.2h-3.fc24.x86_64 >> (gdb) bt >> #0 0x0041241d in replication_status () at ovsdb/replication.c:875 >> #1 0x00406eda in ovsdb_server_get_sync_status (conn=0x1421fd0, >> argc=, argv=, config_=) >> at ovsdb/ovsdb-server.c:1480 >> #2 0x004324ee in process_command (request=0x1421f30, >> conn=0x1421fd0) at lib/unixctl.c:313 >> #3 run_connection (conn=0x1421fd0) at lib/unixctl.c:347 >> #4 unixctl_server_run (server=server@entry=0x141e140) at >> lib/unixctl.c:400 >> #5 0x00405bdc in main_loop (is_backup=0x7fff08062256, >> exiting=0x7fff08062257, run_process=0x0, remotes=0x7fff080622a0, >> unixctl=0x141e140, >> all_dbs=0x7fff080622e0, jsonrpc=0x13f6f00) at ovsdb/ovsdb-server.c:182 >> #6 main (argc=, argv=) at >> ovsdb/ovsdb-server.c:430 >> >> Numan, thanks for the report. I think I spotted the bug: > > Currently, when replication state machine is reset, the state update > takes place after a round of main loop run. this time lag > could lead to the back trace in case the unixctl commands was issued > during this time lag. I have a fix that add another > state to represent the reset condition. The fix is at: > > https://github.com/azhou-nicira/ovs-review/tree/ovsdb-replication-sm-v3 > > Would you please let me know if this version works any better?. Thanks! > Sure. I would test and let you know. Thanks Numan ___ dev mailing list dev@openvswitch.org http://openvswitch.org/mailman/listinfo/dev
Re: [ovs-dev] [replication SMv2 7/7] ovsdb: Replication usability improvements
On Tue, Aug 30, 2016 at 4:17 AM, Numan Siddiquewrote: > > > On Tue, Aug 30, 2016 at 1:11 AM, Andy Zhou wrote: > >> >> >> On Mon, Aug 29, 2016 at 3:14 AM, Numan Siddique >> wrote: >> >>> >>> >>> On Sat, Aug 27, 2016 at 4:45 AM, Andy Zhou wrote: >>> Added the '--no-sync' option base on feedbacks of current implementation. Added appctl command "ovsdb-server/sync-status" based on feedbacks of current implementation. Added a test to simulate the integration of HA manager with OVSDB server using replication. Other documentation and API improvements. Signed-off-by: Andy Zhou -- I hope to get some review comments on the command line and appctl interfaces for replication. Since 2.6 is the first release of those interfaces, it is easier to making changes, compare to future releases. v1->v2: Fix creashes reported at: http://openvswitch.org/pipermail/dev/2016-August/078591.html --- >>> >>> I haven't tested these patches yet. This patch seems to have a white >>> space warning when applied. >>> >> Thanks for the reported. I will fold the fix in the next version when >> posting. >> >> In case it helps, you can also access the patches from my private repo at: >> https://github.com/azhou-nicira/ovs-review/tree/ovsdb-replic >> ation-sm-v2 >> >> > > Hi Andy, > > I am seeing the below crash when > > - The ovsdb-server changes from > master to standby and the active-ovsdb-server it is about to connect to > is killed just before that or it is not reachable. > > - > The pacemaker OCF script calls the sync-status cmd soon after that. > > > Please let me know if you need more information. > > > Core was generated by `ovsdb-server -vdbg > --log-file=/opt/stack/logs/ovsdb-server-sb.log > --remote=puni'. > Program terminated with signal SIGSEGV, Segmentation fault. > #0 0x0041241d in replication_status () at ovsdb/replication.c:875 > 875SHASH_FOR_EACH (node, replication_dbs) { > Missing separate debuginfos, use: dnf debuginfo-install > glibc-2.23.1-10.fc24.x86_64 openssl-libs-1.0.2h-3.fc24.x86_64 > (gdb) bt > #0 0x0041241d in replication_status () at ovsdb/replication.c:875 > #1 0x00406eda in ovsdb_server_get_sync_status (conn=0x1421fd0, > argc=, argv=, config_=) > at ovsdb/ovsdb-server.c:1480 > #2 0x004324ee in process_command (request=0x1421f30, > conn=0x1421fd0) at lib/unixctl.c:313 > #3 run_connection (conn=0x1421fd0) at lib/unixctl.c:347 > #4 unixctl_server_run (server=server@entry=0x141e140) at > lib/unixctl.c:400 > #5 0x00405bdc in main_loop (is_backup=0x7fff08062256, > exiting=0x7fff08062257, run_process=0x0, remotes=0x7fff080622a0, > unixctl=0x141e140, > all_dbs=0x7fff080622e0, jsonrpc=0x13f6f00) at ovsdb/ovsdb-server.c:182 > #6 main (argc=, argv=) at > ovsdb/ovsdb-server.c:430 > > Numan, thanks for the report. I think I spotted the bug: Currently, when replication state machine is reset, the state update takes place after a round of main loop run. this time lag could lead to the back trace in case the unixctl commands was issued during this time lag. I have a fix that add another state to represent the reset condition. The fix is at: https://github.com/azhou-nicira/ovs-review/tree/ovsdb-replication-sm-v3 Would you please let me know if this version works any better?. Thanks! ___ dev mailing list dev@openvswitch.org http://openvswitch.org/mailman/listinfo/dev
Re: [ovs-dev] [replication SMv2 7/7] ovsdb: Replication usability improvements
On Tue, Aug 30, 2016 at 1:11 AM, Andy Zhouwrote: > > > On Mon, Aug 29, 2016 at 3:14 AM, Numan Siddique > wrote: > >> >> >> On Sat, Aug 27, 2016 at 4:45 AM, Andy Zhou wrote: >> >>> Added the '--no-sync' option base on feedbacks of current >>> implementation. >>> >>> Added appctl command "ovsdb-server/sync-status" based on feedbacks >>> of current implementation. >>> >>> Added a test to simulate the integration of HA manager with OVSDB >>> server using replication. >>> >>> Other documentation and API improvements. >>> >>> Signed-off-by: Andy Zhou >>> -- >>> >>> I hope to get some review comments on the command line and appctl >>> interfaces for replication. Since 2.6 is the first release of those >>> interfaces, it is easier to making changes, compare to future >>> releases. >>> >>> >>> v1->v2: Fix creashes reported at: >>> http://openvswitch.org/pipermail/dev/2016-August/078591.html >>> --- >>> >> >> I haven't tested these patches yet. This patch seems to have a white >> space warning when applied. >> > Thanks for the reported. I will fold the fix in the next version when > posting. > > In case it helps, you can also access the patches from my private repo at: > https://github.com/azhou-nicira/ovs-review/tree/ovsdb- > replication-sm-v2 > > Hi Andy, I am seeing the below crash when - The ovsdb-server changes from master to standby and the active-ovsdb-server it is about to connect to is killed just before that or it is not reachable. - The pacemaker OCF script calls the sync-status cmd soon after that. Please let me know if you need more information. Core was generated by `ovsdb-server -vdbg --log-file=/opt/stack/logs/ovsdb-server-sb.log --remote=puni'. Program terminated with signal SIGSEGV, Segmentation fault. #0 0x0041241d in replication_status () at ovsdb/replication.c:875 875SHASH_FOR_EACH (node, replication_dbs) { Missing separate debuginfos, use: dnf debuginfo-install glibc-2.23.1-10.fc24.x86_64 openssl-libs-1.0.2h-3.fc24.x86_64 (gdb) bt #0 0x0041241d in replication_status () at ovsdb/replication.c:875 #1 0x00406eda in ovsdb_server_get_sync_status (conn=0x1421fd0, argc=, argv=, config_=) at ovsdb/ovsdb-server.c:1480 #2 0x004324ee in process_command (request=0x1421f30, conn=0x1421fd0) at lib/unixctl.c:313 #3 run_connection (conn=0x1421fd0) at lib/unixctl.c:347 #4 unixctl_server_run (server=server@entry=0x141e140) at lib/unixctl.c:400 #5 0x00405bdc in main_loop (is_backup=0x7fff08062256, exiting=0x7fff08062257, run_process=0x0, remotes=0x7fff080622a0, unixctl=0x141e140, all_dbs=0x7fff080622e0, jsonrpc=0x13f6f00) at ovsdb/ovsdb-server.c:182 #6 main (argc=, argv=) at ovsdb/ovsdb-server.c:430 Thanks Numan ___ dev mailing list dev@openvswitch.org http://openvswitch.org/mailman/listinfo/dev
Re: [ovs-dev] [replication SMv2 7/7] ovsdb: Replication usability improvements
On Mon, Aug 29, 2016 at 3:14 AM, Numan Siddiquewrote: > > > On Sat, Aug 27, 2016 at 4:45 AM, Andy Zhou wrote: > >> Added the '--no-sync' option base on feedbacks of current >> implementation. >> >> Added appctl command "ovsdb-server/sync-status" based on feedbacks >> of current implementation. >> >> Added a test to simulate the integration of HA manager with OVSDB >> server using replication. >> >> Other documentation and API improvements. >> >> Signed-off-by: Andy Zhou >> -- >> >> I hope to get some review comments on the command line and appctl >> interfaces for replication. Since 2.6 is the first release of those >> interfaces, it is easier to making changes, compare to future >> releases. >> >> >> v1->v2: Fix creashes reported at: >> http://openvswitch.org/pipermail/dev/2016-August/078591.html >> --- >> > > I haven't tested these patches yet. This patch seems to have a white > space warning when applied. > Thanks for the reported. I will fold the fix in the next version when posting. In case it helps, you can also access the patches from my private repo at: https://github.com/azhou-nicira/ovs-review/tree/ovsdb-replication-sm-v2 ___ dev mailing list dev@openvswitch.org http://openvswitch.org/mailman/listinfo/dev