Darrell Ball wrote:
> +            LIST_FOR_EACH_SAFE (conn, next, exp_node, &ctb->exp_lists[j]) {
> +                if (!zone || *zone == conn->key.zone) {
> +                    ovs_assert(conn->conn_type == CT_CONN_TYPE_DEFAULT);

why do we need this assert?  
Clean the CT_CONN_TYPE_DEFAULT type in conntrack_flush(), and clean 
CT_CONN_TYPE_UN_NAT in nat_clean() like following:
+                if ((!zone || *zone == conn->key.zone) &&
+                    (conn->conn_type == CT_CONN_TYPE_DEFAULT)) {
+                    //ovs_assert(conn->conn_type == CT_CONN_TYPE_DEFAULT);


with the above code, catch an another panic which in ct_clean thread.
I have see the same panic without changeing the code.
Both ct_clean and flush-conntrack, can catch the bad bucket value. 

#0  0x0000562ae7402553 in hmap_remove (node=0x7f871bdc4258, 
hmap=0x7f9498701c68) at ./include/openvswitch/hmap.h:287
287         while (*bucket != node) {
[Current thread is 1 (Thread 0x7f948ffff700 (LWP 2085796))]
(gdb) bt
#0  0x0000562ae7402553 in hmap_remove (node=0x7f871bdc4258, 
hmap=0x7f9498701c68) at ./include/openvswitch/hmap.h:287
#1  conn_clean (ct=ct@entry=0x7f9498700d98, conn=0x7f871bdc41b0, 
ctb=ctb@entry=0x7f9498701c38) at lib/conntrack.c:815
#2  0x0000562ae7402a28 in sweep_bucket (limit=3906, now=1223168469, 
ctb=<optimized out>, ct=0x7f9498700d98) at lib/conntrack.c:1396
#3  conntrack_clean (now=1223168469, ct=0x7f9498700d98) at lib/conntrack.c:1433
#4  clean_thread_main (f_=0x7f9498700d98) at lib/conntrack.c:1488
#5  0x0000562ae737a48f in ovsthread_wrapper (aux_=<optimized out>) at 
lib/ovs-thread.c:354
#6  0x00007f9497715494 in start_thread (arg=0x7f948ffff700) at 
pthread_create.c:333
#7  0x00007f9496d09acf in clone () at 
../sysdeps/unix/sysv/linux/x86_64/clone.S:97
(gdb) p bucket
$1 = (struct hmap_node **) 0x8  <====== why is bucket not a point value?
(gdb) p *(struct hmap *) 0x7f9498701c68
$2 = {buckets = 0x7f8609f8fc00, one = 0x0, mask = 32767, n = 31040}
(gdb) p *(struct hmap_node *) 0x7f871bdc4258
$3 = {hash = 203059667, next = 0x7f8707cfe6c8}
(gdb) p 203059667&32767
$4 = 29139
(gdb) p &hmap->buckets[29139]
$5 = (struct hmap_node **) 0x7f8609fc8a98


> +                    conn_clean(ct, conn, ctb);
> +                }
>              }
>          }
> -        ct_lock_unlock(&ct->buckets[i].lock);
> +        ct_lock_unlock(&ctb->lock);
>      }
> 
>      return 0;
> 
> 
> which yields conntrack_flush(...) as
> 
> int
> conntrack_flush(struct conntrack *ct, const uint16_t *zone)
> {
>     for (unsigned i = 0; i < CONNTRACK_BUCKETS; i++) {
>         struct conntrack_bucket *ctb = &ct->buckets[i];
>         ct_lock_lock(&ctb->lock);
>         for (unsigned j = 0; j < N_CT_TM; j++) {
>             struct conn *conn, *next;
>             LIST_FOR_EACH_SAFE (conn, next, exp_node, &ctb->exp_lists[j]) {
>                 if (!zone || *zone == conn->key.zone) {
>                     ovs_assert(conn->conn_type == CT_CONN_TYPE_DEFAULT);
>                     conn_clean(ct, conn, ctb);
>                 }
>             }
>         }
>         ct_lock_unlock(&ctb->lock);
>     }
> 
>     return 0;
> }
> 
> Thanks Darrell
> 
> 
> 
> On Wed, Mar 6, 2019 at 8:06 PM solomon <[email protected]> wrote:
> 
>>
>>    When i test conntrack, i catch a panic of ovs.
>>
>> Core was generated by `ovs-vswitchd unix:/var/run/openvswitch/db.sock
>> -vconsole:emer -vsyslog:err -vfi'.
>> Program terminated with signal SIGSEGV, Segmentation fault.
>> #0  0x00005605c5cd7553 in hmap_remove (node=0x7f734cde0218,
>> hmap=0x7f8178c7fd70) at ./include/openvswitch/hmap.h:287
>> 287         while (*bucket != node) {
>> [Current thread is 1 (Thread 0x7f8178dccb00 (LWP 2024338))]
>> (gdb) bt
>> #0  0x00005605c5cd7553 in hmap_remove (node=0x7f734cde0218,
>> hmap=0x7f8178c7fd70) at ./include/openvswitch/hmap.h:287
>> #1  conn_clean (ct=ct@entry=0x7f8178c75d98, conn=0x7f734cde0170,
>> ctb=ctb@entry=0x7f8178c7fd40) at lib/conntrack.c:815
>> #2  0x00005605c5cdd66a in conntrack_flush (ct=0x7f8178c75d98, zone=0x0) at
>> lib/conntrack.c:2549
>> #3  0x00005605c5bc2b39 in ct_dpif_flush (dpif=0x5605c68a6430,
>> zone=zone@entry=0x0, tuple=tuple@entry=0x0) at lib/ct-dpif.c:140
>> #4  0x00005605c5ce17a0 in dpctl_flush_conntrack (argc=argc@entry=1,
>> argv=argv@entry=0x5605c697ec30, dpctl_p=dpctl_p@entry=0x7fffee718110) at
>> lib/dpctl.c:1388
>> #5  0x00005605c5cdeb78 in dpctl_unixctl_handler (conn=0x5605c6959ca0,
>> argc=1, argv=0x5605c697ec30, aux=0x5605c5ce1610 <dpctl_flush_conntrack>) at
>> lib/dpctl.c:2312
>> #6  0x00005605c5c806ea in process_command (request=<optimized out>,
>> conn=0x5605c6959ca0) at lib/unixctl.c:308
>> #7  run_connection (conn=0x5605c6959ca0) at lib/unixctl.c:342
>> #8  unixctl_server_run (server=0x5605c6868230) at lib/unixctl.c:393
>> #9  0x00005605c5804217 in main (argc=<optimized out>, argv=<optimized
>> out>) at vswitchd/ovs-vswitchd.c:126
>>
>>
>> Environment:
>> ovs-2.10.1
>> dpdk-18.0.2.2
>>
>> How-To-Repeat:
>> 1. configure ovs with snat aciton.
>>
>> ovs-ofctl  -O OpenFlow15 add-group $br_name "group_id=1, type=select,
>> selection_method=hash
>> bucket=bucket_id=1,weight:100,actions=ct(nat(src=172.16.1.1-172.255.255.255),commit,table=40)
>> "
>>
>> 2. syn-ddos send tcp syn packet to generate connection tracks.
>> 3.
>> #   ovs-appctl dpctl/ct-get-nconns
>> 2063993
>> #   ovs-appctl dpctl/flush-conntrack
>>
>> 2019-03-07T03:52:24Z|00001|unixctl|WARN|error communicating with
>> unix:/var/run/openvswitch/ovs-vswitchd.2024338.ctl: End of file
>> ovs-appctl: ovs-vswitchd: transaction error (End of file)
>>
>>
>> --
>> Thanks
>> Solomon
>>
> 

-- 

Thanks
Solomon
_______________________________________________
dev mailing list
[email protected]
https://mail.openvswitch.org/mailman/listinfo/ovs-dev

Reply via email to