Hi All, I'll adjust a test for this situation in a new patch.
Em qua., 24 de set. de 2025 às 16:22, Mark Michelson <[email protected]> escreveu: > On Wed, Sep 24, 2025 at 11:12 AM Numan Siddique <[email protected]> wrote: > > > > On Wed, Sep 24, 2025 at 8:08 AM Lucas Vargas Dias via dev > > <[email protected]> wrote: > > > > > > Change the logic to save sbflow uuid and just update if > > > the lflow is reused. Otherwise, it's removed. > > > Also, reduce sbflow searching with uuidset instead of > > > searching through all lflow table. > > > Add lflow states: > > > LFLOW_STALE - Lflow is not relevant > > > LFLOW_TO_SYNC - Lflow needs to be synced with SB DB > > > LFLOW_SYNCED - Lflow is synced with SB SB > > > > > > It generates the following results in a scenario with: > > > - LSPs: 56548 > > > - LRPs: 27304 > > > - LRs: 7922 > > > - LSs: 28602 > > > > > > Without the commit: > > > ovn-nbctl --no-leader-only --print-wait-time --wait=sb lr-add lr9-1 > > > Time spent on processing nb_cfg 275438: > > > ovn-northd delay before processing: 16069ms > > > ovn-northd completion: 32828ms > > > ovn-nbctl --no-leader-only --print-wait-time --wait=sb ls-add bar9-1 > > > Time spent on processing nb_cfg 275439: > > > ovn-northd delay before processing: 15019ms > > > ovn-northd completion: 33207ms > > > > > > ovn-nbctl --no-leader-only --print-wait-time --wait=sb lrp-add lr9-1 > rp9-1-bar 00:01:ff:22:00:01 192.168.10.1/24 2801:80:3eaf:4401::1/64 > > > Time spent on processing nb_cfg 275440: > > > ovn-northd delay before processing: 14784ms > > > ovn-northd completion: 33577ms > > > > > > ovn-nbctl --no-leader-only --print-wait-time --wait=sb lsp-add bar9-1 > bar-rp9-1 -- set Logical_Switch_Port bar-rp9-1 type=router > options:router-port=rp9-1-bar -- lsp-set-addresses bar-rp9-1 router > > > Time spent on processing nb_cfg 275441: > > > ovn-northd delay before processing: 14598ms > > > ovn-northd completion: 31942ms > > > > > > With the commit: > > > ovn-nbctl --no-leader-only --print-wait-time --wait=sb lr-add lr9-1 > > > Time spent on processing nb_cfg 275401: > > > ovn-northd delay before processing: 12602ms > > > ovn-northd completion: 26103ms > > > ovn-nbctl --no-leader-only --print-wait-time --wait=sb ls-add bar9-1 > > > Time spent on processing nb_cfg 275402: > > > ovn-northd delay before processing: 12639ms > > > ovn-northd completion: 26759ms > > > ovn-nbctl --no-leader-only --print-wait-time --wait=sb lrp-add lr9-1 > rp9-1-bar 00:01:ff:22:00:01 192.168.10.1/24 2801:80:3eaf:4401::1/64 > > > Time spent on processing nb_cfg 275403: > > > ovn-northd delay before processing: 11874ms > > > ovn-northd completion: 29733ms > > > ovn-nbctl --no-leader-only --print-wait-time --wait=sb lsp-add bar9-1 > bar-rp9-1 -- set Logical_Switch_Port bar-rp9-1 type=router > options:router-port=rp9-1-bar -- lsp-set-addresses bar-rp9-1 router > > > Time spent on processing nb_cfg 275404: > > > ovn-northd delay before processing: 4058ms > > > ovn-northd completion: 17323ms > > > > > > Signed-off-by: Lucas Vargas Dias <[email protected]> > > > > Thanks for the improvements. > > > > I've one comment below. > > > > Numan > > > > > --- > > > northd/en-lflow.c | 2 +- > > > northd/lflow-mgr.c | 108 +++++++++++++++++++++++++-------------------- > > > northd/lflow-mgr.h | 10 ++++- > > > northd/northd.c | 8 ++-- > > > 4 files changed, 74 insertions(+), 54 deletions(-) > > > > > > diff --git a/northd/en-lflow.c b/northd/en-lflow.c > > > index 50570b611..13c5e3119 100644 > > > --- a/northd/en-lflow.c > > > +++ b/northd/en-lflow.c > > > @@ -122,7 +122,7 @@ en_lflow_run(struct engine_node *node, void *data) > > > stopwatch_start(BUILD_LFLOWS_STOPWATCH_NAME, time_msec()); > > > > > > struct lflow_data *lflow_data = data; > > > - lflow_table_clear(lflow_data->lflow_table); > > > + lflow_table_clear(lflow_data->lflow_table, false); > > > lflow_reset_northd_refs(&lflow_input); > > > lflow_ref_clear(lflow_input.igmp_lflow_ref); > > > > > > diff --git a/northd/lflow-mgr.c b/northd/lflow-mgr.c > > > index 6a66a9718..95986f9c3 100644 > > > --- a/northd/lflow-mgr.c > > > +++ b/northd/lflow-mgr.c > > > @@ -26,6 +26,7 @@ > > > #include "lflow-mgr.h" > > > #include "lib/ovn-parallel-hmap.h" > > > #include "lib/ovn-util.h" > > > +#include "lib/uuidset.h" > > > > > > VLOG_DEFINE_THIS_MODULE(lflow_mgr); > > > > > > @@ -37,7 +38,8 @@ static void ovn_lflow_init(struct ovn_lflow *, > struct ovn_datapath *od, > > > uint16_t priority, char *match, > > > char *actions, char *io_port, > > > char *ctrl_meter, char *stage_hint, > > > - const char *where, const char *flow_desc); > > > + const char *where, const char *flow_desc, > > > + struct uuid sbuuid); > > > static struct ovn_lflow *ovn_lflow_find(const struct hmap *lflows, > > > enum ovn_stage stage, > > > uint16_t priority, const char > *match, > > > @@ -147,6 +149,13 @@ static struct ovs_mutex > lflow_hash_locks[LFLOW_HASH_LOCK_MASK + 1]; > > > */ > > > extern struct ovs_mutex fake_hash_mutex; > > > > > > + > > > +enum ovn_lflow_state { > > > + LFLOW_STALE, > > > + LFLOW_TO_SYNC, > > > + LFLOW_SYNCED, > > > +}; > > > + > > > /* Represents a logical ovn flow (lflow). > > > * > > > * A logical flow with match 'M' and actions 'A' - L(M, A) is created > > > @@ -181,14 +190,7 @@ struct ovn_lflow { > > > struct hmap dp_refcnts_map; /* Maintains the number of times this > ovn_lflow > > > * is referenced by a given datapath. > > > * Contains 'struct dp_refcnt' in the > map. */ > > > -}; > > > - > > > -/* Logical flow table. */ > > > -struct lflow_table { > > > - struct hmap entries; /* hmap of lflows. */ > > > - struct hmap ls_dp_groups; /* hmap of logical switch dp groups. */ > > > - struct hmap lr_dp_groups; /* hmap of logical router dp groups. */ > > > - ssize_t max_seen_lflow_size; > > > + enum ovn_lflow_state sync_state; > > > }; > > > > > > struct lflow_table * > > > @@ -210,10 +212,15 @@ lflow_table_init(struct lflow_table *lflow_table) > > > } > > > > > > void > > > -lflow_table_clear(struct lflow_table *lflow_table) > > > +lflow_table_clear(struct lflow_table *lflow_table, bool destroy_all) > > > { > > > struct ovn_lflow *lflow; > > > HMAP_FOR_EACH_SAFE (lflow, hmap_node, &lflow_table->entries) { > > > + ovs_assert(lflow->sync_state == LFLOW_SYNCED); > > > > Lets say a logical port change (or any change) was handled incrementally > and > > a new logical flow was added to the lflow_table. Its state when added > > would be LFLOW_TO_SYNC. > > > > And for some reason the function lflow_ref_sync_lflows__() returns > > false [1] . The state of this new flow > > will not be LFLOW_SYNCED. The engine will fall back to recompute and > > we would assert here. > > Good catch, Numan. As the comment says, this won't ever happen under > normal operation. But if someone mistakenly (or maliciously) removes > the Logical_DP_Group from the southbound database, then this could > result in a crash due to the assertion. > > The flow being in an unexpected state is interesting here. I suppose > in this case we should mark the flow as LFLOW_STALE and continue. If > the flow ends up being relevant, we'll end up using it during the > recompute. If not, then it will be deleted when syncing. > > > > > I think this scenario should be addressed. Please see the comments in > [1]. > > > > [1] - https://github.com/ovn-org/ovn/blob/main/northd/lflow-mgr.c#L1135 > > > > > > Other than that the patch LGTM. > > > > Thanks > > Numan > > > > > > > > > + if (!destroy_all) { > > > + lflow->sync_state = LFLOW_STALE; > > > + continue; > > > + } > > > ovn_lflow_destroy(lflow_table, lflow); > > > } > > > > > > @@ -224,7 +231,7 @@ lflow_table_clear(struct lflow_table *lflow_table) > > > void > > > lflow_table_destroy(struct lflow_table *lflow_table) > > > { > > > - lflow_table_clear(lflow_table); > > > + lflow_table_clear(lflow_table, true); > > > hmap_destroy(&lflow_table->entries); > > > ovn_dp_groups_destroy(&lflow_table->ls_dp_groups); > > > ovn_dp_groups_destroy(&lflow_table->lr_dp_groups); > > > @@ -257,16 +264,42 @@ lflow_table_sync_to_sb(struct lflow_table > *lflow_table, > > > const struct sbrec_logical_flow_table > *sb_flow_table, > > > const struct sbrec_logical_dp_group_table > *dpgrp_table) > > > { > > > + struct uuidset sb_uuid_set = UUIDSET_INITIALIZER(&sb_uuid_set); > > > struct hmap lflows_temp = HMAP_INITIALIZER(&lflows_temp); > > > struct hmap *lflows = &lflow_table->entries; > > > struct ovn_lflow *lflow; > > > + const struct sbrec_logical_flow *sbflow; > > > > > > fast_hmap_size_for(&lflows_temp, > > > lflow_table->max_seen_lflow_size); > > > > > > + HMAP_FOR_EACH_SAFE (lflow, hmap_node, lflows) { > > > + if (lflow->sync_state == LFLOW_STALE) { > > > + ovn_lflow_destroy(lflow_table, lflow); > > > + continue; > > > + } > > > + sbflow = NULL; > > > + if (!uuid_is_zero(&lflow->sb_uuid)) { > > > + sbflow = > sbrec_logical_flow_table_get_for_uuid(sb_flow_table, > > > + > &lflow->sb_uuid); > > > + } > > > + sync_lflow_to_sb(lflow, ovnsb_txn, lflow_table, ls_datapaths, > > > + lr_datapaths, ovn_internal_version_changed, > > > + sbflow, dpgrp_table); > > > + uuidset_insert(&sb_uuid_set, &lflow->sb_uuid); > > > + hmap_remove(lflows, &lflow->hmap_node); > > > + hmap_insert(&lflows_temp, &lflow->hmap_node, > > > + hmap_node_hash(&lflow->hmap_node)); > > > + } > > > /* Push changes to the Logical_Flow table to database. */ > > > - const struct sbrec_logical_flow *sbflow; > > > SBREC_LOGICAL_FLOW_TABLE_FOR_EACH_SAFE (sbflow, sb_flow_table) { > > > + struct uuidset_node *node = uuidset_find(&sb_uuid_set, > > > + > &sbflow->header_.uuid); > > > + if (!node) { > > > + sbrec_logical_flow_delete(sbflow); > > > + continue; > > > + } > > > + uuidset_delete(&sb_uuid_set, node); > > > struct sbrec_logical_dp_group *dp_group = > sbflow->logical_dp_group; > > > struct ovn_datapath *logical_datapath_od = NULL; > > > size_t i; > > > @@ -297,38 +330,8 @@ lflow_table_sync_to_sb(struct lflow_table > *lflow_table, > > > sbrec_logical_flow_delete(sbflow); > > > continue; > > > } > > > - > > > - enum ovn_pipeline pipeline > > > - = !strcmp(sbflow->pipeline, "ingress") ? P_IN : P_OUT; > > > - > > > - lflow = ovn_lflow_find( > > > - lflows, > > > - > ovn_stage_build(ovn_datapath_get_type(logical_datapath_od), > > > - pipeline, sbflow->table_id), > > > - sbflow->priority, sbflow->match, sbflow->actions, > > > - sbflow->controller_meter, sbflow->hash); > > > - if (lflow) { > > > - sync_lflow_to_sb(lflow, ovnsb_txn, lflow_table, > ls_datapaths, > > > - lr_datapaths, > ovn_internal_version_changed, > > > - sbflow, dpgrp_table); > > > - > > > - hmap_remove(lflows, &lflow->hmap_node); > > > - hmap_insert(&lflows_temp, &lflow->hmap_node, > > > - hmap_node_hash(&lflow->hmap_node)); > > > - } else { > > > - sbrec_logical_flow_delete(sbflow); > > > - } > > > - } > > > - > > > - HMAP_FOR_EACH_SAFE (lflow, hmap_node, lflows) { > > > - sync_lflow_to_sb(lflow, ovnsb_txn, lflow_table, ls_datapaths, > > > - lr_datapaths, ovn_internal_version_changed, > > > - NULL, dpgrp_table); > > > - > > > - hmap_remove(lflows, &lflow->hmap_node); > > > - hmap_insert(&lflows_temp, &lflow->hmap_node, > > > - hmap_node_hash(&lflow->hmap_node)); > > > } > > > + uuidset_destroy(&sb_uuid_set); > > > hmap_swap(lflows, &lflows_temp); > > > hmap_destroy(&lflows_temp); > > > } > > > @@ -847,7 +850,7 @@ ovn_lflow_init(struct ovn_lflow *lflow, struct > ovn_datapath *od, > > > size_t dp_bitmap_len, enum ovn_stage stage, uint16_t > priority, > > > char *match, char *actions, char *io_port, char > *ctrl_meter, > > > char *stage_hint, const char *where, > > > - const char *flow_desc) > > > + const char *flow_desc, struct uuid sbuuid) > > > { > > > lflow->dpg_bitmap = bitmap_allocate(dp_bitmap_len); > > > lflow->od = od; > > > @@ -861,7 +864,8 @@ ovn_lflow_init(struct ovn_lflow *lflow, struct > ovn_datapath *od, > > > lflow->flow_desc = flow_desc; > > > lflow->dpg = NULL; > > > lflow->where = where; > > > - lflow->sb_uuid = UUID_ZERO; > > > + lflow->sb_uuid = sbuuid; > > > + lflow->sync_state = LFLOW_TO_SYNC; > > > hmap_init(&lflow->dp_refcnts_map); > > > ovs_list_init(&lflow->referenced_by); > > > } > > > @@ -957,13 +961,18 @@ do_ovn_lflow_add(struct lflow_table > *lflow_table, size_t dp_bitmap_len, > > > { > > > struct ovn_lflow *old_lflow; > > > struct ovn_lflow *lflow; > > > + struct uuid sbuuid = UUID_ZERO; > > > > > > ovs_assert(dp_bitmap_len); > > > > > > old_lflow = ovn_lflow_find(&lflow_table->entries, stage, > > > priority, match, actions, ctrl_meter, > hash); > > > if (old_lflow) { > > > - return old_lflow; > > > + if (old_lflow->sync_state != LFLOW_STALE) { > > > + return old_lflow; > > > + } > > > + sbuuid = old_lflow->sb_uuid; > > > + ovn_lflow_destroy(lflow_table, old_lflow); > > > } > > > > > > lflow = xzalloc(sizeof *lflow); > > > @@ -975,14 +984,16 @@ do_ovn_lflow_add(struct lflow_table > *lflow_table, size_t dp_bitmap_len, > > > io_port ? xstrdup(io_port) : NULL, > > > nullable_xstrdup(ctrl_meter), > > > ovn_lflow_hint(stage_hint), where, > > > - flow_desc); > > > + flow_desc, sbuuid); > > > > > > if (parallelization_state != STATE_USE_PARALLELIZATION) { > > > hmap_insert(&lflow_table->entries, &lflow->hmap_node, hash); > > > } else { > > > hmap_insert_fast(&lflow_table->entries, &lflow->hmap_node, > > > hash); > > > - thread_lflow_counter++; > > > + if (uuid_is_zero(&lflow->sb_uuid)) { > > > + thread_lflow_counter++; > > > + } > > > } > > > > > > return lflow; > > > @@ -1169,6 +1180,7 @@ sync_lflow_to_sb(struct ovn_lflow *lflow, > > > ovn_dp_group_release(dp_groups, pre_sync_dpg); > > > } > > > > > > + lflow->sync_state = LFLOW_SYNCED; > > > return true; > > > } > > > > > > diff --git a/northd/lflow-mgr.h b/northd/lflow-mgr.h > > > index 1521270d6..91708c8a9 100644 > > > --- a/northd/lflow-mgr.h > > > +++ b/northd/lflow-mgr.h > > > @@ -26,10 +26,16 @@ struct ovn_datapath; > > > struct ovsdb_idl_row; > > > > > > /* lflow map which stores the logical flows. */ > > > -struct lflow_table; > > > +struct lflow_table { > > > + struct hmap entries; /* hmap of lflows. */ > > > + struct hmap ls_dp_groups; /* hmap of logical switch dp groups. */ > > > + struct hmap lr_dp_groups; /* hmap of logical router dp groups. */ > > > + ssize_t max_seen_lflow_size; > > > +}; > > > + > > > struct lflow_table *lflow_table_alloc(void); > > > void lflow_table_init(struct lflow_table *); > > > -void lflow_table_clear(struct lflow_table *); > > > +void lflow_table_clear(struct lflow_table *, bool); > > > void lflow_table_destroy(struct lflow_table *); > > > void lflow_table_expand(struct lflow_table *); > > > void lflow_table_set_size(struct lflow_table *, size_t); > > > diff --git a/northd/northd.c b/northd/northd.c > > > index 8b5413ef3..6dbf22e18 100644 > > > --- a/northd/northd.c > > > +++ b/northd/northd.c > > > @@ -18012,9 +18012,9 @@ noop_callback(struct worker_pool *pool > OVS_UNUSED, > > > static void > > > fix_flow_table_size(struct lflow_table *lflow_table, > > > struct lswitch_flow_build_info *lsiv, > > > - size_t n_lsiv) > > > + size_t n_lsiv, size_t start) > > > { > > > - size_t total = 0; > > > + size_t total = start; > > > for (size_t i = 0; i < n_lsiv; i++) { > > > total += lsiv[i].thread_lflow_counter; > > > } > > > @@ -18089,8 +18089,10 @@ build_lswitch_and_lrouter_flows( > > > } > > > > > > /* Run thread pool. */ > > > + size_t current_lflow_table_size = > hmap_count(&lflows->entries); > > > run_pool_callback(build_lflows_pool, NULL, NULL, > noop_callback); > > > - fix_flow_table_size(lflows, lsiv, build_lflows_pool->size); > > > + fix_flow_table_size(lflows, lsiv, build_lflows_pool->size, > > > + current_lflow_table_size); > > > > > > for (index = 0; index < build_lflows_pool->size; index++) { > > > ds_destroy(&lsiv[index].match); > > > -- > > > 2.34.1 > > > > > > > > > -- > > > > > > > > > > > > > > > _'Esta mensagem é direcionada apenas para os endereços constantes no > > > cabeçalho inicial. Se você não está listado nos endereços constantes no > > > cabeçalho, pedimos-lhe que desconsidere completamente o conteúdo dessa > > > mensagem e cuja cópia, encaminhamento e/ou execução das ações citadas > estão > > > imediatamente anuladas e proibidas'._ > > > > > > > > > * **'Apesar do Magazine Luiza tomar > > > todas as precauções razoáveis para assegurar que nenhum vírus esteja > > > presente nesse e-mail, a empresa não poderá aceitar a responsabilidade > por > > > quaisquer perdas ou danos causados por esse e-mail ou por seus > anexos'.* > > > > > > > > > > > > _______________________________________________ > > > dev mailing list > > > [email protected] > > > https://mail.openvswitch.org/mailman/listinfo/ovs-dev > > _______________________________________________ > > dev mailing list > > [email protected] > > https://mail.openvswitch.org/mailman/listinfo/ovs-dev > > -- _‘Esta mensagem é direcionada apenas para os endereços constantes no cabeçalho inicial. Se você não está listado nos endereços constantes no cabeçalho, pedimos-lhe que desconsidere completamente o conteúdo dessa mensagem e cuja cópia, encaminhamento e/ou execução das ações citadas estão imediatamente anuladas e proibidas’._ * **‘Apesar do Magazine Luiza tomar todas as precauções razoáveis para assegurar que nenhum vírus esteja presente nesse e-mail, a empresa não poderá aceitar a responsabilidade por quaisquer perdas ou danos causados por esse e-mail ou por seus anexos’.* _______________________________________________ dev mailing list [email protected] https://mail.openvswitch.org/mailman/listinfo/ovs-dev
