On 9/23/21 11:47 AM, Mark Gray wrote:
> On 16/09/2021 16:37, Dumitru Ceara wrote:
>> Add a new command, 'ovsdb-server/log-db-ops DB TABLE on|off', which
>> allows the user to enable/disable transaction logging for specific
>> databases and tables.
>>
>> By default, logging is disabled. Once enabled, logs are generated
>> with level INFO and are also rate limited.
>>
>> If used with care, this command can be useful in analyzing production
>> deployment performance issues, allowing the user to pin point
>> bottlenecks without the need to enable wider debug logs, e.g., jsonrpc.
>>
>> Signed-off-by: Dumitru Ceara <[email protected]>
>> ---
>> A sample use case is an ovn-kubernetes scaled deployment in which
>> we're interesting in reducing time to bring up PODs (represented by
>> OVN logical switch ports). In order to determine exactly where the
>> bottleneck is when provisioning PODs (CMS/ovn-nbctl/client
>> IDLs/ovsdb-server/ovn-controller/etc) we need timestamps of when
>> operations happen at various places in the stack.
>>
>> Without this patch the only option for tracking when transactions
>> happen in the Northbound database is to enable jsonrpc debug logs in
>> ovsdb-server. This generates a rather large amount of data.
>>
>> Instead, now, users would be able to just enable logging for the
>> Logical_Switch_Port table getting more relevant and precise
>> information.
>>
>> V2:
>> - rebased (fixed conflicts in NEWS).
>
> Generally ok and I also did a quick test but I have a few comments on
> the UI which wouldn't block an ACK and one small comment below in the code:
Thanks for the review!
>
> * My personal preference would be that the syntax somewhat followed the
> vlog/set one that seperates terms by ':'. Also a more memorable name.
> For example tlog/set DB:TABLE:on or something like that but feel free to
> choose to completely ignore me :) I just find it can sometimes be
> difficult to remember syntax so I prefer some consistency.
Makes sense, I'll try to come up with something more consistent.
>
> * One limititation is that I don't think we can add a log before a table
> is created (i.e. to see the first entry). For example, let's say I want
> to log the very first creation of a bridge using `ovs-vsctl add-br br0`.
> Is there any way around it? I couldn't think of one.
I'm not sure I follow this point. If you enable logging for the Bridge
table before adding the first bridge you'll get the log, e.g., in an OVS
sandbox:
$ ovs-appctl -t $PWD/sandbox/ovsdb-server.*.ctl ovsdb-server/log-db-ops
Open_vSwitch Bridge on
$ ovs-vsctl show
f43b960d-13f9-48d1-9898-703a3ac3f730
$ ovs-vsctl add-br br9
$ grep transaction sandbox/ovsdb-server.log
2021-09-23T09:57:53.426Z|00007|transaction|INFO|table:Bridge,op:inserted,name:br9,flood_vlans:[],auto_attach:[],ports:[388811ab-13a9-4b50-92c6-10c00a8b5e42],stp_enable:false,rstp_enable:false,_uuid:8a59955c-2a3c-41d3-9e93-66845e51878b,fail_mode:[],rstp_status:{},flow_tables:{},_version:1603df48-3730-4324-a71b-1c9f7eadb865,netflow:[],datapath_type:"",controller:[],other_config:{},external_ids:{},status:{},ipfix:[],datapath_id:[],mirrors:[],mcast_snooping_enable:false,datapath_version:"",sflow:[],protocols:[]
[...]
>
> * Could you make the table and database names case-insensitive. For
> example, I can do `ovs-vsctl list open` even though the table name is
> actually Open_vSwitch? However, with this I need to specify the actual
> table name.
>
We can try but this would be a bit of an orthogonal change. For example
"ovs-appctl -t .. ovsdb-server/remove-db DB" only accepts case sensitive
database names (due to case sensitive shash operations). For using
partial table names and/or, it would also be a more complex change.
What you mentioned (ovs-vsctl) is actually implemented on the client
side in db-ctl-base.c, get_table()/score_partial_match().
> * There is no way to list the tables that are currently 'on'. Something
> like vlog/list.
Good idea, I'll add it!
>
> * Would it be useful to be able to specify which level to log to?: e.g.
>
> tlog/set DB:TABLE:info
>
I'm not sure, maybe, although my initial goal was to make this usable at
a reasonably high log level, e.g., INFO. I don't think there's a point
to use an even higher level though (like WARN/ERR), though.
> * Would it be useful to have a non-verbose mode that only states that an
> insert/delete/update happened?
>
Would you log some row specific information then? E.g., _uuid.
>> ---
>> NEWS | 4 ++++
>> ovsdb/ovsdb-server.c | 38 +++++++++++++++++++++++++++++++++
>> ovsdb/row.c | 17 +++++++++++++++
>> ovsdb/row.h | 1 +
>> ovsdb/table.c | 7 ++++++
>> ovsdb/table.h | 3 +++
>> ovsdb/transaction.c | 51 ++++++++++++++++++++++++++++++++++++++++++++
>> 7 files changed, 121 insertions(+)
>>
>> diff --git a/NEWS b/NEWS
>> index 90f4b15902b8..d56329772276 100644
>> --- a/NEWS
>> +++ b/NEWS
>> @@ -10,6 +10,10 @@ Post-v2.16.0
>> limiting behavior.
>> * Add hardware offload support for matching IPv4/IPv6 frag types
>> (experimental).
>> + - OVSDB:
>> + * New unixctl command 'ovsdb-server/log-db-ops DB TABLE on|off".
>> + If turned on, ovsdb-server will log (at level INFO and rate limited)
>> + all operations that are committed to table TABLE in the DB database.
>>
>>
>> v2.16.0 - 16 Aug 2021
>> diff --git a/ovsdb/ovsdb-server.c b/ovsdb/ovsdb-server.c
>> index 0b3d2bb71432..c48645f7e255 100644
>> --- a/ovsdb/ovsdb-server.c
>> +++ b/ovsdb/ovsdb-server.c
>> @@ -115,6 +115,7 @@ static unixctl_cb_func ovsdb_server_list_remotes;
>> static unixctl_cb_func ovsdb_server_add_database;
>> static unixctl_cb_func ovsdb_server_remove_database;
>> static unixctl_cb_func ovsdb_server_list_databases;
>> +static unixctl_cb_func ovsdb_server_log_db_ops;
>>
>> static void read_db(struct server_config *, struct db *);
>> static struct ovsdb_error *open_db(struct server_config *,
>> @@ -443,6 +444,8 @@ main(int argc, char *argv[])
>> ovsdb_server_remove_database, &server_config);
>> unixctl_command_register("ovsdb-server/list-dbs", "", 0, 0,
>> ovsdb_server_list_databases, &all_dbs);
>> + unixctl_command_register("ovsdb-server/log-db-ops", "DB TABLE on|off",
>> + 3, 3, ovsdb_server_log_db_ops, &all_dbs);
>> unixctl_command_register("ovsdb-server/perf-counters-show", "", 0, 0,
>> ovsdb_server_perf_counters_show, NULL);
>> unixctl_command_register("ovsdb-server/perf-counters-clear", "", 0, 0,
>> @@ -1769,6 +1772,41 @@ ovsdb_server_list_databases(struct unixctl_conn
>> *conn, int argc OVS_UNUSED,
>> ds_destroy(&s);
>> }
>>
>> +static void
>> +ovsdb_server_log_db_ops(struct unixctl_conn *conn, int argc OVS_UNUSED,
>> + const char *argv[], void *all_dbs_)
>> +{
>> + struct shash *all_dbs = all_dbs_;
>> + const char *db_name = argv[1];
>> + const char *tbl_name = argv[2];
>> + const char *command = argv[3];
>> + bool log;
>> +
>> + if (!strcmp(command, "on")) {
>> + log = true;
>> + } else if (!strcmp(command, "off")) {
>> + log = false;
>> + } else {
>> + unixctl_command_reply_error(conn, "invalid argument");
>> + return;
>> + }
>> +
>> + struct db *db = shash_find_data(all_dbs, db_name);
>> + if (!db) {
>> + unixctl_command_reply_error(conn, "no such database");
>> + return;
>> + }
>> +
>> + struct ovsdb_table *table = ovsdb_get_table(db->db, tbl_name);
>> + if (!table) {
>> + unixctl_command_reply_error(conn, "no such table");
>> + return;
>> + }
>> +
>> + ovsdb_table_log_ops(table, log);
>> + unixctl_command_reply(conn, NULL);
>> +}
>> +
>> static void
>> ovsdb_server_get_sync_status(struct unixctl_conn *conn, int argc OVS_UNUSED,
>> const char *argv[] OVS_UNUSED, void *config_)
>> diff --git a/ovsdb/row.c b/ovsdb/row.c
>> index 65a0546211c8..5e31716506bc 100644
>> --- a/ovsdb/row.c
>> +++ b/ovsdb/row.c
>> @@ -278,6 +278,23 @@ ovsdb_row_to_json(const struct ovsdb_row *row,
>> }
>> return json;
>> }
>> +
>> +void
>> +ovsdb_row_to_string(const struct ovsdb_row *row, struct ds *out)
>> +{
>> + struct shash_node *node;
>> +
>> + SHASH_FOR_EACH (node, &row->table->schema->columns) {
>> + const struct ovsdb_column *column = node->data;
>> +
>> + ds_put_format(out, "%s:", column->name);
>> + ovsdb_datum_to_string(&row->fields[column->index], &column->type,
>> out);
>> + ds_put_cstr(out, ",");
>> + }
>> + if (shash_count(&row->table->schema->columns)) {
>> + ds_chomp(out, ',');
>> + }
>> +}
>>
>> void
>> ovsdb_row_set_init(struct ovsdb_row_set *set)
>> diff --git a/ovsdb/row.h b/ovsdb/row.h
>> index 394ac8eb49b6..f22a08ecd197 100644
>> --- a/ovsdb/row.h
>> +++ b/ovsdb/row.h
>> @@ -95,6 +95,7 @@ struct ovsdb_error *ovsdb_row_from_json(struct ovsdb_row *,
>> OVS_WARN_UNUSED_RESULT;
>> struct json *ovsdb_row_to_json(const struct ovsdb_row *,
>> const struct ovsdb_column_set *include);
>> +void ovsdb_row_to_string(const struct ovsdb_row *, struct ds *);
>>
>> static inline const struct uuid *
>> ovsdb_row_get_uuid(const struct ovsdb_row *row)
>> diff --git a/ovsdb/table.c b/ovsdb/table.c
>> index 455a3663fe89..b7b41d139914 100644
>> --- a/ovsdb/table.c
>> +++ b/ovsdb/table.c
>> @@ -301,10 +301,17 @@ ovsdb_table_create(struct ovsdb_table_schema *ts)
>> hmap_init(&table->indexes[i]);
>> }
>> hmap_init(&table->rows);
>> + table->log = false;
>>
>> return table;
>> }
>>
>> +void
>> +ovsdb_table_log_ops(struct ovsdb_table *table, bool enabled)
>> +{
>> + table->log = enabled;
>> +}
>> +
>> void
>> ovsdb_table_destroy(struct ovsdb_table *table)
>> {
>> diff --git a/ovsdb/table.h b/ovsdb/table.h
>> index ce69a5d130bf..be88b7a59279 100644
>> --- a/ovsdb/table.h
>> +++ b/ovsdb/table.h
>> @@ -63,10 +63,13 @@ struct ovsdb_table {
>> * ovsdb_row"s. Each of the hmap_nodes in indexes[i] are at index 'i'
>> at
>> * the end of struct ovsdb_row, following the 'fields' member. */
>> struct hmap *indexes;
>> +
>> + bool log; /* True if logging is enabled for this table. */
>> };
>>
>> struct ovsdb_table *ovsdb_table_create(struct ovsdb_table_schema *);
>> void ovsdb_table_destroy(struct ovsdb_table *);
>> +void ovsdb_table_log_ops(struct ovsdb_table *, bool);
>>
>> const struct ovsdb_row *ovsdb_table_get_row(const struct ovsdb_table *,
>> const struct uuid *);
>> diff --git a/ovsdb/transaction.c b/ovsdb/transaction.c
>> index 8ffefcf7c9d0..dc07e9c00a4b 100644
>> --- a/ovsdb/transaction.c
>> +++ b/ovsdb/transaction.c
>> @@ -29,6 +29,7 @@
>> #include "openvswitch/vlog.h"
>> #include "ovsdb-error.h"
>> #include "ovsdb.h"
>> +#include "ovs-thread.h"
>> #include "row.h"
>> #include "storage.h"
>> #include "table.h"
>> @@ -95,6 +96,7 @@ struct ovsdb_txn_row {
>> static struct ovsdb_error * OVS_WARN_UNUSED_RESULT
>> delete_garbage_row(struct ovsdb_txn *txn, struct ovsdb_txn_row *r);
>> static void ovsdb_txn_row_prefree(struct ovsdb_txn_row *);
>> +static void ovsdb_txn_row_log(const struct ovsdb_txn_row *);
>> static struct ovsdb_error * OVS_WARN_UNUSED_RESULT
>> for_each_txn_row(struct ovsdb_txn *txn,
>> struct ovsdb_error *(*)(struct ovsdb_txn *,
>> @@ -104,6 +106,11 @@ for_each_txn_row(struct ovsdb_txn *txn,
>> * processed. */
>> static unsigned int serial;
>>
>> +/* Used by ovsdb_txn_row_log() to avoid reallocating dynamic strings
>> + * every time a row operation is logged.
>> + */
>> +DEFINE_STATIC_PER_THREAD_DATA(struct ds, row_log_str, DS_EMPTY_INITIALIZER);
>> +
>> struct ovsdb_txn *
>> ovsdb_txn_create(struct ovsdb *db)
>> {
>> @@ -422,6 +429,49 @@ update_ref_counts(struct ovsdb_txn *txn)
>> return for_each_txn_row(txn, check_ref_count);
>> }
>>
>> +static void
>> +ovsdb_txn_row_log(const struct ovsdb_txn_row *txn_row)
>> +{
>> + static struct vlog_rate_limit rl_insert = VLOG_RATE_LIMIT_INIT(30, 60);
>> + static struct vlog_rate_limit rl_update = VLOG_RATE_LIMIT_INIT(30, 60);
>> + static struct vlog_rate_limit rl_delete = VLOG_RATE_LIMIT_INIT(30, 60);
>
> Why do you have 3 different rate limiters?
>
Updates usually happen more often than inserts/deletes (e.g., with OVN
logical ports). I thought it would be more useful if updates didn't
consume tokens from the insert/delete logs. But thinking more about it,
a rate of 30 logs per minute with a burst of 60 doesn't allow too many
transactions to be logged often either. So, I guess, in a real use case
the user would disable rate limiting for the "transaction" vlog module.
We might as well move to a single rate limiter then.
I'll change it in the next revision.
>> +
>> + if (!txn_row->table->log) {
>> + return;
>> + }
>> +
>> + size_t n_columns = shash_count(&txn_row->table->schema->columns);
>> + struct ovsdb_row *log_row;
>> + const char *op = NULL;
>> +
>> + if (!txn_row->old && txn_row->new) {
>> + if (!vlog_should_drop(&this_module, VLL_INFO, &rl_insert)) {
>> + log_row = txn_row->new;
>> + op = "inserted";
>> + }
>> + } else if (txn_row->old && txn_row->new
>> + && !bitmap_is_all_zeros(txn_row->changed, n_columns)) {
>> + if (!vlog_should_drop(&this_module, VLL_INFO, &rl_update)) {
>> + log_row = txn_row->new;
>> + op = "updated";
>> + }
>> + } else if (txn_row->old && !txn_row->new) {
>> + if (!vlog_should_drop(&this_module, VLL_INFO, &rl_delete)) {
>> + log_row = txn_row->old;
>> + op = "deleted";
>> + }
>> + }
>> +
>> + if (op) {
>> + struct ds *ds = row_log_str_get();
>> + ds_clear(ds);
>> + ds_put_format(ds, "table:%s,op:%s,", txn_row->table->schema->name,
>> + op);
>> + ovsdb_row_to_string(log_row, ds);
>> + VLOG_INFO("%s", ds_cstr(ds));
>> + }
>> +}
>> +
>> static struct ovsdb_error *
>> ovsdb_txn_row_commit(struct ovsdb_txn *txn OVS_UNUSED,
>> struct ovsdb_txn_row *txn_row)
>> @@ -445,6 +495,7 @@ ovsdb_txn_row_commit(struct ovsdb_txn *txn OVS_UNUSED,
>> }
>> }
>>
>> + ovsdb_txn_row_log(txn_row);
>> ovsdb_txn_row_prefree(txn_row);
>> if (txn_row->new) {
>> txn_row->new->n_refs = txn_row->n_refs;
>>
>
_______________________________________________
dev mailing list
[email protected]
https://mail.openvswitch.org/mailman/listinfo/ovs-dev