* Sasha Levin <[email protected]> wrote:
> Interval rb-tree allows to directly store interval ranges
> and quickly lookup an overlap with a single point or a range.
>
> The helper is based on the kernel rb-tree implementation
> (located in <linux/rbtree.h>) which alows for the augmention
> of the classical rb-tree to be used as an interval tree.
>
> Signed-off-by: Sasha Levin <[email protected]>
> ---
> tools/kvm/Makefile | 1 +
> tools/kvm/include/kvm/interval-rbtree.h | 27 +++++++++
> tools/kvm/util/interval-rbtree.c | 97
> +++++++++++++++++++++++++++++++
Small nit, please make it rbtree-interval.c - we try to name by using the
higher-order (more generic) concept first.
So we name in a galaxy_planet_country_city_street hierarchical fashion, not in
a street_city_country_planet_galaxy Polish fashion. This applies to function
names as well.
> +struct rb_int_node {
> + struct rb_node node;
> + u64 low;
> + u64 high;
> +
> + /* max_high will store the highest high of it's 2 children. */
> + u64 max_high;
> +};
Another small nit: please align the fields vertically, like we do it for all
other structs.
> +#include <kvm/interval-rbtree.h>
> +#include <stdio.h>
> +#include <stdlib.h>
At first sight i dont think you really need the stdio.h and stlib.h includes -
you added these while having debugging printfs in the code?
> +#define RB_INT(n) container_of(n, struct rb_int_node, node)
please use a name that matches the kernel's rb entry definition:
#define rb_entry(ptr, type, member) container_of(ptr, type, member)
i.e. lower-case and something like:
#define rb_int_entry(ptr) container_of(ptr, struct rb_int_node, node)
But it could also be shortened to rb_int() like you did - as long as it does
not shout at us in all capitals :-)
> +struct rb_int_node *rb_int_search_single(struct rb_root *root, u64 p)
What does 'p' mean here? Please use more descriptive (but still short) names.
> +{
> + struct rb_node *node = root->rb_node;
> + struct rb_node *lowest = NULL;
> +
> + while (node) {
> + struct rb_int_node *cur = RB_INT(node);
> + struct rb_int_node *left;
> + if (node->rb_left)
newline after local variable definitions please, so that it stands out better
visually.
> + return container_of(lowest, struct rb_int_node, node);
Using the new definition above this could be written as rb_int(lowest) i guess?
> +static void update_node_max_high(struct rb_node *node, void *arg)
> +{
> + u64 high_left = 0, high_right = 0;
> + struct rb_int_node *data = RB_INT(node);
> +
> + if (node->rb_left)
> + high_left = RB_INT(node->rb_left)->high;
> + if (node->rb_right)
> + high_right = RB_INT(node->rb_right)->high;
> +
> + data->max_high = (high_left > high_right) ? high_left : high_right;
that's
data->max_high = max(high_left, high_right);
correct?
> + if (data->max_high < RB_INT(node)->high)
> + data->max_high = RB_INT(node)->high;
and that is:
data->max_high = max(data->max_high, RB_INT(node)->high);
right?
so writing:
data->max_high = max(high_left, high_right);
data->max_high = max(data->max_high, rb_int(node)->high);
makes it more obvious that we are really just tracking a maximum of 3
possibilities here.
In fact it could be all written as just a simple:
static void update_node_max_high(struct rb_node *node, void *arg)
{
struct rb_int_node *i_node = rb_int(node);
i_node->max_high = i_node->high;
if (node->rb_left)
i_node->max_high = max(rb_int(node->rb_left)->high,
i_node->max_high);
if (node->rb_right)
i_node->max_high = max(rb_int(node->rb_right)->high,
i_node->max_high);
Which is even more obvious.
Note that i renamed 'data' to 'i_node' - it's much better to use a descriptive
name for local variables not some opaque 'data' which could be anything.
> +int rb_int_insert(struct rb_root *root, struct rb_int_node *data)
i'd suggest to rename 'data' to i_node in other places as well. Here we'd want
to use the name i_node_root i suspect.
(Note, naming it 'inode' would suck for us kernel developers :-)
> + struct rb_node **new = &(root->rb_node), *parent = NULL;
> +
> + while (*new) {
> + struct rb_int_node *this = RB_INT(*new);
So the rb-node iterator is named 'new', while the rb-int-node iterator is
called 'this'? That does not make sense.
Please name them in a matching way: 'node' for the rb iterator, 'i_node' for
the rb-int iterator, or so.
> + int result = data->low - this->low;
and then this would look like:
> + int result = i_node_root->low -
> i_node->low;
which suddenly gains semantic meaning even if all you see is this single line!
That is what proper naming allows.
> +
> + parent = *new;
> + if (result < 0)
> + new = &((*new)->rb_left);
> + else if (result > 0)
> + new = &((*new)->rb_right);
> + else
> + return 0;
> + }
> +
> + rb_link_node(&data->node, parent, new);
> + rb_insert_color(&data->node, root);
> +
> + rb_augment_insert(*new, update_node_max_high, NULL);
> + return 1;
> +}
> +
> +void rb_int_erase(struct rb_root *root, struct rb_int_node *node)
> +{
> + struct rb_node *deepest;
> +
> + deepest = rb_augment_erase_begin(&node->node);
> + rb_erase(&node->node, root);
> + rb_augment_erase_end(deepest, update_node_max_high, NULL);
> +
> +}
Nit: superfluous newline at the end of the function.
Thanks,
Ingo
--
To unsubscribe from this list: send the line "unsubscribe kvm" in
the body of a message to [email protected]
More majordomo info at http://vger.kernel.org/majordomo-info.html