On Wed, Oct 09, 2013 at 11:38:06PM +0100, Andreas Herrmann wrote:
> Try to determine a mask that can be used for all StreamIDs of a master
> device. This allows to use just one SMR group instead of
> number-of-streamids SMR groups for a master device.
> 
> Signed-off-by: Andreas Herrmann <andreas.herrm...@calxeda.com>
> ---
>  drivers/iommu/arm-smmu.c |   79 
> ++++++++++++++++++++++++++++++++++++----------
>  1 file changed, 63 insertions(+), 16 deletions(-)
> 
> diff --git a/drivers/iommu/arm-smmu.c b/drivers/iommu/arm-smmu.c
> index 91316a8..4e8ceab 100644
> --- a/drivers/iommu/arm-smmu.c
> +++ b/drivers/iommu/arm-smmu.c
> @@ -28,6 +28,7 @@
>   *   - Context fault reporting
>   */
>  
> +#define DEBUG
>  #define pr_fmt(fmt) "arm-smmu: " fmt
>  
>  #include <linux/delay.h>
> @@ -341,6 +342,9 @@ struct arm_smmu_master {
>       struct rb_node                  node;
>       int                             num_streamids;
>       u16                             streamids[MAX_MASTER_STREAMIDS];
> +     int                             num_smrs;

This is easy to confuse with smmu->num_mapping_groups, but is actually the
number of SMRs in use by this master, right? Maybe tweaking the name
(num_used_smrs?) would make this clearer.

> +     u16                             smr_mask;
> +     u16                             smr_id;
>  
>       /*
>        * We only need to allocate these on the root SMMU, as we
> @@ -530,14 +534,11 @@ static int register_smmu_master(struct arm_smmu_device 
> *smmu,
>                                       ARM_SMMU_OPT_MASK_STREAM_IDS));
>                       return -EINVAL;
>               }
> -             /* set fixed streamid (0) that will be used for masking */
> -             master->num_streamids = 1;
> -             master->streamids[0] = 0;
> -     } else {
> -             for (i = 0; i < master->num_streamids; ++i)
> -                     master->streamids[i] = masterspec->args[i];
>       }
>  
> +     for (i = 0; i < master->num_streamids; ++i)
> +             master->streamids[i] = masterspec->args[i];
> +
>       return insert_smmu_master(smmu, master);
>  }
>  
> @@ -1049,6 +1050,41 @@ static void arm_smmu_domain_destroy(struct 
> iommu_domain *domain)
>       kfree(smmu_domain);
>  }
>  
> +/*
> + * no duplicates streamids please
> + */

We could probably check for that actually in register_smmu_master.

> +static void determine_smr_mapping(struct arm_smmu_device *smmu,
> +                             struct arm_smmu_master *master)
> +{
> +     int nr_sid;
> +     u16 i, v1, v2, const_mask;

The bitwise stuff later on could use some more meaningful identifiers
(although the comments do help).

> +
> +     if (smmu->options & ARM_SMMU_OPT_MASK_STREAM_IDS) {
> +             master->smr_mask = smmu->smr_mask_bits;
> +             master->smr_id = 0;
> +             return;
> +     }
> +
> +     nr_sid = master->num_streamids;
> +     if (!is_power_of_2(nr_sid))
> +             return;

As I mentioned before, we could do better than this if we forced the DT to
contain complete topological information. Then we could round up to the next
power of two and check that we didn't accidentally include another device.
What is your opinion on this?

> +     v1 = 0;
> +     v2 = -1;

I'd rather this was written as 0xffff;

> +     for (i = 0; i < nr_sid; i++) {
> +             v1 |= master->streamids[i];     /* for const 0 bits */
> +             v2 &= ~(master->streamids[i]);  /* const 1 bits */
> +     }
> +     const_mask = (~v1) | v2;        /* const bits (either 0 or 1) */
> +
> +     v1 = hweight16(~const_mask);
> +     if ((1 << v1) == nr_sid) {
> +             /* if smr_mask is set, only 1 SMR group is used smr[0] = 0 */
> +             master->smr_mask = ~const_mask;
> +             master->smr_id = v1 & const_mask;
> +     }

Hehe, this is cool, nice one! I originally thought you could just xor stuff,
but that ends up being slightly nasty because it all has to be done pairwise.

> +}
> +
>  static int arm_smmu_master_configure_smrs(struct arm_smmu_device *smmu,
>                                         struct arm_smmu_master *master)
>  {
> @@ -1062,15 +1098,22 @@ static int arm_smmu_master_configure_smrs(struct 
> arm_smmu_device *smmu,
>       if (master->smrs)
>               return -EEXIST;
>  
> -     smrs = kmalloc(sizeof(*smrs) * master->num_streamids, GFP_KERNEL);
> +     determine_smr_mapping(smmu, master);
> +
> +     if (master->smr_mask)
> +             master->num_smrs = 1;

So the next challenge would be to allocate one SMR using your power-of-2
trick, then mop up what's left with individual SMR entries.

Will
_______________________________________________
iommu mailing list
iommu@lists.linux-foundation.org
https://lists.linuxfoundation.org/mailman/listinfo/iommu

Reply via email to