arm-smmu-v3: Reduce contention during command-queue insertion

John Garry Wed, 24 Jul 2019 01:21:29 -0700

On 11/07/2019 18:19, Will Deacon wrote:

+static int arm_smmu_cmdq_issue_cmdlist(struct arm_smmu_device *smmu,
+                                      u64 *cmds, int n, bool sync)
+{
+       u64 cmd_sync[CMDQ_ENT_DWORDS];
+       u32 prod;
        unsigned long flags;
-       bool wfe = !!(smmu->features & ARM_SMMU_FEAT_SEV);
-       struct arm_smmu_cmdq_ent ent = { .opcode = CMDQ_OP_CMD_SYNC };
-       int ret;
+       bool owner;
+       struct arm_smmu_cmdq *cmdq = &smmu->cmdq;
+       struct arm_smmu_ll_queue llq = {
+               .max_n_shift = cmdq->q.llq.max_n_shift,
+       }, head = llq;
+       int ret = 0;


-       arm_smmu_cmdq_build_cmd(cmd, &ent);
+       /* 1. Allocate some space in the queue */
+       local_irq_save(flags);
+       llq.val = READ_ONCE(cmdq->q.llq.val);
+       do {
+               u64 old;
+
+               while (!queue_has_space(&llq, n + sync)) {
+                       local_irq_restore(flags);
+                       if (arm_smmu_cmdq_poll_until_not_full(smmu, &llq))
+                               dev_err_ratelimited(smmu->dev, "CMDQ 
timeout\n");
+                       local_irq_save(flags);
+               }
+
+               head.cons = llq.cons;
+               head.prod = queue_inc_prod_n(&llq, n + sync) |
+                                            CMDQ_PROD_OWNED_FLAG;
+
+               old = cmpxchg_relaxed(&cmdq->q.llq.val, llq.val, head.val);

I added some basic debug to the stress test on your branch, and thiscmpxchg was failing ~10 times on average on my D06.


So we're not using the spinlock now, but this cmpxchg may lack fairness.

Since we're batching commands, I wonder if it's better to restore thespinlock and send batched commands + CMD_SYNC under the lock, and thenwait for the CMD_SYNC completion outside the lock.

I don't know if it improves the queue contetion, but at least the prodpointer would be more closely track the issued commands, such that we'renot waiting to kick off many gathered batches of commands, while theSMMU HW may be idle (in terms of command processing).


Cheers,
John

+               if (old == llq.val)
+                       break;
+
+               llq.val = old;
+       } while (1);
+       owner = !(llq.prod & CMDQ_PROD_OWNED_F



_______________________________________________
iommu mailing list
[email protected]
https://lists.linuxfoundation.org/mailman/listinfo/iommu

Re: [RFC PATCH v2 18/19] iommu/arm-smmu-v3: Reduce contention during command-queue insertion

Reply via email to