On Wed, Mar 16 2005, Guennadi Liakhovetski wrote:
> Hi
> 
> Thanks for reviewing this patch!
> 
> On Wed, 16 Mar 2005, Jens Axboe wrote:
> 
> > On Wed, Mar 16 2005, James Bottomley wrote:
> ...
> > > I agree the kmap is inefficient.  The efficient alternative is to do
> > > dma_map_sg() and use kmap_atomic() in the interrupt routine where we do
> > > the PIO cleanup---I'm afraid I just passed on explaining how to do
> > > this ... unless you care to do the honours ?
> 
> In fact, the first version of my patch (attached below) did exactly this - 
> only when the driver recognises, that it needs to do PIO in the interrupt, 
> I would call kmap_atomic(), do PIO, then kunmap_atomic(). The main reason, 
> why I didn't submit that patch, was that I got confused about various KM_ 
> macros, and I thought, since it is a valuable limited resource, only very 
> "special" drivers are allowed to use them / are allocated one of them. 
> But, I guess now, you can just do
> 
>       local_irq_save(flags);
>       kmap_atomic();
>       ...
>       kunmap_atomic();
>       local_irq_restore(flags);
> 
> Please, have a look. Or should we indeed go the "generic helper functions" 
> way?

In generel it looks ok, comments below.

> > The kmap() isn't just inefficient, it's a problem to iterate over the sg
> > list and kmap all the pages. That is illegal.
> 
> Hm, what do you mean "illegal"? Could you explain why?

You risk deadlocking.

> > But it's not so tricky to get right, if the punting just happens in the
> > isr. Basically just iterate over every sg entry left ala:
> > 
> >         for (i = start; i < sg_entries; i++) {
> >                 unsigned long flags;
> >                 char *ptr;
> > 
> >                 local_irq_save(flags);
> >                 ptr = kmap_atomic(sg->page, KM_BIO_SRC_IRQ);
> > 
> >                 /* transfer to/from ptr + sg->offset, sg->length bytes */
> > 
> >                 kunmap_atomic(ptr, KM_BIO_SRC_IRQ);
> >                 local_irq_restore(flags);
> >         }
> > 
> > I _think_ the sg->length field is universally called so, you should not
> > use sg->dma_length/sg_dma_len() or sg_dma_address(), as we are outside
> > the work of the iommu at this point.
> 
> One more fragment in the driver I wasn't sure about is this:
> 
>               unsigned long mask =
>                   ~((unsigned long)sg->length - 1) & PAGE_MASK;
>               if ((sg_dma_address(sg) & mask) == (psge->address & mask)) {
> 
> Is sg->length guaranteed to be something like a power of 2 or smaller 
> than page? I thought about replacing the above with

No, it's not guaranteed to be a power-of-2.

> +             if (sg_dma_address(sg) <= psge->address && sg_dma_address(sg) + 
> psge->length > psge->address) {

What is it trying to accomplish?

> @@ -1020,11 +1022,11 @@
>                       reqlen, cmd->request_buffer, cmd->use_sg,
>                       srb->sg_count);
>  
> -             srb->virt_addr = page_address(sl->page);
> +             srb->page = sl->page;
> +             srb->offset = sl->offset;
>               for (i = 0; i < srb->sg_count; i++) {
> -                     u32 busaddr = (u32)sg_dma_address(&sl[i]);
> -                     u32 seglen = (u32)sl[i].length;
> -                     sgp[i].address = busaddr;
> +                     u32 seglen = (u32)sg_dma_len(sl + i);
> +                     sgp[i].address = cpu_to_le32(0xffffffff & 
> sg_dma_address(sl +i));
>                       sgp[i].length = seglen;
>                       srb->total_xfer_length += seglen;
>               }

I don't understand this change, why the cpu_to_le32?

> @@ -2297,19 +2287,18 @@
>                   && srb->total_xfer_length <= DC395x_LASTPIO) {
>                       /*u32 addr = (srb->segment_x[srb->sg_index].address); */
>                       /*sg_update_list (srb, d_left_counter); */
> -                     dprintkdbg(DBG_PIO, "data_in_phase0: PIO (%i %s) to "
> -                             "%p for remaining %i bytes:",
> -                             DC395x_read8(acb, TRM_S1040_SCSI_FIFOCNT) & 
> 0x1f,
> -                             (srb->dcb->sync_period & WIDE_SYNC) ?
> -                                 "words" : "bytes",
> -                             srb->virt_addr,
> -                             srb->total_xfer_length);
> +                     char *page_addr, *virt_addr;
> +                     unsigned long flags;
>                       if (srb->dcb->sync_period & WIDE_SYNC)
>                               DC395x_write8(acb, TRM_S1040_SCSI_CONFIG2,
>                                             CFG2_WIDEFIFO);
> +                     local_irq_save(flags);
> +                     page_addr = kmap_atomic(srb->page, KM_USER0);
> +                     virt_addr = page_addr + srb->offset;
> +

You can't use KM_USER0 here, use one of the bio assigned kmap types (you
can use KM_BIO_SRC_IRQ, for instance - the reason there are two is for
the bouncing that needs to kmap both source and destination at the same
time).

>                       while (DC395x_read8(acb, TRM_S1040_SCSI_FIFOCNT) != 
> 0x40) {
>                               u8 byte = DC395x_read8(acb, 
> TRM_S1040_SCSI_FIFO);
> -                             *(srb->virt_addr)++ = byte;
> +                             *(virt_addr)++ = byte;
>                               if (debug_enabled(DBG_PIO))
>                                       printk(" %02x", byte);
>                               d_left_counter--;
> @@ -2320,7 +2309,7 @@
>                  /* Read the last byte ... */
>                               if (srb->total_xfer_length > 0) {
>                                       u8 byte = DC395x_read8(acb, 
> TRM_S1040_SCSI_FIFO);
> -                                     *(srb->virt_addr)++ = byte;
> +                                     *(virt_addr)++ = byte;
>                                       srb->total_xfer_length--;
>                                       if (debug_enabled(DBG_PIO))
>                                               printk(" %02x", byte);
> @@ -2328,6 +2317,8 @@
>  #endif
>                               DC395x_write8(acb, TRM_S1040_SCSI_CONFIG2, 0);
>                       }
> +                     kunmap_atomic(page_addr, KM_IRQ0);
> +                     local_irq_restore(flags);

Here you kunmap_atomic() with a different kmap type than you mapped
with? Must be the same.

Same applies to the matchin section further down.


-- 
Jens Axboe

-
To unsubscribe from this list: send the line "unsubscribe linux-scsi" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Reply via email to