On Wed, 2008-01-09 at 23:59 +0100, Krzysztof Helt wrote:
> From: Krzysztof Helt <[EMAIL PROTECTED]>
> 
> This patch fixes two bugs pointed by James Bottomley:
> 
>  1. the if (!sym_data->io_reset).  That variable is only ever filled
>     by a stack based completion.  If we find it non empty it means
>     this code has been entered twice and we have a severe problem,
>     so that should just become a BUG_ON(!sym_data->io_reset).
>  2. sym_data->io_reset should be set to NULL before the routine is
>     exited otherwise the PCI recovery code could end up completing
>     what will be a bogus pointer into the stack.
> 
> Big thanks to James Bottomley for help with the patch.
> 
> Signed-off-by: Krzysztof Helt <[EMAIL PROTECTED]>

Well done .. there's actually just one problem remaining:

> ---
> I do not know if I understood correctly all James' tips.
> 
> diff -urp linux-ref/drivers/scsi/sym53c8xx_2/sym_glue.c 
> linux-new/drivers/scsi/sym53c8xx_2/sym_glue.c
> --- linux-ref/drivers/scsi/sym53c8xx_2/sym_glue.c     2007-12-23 
> 20:39:44.000000000 +0100
> +++ linux-new/drivers/scsi/sym53c8xx_2/sym_glue.c     2008-01-09 
> 22:22:30.000000000 +0100
> @@ -609,22 +609,22 @@ static int sym_eh_handler(int op, char *
>        */
>  #define WAIT_FOR_PCI_RECOVERY        35
>       if (pci_channel_offline(pdev)) {
> -             struct completion *io_reset;
>               int finished_reset = 0;
>               init_completion(&eh_done);
>               spin_lock_irq(shost->host_lock);
>               /* Make sure we didn't race */
>               if (pci_channel_offline(pdev)) {
> -                     if (!sym_data->io_reset)
> -                             sym_data->io_reset = &eh_done;
> -                     io_reset = sym_data->io_reset;
> +                     BUG_ON(!sym_data->io_reset);
> +                     sym_data->io_reset = &eh_done;
>               } else {
>                       finished_reset = 1;
>               }
>               spin_unlock_irq(shost->host_lock);
>               if (!finished_reset)
> -                     finished_reset = wait_for_completion_timeout(io_reset,
> +                     finished_reset = wait_for_completion_timeout
> +                                             (sym_data->io_reset,
>                                               WAIT_FOR_PCI_RECOVERY*HZ);
> +             sym_data->io_reset = NULL;

This has to be cleared under the host_lock to forestall the (tiny) race
where the pci recovery code checks the value of sym_data->io_reset, we
change it to null and then the pci recovery code completes a NULL
pointer.

Other than this one problem, the code looks fine.

James


-
To unsubscribe from this list: send the line "unsubscribe linux-scsi" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Reply via email to