On Wed, 2008-01-09 at 23:59 +0100, Krzysztof Helt wrote:
> From: Krzysztof Helt <[EMAIL PROTECTED]>
>
> This patch fixes two bugs pointed by James Bottomley:
>
> 1. the if (!sym_data->io_reset). That variable is only ever filled
> by a stack based completion. If we find it non empty it means
> this code has been entered twice and we have a severe problem,
> so that should just become a BUG_ON(!sym_data->io_reset).
> 2. sym_data->io_reset should be set to NULL before the routine is
> exited otherwise the PCI recovery code could end up completing
> what will be a bogus pointer into the stack.
>
> Big thanks to James Bottomley for help with the patch.
>
> Signed-off-by: Krzysztof Helt <[EMAIL PROTECTED]>
Well done .. there's actually just one problem remaining:
> ---
> I do not know if I understood correctly all James' tips.
>
> diff -urp linux-ref/drivers/scsi/sym53c8xx_2/sym_glue.c
> linux-new/drivers/scsi/sym53c8xx_2/sym_glue.c
> --- linux-ref/drivers/scsi/sym53c8xx_2/sym_glue.c 2007-12-23
> 20:39:44.000000000 +0100
> +++ linux-new/drivers/scsi/sym53c8xx_2/sym_glue.c 2008-01-09
> 22:22:30.000000000 +0100
> @@ -609,22 +609,22 @@ static int sym_eh_handler(int op, char *
> */
> #define WAIT_FOR_PCI_RECOVERY 35
> if (pci_channel_offline(pdev)) {
> - struct completion *io_reset;
> int finished_reset = 0;
> init_completion(&eh_done);
> spin_lock_irq(shost->host_lock);
> /* Make sure we didn't race */
> if (pci_channel_offline(pdev)) {
> - if (!sym_data->io_reset)
> - sym_data->io_reset = &eh_done;
> - io_reset = sym_data->io_reset;
> + BUG_ON(!sym_data->io_reset);
> + sym_data->io_reset = &eh_done;
> } else {
> finished_reset = 1;
> }
> spin_unlock_irq(shost->host_lock);
> if (!finished_reset)
> - finished_reset = wait_for_completion_timeout(io_reset,
> + finished_reset = wait_for_completion_timeout
> + (sym_data->io_reset,
> WAIT_FOR_PCI_RECOVERY*HZ);
> + sym_data->io_reset = NULL;
This has to be cleared under the host_lock to forestall the (tiny) race
where the pci recovery code checks the value of sym_data->io_reset, we
change it to null and then the pci recovery code completes a NULL
pointer.
Other than this one problem, the code looks fine.
James
-
To unsubscribe from this list: send the line "unsubscribe linux-scsi" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at http://vger.kernel.org/majordomo-info.html