• Quinn Tran's avatar
    scsi: qla2xxx: Fix premature hw access after PCI error · e35920ab
    Quinn Tran authored
    After a recoverable PCI error has been detected and recovered, qla driver
    needs to check to see if the error condition still persist and/or wait
    for the OS to give the resume signal.
    
    Sep  8 22:26:03 localhost kernel: WARNING: CPU: 9 PID: 124606 at qla_tmpl.c:440
    qla27xx_fwdt_entry_t266+0x55/0x60 [qla2xxx]
    Sep  8 22:26:03 localhost kernel: RIP: 0010:qla27xx_fwdt_entry_t266+0x55/0x60
    [qla2xxx]
    Sep  8 22:26:03 localhost kernel: Call Trace:
    Sep  8 22:26:03 localhost kernel: ? qla27xx_walk_template+0xb1/0x1b0 [qla2xxx]
    Sep  8 22:26:03 localhost kernel: ? qla27xx_execute_fwdt_template+0x12a/0x160
    [qla2xxx]
    Sep  8 22:26:03 localhost kernel: ? qla27xx_fwdump+0xa0/0x1c0 [qla2xxx]
    Sep  8 22:26:03 localhost kernel: ? qla2xxx_pci_mmio_enabled+0xfb/0x120
    [qla2xxx]
    Sep  8 22:26:03 localhost kernel: ? report_mmio_enabled+0x44/0x80
    Sep  8 22:26:03 localhost kernel: ? report_slot_reset+0x80/0x80
    Sep  8 22:26:03 localhost kernel: ? pci_walk_bus+0x70/0x90
    Sep  8 22:26:03 localhost kernel: ? aer_dev_correctable_show+0xc0/0xc0
    Sep  8 22:26:03 localhost kernel: ? pcie_do_recovery+0x1bb/0x240
    Sep  8 22:26:03 localhost kernel: ? aer_recover_work_func+0xaa/0xd0
    Sep  8 22:26:03 localhost kernel: ? process_one_work+0x1a7/0x360
    ..
    Sep  8 22:26:03 localhost kernel: qla2xxx [0000:42:00.2]-8041:22: detected PCI
    disconnect.
    Sep  8 22:26:03 localhost kernel: qla2xxx [0000:42:00.2]-107ff:22:
    qla27xx_fwdt_entry_t262: dump ram MB failed. Area 5h start 198013h end 198013h
    Sep  8 22:26:03 localhost kernel: qla2xxx [0000:42:00.2]-107ff:22: Unable to
    capture FW dump
    Sep  8 22:26:03 localhost kernel: qla2xxx [0000:42:00.2]-1015:22: cmd=0x0,
    waited 5221 msecs
    Sep  8 22:26:03 localhost kernel: qla2xxx [0000:42:00.2]-680d:22: mmio
    enabled returning.
    Sep  8 22:26:03 localhost kernel: qla2xxx [0000:42:00.2]-d04c:22: MBX
    Command timeout for cmd 0, iocontrol=ffffffff jiffies=10140f2e5
    mb[0-3]=[0xffff 0xffff 0xffff 0xffff]
    
    Link: https://lore.kernel.org/r/20220110050218.3958-6-njavali@marvell.com
    Cc: stable@vger.kernel.org
    Reviewed-by: default avatarHimanshu Madhani <himanshu.madhani@oracle.com>
    Signed-off-by: default avatarQuinn Tran <qutran@marvell.com>
    Signed-off-by: default avatarNilesh Javali <njavali@marvell.com>
    Signed-off-by: default avatarMartin K. Petersen <martin.petersen@oracle.com>
    e35920ab
qla_tmpl.c 29.4 KB