From: Justin Tee justin.tee@broadcom.com
[ Upstream commit f408dde2468b3957e92b25e7438f74c8e9fb9e73 ]
If lpfc_reset_flush_io_context fails to execute, then the wrong return status code may be passed back to upper layers when issuing a target reset TMF command. Fix by checking the return status from lpfc_reset_flush_io_context() first in order to properly return FAILED or FAST_IO_FAIL.
Signed-off-by: Justin Tee justin.tee@broadcom.com Message-ID: 20250915180811.137530-7-justintee8345@gmail.com Signed-off-by: Martin K. Petersen martin.petersen@oracle.com Signed-off-by: Sasha Levin sashal@kernel.org ---
LLM Generated explanations, may be completely bogus:
YES - returning FAIL/Fast-IO correctly from target reset avoids trapping lpfc in a half-reset state.
- drivers/scsi/lpfc/lpfc_scsi.c:6112-6119 now propagates the status from lpfc_reset_flush_io_context(), so a flush failure surfaces as FAILED instead of always falling through to FAST_IO_FAIL; previously FAST_IO_FAIL was reported even when cnt != 0, leaving orphaned contexts behind. - In the SCSI EH core, FAST_IO_FAIL is treated as a completed reset (drivers/scsi/scsi_error.c:1680-1694), so the old code caused the error handler to stop escalation while the adapter still had outstanding I/O—users would see hung commands after a target reset TMF. - A FAILED return triggers the midlayer to keep escalating (bus/host reset), which is the only safe recovery once lpfc_reset_flush_io_context() reports 0x2003 (see its failure path at drivers/scsi/lpfc/lpfc_scsi.c:5969-5975); the fix therefore prevents long-lived I/O leaks and recovery deadlocks. - Remaining changes are cosmetic (typo fix at drivers/scsi/lpfc/lpfc_scsi.c:5938 and cleaned log text at drivers/scsi/lpfc/lpfc_scsi.c:6210) and pose no regression risk. - Patch is small, self-contained in lpfc, and has no dependencies—ideal for stable backporting.
drivers/scsi/lpfc/lpfc_scsi.c | 14 ++++++++++---- 1 file changed, 10 insertions(+), 4 deletions(-)
diff --git a/drivers/scsi/lpfc/lpfc_scsi.c b/drivers/scsi/lpfc/lpfc_scsi.c index 508ceeecf2d95..6d9d8c196936a 100644 --- a/drivers/scsi/lpfc/lpfc_scsi.c +++ b/drivers/scsi/lpfc/lpfc_scsi.c @@ -5935,7 +5935,7 @@ lpfc_chk_tgt_mapped(struct lpfc_vport *vport, struct fc_rport *rport) /** * lpfc_reset_flush_io_context - * @vport: The virtual port (scsi_host) for the flush context - * @tgt_id: If aborting by Target contect - specifies the target id + * @tgt_id: If aborting by Target context - specifies the target id * @lun_id: If aborting by Lun context - specifies the lun id * @context: specifies the context level to flush at. * @@ -6109,8 +6109,14 @@ lpfc_target_reset_handler(struct scsi_cmnd *cmnd) pnode->nlp_fcp_info &= ~NLP_FCP_2_DEVICE; spin_unlock_irqrestore(&pnode->lock, flags); } - lpfc_reset_flush_io_context(vport, tgt_id, lun_id, - LPFC_CTX_TGT); + status = lpfc_reset_flush_io_context(vport, tgt_id, lun_id, + LPFC_CTX_TGT); + if (status != SUCCESS) { + lpfc_printf_vlog(vport, KERN_ERR, LOG_FCP, + "0726 Target Reset flush status x%x\n", + status); + return status; + } return FAST_IO_FAIL; }
@@ -6202,7 +6208,7 @@ lpfc_host_reset_handler(struct scsi_cmnd *cmnd) int rc, ret = SUCCESS;
lpfc_printf_vlog(vport, KERN_ERR, LOG_FCP, - "3172 SCSI layer issued Host Reset Data:\n"); + "3172 SCSI layer issued Host Reset\n");
lpfc_offline_prep(phba, LPFC_MBX_WAIT); lpfc_offline(phba);