[PATCH 6.1 110/118] nvme-rdma: stop auth work after tearing down queues in error recovery

20 Feb 2023

From: Sagi Grimberg sagi@grimberg.me
[ Upstream commit 91c11d5f32547a08d462934246488fe72f3d44c3 ]
when starting error recovery there might be a authentication work
running, and it involves I/O commands. Given the controller is tearing
down there is no chance for the I/O to complete other than timing out
which may unnecessarily take a full io timeout.
So first tear down the queues, fail/cancel all inflight I/O (including
potentially authentication) and only then stop authentication. This
ensures that failover is not stalled due to blocked authentication I/O.
Signed-off-by: Sagi Grimberg sagi@grimberg.me
Reviewed-by: Chaitanya Kulkarni kch@nvidia.com
Signed-off-by: Christoph Hellwig hch@lst.de
Signed-off-by: Sasha Levin sashal@kernel.org
---
 drivers/nvme/host/rdma.c | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/drivers/nvme/host/rdma.c b/drivers/nvme/host/rdma.c
index 6f918e61b6aef..80383213b8828 100644
--- a/drivers/nvme/host/rdma.c
+++ b/drivers/nvme/host/rdma.c
@@ -1154,13 +1154,13 @@ static void nvme_rdma_error_recovery_work(struct work_struct *work)
    struct nvme_rdma_ctrl *ctrl = container_of(work,
    		struct nvme_rdma_ctrl, err_work);
-	nvme_auth_stop(&ctrl->ctrl);
    nvme_stop_keep_alive(&ctrl->ctrl);
    flush_work(&ctrl->ctrl.async_event_work);
    nvme_rdma_teardown_io_queues(ctrl, false);
    nvme_start_queues(&ctrl->ctrl);
    nvme_rdma_teardown_admin_queue(ctrl, false);
    nvme_start_admin_queue(&ctrl->ctrl);
+	nvme_auth_stop(&ctrl->ctrl);
if (!nvme_change_ctrl_state(&ctrl->ctrl, NVME_CTRL_CONNECTING)) {
    	/* state change failure is ok if we started ctrl delete */
-- 
2.39.0




    

2025

2024

2023

2022

2021

2020

2019

2018

2017

[PATCH 6.1 110/118] nvme-rdma: stop auth work after tearing down queues in error recovery