New subject: [PATCH 5.15.y 5.10.y 5.4.y 2/2] nvme-rdma: fix potential unbalanced freeze & unfreeze

13 Aug 2023

From: Ming Lei ming.lei@redhat.com
Move start_freeze into nvme_tcp_configure_io_queues(), and there is
at least two benefits:
1) fix unbalanced freeze and unfreeze, since re-connection work may
fail or be broken by removal
2) IO during error recovery can be failfast quickly because nvme fabrics
unquiesces queues after teardown.
One side-effect is that !mpath request may timeout during connecting
because of queue topo change, but that looks not one big deal:
1) same problem exists with current code base
2) compared with !mpath, mpath use case is dominant
Fixes: 2875b0aecabe ("nvme-tcp: fix controller reset hang during traffic")
Cc: stable@vger.kernel.org
Signed-off-by: Ming Lei ming.lei@redhat.com
Tested-by: Yi Zhang yi.zhang@redhat.com
Reviewed-by: Sagi Grimberg sagi@grimberg.me
Signed-off-by: Keith Busch kbusch@kernel.org
---
 drivers/nvme/host/tcp.c | 3 ++-
 1 file changed, 2 insertions(+), 1 deletion(-)

diff --git a/drivers/nvme/host/tcp.c b/drivers/nvme/host/tcp.c
index 96d8d7844e84..c2e037644ad1 100644
--- a/drivers/nvme/host/tcp.c
+++ b/drivers/nvme/host/tcp.c
@@ -1882,6 +1882,7 @@ static int nvme_tcp_configure_io_queues(struct nvme_ctrl *ctrl, bool new)
    	goto out_cleanup_connect_q;
if (!new) {
+		nvme_start_freeze(ctrl);
    	nvme_start_queues(ctrl);
    	if (!nvme_wait_freeze_timeout(ctrl, NVME_IO_TIMEOUT)) {
    		/*
@@ -1890,6 +1891,7 @@ static int nvme_tcp_configure_io_queues(struct nvme_ctrl *ctrl, bool new)
    		 * to be safe.
    		 */
    		ret = -ENODEV;
+			nvme_unfreeze(ctrl);
    		goto out_wait_freeze_timed_out;
    	}
    	blk_mq_update_nr_hw_queues(ctrl->tagset,
@@ -2008,7 +2010,6 @@ static void nvme_tcp_teardown_io_queues(struct nvme_ctrl *ctrl,
    if (ctrl->queue_count <= 1)
    	return;
    blk_mq_quiesce_queue(ctrl->admin_q);
-	nvme_start_freeze(ctrl);
    nvme_stop_queues(ctrl);
    nvme_sync_io_queues(ctrl);
    nvme_tcp_stop_io_queues(ctrl);
-- 
2.41.0


    

[PATCH 5.15.y 5.10.y 5.4.y 1/2] nvme-tcp: fix potential unbalanced freeze & unfreeze