[PATCH 6.1 118/149] net/mlx5: Skip clock update work when device is in error state

13 Aug 2023

From: Moshe Shemesh moshe@nvidia.com
commit d006207625657322ba8251b6e7e829f9659755dc upstream.
When device is in error state, marked by the flag
MLX5_DEVICE_STATE_INTERNAL_ERROR, the HW and PCI may not be accessible
and so clock update work should be skipped. Furthermore, such access
through PCI in error state, after calling mlx5_pci_disable_device() can
result in failing to recover from pci errors.
Fixes: ef9814deafd0 ("net/mlx5e: Add HW timestamping (TS) support")
Reported-and-tested-by: Ganesh G R ganeshgr@linux.ibm.com
Closes: https://lore.kernel.org/netdev/9bdb9b9d-140a-7a28-f0de-2e64e873c068@nvidia.c...
Signed-off-by: Moshe Shemesh moshe@nvidia.com
Reviewed-by: Aya Levin ayal@nvidia.com
Signed-off-by: Saeed Mahameed saeedm@nvidia.com
Signed-off-by: Greg Kroah-Hartman gregkh@linuxfoundation.org
---
 drivers/net/ethernet/mellanox/mlx5/core/lib/clock.c |    5 +++++
 1 file changed, 5 insertions(+)

--- a/drivers/net/ethernet/mellanox/mlx5/core/lib/clock.c
+++ b/drivers/net/ethernet/mellanox/mlx5/core/lib/clock.c
@@ -198,10 +198,15 @@ static void mlx5_timestamp_overflow(stru
    clock = container_of(timer, struct mlx5_clock, timer);
    mdev = container_of(clock, struct mlx5_core_dev, clock);
+	if (mdev->state == MLX5_DEVICE_STATE_INTERNAL_ERROR)
+		goto out;
+
    write_seqlock_irqsave(&clock->lock, flags);
    timecounter_read(&timer->tc);
    mlx5_update_clock_info_page(mdev);
    write_sequnlock_irqrestore(&clock->lock, flags);
+
+out:
    schedule_delayed_work(&timer->overflow_work, timer->overflow_period);
 }

    

2025

2024

2023

2022

2021

2020

2019

2018

2017

[PATCH 6.1 118/149] net/mlx5: Skip clock update work when device is in error state