[PATCH net V2 0/3] net/mlx5: LAG bug fixes

From: Tariq Toukan

Date: Tue Jun 30 2026 - 07:32:13 EST


Hi,

Three bug fixes by Shay in the mlx5 LAG subsystem.

Patch 1 fixes an off-by-one in the error rollback path of
mlx5_lag_create_single_fdb_filter(): the loop started from the
failed index i, potentially operating on uninitialized state or
double-tearing-down an entry that had already self-rolled-back.
The rollback should start from i - 1.

Patch 2 fixes a hang in mlx5_mpesw_work(): when
mlx5_lag_get_devcom_comp() returns NULL the function returned
early without calling complete(), blocking any caller waiting on
mpesww->comp indefinitely.

Patch 3 fixes a kernel crash during teardown when
mlx5_lag_get_dev_seq() returns an error because no device is
marked as master or the peer is no longer in the LAG. The peer
flow cleanup is now skipped instead of proceeding with a bad
pointer.

This series by Shay fixes three bugs in the mlx5 LAG subsystem.

Regards,
Tariq

V2:
- Rebase.
- Patch 3: simplify to a single 'continue' on seq lookup failure.

V1:
https://lore.kernel.org/all/20260617063204.547427-2-tariqt@xxxxxxxxxx/

Find replies to previous Sashiko comments here:
https://lore.kernel.org/all/e18662ac-413e-43f6-ac65-a4e15fd47bb7@xxxxxxxxxx/

Shay Drory (3):
net/mlx5: LAG, Fix off-by-one in single-FDB error rollback
net/mlx5: LAG, MPESW, Fix missing complete() on devcom error
net/mlx5e: TC, skip peer flow cleanup when LAG seq is unavailable

drivers/net/ethernet/mellanox/mlx5/core/en_tc.c | 3 +++
drivers/net/ethernet/mellanox/mlx5/core/lag/mpesw.c | 7 +++++--
drivers/net/ethernet/mellanox/mlx5/core/lag/shared_fdb.c | 2 +-
3 files changed, 9 insertions(+), 3 deletions(-)


base-commit: dbf803bc4a8b0522c9a12560c20905a5952d1cb9
--
2.44.0