Re: [PATCH rdma-next] RDMA/mlx5: Fix UMR hang in LAG error state unload

From: Leon Romanovsky

Date: Sun Jan 18 2026 - 11:23:12 EST



On Tue, 13 Jan 2026 15:37:10 +0200, Edward Srouji wrote:
> During firmware reset in LAG mode, a race condition causes the driver
> to hang indefinitely while waiting for UMR completion during device
> unload. See [1].
>
> In LAG mode the bond device is only registered on the master, so it
> never sees sys_error events from the slave.
> During firmware reset this causes UMR waits to hang forever on unload
> as the slave is dead but the master hasn't entered error state yet, so
> UMR posts succeed but completions never arrive.
>
> [...]

Applied, thanks!

[1/1] RDMA/mlx5: Fix UMR hang in LAG error state unload
https://git.kernel.org/rdma/rdma/c/ebc2164a4cd431

Best regards,
--
Leon Romanovsky <leon@xxxxxxxxxx>