Re: [PATCH net] net/smc: Avoid overwriting the copies of clcsock callback functions

From: Tony Lu
Date: Wed Feb 09 2022 - 21:50:15 EST


On Wed, Feb 09, 2022 at 10:10:53PM +0800, Wen Gu wrote:
> The callback functions of clcsock will be saved and replaced during
> the fallback. But if the fallback happens more than once, then the
> copies of these callback functions will be overwritten incorrectly,
> resulting in a loop call issue:
>
> clcsk->sk_error_report
> |- smc_fback_error_report() <------------------------------|
> |- smc_fback_forward_wakeup() | (loop)
> |- clcsock_callback() (incorrectly overwritten) |
> |- smc->clcsk_error_report() ------------------|
>
> So this patch fixes the issue by saving these function pointers only
> once in the fallback and avoiding overwriting.
>
> Reported-by: syzbot+4de3c0e8a263e1e499bc@xxxxxxxxxxxxxxxxxxxxxxxxx
> Fixes: 341adeec9ada ("net/smc: Forward wakeup to smc socket waitqueue after fallback")
> Link: https://lore.kernel.org/r/0000000000006d045e05d78776f6@xxxxxxxxxx
> Signed-off-by: Wen Gu <guwen@xxxxxxxxxxxxxxxxx>
> ---
> net/smc/af_smc.c | 10 +++++++---
> 1 file changed, 7 insertions(+), 3 deletions(-)
>
> diff --git a/net/smc/af_smc.c b/net/smc/af_smc.c
> index 8c89d0b..306d9e8c 100644
> --- a/net/smc/af_smc.c
> +++ b/net/smc/af_smc.c
> @@ -667,14 +667,17 @@ static void smc_fback_error_report(struct sock *clcsk)
> static int smc_switch_to_fallback(struct smc_sock *smc, int reason_code)
> {
> struct sock *clcsk;
> + int rc = 0;
>
> mutex_lock(&smc->clcsock_release_lock);
> if (!smc->clcsock) {
> - mutex_unlock(&smc->clcsock_release_lock);
> - return -EBADF;
> + rc = -EBADF;
> + goto out;
> }
> clcsk = smc->clcsock->sk;
>
> + if (smc->use_fallback)
> + goto out;
> smc->use_fallback = true;

I am wondering that there is a potential racing. If ->use_fallback is
setted to true, but the rest of replacing process is on the way, others
who tested and passed ->use_fallback, they would get old value before
replacing.

Thanks,
Tony Lu