Re: [PATCH net-next v6 00/10] Decouple receive and transmit enablement in team driver

From: Kuniyuki Iwashima

Date: Wed Apr 08 2026 - 21:50:30 EST


From: Marc Harvey <marcharvey@xxxxxxxxxx>
Date: Wed, 8 Apr 2026 17:10:05 -0700
> On Wed, Apr 8, 2026 at 9:40 AM Jakub Kicinski <kuba@xxxxxxxxxx> wrote:
> >
> > It pains me to report on non-debug kernels:
>
> I'm sorry to have pained you. Despite my best efforts to run with the
> exact same environment and conditions as your CI, my teamd can be
> killed with "teamd -k" but yours hangs (both are version 1.32 on
> Fedora with the same kernel config).

Considering the subsequent "kill" works on the dbg instance (thanks
to 2400s timeout), I guess teamd is somehow stuck at SIGTERM handling
removing team devices in teamd_port_remove_all(). (SIGTERM being masked
sounds unlikely)

https://netdev-ctrl.bots.linux.dev/logs/vmksft/bonding-dbg/results/593802/4-teamd-activebackup-sh/stdout
https://netdev-ctrl.bots.linux.dev/logs/vmksft/bonding-dbg/results/593802/4-teamd-activebackup-sh/stderr
---8<---
[ 759.819815][T21724] test_team1: Port device eth1 removed
[ 759.822323][T21724] test_team1: Port device eth0 removed
[ 790.615687][T21728] test_team2: Port device eth1 removed
[ 790.617445][T21728] test_team2: Port device eth0 removed
---8<---

Adding -N and letting "ip netns del" release the last netns refcnt
and defer device destruction to cleanup_net() may help.


> For v7, I’ll invoke "teamd -k"
> using the timeout utility, or just increase the test timeout.

+1 for the latter, maybe set timeout=300.

daemon_pid_file_kill_wait(SIGTERM, 30) * 2 = 120s, but just in case.

See these files for howto:

$ find tools/testing/selftests/net/ -name settings