Re: TIOCGWINSZ retuns old pty size after receiving SIGWINCH

From: Ico Doornekamp
Date: Tue Aug 19 2008 - 16:13:33 EST




* On 2008-08-19 Andrew Morton <akpm@xxxxxxxxxxxxxxxxxxxx> wrote :

> On Tue, 19 Aug 2008 19:56:39 +0200
> Ico Doornekamp <lkml@xxxxxxx> wrote:
>
> >
> >
> > * On 2008-08-19 Ico Doornekamp <lkml@xxxxxxx> wrote :
> >
> > >
> > >
> > > * On 2008-08-19 Andrew Morton <akpm@xxxxxxxxxxxxxxxxxxxx> wrote :
> > >
> > > > On Tue, 19 Aug 2008 09:54:14 +0200 Ico Doornekamp <lkml@xxxxxxx> wrote:
> > > > >
> > > > > * On 2008-08-19 Andrew Morton <akpm@xxxxxxxxxxxxxxxxxxxx> wrote :
> > > > >
> > > > > > On Sun, 10 Aug 2008 17:08:59 +0200 Ico Doornekamp <lkml@xxxxxxx> wrote:
> > > > > >
> > > > > > > Recently my X terminals showed annoying behaviour where the application
> > > > > > > in the terminal was not resized properly to the actual size of the X
> > > > > > > terminal emulator window, resulting in a lot of misaligned text on the
> > > > > > > screen. Hunting the issue down from the windowmanager and the terminal
> > > > > > > emulator program, I suspect the problem might lie in the kernel. I'm
> > > > > > > running 2.6.26 on a dual core i386.
> > > > > > >
> > > > > > > What I see is this: the userspace application receives a SIGWINCH signal
> > > > > > > and acquires the terminal size usign the TIOCGWINSZ ioctl. It seems that
> > > > > > > in some cases the old instead of the new terminal size is returned.
> > > > > > > A small delay before the ioctl seems to 'fix' this behaviour.
> > > > > > >
> > > > > > > I noticed some changes involving locking in the the pty code in the last
> > > > > > > kernel verions, could one of these changes cause the above behaviour ? If
> > > > > > > so, wouldn't this affect much more users ?
> > > > > >
> > > > > > hm, that code is pretty simple and although it does the SIGWINCH and
> > > > > > the window-size setting in a peculiar order, it looks to be race-free.
> > > > > >
> > > > > > Approximately what proportion of the time does it go wrong?
> > > > >
> > > > > I guess about 10 to 20% of the resizes. I happen to be using a tiling
> > > > > window manager which causes resizing more often and more agressive then
> > > > > 'normal' window managers, I guess this helps triggering the problem.
> > > > >
> > > > > I temporary worked around this issue this by changing the order of the
> > > > > signal and the updating of the pty size in tty_io.c's tiocswinsz(), but
> > > > > this is not much of a real fix.
> > > > >
> > > >
> > > > Well damn. Are you sure? The code looks solid to me.
> > > >
> > > > At least, it does after
> > > >
> > > > Author: Alan Cox <alan@xxxxxxxxxx> 2008-08-15 02:39:38
> > > > Committer: Linus Torvalds <torvalds@xxxxxxxxxxxxxxxxxxxx> 2008-08-15 10:34:07
> > > > Parent: 000b9151d7851cc1e490b2a76d0206e524f43cca (Fix race/oops in tty layer after BKL pushdown)
> > > > Branches: git-cifs, git-ia64, git-nfs, git-powerpc-merge, linux-next, remotes/origin/master
> > > > Follows: v2.6.27-rc3
> > > > Precedes: next-20080818
> > > >
> > > > tty: remove resize window special case
> > > >
> > > > perhaps you're still running a kernel which is earlier than that?
> > >
> > > I reported the behaviour on 2.6.26.2, but I was not aware this issue was
> > > adressed already in the 2.6.27 tree. I will update to the latest 2.6.27
> > > rc and report if the problem still persists.
> >
> > I am not able to reproduce the problem with git 2.6.27-rc3-next-20080819.
>
> That's linux-next, yes?
>
> linux-next has additional locking in there:
>
> commit 2283faa9cec083b6ddc1fa02a974ce1c797e847f
> Author: Alan Cox <alan@xxxxxxxxxx>
> Date: Thu Aug 14 09:53:22 2008 +1000
>
> tty-fix-pty-termios-race
>
> Kanru Chen posted a patch versus the old code which deals with the case
> where you resize the pty side of a pty/tty pair. In that situation the
> termios data is updated for both pty and tty but the locks are not held
> on both sides.
>
> This reimplements the fix against the updated tty code. Patch by self but
> the hard bit (noticing and fixing the bug) is thanks to Kanru Chen.
>
> Signed-off-by: Alan Cox <alan@xxxxxxxxxx>
>
> diff --git a/drivers/char/tty_io.c b/drivers/char/tty_io.c
> index a8ddcba..779c6b5 100644
> --- a/drivers/char/tty_io.c
> +++ b/drivers/char/tty_io.c
> @@ -2068,7 +2068,7 @@ static int tiocgwinsz(struct tty_struct *tty, struct winsize __user *arg)
> /**
> * tty_do_resize - resize event
> * @tty: tty being resized
> - * @real_tty: real tty (if using a pty/tty pair)
> + * @real_tty: real tty (not the same as tty if using a pty/tty pair)
> * @rows: rows (character)
> * @cols: cols (character)
> *
> @@ -2085,6 +2085,14 @@ int tty_do_resize(struct tty_struct *tty, struct tty_struct *real_tty,
> mutex_lock(&tty->termios_mutex);
> if (!memcmp(ws, &tty->winsize, sizeof(*ws)))
> goto done;
> +
> + /* If a pty/tty pair is updated we will have a real_tty defined
> + which doesn't match the tty. In this case as we will update
> + both of the tty termios sets. We can lock both mutex safely here
> + as in this case real_tty is the tty, tty is the pty side and we
> + have lock ordering */
> + if (real_tty != tty)
> + mutex_lock(&real_tty->termios_mutex);
> /* Get the PID values and reference them so we can
> avoid holding the tty ctrl lock while sending signals */
> spin_lock_irqsave(&tty->ctrl_lock, flags);
> @@ -2102,6 +2110,8 @@ int tty_do_resize(struct tty_struct *tty, struct tty_struct *real_tty,
>
> tty->winsize = *ws;
> real_tty->winsize = *ws;
> + if (real_tty != tty)
> + mutex_unlock(&real_tty->termios_mutex);
> done:
> mutex_unlock(&tty->termios_mutex);
> return 0;
>
> which perhaps fixed the problem.
>
>
> This bug may have been present in released kernels for some time - I
> think I read in another thread that CPU scheduler changes might have
> caused it to surface. Doesn't matter really - we don't want
> user-visible races like this affecting desktop applications in stable
> kernels!
>
> Can you please help us to complete this list?

2.6.25.9: good
2.6.26-rc1 bad ( +/- 10% of the resizes)
2.6.26.2: still bad ( +/- 10% of the resizes )
2.6.27-rc3: ugly bad ( > 75% of the resizes )
linux-next good

Any other versions to test ?

--
:wq
^X^Cy^K^X^C^C^C^C
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/