Re: wait() and strace -f

From: OGAWA Hirofumi (hirofumi@mail.parknet.co.jp)
Date: Wed Dec 19 2001 - 10:26:38 EST


Kurt Roeckx <Q@ping.be> writes:

> > Probably, it's feature (or bug) of strace. I'm seems, if strace has
                                                             ^^^^^^
Sorry, s/strace/the trace process/

> > child, trace of a child is started before wait() of parent. Then,
> > exit() of child continue wait() of parent.
>
> If I understand what you're saying, sleep(1) in child1, and
> sleep(2) in the parent should fix the problem, which it doesn't.
>
> And it still doesn't explain why it only happens with 2 childs.

As far as I read the source, it seems strace is not counting the
zombie. And strace wait exit() of child2 before restarting the wait()
of parent.

        strace parent child1 child2
                                      zombie

                       sleep(1) sleep(10)
                     before wait()
      trap wait()

                                                 before exit()
      trap exit()
     restart child2
                                                   run exit()
     restart parent
                      run wait()

> Maybe I should have mentioned this before: the wait will clean up
> the first child at the time the second child dies, or atleast
> that's what wait() returns.
>
> > > if (!fork())
> > > {
> > > /* Child 1. */
> > sleep(2);
> > > return 0;
> > > }
> >
> > The above change is continued the parent after 2 seconds.
>
> I know that too, as I said, only when child 1 dies before the
> parent calls wait().

strace-4.4/process.c in strace_4.4-1.tar.gz

diff -u /tmp/t/strace-4.4/process.c.orig /tmp/t/strace-4.4/process.c
--- /tmp/t/strace-4.4/process.c.orig Fri Aug 3 20:51:28 2001
+++ /tmp/t/strace-4.4/process.c Wed Dec 19 08:20:05 2001
@@ -1349,7 +1349,7 @@
                 /* WTA: fix bug with hanging children */
                 if (!(tcp->u_arg[2] & WNOHANG) && tcp->nchildren > 0) {
                         /* There are traced children */
- tcp->flags |= TCB_SUSPENDED;
+ /* tcp->flags |= TCB_SUSPENDED; */
                         tcp->waitpid = tcp->u_arg[0];
                 }
         }

Try the above patch. This restart wait() immediately. However, probably
it will break something of other. ;)

-- 
OGAWA Hirofumi <hirofumi@mail.parknet.co.jp>
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/



This archive was generated by hypermail 2b29 : Sun Dec 23 2001 - 21:00:19 EST