Re: BUG: NTPL: waitpid() doesn't return?

From: Roland McGrath (roland_at_redhat.com)
Date: 01/31/04

  • Next message: Willem Riede: "Re: The survival of ide-scsi in 2.6.x [PATCH 3/3]"
    Date:	Sat, 31 Jan 2004 13:52:43 -0800
    To: "Matthias Urlichs" <smurf@smurf.noris.de>
    
    

    > Your test program works... except that it reports, when I strace it,
    >
    > [pid 10629] waitpid(10631, Process 10629 suspended
    > <unfinished ...>
    > [pid 10628] <... mmap2 resumed> ) = 0x41966000
    > [pid 10630] waitpid(10632, Process 10630 suspended
    > <unfinished ...>
    >
    > Those "Process ### suspended" messages did NOT happen with the Python
    > script that exhibits the bug.

    This is an strace bug. Because of goofy ptrace interactions, strace does
    funny business with threads doing wait calls. strace should resume those
    threads when the pids they are waiting for exit.

    Make sure you are using the most current strace and if it's still not
    different then report the strace bug (<strace-devel@lists.sourceforge.net>
    or https://bugzilla.redhat.com are fine).
      
    To research the issue thoroughly, you may have to avoid relying on strace
    to tell you what calls your programs make.
    -
    To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
    the body of a message to majordomo@vger.kernel.org
    More majordomo info at http://vger.kernel.org/majordomo-info.html
    Please read the FAQ at http://www.tux.org/lkml/


  • Next message: Willem Riede: "Re: The survival of ide-scsi in 2.6.x [PATCH 3/3]"

    Relevant Pages

    • strace, accept(), ERESTARTSYS and EINTR
      ... as I understand it this is an artefact of how strace works, and my code will not have seen accept return at all at that point. ... the strace output does not show any other return from the call to acceptbefore reporting that thread's call to fcntl(). ... http://lkml.org/lkml/2001/11/22/65 - Phil Howard reports getting ERESTARTSYS returned from accept, not only in the strace output, and fixed his problem by treating it like EINTR. ... It's quite possible that some process resource limit had been reached; could this have confused the glibc syscall wrapper, causing it to return the mysterious -512? ...
      (Linux-Kernel)
    • Re: skype: Internet telephony
      ... > hosts it accesses and how. ... There was a thread about it in linux-il, most reports were about debian ... strace but a problem connecting strace to am already running process. ...
      (Debian-User)
    • Re: skype: Internet telephony
      ... I've only used it on windows 2000, but for what it's worth, I've had ... > There was a thread about it in linux-il, most reports were about debian ... > strace but a problem connecting strace to am already running process. ...
      (Debian-User)
    • Re: the sys call top
      ... but are you sure the command is "strace who" ... because it doesn`t work on my comp ... it reports the error can`t open dev/log ...
      (comp.os.linux.development.apps)
    • Re: BUG: NTPL: waitpid() doesnt return?
      ... > The strace case I'm more than willing to pass off as a strace problem. ... > I find it quite common that strace doesn't detach from processes, ... bert's test program exhibits exactly the same clone ... send the line "unsubscribe linux-kernel" in ...
      (Linux-Kernel)