Re: Hard kill a process?



x0054 wrote in <Xns97D2EEAEA1F1Cx0054indexcom@xxxxxxxxxxxxxx> on Tue, May 30
2006 02:32:

Just wondering, if you can point me to an article that discusses this
topic more I would be grateful. It sounds very interesting to me.


Sure thing -- this is just the first hit back from Google:

,----[ http://linuxgazette.net/issue83/tag/6.html Quote ]
| A process which ends up in "D" state for any measurable length of time is
| trapped in the midst of a system call (usually an I/O operation on a
| device --- thus the initial in the ps output).
|
| Such a process cannot be killed --- it would risk leaving the kernel in an
| inconsistent state, leading to a panic. In general you can consider this
| to be a bug in the device driver that the process is accessing.
|
| Once case that I can't consider to be a Linux bug occurs when one is
| attempting to access a hard mounted NFS exported share without the intr
| (mount( 8)) flag. If the NFS server (or the network connection thereto)
| becomes unavailable all processes that try to access any part of that
| share will be set into D state. (Use intr or soft mount options on NFS to
| avoid all that). I might consider that to be a design bug in the NFS
| protocol --- but that might be contentious. That particular NFS behavior
| can actually be a feature in some cases; I think it should NOT be the
| default, though.
|
| When it comes to scsi and similarly local device drivers --- I would
| report cases of "D" state as bugs (after due diligence of checking for
| prior reports, updates, and perhaps trying to troubleshoot it a bit).
`----

This article covers all sorts of other situations, centering around the
unkillable process. Some of the info is pretty good, actually.

Google should be able to tell you more, if you need it.

- Mike

--
Registered Linux User #417338, machine #325045.

....and that is how we know the Earth to be banana-shaped.
.



Relevant Pages

  • Re: Please help!
    ... This bug check is issued if paged memory is accessed ... installation of a faulty device driver, system service, or BIOS. ...
    (microsoft.public.windowsxp.help_and_support)
  • Re: sshfs/nfs cause server lockup
    ... he stated he only gets the bug on sis rl and fxp. ... but I think that the 'NFS via TCP' thread ... I've tested the cards above, and the person I replied to encountered the ... So far I've not run into an NFS server deadlock you ...
    (freebsd-stable)
  • can device drivers return non-ram via vm_ops->nopage?
    ... definitive confirmation from you about the ->nopage API. ... the bug triggering of course is the ... If we want to return non-ram, we could, but I believe we should change ... a device driver is returning a non-ram page via ->nopage. ...
    (Linux-Kernel)
  • Re: Linux v2.6.17-rc6
    ... Trond Myklebust have created a patch which we have verified solves this ... on submitting it for 2.6.17 since it is not a critical bug. ... I guess that depends on how much it slows down and how much you depend on the speed of NFS. ... The initial bug report makes it look as if it's about two orders of magnitude slower, but doesn't quantify the effect on more common sequential access operations. ...
    (Linux-Kernel)
  • Re: sshfs/nfs cause server lockup
    ... ie the inability to mount NFS v3 ... I've tested the cards above, and the person I replied to ... encountered the same bug with a bge card. ... So far I've not run into an NFS server ...
    (freebsd-stable)