Re: OOM killer gripe (was Re: What still uses the block layer?)



On Monday 15 October 2007 19:52, Rob Landley wrote:
On Monday 15 October 2007 8:37:44 am Nick Piggin wrote:
Virtual memory isn't perfect. I've _always_ been able to come up with
examples where it just doesn't work for me. This doesn't mean VM
overcommit should be abolished, because it's useful more often than
not.

I hate to go completely offtopic here, but disks are so incredibly
slow when compared to RAM that there is really nothing the kernel
can do about this.

I know.

Presumably the job will finish, given infinite
time.

I gave it about half an hour, then it locked solid and stopped writing to
the disk at all. (I gave it another 5 minutes at that point, then held
down the power button.)

Maybe it was a bug then. Hard to say without backtraces ;)


You really shouldn't configure
so much unless you do want the kernel to actually use it all, right?

Two words: "Software suspend". I've actually been thinking of increasing
it on the next install...

Kernel doesn't know that you want to use it for suspend but not
regular swapping, unfortunately.


Because if we're not really conservative about OOM killing, then the
user who actually really did want to use all the swap they configured
gets angry when we kill their jobs without using it all.

I tend to lower "swappiness" and when that happens all sorts of stuff goes
weird. Software suspend used to say says it can't free enough memory if I
put swappiness at 0 (dunno if it still does). This time the OOM killer
never triggered before hard deadlock. (I think I had it around 20 or 40 or
some such.)

Would an oom-kill-someone-now sysrq be of help, I wonder?

*shrug* It might. I was a letting it run hoping it would complete itself
when it locked solid. (The keyboard LEDs weren't flashing, so I don't
_think_ it paniced. I was in X so I wouldn't have seen a message...)

If you can work out where things are spinning/sleeping when that happens,
along with sysrq+M data, then it could make for a useful bug report. Not
entirely helpful, but if it is a reproducible problem for you, then you
might be able to get that data from outside X.
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/