Re: 2.6.19 file content corruption on ext3





On Thu, 28 Dec 2006, Marc Haber wrote:

After being up for ten days, I have now encountered the file
corruption of pkgcache.bin for the first time again. The 256 MB i386
box is like 26M in swap, is under very moderate load.

I am running plain vanilla 2.6.19.1. Is there a patch that I should
apply against 2.6.19.1 that would help in debugging?

Not right now.

And I have a test-program that shows the corruption _much_ easier (at
least according to my own testing, and that of several reporters that back
me up), and that seems to show the corruption going way way back (ie going
back to Linux-2.6.5 at least, according to one tester).

So it just got a lot _easier_ to trigger in 2.6.19, but it's not a new
bug.

What we need now is actually looking at the source code, and people who
understand the VM, I'm afraid. I'm gathering traces now that I have a good
test-case. I'll post my trace tools once I've tested that they work, in
case others want to help.

(And hey, you don't have to be a VM expert to help: this could be a
learning experience. However, I'll warn you: this is _the_ most grotty
part of the whole kernel. It's not even ugly, it's just damn hard and
complex).

Linus
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/



Relevant Pages

  • Re: [2.4] NMI WD detected lockup during page alloc
    ... you may want to disable compression, this sounds like mm corruption and ... I assume this is the first time you've got the nmi watchdog ... send the line "unsubscribe linux-kernel" in ...
    (Linux-Kernel)
  • Re: Access 97 / XP sharing problem
    ... Thanks for the reply, Steve. ... It looks as though the corruption may have ... been introduced the first time the XP user touched it. ... Prev by Date: ...
    (comp.databases.ms-access)
  • RE: where do I find sku1e3.cab?
    ... Is this the first time this happens? ... this is your first update then this may be caused by corruption of your Local ... LIS TOOL ...
    (microsoft.public.officeupdate)
  • Corrupt Word 2000 Document
    ... Sorry about the accidental empty post preceding this one. ... What causes this corruption to happen, because it isn't the first time ...
    (microsoft.public.word.docmanagement)