Re: XFS: how to NOT null files on fsck?
From: Helge Hafting (helge.hafting_at_hist.no)
Date: 08/05/04
- Previous message: Andrew Morton: "Re: [RFC/PATCH] FUSYN Realtime & robust mutexes for Linux, v2.3.1"
- In reply to: L A Walsh: "Re: XFS: how to NOT null files on fsck?"
- Next in thread: Nathan Scott: "Re: XFS: how to NOT null files on fsck?"
- Reply: Nathan Scott: "Re: XFS: how to NOT null files on fsck?"
- Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]
Date: Thu, 05 Aug 2004 10:16:02 +0200 To: L A Walsh <lkml@tlinx.org>
L A Walsh wrote:
>
>> The problem relates to an updated inode size
>> being flushed ahead of the data behind it (hence a size update
>> can make it out before delayed allocate extents do, and we end
>> up with a hole beyond the end of file, which reads as zeroes).
>>
> I believe I understand the scenario you are talking about, but I don't
> think it fits the examples I have referred to. In particular,
> "/etc/fstab".
> I update 'fstab' on Tuesday, say, it works fine...gets backed up just
> fine...and I forget about it and move on. Then, 2-3 days later, my
> system crashes and doesn't want to some up. That's odd, usually after
> a crash, it just burps a bit and comes back up. I grumble and go for
> single user. Turns out my 1.2k fstab file is all "nulls".
> Coinidentally,
> I find, _maybe_, a couple of other files written around the same time,
> also nulled, including times when the nulls appeared in the system log
> for that time period!
> Now I know it takes a while before data may end up on disk and that it
> may not go out to disk in an ordered fashion, but 2-3 days?
Seems strange to me, but the amount of delay is entirely up to the
filesystem.
> This isn't
> a case of a multi-extent file. My current fstab is only 1335 bytes long.
> I doubt it has ever been more than 2.
> My filesystems all use the Allocation unit (AU) size allowed. I wish
> for something larger than a 4k AU size but I'm told it is limited by
> the linux page size and to find a PC that uses the IA64 page size to
> use larger file AU size (but I haven't seen to many of these IA64
> machines
> available from Dell or Gateway...:-) Maybe the code in FAT32 that
> handles
> larger AU's could be ported to XFS? If FAT32 can do it...nevermind...
> I'm sure there are more important issues on the plate.
>
>>> Apparently not easily reproduced, no one has a clue why it does it.
>>> Just does.
>>
>> No, its actually well known why it behaves this way.
>> We are looking into ways to address this, and have some
>> ideas - the trick is fixing it without hurting write
>> performance - which we will do, its just not trivial.
>>
> You could increase the max AU size :-) But more seriously, is my
> example of writing a 1 AU sized file that becomes zeroed days later
> an example of the problem you are speaking of?
>
>> There are several techiques to reduce the impact of this
>> behaviour, as others have described (or see the linux-xfs
>> archives).
>>
> Like setting the disk for synchronous writes? Why not something
> in between, like guaranteeing the info on a mostly quiescent machine
> will be written to disk within an hour or so? Or is that not "it"?
>
This should be trivial. Edit your crontab, so that cron will
run "sync" once per hour. Everything queued for writing when the
"sync" command is issued will be on disk when the command finishes.
So this guarantees that nothing waits more than 1 hour. (Sync is usually
over in a few seconds on a home machine. There should be no more
lost "old" files unless the fs has a bug.)
You may also want to run "sync" manually before doing something that
risks crashing. (Such as moving a live machine, dubious hotplugging,
testing beta device drivers . . .)
> I haven't seen an incidence of this behavior in several months on
> my machines so my particular problem may have been fixed and the
> problem you speak of is unrelated to my own, but the number of
> unplanned shutdowns on my system has only increased recently, since I
> upgraded
> to the stable 2.6 series, whereas before, with 2.4, it could be months
> between "blue screens".
You may want to keep using 2.4 for a while then - it probably _is_ a lot
more stable.
It has been stabilizing for the entire 2.5 development time,
2.6 stabilization has just begun!
>
> Sad was the day that it was decided that the linux-kernel "corp" decided
> on feature development vs. stability in the "stable" kernel series.
> Isn't that criticism lodged most often against MS.
Not a big problem in this case. If XFS isn't stable enough for you,
consider one
of the many other filesystems. ext3 or reiserfs for journalling, or
plain old
ext2. The nice thing about having many features, is that you have a set
to choose from.
Helge Hafting
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/
- Previous message: Andrew Morton: "Re: [RFC/PATCH] FUSYN Realtime & robust mutexes for Linux, v2.3.1"
- In reply to: L A Walsh: "Re: XFS: how to NOT null files on fsck?"
- Next in thread: Nathan Scott: "Re: XFS: how to NOT null files on fsck?"
- Reply: Nathan Scott: "Re: XFS: how to NOT null files on fsck?"
- Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]
Relevant Pages
|