Re: bad blocks on raid5 cause filesystem failure
From: alazarev (alazarev_at_itg.uiuc.edu)
Date: 09/21/05
- Previous message: Alex Bazan: "Re: System freezing"
- In reply to: kermit: "Re: bad blocks on raid5 cause filesystem failure"
- Next in thread: Michael: "Re: bad blocks on raid5 cause filesystem failure"
- Reply: Michael: "Re: bad blocks on raid5 cause filesystem failure"
- Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]
Date: 21 Sep 2005 12:01:53 -0700
Thanks for the informative post. I've got a few questions though.
1) Do you have a link to the report that you read which describes the
probablity of double fault. Sounds like an interesting read for me.
2) Correct me if I'm wrong, but if two blocks on a drive, happen to
fail at the same time, before rebuild can finish parity on the first,
then you will have a problem, unless you have double parity? Fine, but
then what about 3 bad blocks in a row. At some point, the RAID
controller should, like you say, stop all host IO and report the drive
failed, and then rebuild the drive from parity. How many bad blocks in
a row should cause this drive failure, three or more, right? Since we
saw about 10 bad block failures all with the same time stamp, double
parity would not have helped us at all. The only thing that would have
helped us is a RAID controller that would stop IO to the host. Instead,
our RAID still provided "fake access" for the host and thus the fs
failure. Sound ligit to you? Any idea what functionality this is
called, so I know to avoid it when shopping around for new RAID? I
suppose SCSI provides much better reliability in this respect. Too bad,
we are already in the SATA hole. Too much data to afford moving it to
SCSI.
3) Double parity is also called RAID 6, right? Does RAID 6 provide
double parity at the block level? Or only at the drive level?
Thanks,
Alex
- Previous message: Alex Bazan: "Re: System freezing"
- In reply to: kermit: "Re: bad blocks on raid5 cause filesystem failure"
- Next in thread: Michael: "Re: bad blocks on raid5 cause filesystem failure"
- Reply: Michael: "Re: bad blocks on raid5 cause filesystem failure"
- Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]
Relevant Pages
|