Adaptec 2400A going offline...

From: S?bastien (s.jousse_at_free.fr)
Date: 01/28/04


Date: 28 Jan 2004 09:31:12 -0800

Hello,

I have a problem with an Adaptec 2400A on gentoo linux with a
gs-sources 2.4.25_pre6
I have compiled into the kernel the Adaptec I2O raid low level driver
I have 4 disks working in Raid 5, i have the latest bios update, and
it still doesn't work:

Here is my problem, it works fine for 1 hour to 16 hours max for the
moment and i can't access anymore
my disks...

Here is what it shows when it works:
------------------------------------

#cat /proc/scsi/dpt_i2o/0

Adaptec I2O RAID Driver Version: 2.4 Build 5

Vendor: Adaptec Model: 2400A FW:3A0L
SCSI Host=scsi0 Control Node=/dev/dpti0 irq=17
        post fifo size = 255
        reply fifo size = 255
        sg table size = 56

Devices:
        ADAPTEC RAID-5 Rev: 3A0L
        TID=523, (Channel=0, Target=0, Lun=0) (online)

Here is what it shows when it doesn't work:
-------------------------------------------

#cat /proc/scsi/dpt_i2o/0

Adaptec I2O RAID Driver Version: 2.4 Build 5

Vendor: Adaptec Model: 2400A FW:3A0L
SCSI Host=scsi0 Control Node=/dev/dpti0 irq=17
        post fifo size = 255
        reply fifo size = 255
        sg table size = 56

Devices:
        ADAPTEC RAID-5 Rev: 3A0L
        TID=523, (Channel=0, Target=0, Lun=0) (offline)

Here is list of error in syslog:
--------------------------------

Jan 28 14:03:30 phonam-fr001 kernel: dpti0: Trying to Abort cmd=4123
Jan 28 14:03:30 phonam-fr001 kernel: dpti0: Abort cmd not supported
Jan 28 14:03:30 phonam-fr001 kernel: dpti0: Trying to Abort cmd=4133
Jan 28 14:03:30 phonam-fr001 kernel: dpti0: Abort cmd not supported
Jan 28 14:03:30 phonam-fr001 kernel: dpti0: Trying to reset device
Jan 28 14:03:30 phonam-fr001 kernel: dpti0: Device reset not supported
Jan 28 14:03:30 phonam-fr001 kernel: dpti0: Bus reset: SCSI Bus 0:
tid: 11
Jan 28 14:03:30 phonam-fr001 kernel: dpti0: Bus reset success.
Jan 28 14:03:45 phonam-fr001 kernel: dpti0: Trying to Abort cmd=4123
Jan 28 14:03:45 phonam-fr001 kernel: dpti0: Abort cmd not supported
Jan 28 14:03:45 phonam-fr001 kernel: scsi: device set offline - not
ready or command retry failed after bus reset: host 0 channel 0 id
0 lun 0
Jan 28 14:03:55 phonam-fr001 kernel: dpti0: Trying to Abort cmd=4133
Jan 28 14:03:55 phonam-fr001 kernel: dpti0: Abort cmd not supported
Jan 28 14:03:55 phonam-fr001 kernel: scsi: device set offline - not
ready or command retry failed after bus reset: host 0 channel 0 id
0 lun 0
Jan 28 14:03:55 phonam-fr001 kernel: I/O error: dev 08:01, sector
10016
Jan 28 14:03:55 phonam-fr001 kernel: I/O error: dev 08:01, sector
52690944
Jan 28 14:03:55 phonam-fr001 kernel: EXT3-fs error (device sd(8,1)):
read_block_bitmap: Cannot read block bitmap - block_group = 201, b
lock_bitmap = 6586368
Jan 28 14:03:55 phonam-fr001 kernel: I/O error: dev 08:01, sector 0
Jan 28 14:03:55 phonam-fr001 kernel: I/O error: dev 08:01, sector
84700368
Jan 28 14:03:55 phonam-fr001 kernel: I/O error: dev 08:01, sector
84700472
Jan 28 14:03:55 phonam-fr001 kernel: I/O error: dev 08:01, sector
84700720
Jan 28 14:03:55 phonam-fr001 kernel: I/O error: dev 08:01, sector
84701296
Jan 28 14:03:55 phonam-fr001 kernel: I/O error: dev 08:01, sector
84701584
Jan 28 14:03:55 phonam-fr001 kernel: I/O error: dev 08:01, sector
101187648
Jan 28 14:03:55 phonam-fr001 kernel: I/O error: dev 08:01, sector
109838352
Jan 28 14:03:55 phonam-fr001 kernel: I/O error: dev 08:01, sector
133431408
Jan 28 14:03:55 phonam-fr001 kernel: I/O error: dev 08:01, sector
160694344
Jan 28 14:03:55 phonam-fr001 kernel: I/O error: dev 08:01, sector
252706856
Jan 28 14:03:55 phonam-fr001 kernel: I/O error: dev 08:01, sector
10232
Jan 28 14:03:55 phonam-fr001 kernel: I/O error: dev 08:01, sector
52690944
Jan 28 14:03:55 phonam-fr001 kernel: EXT3-fs error (device sd(8,1)):
read_block_bitmap: Cannot read block bitmap - block_group = 201, b
lock_bitmap = 6586368
Jan 28 14:03:55 phonam-fr001 kernel: I/O error: dev 08:01, sector 0
Jan 28 14:04:00 phonam-fr001 kernel: I/O error: dev 08:01, sector
10240
Jan 28 14:04:00 phonam-fr001 kernel: I/O error: dev 08:01, sector
84695960
Jan 28 14:04:00 phonam-fr001 kernel: I/O error: dev 08:01, sector
84701408
Jan 28 14:04:00 phonam-fr001 kernel: I/O error: dev 08:01, sector
84701600
Jan 28 14:04:00 phonam-fr001 kernel: I/O error: dev 08:01, sector
84701704
Jan 28 14:04:29 phonam-fr001 kernel: EXT3-fs error (device sd(8,1)):
ext3_readdir: directory #5849319 contains a hole at offset 0
Jan 28 14:04:29 phonam-fr001 kernel: I/O error: dev 08:01, sector 0
Jan 28 14:04:29 phonam-fr001 kernel: I/O error: dev 08:01, sector
93615784
Jan 28 14:04:29 phonam-fr001 kernel: EXT3-fs error (device sd(8,1)):
ext3_readdir: directory #5849323 contains a hole at offset 0
Jan 28 14:04:29 phonam-fr001 kernel: I/O error: dev 08:01, sector 0
Jan 28 14:04:29 phonam-fr001 kernel: I/O error: dev 08:01, sector
93615800
Jan 28 14:04:29 phonam-fr001 kernel: EXT3-fs error (device sd(8,1)):
ext3_readdir: directory #5849325 contains a hole at offset 0
Jan 28 14:04:29 phonam-fr001 kernel: I/O error: dev 08:01, sector 0
Jan 28 14:04:30 phonam-fr001 kernel: I/O error: dev 08:01, sector
235143200
Jan 28 14:04:30 phonam-fr001 kernel: EXT3-fs error (device sd(8,1)):
ext3_get_inode_loc: unable to read inode block - inode=14696530, b
lock=29392900

I have no idea why it does that, I have a gigabyte 7N400-L and there
is no way to disable APM or ACPI... :(

Thanks for all the help!