bad blocks... random death

From: Thierry ITTY (thierry.itty_at_besancon.org)
Date: 08/13/04

  • Next message: Crucificator: "Re: Keep losing RAID device"
    Date: Fri, 13 Aug 2004 11:32:45
    To: redhat-list@redhat.com
    
    

    this continues discussions about bad disk blocks not really bad and redhat
    9 dying randomly

    we're now a few on this list experiencing various symptoms (dma errors, bad
    blocks on disks, system freeze or death) that look like hardware problems.
    after talking togeteher we can now say that those problems are pure OS
    problems.

    the disks with bad blocks work actually fine elswhere (in my case I ran the
    manufacturer low-level diags and no disk had any problem. and, ain't it
    very strange that 10 disks get the same problems at the same time ?!!!)

    the problem happens on various machines (gigabyte, asus, athlon, pentium,
    maxtor, western...).

    it seems it is related to high load periods (in my case a heavily used file
    server).

    we've been advised to change dma disks settings. I tried various things (no
    dma at all, forcing mdma0 or udma2). the system behave differently (either
    no errors or other errors as dma timeouts), but it's not working quite well
    (for example deactivating dma on disks lowers the average network
    throughput from 50 MB/s to 1.5 !!! almost 40 times slower !!!

    we really need help to investigate this problem which causes io errors and
    fs corruption !

    tia

    -- 
    redhat-list mailing list
    unsubscribe mailto:redhat-list-request@redhat.com?subject=unsubscribe
    https://www.redhat.com/mailman/listinfo/redhat-list
    

  • Next message: Crucificator: "Re: Keep losing RAID device"

    Relevant Pages

    • Re: DMA settings
      ... "Dick Snow" wrote in message ... > I was having some problems recently writing to DVD+RW disks for my ... > I was able to make DVD-R disks OK and the CD-RW drive was functioning OK ... > DMA if Available, but the current transfer mode is always showing PIO ...
      (microsoft.public.windowsxp.setup_deployment)
    • dma_intr: error=0x84 DriveReady SeekComplete Error
      ... error messages referring to a DMA issue. ... unsupported controller or dying disks. ... ide1: reset: success ...
      (comp.unix.admin)
    • Re: dma problems with Serverworks CSB5 chipset
      ... Just want to report that after changing the disks to two Maxtor 6Y160P0 ... > same DMA timeout error and often it locks up completely. ... > If i disable DMA with hdparm the system works fine. ... send the line "unsubscribe linux-kernel" in ...
      (Linux-Kernel)
    • Re: NetBSD1.6.1 booting sometimes and sometimes not, may be IDE problem?
      ... > IMHO this means problems with either DMA or IRQ. ... > disks, looks more a problem with the controller. ... The compatibility mode is only used to ...
      (comp.unix.bsd.netbsd.misc)
    • Re: Weird system locks
      ... >> I am expreriencing a weird system locking issue on my RedHat 8, ... I am assuming that because i can reproduce the situation when ... Could this be a memory related problem? ... thrashing the disks around searching/scanning for ...
      (comp.os.linux.misc)