Re: Bonnie++ with 1024k stripe SW/RAID5 causes kernel to goto D-state



Justin Piszcz wrote:
Kernel: 2.6.23-rc8 (older kernels do this as well)

When running the following command:
/usr/bin/time /usr/sbin/bonnie++ -d /x/test -s 16384 -m p34 -n 16:100000:16:64

It hangs unless I increase various parameters md/raid such as the stripe_cache_size etc..

# ps auxww | grep D
USER PID %CPU %MEM VSZ RSS TTY STAT START TIME COMMAND
root 276 0.0 0.0 0 0 ? D 12:14 0:00 [pdflush]
root 277 0.0 0.0 0 0 ? D 12:14 0:00 [pdflush]
root 1639 0.0 0.0 0 0 ? D< 12:14 0:00 [xfsbufd]
root 1767 0.0 0.0 8100 420 ? Ds 12:14 0:00 root 2895 0.0 0.0 5916 632 ? Ds 12:15 0:00 /sbin/syslogd -r

See the bottom for more details.

Is this normal? Does md only work without tuning up to a certain stripe size? I use a RAID 5 with 1024k stripe which works fine with many optimizations, but if I just boot the system and run bonnie++ on it without applying the optimizations, it will hang in d-state. When I run the optimizations, then it exits out of D-state, pretty weird?

Not at all. 1024k stripes are way outside the norm. If you do something way outside the norm, and don't tune for it in advance, don't be terribly surprised when something like bonnie++ brings your box to its knees.

That's not to say we couldn't make md auto-tune itself more intelligently, but this isn't really a bug. With a sufficiently huge amount of RAM, you'd be able to dynamically allocate the buffers that you're not pre-allocating with stripe_cache_size, but bonnie++ is eating that up in this case.

-- Chris
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/



Relevant Pages

  • Re: Tk application hangs
    ... explicitly reached eof and it hangs? ... |>after running just one command. ... |>"List Packages" is supposed to list all available DarwinPorts packages ...
    (comp.lang.tcl)
  • Re: linux bewbie lame question
    ... It just hangs, even once I have logged out. ... just type "mozilla" from command line or "netscape" this is more stable if you ... konqueror, but this is how I get arround my system. ...
    (Debian-User)
  • Re: Tk application hangs
    ... KW> explicitly reached eof and it hangs? ... KW> | Kevin Walzer wrote: ... KW> |>after running just one command. ... KW> |>"List Packages" is supposed to list all available DarwinPorts packages ...
    (comp.lang.tcl)
  • Executing a remote command with ssh sometimes hangs
    ... where ssh _sometimes_ hangs when executing a command on a remote machine. ... I have a shell script, which deletes some files on remote machines. ...
    (SSH)
  • [HPADM] Re: How to determine stripe size and stripe width on HPUX 11.11 for MBRC
    ... How can I determine the values for stripe width and stripe size on HP-UX 11.11? ... I thought perhaps the lvdisplay command might have the info. ... I also looked at the vgdisplay command, part of which I have included below. ...
    (HP-UX-Admin)