Re: Random reboots



On Wed, Feb 15, 2006 at 04:41:39PM +0100, Jean Delvare wrote:

Hi Ryan,

On 2006-02-15, Ryan Richter wrote:
The sensors report a bunch of obvious nonsesne as always... I keep them
configured in with the hope that one day they'll report useful
information, but that day hasn't come yet. I just checked, and all the
fans are still fine. It's in a huge case with lots of fans and it's
hardly warmer than room temp. The opteron 240s don't put out much heat.

The sensors might just need some board-specific configuration. May I ask
which motherboard this is?

I may help you (in private) setup your sensors. If you're interested,
send the output of "sensors-detect" and "sensors" to me and I'll
see what can be done to improve the reported values.

It's a Tyan S2880, and I'm using their sensors.conf:

ftp://ftp.tyan.com/software/lms/lms_s2880.tgz

Here's what sensors reports:

w83627hf-isa-0290
Adapter: ISA adapter
VCore 1: +1.54 V (min = +1.47 V, max = +1.62 V) ALARM
VCore 2: +1.54 V (min = +1.47 V, max = +1.62 V) ALARM
+3.3V: +3.33 V (min = +3.14 V, max = +3.46 V)
+5V: +4.97 V (min = +4.73 V, max = +5.24 V)
+12V: +4.56 V (min = +10.82 V, max = +13.19 V)
-12V: -2.25 V (min = -13.18 V, max = -10.88 V)
-5V: -3.94 V (min = -5.25 V, max = -4.75 V)
V5SB: +5.51 V (min = +4.73 V, max = +5.24 V)
VBat: +1.28 V (min = +2.40 V, max = +3.60 V)
fan1: 4354 RPM (min = -1 RPM, div = 2)
fan2: 3479 RPM (min = 5273 RPM, div = 2)
fan3: 0 RPM (min = 30681 RPM, div = 2)
temp1: +77°C (high = -128°C, hyst = -128°C) sensor = thermistor
temp2: +77.5°C (high = +80°C, hyst = +75°C) sensor = thermistor
temp3: +77.5°C (high = +80°C, hyst = +75°C) sensor = thermistor
vid: +1.550 V (VRM Version 2.4)

The temps and +/-12V readings are obviously wrong, and always have been
AFAIR. I've run the machine with 6 more 10krpm old full-height drives
than it currently has. I checked the max 5V and 12V current draw of the
drives and specced the power supply carefully when we bought it a couple
years ago, and it has lots of headroom on both of those rails.

Two more random thoughts:

Any reason why you run 2.6.15 rather than 2.6.15.4? That's where I would
start if I was suspecting a kernel bug.

Did you already update the BIOS to the latest version available? There
are a few kernel complaints in your dmesg which might be solved by a
newer BIOS (and/or parameter changes in the BIOS setup).

I'll be booting 2.6.15.4 this weekend. The BIOS is indeed old, and I
see there's a newer one that came out a year ago. It'll be a while
before I can try it, I need to scare up a keyboard, video card, and
monitor, not to mention a DOS disk. You can tell why I haven't flashed
the BIOS in years...

Still, I don't see why the new kernel shouldn't be stable if 2.6.11.3
was.

Thanks,
-ryan
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/



Relevant Pages

  • Re: Reporting ThinkPad BIOS bugs
    ... I have standard kernel logs with the behaviors, ... on a T60p/T61p and test it on a Linux box that isn't one of those types ... Ask the BIOS authors. ... Even if the person reading the report doesn't know Linux well, ...
    (Linux-Kernel)
  • Re: a long rebuttal to the Linux-is-the-engine fallacy
    ... It does rely on probing and some BIOS configuration tables to ... hardware, printing its output pixel-by-pixel on the screen? ... The difference between memtest and the kernel is that memtest *relies* on bios ...
    (Fedora)
  • Re: [opensuse] Kernel only see 3 of 4GB
    ... But in order to get a 32bit kernel that supports PAE, ... because the BIOS reserves some address space for PCI ... configuration option. ... So maybe BIOS is configured for PAE and has problems with 64bit kernels? ...
    (SuSE)
  • Re: Linux 2.6.25 (coretemp reads high temperatures)
    ... it's concerning much higher temperatures being read out by the ... The reason is that bios did report same temperatures as coretemp in 2.6.24, ... And the driver did say in kernel log that TJMAX is 85C ...
    (Linux-Kernel)
  • Re: speedstep-centrino: ENODEV
    ... Pallipadi, Venkatesh wrote: ... Things seem to be fine with the BIOS. ... kernel. ... I'll report back, ...
    (Linux-Kernel)