Re: [PATCH] NFS regression in 2.6.26?, "task blocked for more than 120 seconds"



On Mon, Oct 20, 2008 at 4:51 AM, Max Kellermann <mk@xxxxxxxxxx> wrote:
On 2008/10/17 16:33, Glauber Costa <glommer@xxxxxxxxxx> wrote:
That's probably something related to apic congestion.
Does the problem go away if the only thing you change is this:


@@ -891,11 +897,6 @@ do_rest:
store_NMI_vector(&nmi_high, &nmi_low);

smpboot_setup_warm_reset_vector(start_ip);
- /*
- * Be paranoid about clearing APIC errors.
- */
- apic_write(APIC_ESR, 0);
- apic_read(APIC_ESR);
}


Please let me know.

Hello Glauber,

I have rebooted the server with 2.6.27.1 + this patchlet an hour ago.
No problems since.

Hardware: Compaq P4 Xeon server, Broadcom CMIC-WS / CIOB-X2 board.
Tell me if you need more detailed information.


There's a patch in flight from cyrill that probably fixes your problem:
http://lkml.org/lkml/2008/9/15/93

The checks are obviously there for a reason, and we can't just wipe
them out unconditionally ;-) So can you check please that you are also
covered by the case provided?

On 2008/10/20 08:27, Ian Campbell <ijc@xxxxxxxxxxxxxx> wrote:
The issue I see still occurs well before those changesets. I have
seen it with v2.6.25 but v2.6.24 survived for 7 days without issue
(my threshold for a good kernel is 7 days, hence bisecting is a bit
slow...).

Hello Ian,

it seems we're hunting down different bugs after all. Too bad, I
hoped I could have solved your problem, too. Our machine has been
running well over the weekend with the patch I posted; with faulty
kernels, the problem would occur after a few minutes.

Max
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/




--
Glauber Costa.
"Free as in Freedom"
http://glommer.net

"The less confident you are, the more serious you have to act."
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/



Relevant Pages

  • Re: [Kgdb-bugreport] 2.6.23-rc3-mm1: kgdb build failure on powerpc
    ... Plus we should never add power-burners like that into the kernel ... Where is the kgdb git tree? ... More majordomo info at http://vger.kernel.org/majordomo-info.html ... Please read the FAQ at http://www.tux.org/lkml/ ...
    (Linux-Kernel)
  • Re: : unclean backward scrolling
    ... at 1280x1024 on a i386 system, with a 2.6.16.17 kernel. ... fact some lengthier lines are not erased scrolling backward and some ... More majordomo info at http://vger.kernel.org/majordomo-info.html ... Please read the FAQ at http://www.tux.org/lkml/ ...
    (Linux-Kernel)
  • Re: 2.6.14.5 to 2.6.15 patch
    ... >> this documented explicitly in the kernel but not on the kernel.org FAQ. ... necessary to completely explain the "patch nightmare" to everybody concerned. ... *Patch Process* ...
    (Linux-Kernel)
  • Re: [PATCH] ide-cd: fix endianity for the error message in cdrom_read_capacity
    ... Bart owns this patch now. ... switch { ... More majordomo info at http://vger.kernel.org/majordomo-info.html ... Please read the FAQ at http://www.tux.org/lkml/ ...
    (Linux-Kernel)
  • Re: [PATCH] ibmphp: Fix module ref count underflow
    ... I've forwarded the patch on to Jesse. ... More majordomo info at http://vger.kernel.org/majordomo-info.html ... Please read the FAQ at http://www.tux.org/lkml/ ...
    (Linux-Kernel)