Re: PATCH/RFC: [kdump] fix APIC shutdown sequence



Eric W. Biederman wrote:

Ok. Later in the thread it sounds like you have retried this and
irqpoll is working now.

Yes. I'd give a lot to know what went wrong when I tried that in April.
It'd have saved me many hours of work if I had discovered this workaround
before.

Have you done any looking at moving where the kernel initalizes
io_apics? One of the todo items on the path is to leave
io_apic mode enabled and just startup the kernel in io_apic
mode.
I have tried to recover from the "IRR set" situation in several ways by
changing setup_IO_APIC_irq(). But I haven't found a way to recover from
this situation once disable_IO_APIC() had been called.

Yes. The long term goal is to remove the need for calling
disable_IO_APIC(). Because that makes the code simpler etc.

I think a lot would be gained if disable_IO_APIC() would just mask the IRQs
(like the function in my patch does), and perhaps fix the dest ID, instead of
totally clearing the registers.

Moreover, it'd be reasonable to separate out the code that restores virtual
wire mode from disable_IO_APIC().

It is quite possible. I have observed a lot of obscure bugs in the
corner cases of the state machines, although it is possible
this is correct behavior and it is just specific to level
triggered interrupts which are almost exclusively not on
the first ioapic in a system like you describe.

Even if my patch in the form in which I submitted it is unusable,
I think the basic idea that IRQs should be masked bottom-up
(IO-APIC first, then local APIC, then CPU) is correct.

Or is there any specific reason why the current code does it vice-versa?

Martin


--
Martin Wilck
PRIMERGY System Software Engineer
FSC IP ESP DE6

Fujitsu Siemens Computers GmbH
Heinz-Nixdorf-Ring 1
33106 Paderborn
Germany

Tel: ++49 5251 8 15113
Fax: ++49 5251 8 20409
Email: mailto:martin.wilck@xxxxxxxxxxxxxxxxxxx
Internet: http://www.fujitsu-siemens.com
Company Details: http://www.fujitsu-siemens.com/imprint.html
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/



Relevant Pages

  • Re: [patch] lockf(3) user-exploitable kernel panic
    ... I know my patch fails ... that libutil tries to provide this interface. ... The reason I asked was because I don't have access to many boxes of ... different architectures or operating systems. ...
    (freebsd-arch)
  • Re: Why are so many built-in types inheritable?
    ... reason why FunctionType is not subclassable is that nobody bothered to ... why is there a need for such a patch? ... The reason why it doesn't work then seems to boil down to the ... I know about practicality beating purity, ...
    (comp.lang.python)
  • Re: [PATCH 0/4] add task handling notifier
    ... For some reason neither ever made a lot of progess (performance ... it adds runtime overhead purely for the convenience of kernel ... While I (obviously, since I submitted the patch disagree), ...
    (Linux-Kernel)
  • Re: EGNOS working at last!
    ... EGNOS satellites, so it may be worth you checking their site to see if ... there's a patch for your model. ... The reason it's so difficult and unreliable right now is that it isn't fully ... it seems a lot of money to spend on whizz-bang ...
    (uk.rec.walking)
  • Re: [PATCH] x86 (64): make calibrate_APIC_clock() SMI-safe (take 2)
    ... Cyrill Gorcunov wrote: ... I don't really mind against your patch but we better ... I have to say that my simplified patch failed to do the calibration correctly on our test system. ... PRIMERGY System Software Engineer ...
    (Linux-Kernel)