kernel-ark/arch
Jack F Vogel 67701ae976 [PATCH] check nmi watchdog is broken
A bug against an xSeries system showed up recently noting that the
check_nmi_watchdog() test was failing.

I have been investigating it and discovered in both i386 and x86_64 the
recent change to the routine to use the cpu_callin_map has uncovered a
problem.  Prior to that change, on an SMP box, the test was trivally
passing because all cpu's were found to not yet be online, but now with the
callin_map they are discovered, it goes on to test the counter and they
have not yet begun to increment, so it announces a CPU is stuck and bails
out.

On all the systems I have access to test, the announcement of failure is
also bougs...  by the time you can login and check /proc/interrupts, the
NMI count is happily incrementing on all CPUs.  Its just that the test is
being done too early.

I have tried moving the call to the test around a bit, and it was always
too early.  I finally hit on this proposed solution, it delays the routine
via a late_initcall(), seems like the right solution to me.

Signed-off-by: Adrian Bunk <bunk@stusta.de>
Cc: Andi Kleen <ak@muc.de>
Signed-off-by: Andrew Morton <akpm@osdl.org>
Signed-off-by: Linus Torvalds <torvalds@osdl.org>
2005-05-01 08:58:48 -07:00
..
alpha
arm [PATCH] ARM: IntegratorCP: Fix CLCD MUX selection values 2005-04-30 23:32:38 +01:00
arm26
cris
frv
h8300
i386 [PATCH] check nmi watchdog is broken 2005-05-01 08:58:48 -07:00
ia64
m32r
m68k
m68knommu
mips
parisc
ppc [PATCH] ppc32: workaround for spurious IRQs on PQ2 2005-05-01 08:58:42 -07:00
ppc64 [PATCH] ppc64: use smp_mb and smp_wmb 2005-05-01 08:58:47 -07:00
s390
sh
sh64
sparc
sparc64
um
v850
x86_64 [PATCH] check nmi watchdog is broken 2005-05-01 08:58:48 -07:00