• Laurent Dufour's avatar
    powerpc/pseries/mobility: set NMI watchdog factor during an LPM · 118b1366
    Laurent Dufour authored
    During an LPM, while the memory transfer is in progress on the arrival
    side, some latencies are generated when accessing not yet transferred
    pages on the arrival side. Thus, the NMI watchdog may be triggered too
    frequently, which increases the risk to hit an NMI interrupt in a bad
    place in the kernel, leading to a kernel panic.
    
    Disabling the Hard Lockup Watchdog until the memory transfer could be a
    too strong work around, some users would want this timeout to be
    eventually triggered if the system is hanging even during an LPM.
    
    Introduce a new sysctl variable nmi_watchdog_factor. It allows to apply
    a factor to the NMI watchdog timeout during an LPM. Just before the CPUs
    are stopped for the switchover sequence, the NMI watchdog timer is set
    to watchdog_thresh + factor%
    
    A value of 0 has no effect. The default value is 200, meaning that the
    NMI watchdog is set to 30s during LPM (based on a 10s watchdog_thresh
    value). Once the memory transfer is achieved, the factor is reset to 0.
    
    Setting this value to a high number is like disabling the NMI watchdog
    during an LPM.
    Signed-off-by: default avatarLaurent Dufour <ldufour@linux.ibm.com>
    Reviewed-by: default avatarNicholas Piggin <npiggin@gmail.com>
    Signed-off-by: default avatarMichael Ellerman <mpe@ellerman.id.au>
    Link: https://lore.kernel.org/r/20220713154729.80789-5-ldufour@linux.ibm.com
    118b1366
kernel.rst 49.4 KB