• Tony Luck's avatar
    x86/mce: Improve error message when kernel cannot recover · c7d606f5
    Tony Luck authored
    Since we added support to add recovery from some errors inside the kernel in:
    
    commit b2f9d678 ("x86/mce: Check for faults tagged in EXTABLE_CLASS_FAULT exception table entries")
    
    we have done a less than stellar job at reporting the cause of recoverable
    machine checks that occur in other parts of the kernel. The user just gets
    the unhelpful message:
    
    	mce: [Hardware Error]: Machine check: Action required: unknown MCACOD
    
    doubly unhelpful when they check the manual for the reported IA32_MSR_STATUS.MCACOD
    and see that it is listed as one of the standard recoverable values.
    
    Add an extra rule to the MCE severity table to catch this case and report it
    as:
    
    	mce: [Hardware Error]: Machine check: Data load in unrecoverable area of kernel
    
    Fixes: b2f9d678 ("x86/mce: Check for faults tagged in EXTABLE_CLASS_FAULT exception table entries")
    Signed-off-by: default avatarTony Luck <tony.luck@intel.com>
    Signed-off-by: default avatarThomas Gleixner <tglx@linutronix.de>
    Cc: Qiuxu Zhuo <qiuxu.zhuo@intel.com>
    Cc: Ashok Raj <ashok.raj@intel.com>
    Cc: stable@vger.kernel.org # 4.6+
    Cc: Dan Williams <dan.j.williams@intel.com>
    Cc: Borislav Petkov <bp@suse.de>
    Link: https://lkml.kernel.org/r/4cc7c465150a9a48b8b9f45d0b840278e77eb9b5.1527283897.git.tony.luck@intel.com
    c7d606f5
mce-severity.c 10.8 KB