• Ian Munsie's avatar
    cxl: Return error to PSL if IRQ demultiplexing fails & print clearer warning · bc78b05b
    Ian Munsie authored
    If an AFU has a hardware bug that causes it to acknowledge a context
    terminate or remove while that context has outstanding transactions, it
    is possible for the kernel to receive an interrupt for that context
    after we have removed it from the context list.
    
    The kernel will not be able to demultiplex the interrupt (or worse - if
    we have already reallocated the process handle we could mis-attribute it
    to the new context), and printed a big scary warning.
    
    It did not acknowledge the interrupt, which would effectively halt
    further translation fault processing on the PSL.
    
    This patch makes the warning clearer about the likely cause of the issue
    (i.e. hardware bug) to make it obvious to future AFU designers of what
    needs to be fixed. It also prints out the process handle which can then
    be matched up with hardware and software traces for debugging.
    
    It also acknowledges the interrupt to the PSL with either an address
    error or acknowledge, so that the PSL can continue with other
    translations.
    Signed-off-by: default avatarIan Munsie <imunsie@au1.ibm.com>
    Signed-off-by: default avatarMichael Ellerman <mpe@ellerman.id.au>
    bc78b05b
irq.c 11.3 KB