• Tony Krowiak's avatar
    s390/vfio-ap: implement in-use callback for vfio_ap driver · 3f85d1df
    Tony Krowiak authored
    Let's implement the callback to indicate when an APQN
    is in use by the vfio_ap device driver. The callback is
    invoked whenever a change to the apmask or aqmask would
    result in one or more queue devices being removed from the driver. The
    vfio_ap device driver will indicate a resource is in use
    if the APQN of any of the queue devices to be removed are assigned to
    any of the matrix mdevs under the driver's control.
    
    There is potential for a deadlock condition between the
    matrix_dev->guests_lock used to lock the guest during assignment of
    adapters and domains and the ap_perms_mutex locked by the AP bus when
    changes are made to the sysfs apmask/aqmask attributes.
    
    The AP Perms lock controls access to the objects that store the adapter
    numbers (ap_perms) and domain numbers (aq_perms) for the sysfs
    /sys/bus/ap/apmask and /sys/bus/ap/aqmask attributes. These attributes
    identify which queues are reserved for the zcrypt default device drivers.
    Before allowing a bit to be removed from either mask, the AP bus must check
    with the vfio_ap device driver to verify that none of the queues are
    assigned to any of its mediated devices.
    
    The apmask/aqmask attributes can be written or read at any time from
    userspace, so care must be taken to prevent a deadlock with asynchronous
    operations that might be taking place in the vfio_ap device driver. For
    example, consider the following:
    
    1. A system administrator assigns an adapter to a mediated device under the
       control of the vfio_ap device driver. The driver will need to first take
       the matrix_dev->guests_lock to potentially hot plug the adapter into
       the KVM guest.
    2. At the same time, a system administrator sets a bit in the sysfs
       /sys/bus/ap/ap_mask attribute. To complete the operation, the AP bus
       must:
       a. Take the ap_perms_mutex lock to update the object storing the values
          for the /sys/bus/ap/ap_mask attribute.
       b. Call the vfio_ap device driver's in-use callback to verify that the
          queues now being reserved for the default zcrypt drivers are not
          assigned to a mediated device owned by the vfio_ap device driver. To
          do the verification, the in-use callback function takes the
          matrix_dev->guests_lock, but has to wait because it is already held
          by the operation in 1 above.
    3. The vfio_ap device driver calls an AP bus function to verify that the
       new queues resulting from the assignment of the adapter in step 1 are
       not reserved for the default zcrypt device driver. This AP bus function
       tries to take the ap_perms_mutex lock but gets stuck waiting for the
       waiting for the lock due to step 2a above.
    
    Consequently, we have the following deadlock situation:
    
    matrix_dev->guests_lock locked (1)
    ap_perms_mutex lock locked (2a)
    Waiting for matrix_dev->gusts_lock (2b) which is currently held (1)
    Waiting for ap_perms_mutex lock (3) which is currently held (2a)
    
    To prevent this deadlock scenario, the function called in step 3 will no
    longer take the ap_perms_mutex lock and require the caller to take the
    lock. The lock will be the first taken by the adapter/domain assignment
    functions in the vfio_ap device driver to maintain the proper locking
    order.
    Signed-off-by: default avatarTony Krowiak <akrowiak@linux.ibm.com>
    Reviewed-by: default avatarJason J. Herne <jjherne@linux.ibm.com>
    Signed-off-by: default avatarAlexander Gordeev <agordeev@linux.ibm.com>
    3f85d1df
vfio_ap_private.h 4.93 KB