• Lucas Stach's avatar
    drm/scheduler: fix timeout worker setup for out of order job completions · 4823e5da
    Lucas Stach authored
    drm_sched_job_finish() is a work item scheduled for each finished job on
    a unbound system workqueue. This means the workers can execute out of order
    with regard to the real hardware job completions.
    
    If this happens queueing a timeout worker for the first job on the ring
    mirror list is wrong, as this may be a job which has already finished
    executing. Fix this by reorganizing the code to always queue the worker
    for the next job on the list, if this job hasn't finished yet. This is
    robust against a potential reordering of the finish workers.
    
    Also move out the timeout worker cancelling, so that we don't need to
    take the job list lock twice. As a small optimization list_del is used
    to remove the job from the ring mirror list, as there is no need to
    reinit the list head in the job we are about to free.
    Signed-off-by: default avatarLucas Stach <l.stach@pengutronix.de>
    Reviewed-by: default avatarChristian König <christian.koenig@amd.com>
    Signed-off-by: default avatarAlex Deucher <alexander.deucher@amd.com>
    4823e5da
gpu_scheduler.c 26.2 KB