1. 05 Mar, 2024 4 commits
    • Karol Herbst's avatar
      drm/nouveau: move more missing UAPI bits · 460be1d5
      Karol Herbst authored
      Those are already de-facto UAPI, so let's just move it into the uapi
      header.
      Signed-off-by: default avatarKarol Herbst <kherbst@redhat.com>
      Reviewed-by: default avatarLyude Paul <lyude@redhat.com>
      Reviewed-by: default avatarDanilo Krummrich <dakr@redhat.com>
      Link: https://patchwork.freedesktop.org/patch/msgid/20240305133853.2214268-2-kherbst@redhat.com
      460be1d5
    • Jagan Teki's avatar
      drm/bridge: Document bridge init order with pre_enable_prev_first · 113cc3ad
      Jagan Teki authored
      In order to satisfy the MIPI DSI initialization sequence the bridge
      init order has been altered with the help of pre_enable_prev_first
      in pre_enable and post_disable bridge operations.
      
      Document the affected bridge init order with an example on the
      bridge operations helpers.
      Signed-off-by: default avatarJagan Teki <jagan@amarulasolutions.com>
      Reviewed-by: default avatarDave Stevenson <dave.stevenson@raspberrypi.com>
      Signed-off-by: default avatarRobert Foss <rfoss@kernel.org>
      Link: https://patchwork.freedesktop.org/patch/msgid/20230328170752.1102347-2-jagan@amarulasolutions.com
      113cc3ad
    • Jagan Teki's avatar
      drm/bridge: Fix improper bridge init order with pre_enable_prev_first · e18aeeda
      Jagan Teki authored
      For a given bridge pipeline if any bridge sets pre_enable_prev_first
      flag then the pre_enable for the previous bridge will be called before
      pre_enable of this bridge and opposite is done for post_disable.
      
      These are the potential bridge flags to alter bridge init order in order
      to satisfy the MIPI DSI host and downstream panel or bridge to function.
      However the existing pre_enable_prev_first logic with associated bridge
      ordering has broken for both pre_enable and post_disable calls.
      
      [pre_enable]
      
      The altered bridge ordering has failed if two consecutive bridges on a
      given pipeline enables the pre_enable_prev_first flag.
      
      Example:
      - Panel
      - Bridge 1
      - Bridge 2 pre_enable_prev_first
      - Bridge 3
      - Bridge 4 pre_enable_prev_first
      - Bridge 5 pre_enable_prev_first
      - Bridge 6
      - Encoder
      
      In this example, Bridge 4 and Bridge 5 have pre_enable_prev_first.
      
      The logic looks for a bridge which enabled pre_enable_prev_first flag
      on each iteration and assigned the previou bridge to limit pointer
      if the bridge doesn't enable pre_enable_prev_first flags.
      
      If control found Bridge 2 is pre_enable_prev_first then the iteration
      looks for Bridge 3 and found it is not pre_enable_prev_first and assigns
      it's previous Bridge 4 to limit pointer and calls pre_enable of Bridge 3
      and Bridge 2 and assign iter pointer with limit which is Bridge 4.
      
      Here is the actual problem, for the next iteration control look for
      Bridge 5 instead of Bridge 4 has iter pointer in previous iteration
      moved to Bridge 4 so this iteration skips the Bridge 4. The iteration
      found Bridge 6 doesn't pre_enable_prev_first flags so the limit assigned
      to Encoder. From next iteration Encoder skips as it is the last bridge
      for reverse order pipeline.
      
      So, the resulting pre_enable bridge order would be,
      - Panel, Bridge 1, Bridge 3, Bridge 2, Bridge 6, Bridge 5.
      
      This patch fixes this by assigning limit to next pointer instead of
      previous bridge since the iteration always looks for bridge that does
      NOT request prev so assigning next makes sure the last bridge on a
      given iteration what exactly the limit bridge is.
      
      So, the resulting pre_enable bridge order with fix would be,
      - Panel, Bridge 1, Bridge 3, Bridge 2, Bridge 6, Bridge 5, Bridge 4,
        Encoder.
      
      [post_disable]
      
      The altered bridge ordering has failed if two consecutive bridges on a
      given pipeline enables the pre_enable_prev_first flag.
      
      Example:
      - Panel
      - Bridge 1
      - Bridge 2 pre_enable_prev_first
      - Bridge 3
      - Bridge 4 pre_enable_prev_first
      - Bridge 5 pre_enable_prev_first
      - Bridge 6
      - Encoder
      
      In this example Bridge 5 and Bridge 4 have pre_enable_prev_first.
      
      The logic looks for a bridge which enabled pre_enable_prev_first flags
      on each iteration and assigned the previou bridge to next and next to
      limit pointer if the bridge does enable pre_enable_prev_first flag.
      
      If control starts from Bridge 6 then it found next Bridge 5 is
      pre_enable_prev_first and immediately the next assigned to previous
      Bridge 6 and limit assignments to next Bridge 6 and call post_enable
      of Bridge 6 even though the next consecutive Bridge 5 is enabled with
      pre_enable_prev_first. This clearly misses the logic to find the state
      of next conducive bridge as everytime the next and limit assigns
      previous bridge if given bridge enabled pre_enable_prev_first.
      
      So, the resulting post_disable bridge order would be,
      - Encoder, Bridge 6, Bridge 5, Bridge 4, Bridge 3, Bridge 2, Bridge 1,
        Panel.
      
      This patch fixes this by assigning next with previou bridge only if the
      bridge doesn't enable pre_enable_prev_first flag and the next further
      assign it to limit. This way we can find the bridge that NOT requested
      prev to disable last.
      
      So, the resulting pre_enable bridge order with fix would be,
      - Encoder, Bridge 4, Bridge 5, Bridge 6, Bridge 2, Bridge 3, Bridge 1,
        Panel.
      
      Validated the bridge init ordering by incorporating dummy bridges in
      the sun6i-mipi-dsi pipeline
      
      Fixes: 4fb912e5 ("drm/bridge: Introduce pre_enable_prev_first to alter bridge init order")
      Signed-off-by: default avatarJagan Teki <jagan@amarulasolutions.com>
      Tested-by: default avatarMichael Trimarchi <michael@amarulasolutions.com>
      Reviewed-by: default avatarDave Stevenson <dave.stevenson@raspberrypi.com>
      Signed-off-by: default avatarRobert Foss <rfoss@kernel.org>
      Link: https://patchwork.freedesktop.org/patch/msgid/20230328170752.1102347-1-jagan@amarulasolutions.com
      e18aeeda
    • Andrew Halaney's avatar
      drm/tidss: Use dev_err_probe() over dev_dbg() when failing to probe the port · b2ec429b
      Andrew Halaney authored
      This gets logged out to /sys/kernel/debug/devices_deferred in the
      -EPROBE_DEFER case and as an error otherwise. The message here provides
      useful information to the user when troubleshooting why their display is
      not working in either case, so let's make it output appropriately.
      Signed-off-by: default avatarAndrew Halaney <ahalaney@redhat.com>
      Reviewed-by: default avatarJavier Martinez Canillas <javierm@redhat.com>
      Tested-by: default avatarEnric Balletbo i Serra <eballetbo@kernel.org>
      Link: https://lore.kernel.org/r/20240228-tidss-dev-err-probe-v1-1-5482252326d3@redhat.comSigned-off-by: default avatarMaxime Ripard <mripard@kernel.org>
      b2ec429b
  2. 04 Mar, 2024 1 commit
  3. 01 Mar, 2024 16 commits
    • Christian König's avatar
      drm/amdgpu: use GTT only as fallback for VRAM|GTT · 216c1282
      Christian König authored
      Try to fill up VRAM as well by setting the busy flag on GTT allocations.
      
      This fixes the issue that when VRAM was evacuated for suspend it's never
      filled up again unless the application is restarted.
      Signed-off-by: default avatarChristian König <christian.koenig@amd.com>
      Reviewed-by: default avatarZack Rusin <zack.rusin@broadcom.com>
      Link: https://patchwork.freedesktop.org/patch/msgid/20240229134003.3688-2-christian.koenig@amd.com
      216c1282
    • Christian König's avatar
      drm/ttm: improve idle/busy handling v5 · cc941c70
      Christian König authored
      Previously we would never try to move a BO into the preferred placements
      when it ever landed in a busy placement since those were considered
      compatible.
      
      Rework the whole handling and finally unify the idle and busy handling.
      ttm_bo_validate() is now responsible to try idle placement first and then
      use the busy placement if that didn't worked.
      
      Drawback is that we now always try the idle placement first for each
      validation which might cause some additional CPU overhead on overcommit.
      
      v2: fix kerneldoc warning and coding style
      v3: take care of XE as well
      v4: keep the ttm_bo_mem_space functionality as it is for now, only add
          new handling for ttm_bo_validate as suggested by Thomas
      v5: fix bug pointed out by Matthew
      Signed-off-by: default avatarChristian König <christian.koenig@amd.com>
      Reviewed-by: Zack Rusin <zack.rusin@broadcom.com> v3
      Link: https://patchwork.freedesktop.org/patch/msgid/20240229134003.3688-1-christian.koenig@amd.comReviewed-by: default avatarThomas Hellström <thomas.hellstrom@linux.intel.com>
      cc941c70
    • Boris Brezillon's avatar
      drm/panthor: Add an entry to MAINTAINERS · 6f6eebcf
      Boris Brezillon authored
      Add an entry for the Panthor driver to the MAINTAINERS file.
      
      v6:
      - Add Maxime's and Heiko's acks
      
      v4:
      - Add Steve's R-b
      
      v3:
      - Add bindings document as an 'F:' line.
      - Add Steven and Liviu as co-maintainers.
      Signed-off-by: default avatarBoris Brezillon <boris.brezillon@collabora.com>
      Reviewed-by: default avatarSteven Price <steven.price@arm.com>
      Acked-by: default avatarMaxime Ripard <mripard@kernel.org>
      Acked-by: default avatarHeiko Stuebner <heiko@sntech.de>
      Link: https://patchwork.freedesktop.org/patch/msgid/20240229162230.2634044-15-boris.brezillon@collabora.com
      6f6eebcf
    • Liviu Dudau's avatar
      dt-bindings: gpu: mali-valhall-csf: Add support for Arm Mali CSF GPUs · 64bc5bd0
      Liviu Dudau authored
      Arm has introduced a new v10 GPU architecture that replaces the Job Manager
      interface with a new Command Stream Frontend. It adds firmware driven
      command stream queues that can be used by kernel and user space to submit
      jobs to the GPU.
      
      Add the initial schema for the device tree that is based on support for
      RK3588 SoC. The minimum number of clocks is one for the IP, but on Rockchip
      platforms they will tend to expose the semi-independent clocks for better
      power management.
      
      v6:
      - Add Maxime's and Heiko's acks
      
      v5:
      - Move the opp-table node under the gpu node
      
      v4:
      - Fix formatting issue
      
      v3:
      - Cleanup commit message to remove redundant text
      - Added opp-table property and re-ordered entries
      - Clarified power-domains and power-domain-names requirements for RK3588.
      - Cleaned up example
      
      Note: power-domains and power-domain-names requirements for other platforms
      are still work in progress, hence the bindings are left incomplete here.
      
      v2:
      - New commit
      Signed-off-by: default avatarLiviu Dudau <liviu.dudau@arm.com>
      Cc: Krzysztof Kozlowski <krzysztof.kozlowski+dt@linaro.org>
      Cc: Rob Herring <robh+dt@kernel.org>
      Cc: Conor Dooley <conor+dt@kernel.org>
      Cc: devicetree@vger.kernel.org
      Signed-off-by: default avatarBoris Brezillon <boris.brezillon@collabora.com>
      Reviewed-by: default avatarRob Herring <robh@kernel.org>
      Acked-by: default avatarMaxime Ripard <mripard@kernel.org>
      Acked-by: default avatarHeiko Stuebner <heiko@sntech.de>
      Link: https://patchwork.freedesktop.org/patch/msgid/20240229162230.2634044-14-boris.brezillon@collabora.com
      64bc5bd0
    • Boris Brezillon's avatar
      drm/panthor: Allow driver compilation · d72f0490
      Boris Brezillon authored
      Now that all blocks are available, we can add/update Kconfig/Makefile
      files to allow compilation.
      
      v6:
      - Add Maxime's and Heiko's acks
      - Keep source files alphabetically ordered in the Makefile
      
      v4:
      - Add Steve's R-b
      
      v3:
      - Add a dep on DRM_GPUVM
      - Fix dependencies in Kconfig
      - Expand help text to (hopefully) describe which GPUs are to be
        supported by this driver and which are for panfrost.
      Co-developed-by: default avatarSteven Price <steven.price@arm.com>
      Signed-off-by: default avatarSteven Price <steven.price@arm.com>
      Signed-off-by: default avatarBoris Brezillon <boris.brezillon@collabora.com>
      Acked-by: Steven Price <steven.price@arm.com> # MIT+GPL2 relicensing,Arm
      Acked-by: Grant Likely <grant.likely@linaro.org> # MIT+GPL2 relicensing,Linaro
      Acked-by: Boris Brezillon <boris.brezillon@collabora.com> # MIT+GPL2 relicensing,Collabora
      Reviewed-by: default avatarSteven Price <steven.price@arm.com>
      Acked-by: default avatarMaxime Ripard <mripard@kernel.org>
      Acked-by: default avatarHeiko Stuebner <heiko@sntech.de>
      Link: https://patchwork.freedesktop.org/patch/msgid/20240229162230.2634044-13-boris.brezillon@collabora.com
      d72f0490
    • Boris Brezillon's avatar
      drm/panthor: Add the driver frontend block · 4bdca115
      Boris Brezillon authored
      This is the last piece missing to expose the driver to the outside
      world.
      
      This is basically a wrapper between the ioctls and the other logical
      blocks.
      
      v6:
      - Add Maxime's and Heiko's acks
      - Return a page-aligned BO size to userspace
      - Keep header inclusion alphabetically ordered
      
      v5:
      - Account for the drm_exec_init() prototype change
      - Include platform_device.h
      
      v4:
      - Add an ioctl to let the UMD query the VM state
      - Fix kernel doc
      - Let panthor_device_init() call panthor_device_init()
      - Fix cleanup ordering in the panthor_init() error path
      - Add Steve's and Liviu's R-b
      
      v3:
      - Add acks for the MIT/GPL2 relicensing
      - Fix 32-bit support
      - Account for panthor_vm and panthor_sched changes
      - Simplify the resv preparation/update logic
      - Use a linked list rather than xarray for list of signals.
      - Simplify panthor_get_uobj_array by returning the newly allocated
        array.
      - Drop the "DOC" for job submission helpers and move the relevant
        comments to panthor_ioctl_group_submit().
      - Add helpers sync_op_is_signal()/sync_op_is_wait().
      - Simplify return type of panthor_submit_ctx_add_sync_signal() and
        panthor_submit_ctx_get_sync_signal().
      - Drop WARN_ON from panthor_submit_ctx_add_job().
      - Fix typos in comments.
      Co-developed-by: default avatarSteven Price <steven.price@arm.com>
      Signed-off-by: default avatarSteven Price <steven.price@arm.com>
      Signed-off-by: default avatarBoris Brezillon <boris.brezillon@collabora.com>
      Acked-by: Steven Price <steven.price@arm.com> # MIT+GPL2 relicensing,Arm
      Acked-by: Grant Likely <grant.likely@linaro.org> # MIT+GPL2 relicensing,Linaro
      Acked-by: Boris Brezillon <boris.brezillon@collabora.com> # MIT+GPL2 relicensing,Collabora
      Reviewed-by: default avatarSteven Price <steven.price@arm.com>
      Reviewed-by: default avatarLiviu Dudau <liviu.dudau@arm.com>
      Acked-by: default avatarMaxime Ripard <mripard@kernel.org>
      Acked-by: default avatarHeiko Stuebner <heiko@sntech.de>
      Link: https://patchwork.freedesktop.org/patch/msgid/20240229162230.2634044-12-boris.brezillon@collabora.com
      4bdca115
    • Boris Brezillon's avatar
      drm/panthor: Add the scheduler logical block · de854881
      Boris Brezillon authored
      This is the piece of software interacting with the FW scheduler, and
      taking care of some scheduling aspects when the FW comes short of slots
      scheduling slots. Indeed, the FW only expose a few slots, and the kernel
      has to give all submission contexts, a chance to execute their jobs.
      
      The kernel-side scheduler is timeslice-based, with a round-robin queue
      per priority level.
      
      Job submission is handled with a 1:1 drm_sched_entity:drm_gpu_scheduler,
      allowing us to delegate the dependency tracking to the core.
      
      All the gory details should be documented inline.
      
      v6:
      - Add Maxime's and Heiko's acks
      - Make sure the scheduler is initialized before queueing the tick work
        in the MMU fault handler
      - Keep header inclusion alphabetically ordered
      
      v5:
      - Fix typos
      - Call panthor_kernel_bo_destroy(group->syncobjs) unconditionally
      - Don't move the group to the waiting list tail when it was already
        waiting for a different syncobj
      - Fix fatal_queues flagging in the tiler OOM path
      - Don't warn when more than one job timesout on a group
      - Add a warning message when we fail to allocate a heap chunk
      - Add Steve's R-b
      
      v4:
      - Check drmm_mutex_init() return code
      - s/drm_gem_vmap_unlocked/drm_gem_vunmap_unlocked/ in
        panthor_queue_put_syncwait_obj()
      - Drop unneeded WARN_ON() in cs_slot_sync_queue_state_locked()
      - Use atomic_xchg() instead of atomic_fetch_and(0)
      - Fix typos
      - Let panthor_kernel_bo_destroy() check for IS_ERR_OR_NULL() BOs
      - Defer TILER_OOM event handling to a separate workqueue to prevent
        deadlocks when the heap chunk allocation is blocked on mem-reclaim.
        This is just a temporary solution, until we add support for
        non-blocking/failable allocations
      - Pass the scheduler workqueue to drm_sched instead of instantiating
        a separate one (no longer needed now that heap chunk allocation
        happens on a dedicated wq)
      - Set WQ_MEM_RECLAIM on the scheduler workqueue, so we can handle
        job timeouts when the system is under mem pressure, and hopefully
        free up some memory retained by these jobs
      
      v3:
      - Rework the FW event handling logic to avoid races
      - Make sure MMU faults kill the group immediately
      - Use the panthor_kernel_bo abstraction for group/queue buffers
      - Make in_progress an atomic_t, so we can check it without the reset lock
        held
      - Don't limit the number of groups per context to the FW scheduler
        capacity. Fix the limit to 128 for now.
      - Add a panthor_job_vm() helper
      - Account for panthor_vm changes
      - Add our job fence as DMA_RESV_USAGE_WRITE to all external objects
        (was previously DMA_RESV_USAGE_BOOKKEEP). I don't get why, given
        we're supposed to be fully-explicit, but other drivers do that, so
        there must be a good reason
      - Account for drm_sched changes
      - Provide a panthor_queue_put_syncwait_obj()
      - Unconditionally return groups to their idle list in
        panthor_sched_suspend()
      - Condition of sched_queue_{,delayed_}work fixed to be only when a reset
        isn't pending or in progress.
      - Several typos in comments fixed.
      Co-developed-by: default avatarSteven Price <steven.price@arm.com>
      Signed-off-by: default avatarSteven Price <steven.price@arm.com>
      Signed-off-by: default avatarBoris Brezillon <boris.brezillon@collabora.com>
      Reviewed-by: default avatarSteven Price <steven.price@arm.com>
      Acked-by: default avatarMaxime Ripard <mripard@kernel.org>
      Acked-by: default avatarHeiko Stuebner <heiko@sntech.de>
      Link: https://patchwork.freedesktop.org/patch/msgid/20240229162230.2634044-11-boris.brezillon@collabora.com
      de854881
    • Boris Brezillon's avatar
      drm/panthor: Add the heap logical block · 9cca48fa
      Boris Brezillon authored
      Tiler heap growing requires some kernel driver involvement: when the
      tiler runs out of heap memory, it will raise an exception which is
      either directly handled by the firmware if some free heap chunks are
      available in the heap context, or passed back to the kernel otherwise.
      The heap helpers will be used by the scheduler logic to allocate more
      heap chunks to a heap context, when such a situation happens.
      
      Heap context creation is explicitly requested by userspace (using
      the TILER_HEAP_CREATE ioctl), and the returned context is attached to a
      queue through some command stream instruction.
      
      All the kernel does is keep the list of heap chunks allocated to a
      context, so they can be freed when TILER_HEAP_DESTROY is called, or
      extended when the FW requests a new chunk.
      
      v6:
      - Add Maxime's and Heiko's acks
      
      v5:
      - Fix FIXME comment
      - Add Steve's R-b
      
      v4:
      - Rework locking to allow concurrent calls to panthor_heap_grow()
      - Add a helper to return a heap chunk if we couldn't pass it to the
        FW because the group was scheduled out
      
      v3:
      - Add a FIXME for the heap OOM deadlock
      - Use the panthor_kernel_bo abstraction for the heap context and heap
        chunks
      - Drop the panthor_heap_gpu_ctx struct as it is opaque to the driver
      - Ensure that the heap context is aligned to the GPU cache line size
      - Minor code tidy ups
      Co-developed-by: default avatarSteven Price <steven.price@arm.com>
      Signed-off-by: default avatarSteven Price <steven.price@arm.com>
      Signed-off-by: default avatarBoris Brezillon <boris.brezillon@collabora.com>
      Reviewed-by: default avatarSteven Price <steven.price@arm.com>
      Acked-by: default avatarMaxime Ripard <mripard@kernel.org>
      Acked-by: default avatarHeiko Stuebner <heiko@sntech.de>
      Link: https://patchwork.freedesktop.org/patch/msgid/20240229162230.2634044-10-boris.brezillon@collabora.com
      9cca48fa
    • Boris Brezillon's avatar
      drm/panthor: Add the FW logical block · 2718d918
      Boris Brezillon authored
      Contains everything that's FW related, that includes the code dealing
      with the microcontroller unit (MCU) that's running the FW, and anything
      related to allocating memory shared between the FW and the CPU.
      
      A few global FW events are processed in the IRQ handler, the rest is
      forwarded to the scheduler, since scheduling is the primary reason for
      the FW existence, and also the main source of FW <-> kernel
      interactions.
      
      v6:
      - Add Maxime's and Heiko's acks
      - Keep header inclusion alphabetically ordered
      
      v5:
      - Fix typo in GLB_PERFCNT_SAMPLE definition
      - Fix unbalanced panthor_vm_idle/active() calls
      - Fallback to a slow reset when the fast reset fails
      - Add extra information when reporting a FW boot failure
      
      v4:
      - Add a MODULE_FIRMWARE() entry for gen 10.8
      - Fix a wrong return ERR_PTR() in panthor_fw_load_section_entry()
      - Fix typos
      - Add Steve's R-b
      
      v3:
      - Make the FW path more future-proof (Liviu)
      - Use one waitqueue for all FW events
      - Simplify propagation of FW events to the scheduler logic
      - Drop the panthor_fw_mem abstraction and use panthor_kernel_bo instead
      - Account for the panthor_vm changes
      - Replace magic number with 0x7fffffff with ~0 to better signify that
        it's the maximum permitted value.
      - More accurate rounding when computing the firmware timeout.
      - Add a 'sub iterator' helper function. This also adds a check that a
        firmware entry doesn't overflow the firmware image.
      - Drop __packed from FW structures, natural alignment is good enough.
      - Other minor code improvements.
      Co-developed-by: default avatarSteven Price <steven.price@arm.com>
      Signed-off-by: default avatarSteven Price <steven.price@arm.com>
      Signed-off-by: default avatarBoris Brezillon <boris.brezillon@collabora.com>
      Reviewed-by: default avatarSteven Price <steven.price@arm.com>
      Acked-by: default avatarMaxime Ripard <mripard@kernel.org>
      Acked-by: default avatarHeiko Stuebner <heiko@sntech.de>
      Link: https://patchwork.freedesktop.org/patch/msgid/20240229162230.2634044-9-boris.brezillon@collabora.com
      2718d918
    • Boris Brezillon's avatar
      drm/panthor: Add the MMU/VM logical block · 647810ec
      Boris Brezillon authored
      MMU and VM management is related and placed in the same source file.
      
      Page table updates are delegated to the io-pgtable-arm driver that's in
      the iommu subsystem.
      
      The VM management logic is based on drm_gpuva_mgr, and is assuming the
      VA space is mostly managed by the usermode driver, except for a reserved
      portion of this VA-space that's used for kernel objects (like the heap
      contexts/chunks).
      
      Both asynchronous and synchronous VM operations are supported, and
      internal helpers are exposed to allow other logical blocks to map their
      buffers in the GPU VA space.
      
      There's one VM_BIND queue per-VM (meaning the Vulkan driver can only
      expose one sparse-binding queue), and this bind queue is managed with
      a 1:1 drm_sched_entity:drm_gpu_scheduler, such that each VM gets its own
      independent execution queue, avoiding VM operation serialization at the
      device level (things are still serialized at the VM level).
      
      The rest is just implementation details that are hopefully well explained
      in the documentation.
      
      v6:
      - Add Maxime's and Heiko's acks
      - Add Steve's R-b
      - Adjust the TRANSCFG value to account for SW VA space limitation on
        32-bit systems
      - Keep header inclusion alphabetically ordered
      
      v5:
      - Fix a double panthor_vm_cleanup_op_ctx() call
      - Fix a race between panthor_vm_prepare_map_op_ctx() and
        panthor_vm_bo_put()
      - Fix panthor_vm_pool_destroy_vm() kernel doc
      - Fix paddr adjustment in panthor_vm_map_pages()
      - Fix bo_offset calculation in panthor_vm_get_bo_for_va()
      
      v4:
      - Add an helper to return the VM state
      - Check drmm_mutex_init() return code
      - Remove the VM from the AS reclaim list when panthor_vm_active() is
        called
      - Count the number of active VM users instead of considering there's
        at most one user (several scheduling groups can point to the same
        vM)
      - Pre-allocate a VMA object for unmap operations (unmaps can trigger
        a sm_step_remap() call)
      - Check vm->root_page_table instead of vm->pgtbl_ops to detect if
        the io-pgtable is trying to allocate the root page table
      - Don't memset() the va_node in panthor_vm_alloc_va(), make it a
        caller requirement
      - Fix the kernel doc in a few places
      - Drop the panthor_vm::base offset constraint and modify
        panthor_vm_put() to explicitly check for a NULL value
      - Fix unbalanced vm_bo refcount in panthor_gpuva_sm_step_remap()
      - Drop stale comments about the shared_bos list
      - Patch mmu_features::va_bits on 32-bit builds to reflect the
        io_pgtable limitation and let the UMD know about it
      
      v3:
      - Add acks for the MIT/GPL2 relicensing
      - Propagate MMU faults to the scheduler
      - Move pages pinning/unpinning out of the dma_signalling path
      - Fix 32-bit support
      - Rework the user/kernel VA range calculation
      - Make the auto-VA range explicit (auto-VA range doesn't cover the full
        kernel-VA range on the MCU VM)
      - Let callers of panthor_vm_alloc_va() allocate the drm_mm_node
        (embedded in panthor_kernel_bo now)
      - Adjust things to match the latest drm_gpuvm changes (extobj tracking,
        resv prep and more)
      - Drop the per-AS lock and use slots_lock (fixes a race on vm->as.id)
      - Set as.id to -1 when reusing an address space from the LRU list
      - Drop misleading comment about page faults
      - Remove check for irq being assigned in panthor_mmu_unplug()
      Co-developed-by: default avatarSteven Price <steven.price@arm.com>
      Signed-off-by: default avatarSteven Price <steven.price@arm.com>
      Signed-off-by: default avatarBoris Brezillon <boris.brezillon@collabora.com>
      Acked-by: Steven Price <steven.price@arm.com> # MIT+GPL2 relicensing,Arm
      Acked-by: Grant Likely <grant.likely@linaro.org> # MIT+GPL2 relicensing,Linaro
      Acked-by: Boris Brezillon <boris.brezillon@collabora.com> # MIT+GPL2 relicensing,Collabora
      Reviewed-by: default avatarSteven Price <steven.price@arm.com>
      Acked-by: default avatarMaxime Ripard <mripard@kernel.org>
      Acked-by: default avatarHeiko Stuebner <heiko@sntech.de>
      Link: https://patchwork.freedesktop.org/patch/msgid/20240229162230.2634044-8-boris.brezillon@collabora.com
      647810ec
    • Boris Brezillon's avatar
      drm/panthor: Add the devfreq logical block · fac9b22d
      Boris Brezillon authored
      Every thing related to devfreq in placed in panthor_devfreq.c, and
      helpers that can be called by other logical blocks are exposed through
      panthor_devfreq.h.
      
      This implementation is loosely based on the panfrost implementation,
      the only difference being that we don't count device users, because
      the idle/active state will be managed by the scheduler logic.
      
      v6:
      - Add Maxime's and Heiko's acks
      - Keep header inclusion alphabetically ordered
      
      v4:
      - Add Clément's A-b for the relicensing
      
      v3:
      - Add acks for the MIT/GPL2 relicensing
      
      v2:
      - Added in v2
      
      Cc: Clément Péron <peron.clem@gmail.com> # MIT+GPL2 relicensing
      Reviewed-by: default avatarSteven Price <steven.price@arm.com>
      Signed-off-by: default avatarBoris Brezillon <boris.brezillon@collabora.com>
      Acked-by: Steven Price <steven.price@arm.com> # MIT+GPL2 relicensing,Arm
      Acked-by: Grant Likely <grant.likely@linaro.org> # MIT+GPL2 relicensing,Linaro
      Acked-by: Boris Brezillon <boris.brezillon@collabora.com> # MIT+GPL2 relicensing,Collabora
      Acked-by: Clément Péron <peron.clem@gmail.com> # MIT+GPL2 relicensing
      Acked-by: default avatarMaxime Ripard <mripard@kernel.org>
      Acked-by: default avatarHeiko Stuebner <heiko@sntech.de>
      Link: https://patchwork.freedesktop.org/patch/msgid/20240229162230.2634044-7-boris.brezillon@collabora.com
      fac9b22d
    • Boris Brezillon's avatar
      drm/panthor: Add GEM logical block · 8a1cc075
      Boris Brezillon authored
      Anything relating to GEM object management is placed here. Nothing
      particularly interesting here, given the implementation is based on
      drm_gem_shmem_object, which is doing most of the work.
      
      v6:
      - Add Maxime's and Heiko's acks
      - Return a page-aligned BO size to userspace when creating a BO
      - Keep header inclusion alphabetically ordered
      
      v5:
      - Add Liviu's and Steve's R-b
      
      v4:
      - Force kernel BOs to be GPU mapped
      - Make panthor_kernel_bo_destroy() robust against ERR/NULL BO pointers
        to simplify the call sites
      
      v3:
      - Add acks for the MIT/GPL2 relicensing
      - Provide a panthor_kernel_bo abstraction for buffer objects managed by
        the kernel (will replace panthor_fw_mem and be used everywhere we were
        using panthor_gem_create_and_map() before)
      - Adjust things to match drm_gpuvm changes
      - Change return of panthor_gem_create_with_handle() to int
      Co-developed-by: default avatarSteven Price <steven.price@arm.com>
      Signed-off-by: default avatarSteven Price <steven.price@arm.com>
      Signed-off-by: default avatarBoris Brezillon <boris.brezillon@collabora.com>
      Acked-by: Steven Price <steven.price@arm.com> # MIT+GPL2 relicensing,Arm
      Acked-by: Grant Likely <grant.likely@linaro.org> # MIT+GPL2 relicensing,Linaro
      Acked-by: Boris Brezillon <boris.brezillon@collabora.com> # MIT+GPL2 relicensing,Collabora
      Reviewed-by: default avatarLiviu Dudau <liviu.dudau@arm.com>
      Reviewed-by: default avatarSteven Price <steven.price@arm.com>
      Acked-by: default avatarMaxime Ripard <mripard@kernel.org>
      Acked-by: default avatarHeiko Stuebner <heiko@sntech.de>
      Link: https://patchwork.freedesktop.org/patch/msgid/20240229162230.2634044-6-boris.brezillon@collabora.com
      8a1cc075
    • Boris Brezillon's avatar
      drm/panthor: Add the GPU logical block · 5cd894e2
      Boris Brezillon authored
      Handles everything that's not related to the FW, the MMU or the
      scheduler. This is the block dealing with the GPU property retrieval,
      the GPU block power on/off logic, and some global operations, like
      global cache flushing.
      
      v6:
      - Add Maxime's and Heiko's acks
      
      v5:
      - Fix GPU_MODEL() kernel doc
      - Fix test in panthor_gpu_block_power_off()
      - Add Steve's R-b
      
      v4:
      - Expose CORE_FEATURES through DEV_QUERY
      
      v3:
      - Add acks for the MIT/GPL2 relicensing
      - Use macros to extract GPU ID info
      - Make sure we reset clear pending_reqs bits when wait_event_timeout()
        times out but the corresponding bit is cleared in GPU_INT_RAWSTAT
        (can happen if the IRQ is masked or HW takes to long to call the IRQ
        handler)
      - GPU_MODEL now takes separate arch and product majors to be more
        readable.
      - Drop GPU_IRQ_MCU_STATUS_CHANGED from interrupt mask.
      - Handle GPU_IRQ_PROTM_FAULT correctly (don't output registers that are
        not updated for protected interrupts).
      - Minor code tidy ups
      
      Cc: Alexey Sheplyakov <asheplyakov@basealt.ru> # MIT+GPL2 relicensing
      Co-developed-by: default avatarSteven Price <steven.price@arm.com>
      Signed-off-by: default avatarSteven Price <steven.price@arm.com>
      Signed-off-by: default avatarBoris Brezillon <boris.brezillon@collabora.com>
      Acked-by: Steven Price <steven.price@arm.com> # MIT+GPL2 relicensing,Arm
      Acked-by: Grant Likely <grant.likely@linaro.org> # MIT+GPL2 relicensing,Linaro
      Acked-by: Boris Brezillon <boris.brezillon@collabora.com> # MIT+GPL2 relicensing,Collabora
      Reviewed-by: default avatarSteven Price <steven.price@arm.com>
      Acked-by: default avatarMaxime Ripard <mripard@kernel.org>
      Acked-by: default avatarHeiko Stuebner <heiko@sntech.de>
      Link: https://patchwork.freedesktop.org/patch/msgid/20240229162230.2634044-5-boris.brezillon@collabora.com
      5cd894e2
    • Boris Brezillon's avatar
      drm/panthor: Add the device logical block · 5fe909ca
      Boris Brezillon authored
      The panthor driver is designed in a modular way, where each logical
      block is dealing with a specific HW-block or software feature. In order
      for those blocks to communicate with each other, we need a central
      panthor_device collecting all the blocks, and exposing some common
      features, like interrupt handling, power management, reset, ...
      
      This what this panthor_device logical block is about.
      
      v6:
      - Add Maxime's and Heiko's acks
      - Keep header inclusion alphabetically ordered
      
      v5:
      - Suspend the MMU/GPU blocks if panthor_fw_resume() fails in
        panthor_device_resume()
      - Move the pm_runtime_use_autosuspend() call before drm_dev_register()
      - Add Liviu's R-b
      
      v4:
      - Check drmm_mutex_init() return code
      - Fix panthor_device_reset_work() out path
      - Fix the race in the unplug logic
      - Fix typos
      - Unplug blocks when something fails in panthor_device_init()
      - Add Steve's R-b
      
      v3:
      - Add acks for the MIT+GPL2 relicensing
      - Fix 32-bit support
      - Shorten the sections protected by panthor_device::pm::mmio_lock to fix
        lock ordering issues.
      - Rename panthor_device::pm::lock into panthor_device::pm::mmio_lock to
        better reflect what this lock is protecting
      - Use dev_err_probe()
      - Make sure we call drm_dev_exit() when something fails half-way in
        panthor_device_reset_work()
      - Replace CSF_GPU_LATEST_FLUSH_ID_DEFAULT with a constant '1' and a
        comment to explain. Also remove setting the dummy flush ID on suspend.
      - Remove drm_WARN_ON() in panthor_exception_name()
      - Check pirq->suspended in panthor_xxx_irq_raw_handler()
      Co-developed-by: default avatarSteven Price <steven.price@arm.com>
      Signed-off-by: default avatarSteven Price <steven.price@arm.com>
      Signed-off-by: default avatarBoris Brezillon <boris.brezillon@collabora.com>
      Acked-by: Steven Price <steven.price@arm.com> # MIT+GPL2 relicensing,Arm
      Acked-by: Grant Likely <grant.likely@linaro.org> # MIT+GPL2 relicensing,Linaro
      Acked-by: Boris Brezillon <boris.brezillon@collabora.com> # MIT+GPL2 relicensing,Collabora
      Reviewed-by: default avatarSteven Price <steven.price@arm.com>
      Reviewed-by: default avatarLiviu Dudau <liviu.dudau@arm.com>
      Acked-by: default avatarMaxime Ripard <mripard@kernel.org>
      Acked-by: default avatarHeiko Stuebner <heiko@sntech.de>
      Link: https://patchwork.freedesktop.org/patch/msgid/20240229162230.2634044-4-boris.brezillon@collabora.com
      5fe909ca
    • Boris Brezillon's avatar
      drm/panthor: Add GPU register definitions · 546b3666
      Boris Brezillon authored
      Those are the registers directly accessible through the MMIO range.
      
      FW registers are exposed in panthor_fw.h.
      
      v6:
      - Add Maxime's and Heiko's acks
      
      v4:
      - Add the CORE_FEATURES register (needed for GPU variants)
      - Add Steve's R-b
      
      v3:
      - Add macros to extract GPU ID info
      - Formatting changes
      - Remove AS_TRANSCFG_ADRMODE_LEGACY - it doesn't exist post-CSF
      - Remove CSF_GPU_LATEST_FLUSH_ID_DEFAULT
      - Add GPU_L2_FEATURES_LINE_SIZE for extracting the GPU cache line size
      Co-developed-by: default avatarSteven Price <steven.price@arm.com>
      Signed-off-by: default avatarSteven Price <steven.price@arm.com>
      Signed-off-by: default avatarBoris Brezillon <boris.brezillon@collabora.com>
      Acked-by: Steven Price <steven.price@arm.com> # MIT+GPL2 relicensing,Arm
      Acked-by: Grant Likely <grant.likely@linaro.org> # MIT+GPL2 relicensing,Linaro
      Acked-by: Boris Brezillon <boris.brezillon@collabora.com> # MIT+GPL2 relicensing,Collabora
      Reviewed-by: default avatarSteven Price <steven.price@arm.com>
      Acked-by: default avatarMaxime Ripard <mripard@kernel.org>
      Acked-by: default avatarHeiko Stuebner <heiko@sntech.de>
      Link: https://patchwork.freedesktop.org/patch/msgid/20240229162230.2634044-3-boris.brezillon@collabora.com
      546b3666
    • Boris Brezillon's avatar
      drm/panthor: Add uAPI · 0f25e493
      Boris Brezillon authored
      Panthor follows the lead of other recently submitted drivers with
      ioctls allowing us to support modern Vulkan features, like sparse memory
      binding:
      
      - Pretty standard GEM management ioctls (BO_CREATE and BO_MMAP_OFFSET),
        with the 'exclusive-VM' bit to speed-up BO reservation on job submission
      - VM management ioctls (VM_CREATE, VM_DESTROY and VM_BIND). The VM_BIND
        ioctl is loosely based on the Xe model, and can handle both
        asynchronous and synchronous requests
      - GPU execution context creation/destruction, tiler heap context creation
        and job submission. Those ioctls reflect how the hardware/scheduler
        works and are thus driver specific.
      
      We also have a way to expose IO regions, such that the usermode driver
      can directly access specific/well-isolate registers, like the
      LATEST_FLUSH register used to implement cache-flush reduction.
      
      This uAPI intentionally keeps usermode queues out of the scope, which
      explains why doorbell registers and command stream ring-buffers are not
      directly exposed to userspace.
      
      v6:
      - Add Maxime's and Heiko's acks
      
      v5:
      - Fix typo
      - Add Liviu's R-b
      
      v4:
      - Add a VM_GET_STATE ioctl
      - Fix doc
      - Expose the CORE_FEATURES register so we can deal with variants in the
        UMD
      - Add Steve's R-b
      
      v3:
      - Add the concept of sync-only VM operation
      - Fix support for 32-bit userspace
      - Rework drm_panthor_vm_create to pass the user VA size instead of
        the kernel VA size (suggested by Robin Murphy)
      - Typo fixes
      - Explicitly cast enums with top bit set to avoid compiler warnings in
        -pedantic mode.
      - Drop property core_group_count as it can be easily calculated by the
        number of bits set in l2_present.
      Co-developed-by: default avatarSteven Price <steven.price@arm.com>
      Signed-off-by: default avatarSteven Price <steven.price@arm.com>
      Signed-off-by: default avatarBoris Brezillon <boris.brezillon@collabora.com>
      Reviewed-by: default avatarSteven Price <steven.price@arm.com>
      Reviewed-by: default avatarLiviu Dudau <liviu.dudau@arm.com>
      Acked-by: default avatarMaxime Ripard <mripard@kernel.org>
      Acked-by: default avatarHeiko Stuebner <heiko@sntech.de>
      Link: https://patchwork.freedesktop.org/patch/msgid/20240229162230.2634044-2-boris.brezillon@collabora.com
      0f25e493
  4. 29 Feb, 2024 3 commits
  5. 28 Feb, 2024 16 commits