1. 28 Dec, 2021 3 commits
  2. 27 Dec, 2021 3 commits
  3. 23 Dec, 2021 1 commit
  4. 17 Dec, 2021 6 commits
    • Alex Deucher's avatar
      drm/amdgpu: add support for IP discovery gc_info table v2 · 5e713c6a
      Alex Deucher authored
      Used on gfx9 based systems. Fixes incorrect CU counts reported
      in the kernel log.
      
      Bug: https://gitlab.freedesktop.org/drm/amd/-/issues/1833Reviewed-by: default avatarHawking Zhang <Hawking.Zhang@amd.com>
      Signed-off-by: default avatarAlex Deucher <alexander.deucher@amd.com>
      Cc: stable@vger.kernel.org
      5e713c6a
    • chen gong's avatar
      drm/amdgpu: When the VCN(1.0) block is suspended, powergating is explicitly enabled · b7865173
      chen gong authored
      Play a video on the raven (or PCO, raven2) platform, and then do the S3
      test. When resume, the following error will be reported:
      
      amdgpu 0000:02:00.0: [drm:amdgpu_ring_test_helper [amdgpu]] *ERROR* ring
      vcn_dec test failed (-110)
      [drm:amdgpu_device_ip_resume_phase2 [amdgpu]] *ERROR* resume of IP block
      <vcn_v1_0> failed -110
      amdgpu 0000:02:00.0: amdgpu: amdgpu_device_ip_resume failed (-110).
      PM: dpm_run_callback(): pci_pm_resume+0x0/0x90 returns -110
      
      [why]
      When playing the video: The power state flag of the vcn block is set to
      POWER_STATE_ON.
      
      When doing suspend: There is no change to the power state flag of the
      vcn block, it is still POWER_STATE_ON.
      
      When doing resume: Need to open the power gate of the vcn block and set
      the power state flag of the VCN block to POWER_STATE_ON.
      But at this time, the power state flag of the vcn block is already
      POWER_STATE_ON. The power status flag check in the "8f2cdef drm/amd/pm:
      avoid duplicate powergate/ungate setting" patch will return the
      amdgpu_dpm_set_powergating_by_smu function directly.
      As a result, the gate of the power was not opened, causing the
      subsequent ring test to fail.
      
      [how]
      In the suspend function of the vcn block, explicitly change the power
      state flag of the vcn block to POWER_STATE_OFF.
      
      BugLink: https://gitlab.freedesktop.org/drm/amd/-/issues/1828Signed-off-by: default avatarchen gong <curry.gong@amd.com>
      Reviewed-by: default avatarEvan Quan <evan.quan@amd.com>
      Signed-off-by: default avatarAlex Deucher <alexander.deucher@amd.com>
      Cc: stable@vger.kernel.org
      b7865173
    • Lijo Lazar's avatar
      drm/amd/pm: Fix xgmi link control on aldebaran · 19e66d51
      Lijo Lazar authored
      Fix the message argument.
      	0: Allow power down
      	1: Disallow power down
      Signed-off-by: default avatarLijo Lazar <lijo.lazar@amd.com>
      Reviewed-by: default avatarHawking Zhang <Hawking.Zhang@amd.com>
      Signed-off-by: default avatarAlex Deucher <alexander.deucher@amd.com>
      19e66d51
    • Huang Rui's avatar
      drm/amdgpu: introduce new amdgpu_fence object to indicate the job embedded fence · bf67014d
      Huang Rui authored
      The job embedded fence donesn't initialize the flags at
      dma_fence_init(). Then we will go a wrong way in
      amdgpu_fence_get_timeline_name callback and trigger a null pointer panic
      once we enabled the trace event here. So introduce new amdgpu_fence
      object to indicate the job embedded fence.
      
      [  156.131790] BUG: kernel NULL pointer dereference, address: 00000000000002a0
      [  156.131804] #PF: supervisor read access in kernel mode
      [  156.131811] #PF: error_code(0x0000) - not-present page
      [  156.131817] PGD 0 P4D 0
      [  156.131824] Oops: 0000 [#1] PREEMPT SMP PTI
      [  156.131832] CPU: 6 PID: 1404 Comm: sdma0 Tainted: G           OE     5.16.0-rc1-custom #1
      [  156.131842] Hardware name: Gigabyte Technology Co., Ltd. Z170XP-SLI/Z170XP-SLI-CF, BIOS F20 11/04/2016
      [  156.131848] RIP: 0010:strlen+0x0/0x20
      [  156.131859] Code: 89 c0 c3 0f 1f 80 00 00 00 00 48 01 fe eb 0f 0f b6 07 38 d0 74 10 48 83 c7 01 84 c0 74 05 48 39 f7 75 ec 31 c0 c3 48 89 f8 c3 <80> 3f 00 74 10 48 89 f8 48 83 c0 01 80 38 00 75 f7 48 29 f8 c3 31
      [  156.131872] RSP: 0018:ffff9bd0018dbcf8 EFLAGS: 00010206
      [  156.131880] RAX: 00000000000002a0 RBX: ffff8d0305ef01b0 RCX: 000000000000000b
      [  156.131888] RDX: ffff8d03772ab924 RSI: ffff8d0305ef01b0 RDI: 00000000000002a0
      [  156.131895] RBP: ffff9bd0018dbd60 R08: ffff8d03002094d0 R09: 0000000000000000
      [  156.131901] R10: 000000000000005e R11: 0000000000000065 R12: ffff8d03002094d0
      [  156.131907] R13: 000000000000001f R14: 0000000000070018 R15: 0000000000000007
      [  156.131914] FS:  0000000000000000(0000) GS:ffff8d062ed80000(0000) knlGS:0000000000000000
      [  156.131923] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
      [  156.131929] CR2: 00000000000002a0 CR3: 000000001120a005 CR4: 00000000003706e0
      [  156.131937] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
      [  156.131942] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
      [  156.131949] Call Trace:
      [  156.131953]  <TASK>
      [  156.131957]  ? trace_event_raw_event_dma_fence+0xcc/0x200
      [  156.131973]  ? ring_buffer_unlock_commit+0x23/0x130
      [  156.131982]  dma_fence_init+0x92/0xb0
      [  156.131993]  amdgpu_fence_emit+0x10d/0x2b0 [amdgpu]
      [  156.132302]  amdgpu_ib_schedule+0x2f9/0x580 [amdgpu]
      [  156.132586]  amdgpu_job_run+0xed/0x220 [amdgpu]
      
      v2: fix mismatch warning between the prototype and function name (Ray, kernel test robot)
      Signed-off-by: default avatarHuang Rui <ray.huang@amd.com>
      Reviewed-by: default avatarChristian König <christian.koenig@amd.com>
      Signed-off-by: default avatarAlex Deucher <alexander.deucher@amd.com>
      bf67014d
    • Dave Airlie's avatar
      Merge tag 'amd-drm-fixes-5.16-2021-12-15' of... · a2fbfd51
      Dave Airlie authored
      Merge tag 'amd-drm-fixes-5.16-2021-12-15' of https://gitlab.freedesktop.org/agd5f/linux into drm-fixes
      
      amd-drm-fixes-5.16-2021-12-15:
      
      amdgpu:
      - Fix RLC register offset
      - GMC fix
      - Properly cache SMU FW version on Yellow Carp
      - Fix missing callback on DCN3.1
      - Reset DMCUB before HW init
      - Fix for GMC powergating on PCO
      - Fix a possible memory leak in GPU metrics table handling on RN
      Signed-off-by: default avatarDave Airlie <airlied@redhat.com>
      From: Alex Deucher <alexander.deucher@amd.com>
      Link: https://patchwork.freedesktop.org/patch/msgid/20211216035239.5787-1-alexander.deucher@amd.com
      a2fbfd51
    • Dave Airlie's avatar
      Merge tag 'drm-misc-fixes-2021-12-16-1' of... · 78fed39a
      Dave Airlie authored
      Merge tag 'drm-misc-fixes-2021-12-16-1' of ssh://git.freedesktop.org/git/drm/drm-misc into drm-fixes
      
      One null pointer dereference fix for ast, a pixel clock unit fix for
      simpledrm and a user-space regression revert for fb-helper
      Signed-off-by: default avatarDave Airlie <airlied@redhat.com>
      
      From: Maxime Ripard <maxime@cerno.tech>
      Link: https://patchwork.freedesktop.org/patch/msgid/20211216082603.pm6yzlckmxvwnqyv@houat
      78fed39a
  5. 16 Dec, 2021 1 commit
  6. 14 Dec, 2021 9 commits
  7. 13 Dec, 2021 1 commit
  8. 12 Dec, 2021 14 commits
  9. 11 Dec, 2021 2 commits
    • Linus Torvalds's avatar
      Merge tag 'perf-tools-fixes-for-v5.16-2021-12-11' of... · bbdff6d5
      Linus Torvalds authored
      Merge tag 'perf-tools-fixes-for-v5.16-2021-12-11' of git://git.kernel.org/pub/scm/linux/kernel/git/acme/linux
      
      Pull perf tools fixes from Arnaldo Carvalho de Melo:
      
       - Prevent out-of-bounds access to per sample registers.
      
       - Fix NULL vs IS_ERR_OR_NULL() checking on the python binding.
      
       - Intel PT fixes, half of those are one-liners:
            - Fix some PGE (packet generation enable/control flow packets) usage.
            - Fix sync state when a PSB (synchronization) packet is found.
            - Fix intel_pt_fup_event() assumptions about setting state type.
            - Fix state setting when receiving overflow (OVF) packet.
            - Fix next 'err' value, walking trace.
            - Fix missing 'instruction' events with 'q' option.
            - Fix error timestamp setting on the decoder error path.
      
      * tag 'perf-tools-fixes-for-v5.16-2021-12-11' of git://git.kernel.org/pub/scm/linux/kernel/git/acme/linux:
        perf python: Fix NULL vs IS_ERR_OR_NULL() checking
        perf intel-pt: Fix error timestamp setting on the decoder error path
        perf intel-pt: Fix missing 'instruction' events with 'q' option
        perf intel-pt: Fix next 'err' value, walking trace
        perf intel-pt: Fix state setting when receiving overflow (OVF) packet
        perf intel-pt: Fix intel_pt_fup_event() assumptions about setting state type
        perf intel-pt: Fix sync state when a PSB (synchronization) packet is found
        perf intel-pt: Fix some PGE (packet generation enable/control flow packets) usage
        perf tools: Prevent out-of-bounds access to registers
      bbdff6d5
    • Linus Torvalds's avatar
      Merge tag 'block-5.16-2021-12-10' of git://git.kernel.dk/linux-block · eccea80b
      Linus Torvalds authored
      Pull block fixes from Jens Axboe:
       "A few block fixes that should go into this release:
      
         - NVMe pull request:
              - set ana_log_size to 0 after freeing ana_log_buf (Hou Tao)
              - show subsys nqn for duplicate cntlids (Keith Busch)
              - disable namespace access for unsupported metadata (Keith
                Busch)
              - report write pointer for a full zone as zone start + zone len
                (Niklas Cassel)
              - fix use after free when disconnecting a reconnecting ctrl
                (Ruozhu Li)
              - fix a list corruption in nvmet-tcp (Sagi Grimberg)
      
         - Fix for a regression on DIO single bio async IO (Pavel)
      
         - ioprio seteuid fix (Davidlohr)
      
         - mtd fix that subsequently got reverted as it was broken, will get
           re-done and submitted for the next round
      
         - Two MD fixes via Song (Markus, zhangyue)"
      
      * tag 'block-5.16-2021-12-10' of git://git.kernel.dk/linux-block:
        Revert "mtd_blkdevs: don't scan partitions for plain mtdblock"
        block: fix ioprio_get(IOPRIO_WHO_PGRP) vs setuid(2)
        md: fix double free of mddev->private in autorun_array()
        md: fix update super 1.0 on rdev size change
        nvmet-tcp: fix possible list corruption for unexpected command failure
        block: fix single bio async DIO error handling
        nvme: fix use after free when disconnecting a reconnecting ctrl
        nvme-multipath: set ana_log_size to 0 after free ana_log_buf
        mtd_blkdevs: don't scan partitions for plain mtdblock
        nvme: report write pointer for a full zone as zone start + zone len
        nvme: disable namespace access for unsupported metadata
        nvme: show subsys nqn for duplicate cntlids
      eccea80b