1. 01 Aug, 2022 16 commits
  2. 31 Jul, 2022 6 commits
  3. 30 Jul, 2022 2 commits
  4. 29 Jul, 2022 16 commits
    • Linus Torvalds's avatar
      Merge tag 'block-5.19-2022-07-29' of git://git.kernel.dk/linux-block · 8a91f86f
      Linus Torvalds authored
      Pull block fix from Jens Axboe:
       "Just a single fix for NVMe, yet another quirk addition"
      
      * tag 'block-5.19-2022-07-29' of git://git.kernel.dk/linux-block:
        nvme-pci: Crucial P2 has bogus namespace ids
      8a91f86f
    • Linus Torvalds's avatar
      Merge tag 'drm-fixes-2022-07-30' of git://anongit.freedesktop.org/drm/drm · e65c6a46
      Linus Torvalds authored
      Pull more drm fixes from Dave Airlie:
       "Maxime had the dog^Wmailing list server eat his homework^Wmisc pull
        request.
      
        Two more small fixes, one in nouveau svm code and the other in
        simpledrm.
      
        nouveau:
         - page migration fix
      
        simpledrm:
         - fix mode_valid return value"
      
      * tag 'drm-fixes-2022-07-30' of git://anongit.freedesktop.org/drm/drm:
        nouveau/svm: Fix to migrate all requested pages
        drm/simpledrm: Fix return type of simpledrm_simple_display_pipe_mode_valid()
      e65c6a46
    • Dave Airlie's avatar
      Merge tag 'drm-misc-fixes-2022-07-29' of git://anongit.freedesktop.org/drm/drm-misc into drm-fixes · ce156c8a
      Dave Airlie authored
      One fix to fix simpledrm mode_valid return value, and one for page
      migration in nouveau
      Signed-off-by: default avatarDave Airlie <airlied@redhat.com>
      
      From: Maxime Ripard <maxime@cerno.tech>
      Link: https://patchwork.freedesktop.org/patch/msgid/20220729094514.sfzhc3gqjgwgal62@penduick
      ce156c8a
    • Linus Torvalds's avatar
      Merge tag 'scsi-fixes' of git://git.kernel.org/pub/scm/linux/kernel/git/jejb/scsi · 1c8ac1c4
      Linus Torvalds authored
      Pull SCSI fixes from James Bottomley:
       "Four fixes, three in drivers.
      
        The two biggest fixes are ufs and the remaining driver and core fix
        are small and obvious (and the core fix is low risk)"
      
      * tag 'scsi-fixes' of git://git.kernel.org/pub/scm/linux/kernel/git/jejb/scsi:
        scsi: ufs: core: Fix a race condition related to device management
        scsi: core: Fix warning in scsi_alloc_sgtables()
        scsi: ufs: host: Hold reference returned by of_parse_phandle()
        scsi: mpt3sas: Stop fw fault watchdog work item during system shutdown
      1c8ac1c4
    • Eiichi Tsukata's avatar
      docs/kernel-parameters: Update descriptions for "mitigations=" param with retbleed · ea304a8b
      Eiichi Tsukata authored
      Updates descriptions for "mitigations=off" and "mitigations=auto,nosmt"
      with the respective retbleed= settings.
      Signed-off-by: default avatarEiichi Tsukata <eiichi.tsukata@nutanix.com>
      Signed-off-by: default avatarBorislav Petkov <bp@suse.de>
      Cc: corbet@lwn.net
      Link: https://lore.kernel.org/r/20220728043907.165688-1-eiichi.tsukata@nutanix.com
      ea304a8b
    • Ralph Campbell's avatar
      mm/hmm: fault non-owner device private entries · 8a295dbb
      Ralph Campbell authored
      If hmm_range_fault() is called with the HMM_PFN_REQ_FAULT flag and a
      device private PTE is found, the hmm_range::dev_private_owner page is used
      to determine if the device private page should not be faulted in. 
      However, if the device private page is not owned by the caller,
      hmm_range_fault() returns an error instead of calling migrate_to_ram() to
      fault in the page.
      
      For example, if a page is migrated to GPU private memory and a RDMA fault
      capable NIC tries to read the migrated page, without this patch it will
      get an error.  With this patch, the page will be migrated back to system
      memory and the NIC will be able to read the data.
      
      Link: https://lkml.kernel.org/r/20220727000837.4128709-2-rcampbell@nvidia.com
      Link: https://lkml.kernel.org/r/20220725183615.4118795-2-rcampbell@nvidia.com
      Fixes: 08ddddda ("mm/hmm: check the device private page owner in hmm_range_fault()")
      Signed-off-by: default avatarRalph Campbell <rcampbell@nvidia.com>
      Reported-by: default avatarFelix Kuehling <felix.kuehling@amd.com>
      Reviewed-by: default avatarAlistair Popple <apopple@nvidia.com>
      Cc: Philip Yang <Philip.Yang@amd.com>
      Cc: Jason Gunthorpe <jgg@nvidia.com>
      Cc: <stable@vger.kernel.org>
      Signed-off-by: default avatarAndrew Morton <akpm@linux-foundation.org>
      8a295dbb
    • Jaewon Kim's avatar
      page_alloc: fix invalid watermark check on a negative value · 9282012f
      Jaewon Kim authored
      There was a report that a task is waiting at the
      throttle_direct_reclaim. The pgscan_direct_throttle in vmstat was
      increasing.
      
      This is a bug where zone_watermark_fast returns true even when the free
      is very low. The commit f27ce0e1 ("page_alloc: consider highatomic
      reserve in watermark fast") changed the watermark fast to consider
      highatomic reserve. But it did not handle a negative value case which
      can be happened when reserved_highatomic pageblock is bigger than the
      actual free.
      
      If watermark is considered as ok for the negative value, allocating
      contexts for order-0 will consume all free pages without direct reclaim,
      and finally free page may become depleted except highatomic free.
      
      Then allocating contexts may fall into throttle_direct_reclaim. This
      symptom may easily happen in a system where wmark min is low and other
      reclaimers like kswapd does not make free pages quickly.
      
      Handle the negative case by using MIN.
      
      Link: https://lkml.kernel.org/r/20220725095212.25388-1-jaewon31.kim@samsung.com
      Fixes: f27ce0e1 ("page_alloc: consider highatomic reserve in watermark fast")
      Signed-off-by: default avatarJaewon Kim <jaewon31.kim@samsung.com>
      Reported-by: default avatarGyeongHwan Hong <gh21.hong@samsung.com>
      Acked-by: default avatarMel Gorman <mgorman@techsingularity.net>
      Cc: Minchan Kim <minchan@kernel.org>
      Cc: Baoquan He <bhe@redhat.com>
      Cc: Vlastimil Babka <vbabka@suse.cz>
      Cc: Johannes Weiner <hannes@cmpxchg.org>
      Cc: Michal Hocko <mhocko@kernel.org>
      Cc: Yong-Taek Lee <ytk.lee@samsung.com>
      Cc: <stable@vger.kerenl.org>
      Signed-off-by: default avatarAndrew Morton <akpm@linux-foundation.org>
      9282012f
    • Linus Torvalds's avatar
      Merge tag 'perf-tools-fixes-for-v5.19-2022-07-29' of... · bb83c99d
      Linus Torvalds authored
      Merge tag 'perf-tools-fixes-for-v5.19-2022-07-29' of git://git.kernel.org/pub/scm/linux/kernel/git/acme/linux
      
      Pull perf tools fixes from Arnaldo Carvalho de Melo:
      
       - Fix addresses for bss symbols, describing variables used in resolving
         data access in tools such as 'perf c2c' and 'perf mem'.
      
       - Skip symbols if SHF_ALLOC flag is not set, a technique used for
         listing deprecated symbols, its addresses are zeros, so not useful.
      
       - Remove undefined behavior from bpf_perf_object__next() when dealing
         with an empty bpf_objects_list list.
      
       - Make a ARM CoreSight disasm script work with both python2 and
         python3.
      
       - Sync x86's cpufeatures header with with the kernel sources.
      
      * tag 'perf-tools-fixes-for-v5.19-2022-07-29' of git://git.kernel.org/pub/scm/linux/kernel/git/acme/linux:
        perf bpf: Remove undefined behavior from bpf_perf_object__next()
        perf symbol: Skip symbols if SHF_ALLOC flag is not set
        perf symbol: Correct address for bss symbols
        perf scripts python: Let script to be python2 compliant
        tools headers cpufeatures: Sync with the kernel sources
      bb83c99d
    • Linus Torvalds's avatar
      Merge tag 'wq-for-5.19-rc8-fixes' of git://git.kernel.org/pub/scm/linux/kernel/git/tj/wq · 4b20426d
      Linus Torvalds authored
      Pull workqueue fix from Tejun Heo:
       "Just one commit to suppress a spurious warning added during the 5.19
        cycle"
      
      * tag 'wq-for-5.19-rc8-fixes' of git://git.kernel.org/pub/scm/linux/kernel/git/tj/wq:
        workqueue: Avoid a false warning in unbind_workers()
      4b20426d
    • Linus Torvalds's avatar
      Merge tag 'pm-5.19-rc9' of git://git.kernel.org/pub/scm/linux/kernel/git/rafael/linux-pm · 506e6dfb
      Linus Torvalds authored
      Pull power management fix from Rafael Wysocki:
       "Make some false positive RCU splats resulting from a recent intel_idle
        driver change go away (Waiman Long)"
      
      * tag 'pm-5.19-rc9' of git://git.kernel.org/pub/scm/linux/kernel/git/rafael/linux-pm:
        intel_idle: Fix false positive RCU splats due to incorrect hardirqs state
      506e6dfb
    • Lai Jiangshan's avatar
      workqueue: Avoid a false warning in unbind_workers() · 46a4d679
      Lai Jiangshan authored
      Doing set_cpus_allowed_ptr() with wq_unbound_cpumask can be possible
      fails and trigger the false warning.
      
      Use cpu_possible_mask instead when wq_unbound_cpumask has no active CPUs.
      
      It is very easy to trigger the warning:
        Set wq_unbound_cpumask to a small set of CPUs.
        Offline all the CPUs of wq_unbound_cpumask.
        Offline an extra CPU and trigger the warning.
      
      Fixes: 10a5a651 ("workqueue: Restrict kworker in the offline CPU pool running on housekeeping CPUs")
      Signed-off-by: default avatarLai Jiangshan <jiangshan.ljs@antgroup.com>
      Signed-off-by: default avatarTejun Heo <tj@kernel.org>
      46a4d679
    • Linus Torvalds's avatar
      Merge tag 'riscv-for-linus-5.19-rc9' of git://git.kernel.org/pub/scm/linux/kernel/git/riscv/linux · e4d8b09d
      Linus Torvalds authored
      Pull RISC-V fix from Palmer Dabbelt:
       "A build fix for 'make vdso_install' that avoids an issue trying to
        install the compat VDSO"
      
      * tag 'riscv-for-linus-5.19-rc9' of git://git.kernel.org/pub/scm/linux/kernel/git/riscv/linux:
        riscv: compat: vdso: Fix vdso_install target
      e4d8b09d
    • Linus Torvalds's avatar
      Merge tag 'loongarch-fixes-5.19-5' of... · a95eb1d0
      Linus Torvalds authored
      Merge tag 'loongarch-fixes-5.19-5' of git://git.kernel.org/pub/scm/linux/kernel/git/chenhuacai/linux-loongson
      
      Pull LoongArch fixes from Huacai Chen:
      
       - Fix cache size calculation, stack protection attributes, ptrace's
         fpr_set and "ROM Size" in boardinfo
      
       - Some cleanups and improvements of assembly
      
       - Some cleanups of unused code and useless code
      
      * tag 'loongarch-fixes-5.19-5' of git://git.kernel.org/pub/scm/linux/kernel/git/chenhuacai/linux-loongson:
        LoongArch: Fix wrong "ROM Size" of boardinfo
        LoongArch: Fix missing fcsr in ptrace's fpr_set
        LoongArch: Fix shared cache size calculation
        LoongArch: Disable executable stack by default
        LoongArch: Remove unused variables
        LoongArch: Remove clock setting during cpu hotplug stage
        LoongArch: Remove useless header compiler.h
        LoongArch: Remove several syntactic sugar macros for branches
        LoongArch: Re-tab the assembly files
        LoongArch: Simplify "BGT foo, zero" with BGTZ
        LoongArch: Simplify "BLT foo, zero" with BLTZ
        LoongArch: Simplify "BEQ/BNE foo, zero" with BEQZ/BNEZ
        LoongArch: Use the "move" pseudo-instruction where applicable
        LoongArch: Use the "jr" pseudo-instruction where applicable
        LoongArch: Use ABI names of registers where appropriate
      a95eb1d0
    • Linus Torvalds's avatar
      Merge tag 'powerpc-5.19-6' of git://git.kernel.org/pub/scm/linux/kernel/git/powerpc/linux · 9d928d9b
      Linus Torvalds authored
      Pull powerpc fixes from Michael Ellerman:
      
       - Re-enable the new amdgpu display engine for powerpc, as long as the
         compiler is correctly configured.
      
       - Disable stack variable initialisation in prom_init to fix GCC 12
         allmodconfig.
      
      Thanks to Dan Horák and Sudip Mukherjee.
      
      * tag 'powerpc-5.19-6' of git://git.kernel.org/pub/scm/linux/kernel/git/powerpc/linux:
        drm/amdgpu: Re-enable DCN for 64-bit powerpc
        powerpc/64s: Disable stack variable initialisation for prom_init
      9d928d9b
    • Zhengjun Xing's avatar
      perf stat: Add topdown metrics in the default perf stat on the hybrid machine · 9a0b3626
      Zhengjun Xing authored
      Topdown metrics are missed in the default perf stat on the hybrid machine,
      add Topdown metrics in default perf stat for hybrid systems.
      
      Currently, we support the perf metrics Topdown for the p-core PMU in the
      perf stat default, the perf metrics Topdown support for e-core PMU will be
      implemented later separately. Refactor the code adds two x86 specific
      functions. Widen the size of the event name column by 7 chars, so that all
      metrics after the "#" become aligned again.
      
      The perf metrics topdown feature is supported on the cpu_core of ADL. The
      dedicated perf metrics counter and the fixed counter 3 are used for the
      topdown events. Adding the topdown metrics doesn't trigger multiplexing.
      
      Before:
      
       # ./perf  stat  -a true
      
       Performance counter stats for 'system wide':
      
                   53.70 msec cpu-clock                 #   25.736 CPUs utilized
                      80      context-switches          #    1.490 K/sec
                      24      cpu-migrations            #  446.951 /sec
                      52      page-faults               #  968.394 /sec
               2,788,555      cpu_core/cycles/          #   51.931 M/sec
                 851,129      cpu_atom/cycles/          #   15.851 M/sec
               2,974,030      cpu_core/instructions/    #   55.385 M/sec
                 416,919      cpu_atom/instructions/    #    7.764 M/sec
                 586,136      cpu_core/branches/        #   10.916 M/sec
                  79,872      cpu_atom/branches/        #    1.487 M/sec
                  14,220      cpu_core/branch-misses/   #  264.819 K/sec
                   7,691      cpu_atom/branch-misses/   #  143.229 K/sec
      
             0.002086438 seconds time elapsed
      
      After:
      
       # ./perf stat  -a true
      
       Performance counter stats for 'system wide':
      
                   61.39 msec cpu-clock                        #   24.874 CPUs utilized
                      76      context-switches                 #    1.238 K/sec
                      24      cpu-migrations                   #  390.968 /sec
                      52      page-faults                      #  847.097 /sec
               2,753,695      cpu_core/cycles/                 #   44.859 M/sec
                 903,899      cpu_atom/cycles/                 #   14.725 M/sec
               2,927,529      cpu_core/instructions/           #   47.690 M/sec
                 428,498      cpu_atom/instructions/           #    6.980 M/sec
                 581,299      cpu_core/branches/               #    9.470 M/sec
                  83,409      cpu_atom/branches/               #    1.359 M/sec
                  13,641      cpu_core/branch-misses/          #  222.216 K/sec
                   8,008      cpu_atom/branch-misses/          #  130.453 K/sec
              14,761,308      cpu_core/slots/                  #  240.466 M/sec
               3,288,625      cpu_core/topdown-retiring/       #     22.3% retiring
               1,323,323      cpu_core/topdown-bad-spec/       #      9.0% bad speculation
               5,477,470      cpu_core/topdown-fe-bound/       #     37.1% frontend bound
               4,679,199      cpu_core/topdown-be-bound/       #     31.7% backend bound
                 646,194      cpu_core/topdown-heavy-ops/      #      4.4% heavy operations       #     17.9% light operations
               1,244,999      cpu_core/topdown-br-mispredict/  #      8.4% branch mispredict      #      0.5% machine clears
               3,891,800      cpu_core/topdown-fetch-lat/      #     26.4% fetch latency          #     10.7% fetch bandwidth
               1,879,034      cpu_core/topdown-mem-bound/      #     12.7% memory bound           #     19.0% Core bound
      
             0.002467839 seconds time elapsed
      Reviewed-by: default avatarKan Liang <kan.liang@linux.intel.com>
      Signed-off-by: default avatarXing Zhengjun <zhengjun.xing@linux.intel.com>
      Acked-by: default avatarIan Rogers <irogers@google.com>
      Acked-by: default avatarNamhyung Kim <namhyung@kernel.org>
      Cc: Alexander Shishkin <alexander.shishkin@intel.com>
      Cc: Andi Kleen <ak@linux.intel.com>
      Cc: Ingo Molnar <mingo@redhat.com>
      Cc: Jiri Olsa <jolsa@kernel.org>
      Cc: Peter Zijlstra <peterz@infradead.org>
      Link: https://lore.kernel.org/r/20220721065706.2886112-6-zhengjun.xing@linux.intel.comSigned-off-by: default avatarArnaldo Carvalho de Melo <acme@redhat.com>
      9a0b3626
    • Kan Liang's avatar
      perf x86 evlist: Add default hybrid events for perf stat · cdb204ad
      Kan Liang authored
      Provide a new solution to replace the reverted commit ac2dc29e
      ("perf stat: Add default hybrid events")
      
      For the default software attrs, nothing is changed.
      
      For the default hardware attrs, create a new evsel for each hybrid pmu.
      
      With the new solution, adding a new default attr will not require the
      special support for the hybrid platform anymore.
      
      Also, the "--detailed" is supported on the hybrid platform
      
      With the patch,
      
        $ perf stat -a -ddd sleep 1
      
         Performance counter stats for 'system wide':
      
               32,231.06 msec cpu-clock                 #   32.056 CPUs utilized
                     529      context-switches          #   16.413 /sec
                      32      cpu-migrations            #    0.993 /sec
                      69      page-faults               #    2.141 /sec
             176,754,151      cpu_core/cycles/          #    5.484 M/sec          (41.65%)
             161,695,280      cpu_atom/cycles/          #    5.017 M/sec          (49.92%)
              48,595,992      cpu_core/instructions/    #    1.508 M/sec          (49.98%)
              32,363,337      cpu_atom/instructions/    #    1.004 M/sec          (58.26%)
              10,088,639      cpu_core/branches/        #  313.010 K/sec          (58.31%)
               6,390,582      cpu_atom/branches/        #  198.274 K/sec          (58.26%)
                 846,201      cpu_core/branch-misses/   #   26.254 K/sec          (66.65%)
                 676,477      cpu_atom/branch-misses/   #   20.988 K/sec          (58.27%)
              14,290,070      cpu_core/L1-dcache-loads/ #  443.363 K/sec          (66.66%)
               9,983,532      cpu_atom/L1-dcache-loads/ #  309.749 K/sec          (58.27%)
                 740,725      cpu_core/L1-dcache-load-misses/ #   22.982 K/sec    (66.66%)
         <not supported>      cpu_atom/L1-dcache-load-misses/
                 480,441      cpu_core/LLC-loads/       #   14.906 K/sec          (66.67%)
                 326,570      cpu_atom/LLC-loads/       #   10.132 K/sec          (58.27%)
                     329      cpu_core/LLC-load-misses/ #   10.208 /sec           (66.68%)
                       0      cpu_atom/LLC-load-misses/ #    0.000 /sec           (58.32%)
         <not supported>      cpu_core/L1-icache-loads/
              21,982,491      cpu_atom/L1-icache-loads/ #  682.028 K/sec          (58.43%)
               4,493,189      cpu_core/L1-icache-load-misses/ #  139.406 K/sec    (33.34%)
               4,711,404      cpu_atom/L1-icache-load-misses/ #  146.176 K/sec    (50.08%)
              13,713,090      cpu_core/dTLB-loads/      #  425.462 K/sec          (33.34%)
               9,384,727      cpu_atom/dTLB-loads/      #  291.170 K/sec          (50.08%)
                 157,387      cpu_core/dTLB-load-misses/ #    4.883 K/sec         (33.33%)
                 108,328      cpu_atom/dTLB-load-misses/ #    3.361 K/sec         (50.08%)
         <not supported>      cpu_core/iTLB-loads/
         <not supported>      cpu_atom/iTLB-loads/
                  37,655      cpu_core/iTLB-load-misses/ #    1.168 K/sec         (33.32%)
                  61,661      cpu_atom/iTLB-load-misses/ #    1.913 K/sec         (50.03%)
         <not supported>      cpu_core/L1-dcache-prefetches/
         <not supported>      cpu_atom/L1-dcache-prefetches/
         <not supported>      cpu_core/L1-dcache-prefetch-misses/
         <not supported>      cpu_atom/L1-dcache-prefetch-misses/
      
               1.005466919 seconds time elapsed
      Signed-off-by: default avatarKan Liang <kan.liang@linux.intel.com>
      Acked-by: default avatarIan Rogers <irogers@google.com>
      Acked-by: default avatarNamhyung Kim <namhyung@kernel.org>
      Cc: Alexander Shishkin <alexander.shishkin@intel.com>
      Cc: Andi Kleen <ak@linux.intel.com>
      Cc: Ingo Molnar <mingo@redhat.com>
      Cc: Jiri Olsa <jolsa@kernel.org>
      Cc: Peter Zijlstra <peterz@infradead.org>
      Link: https://lore.kernel.org/r/20220721065706.2886112-5-zhengjun.xing@linux.intel.comSigned-off-by: default avatarXing Zhengjun <zhengjun.xing@linux.intel.com>
      Signed-off-by: default avatarArnaldo Carvalho de Melo <acme@redhat.com>
      cdb204ad