1. 10 Feb, 2021 1 commit
    • wanghongzhe's avatar
      seccomp: Improve performace by optimizing rmb() · a381b70a
      wanghongzhe authored
      According to Kees's suggest, we started with the patch that just replaces
      rmb() with smp_rmb() and did a performance test with UnixBench. The
      results showed the overhead about 2.53% in rmb() test compared to the
      smp_rmb() one, in a x86-64 kernel with CONFIG_SMP enabled running inside a
      qemu-kvm vm. The test is a "syscall" testcase in UnixBench, which executes
      5 syscalls in a loop during a certain timeout (100 second in our test) and
      counts the total number of executions of this 5-syscall sequence. We set
      a seccomp filter with all allow rule for all used syscalls in this test
      (which will go bitmap path) to make sure the rmb() will be executed. The
      details for the test:
      
      with rmb():
      /txm # ./syscall_allow_min 100
      COUNT|35861159|1|lps
      /txm # ./syscall_allow_min 100
      COUNT|35545501|1|lps
      /txm # ./syscall_allow_min 100
      COUNT|35664495|1|lps
      
      with smp_rmb():
      /txm # ./syscall_allow_min 100
      COUNT|36552771|1|lps
      /txm # ./syscall_allow_min 100
      COUNT|36491247|1|lps
      /txm # ./syscall_allow_min 100
      COUNT|36504746|1|lps
      
      For a x86-64 kernel with CONFIG_SMP enabled, the smp_rmb() is just a
      compiler barrier() which have no impact in runtime, while rmb() is a
      lfence which will prevent all memory access operations (not just load
      according the recently claim by Intel) behind itself. We can also figure
      it out in disassembly:
      
      with rmb():
      0000000000001430 <__seccomp_filter>:
          1430:   41 57                   push   %r15
          1432:   41 56                   push   %r14
          1434:   41 55                   push   %r13
          1436:   41 54                   push   %r12
          1438:   55                      push   %rbp
          1439:   53                      push   %rbx
          143a:   48 81 ec 90 00 00 00    sub    $0x90,%rsp
          1441:   89 7c 24 10             mov    %edi,0x10(%rsp)
          1445:   89 54 24 14             mov    %edx,0x14(%rsp)
          1449:   65 48 8b 04 25 28 00    mov    %gs:0x28,%rax
          1450:   00 00
          1452:   48 89 84 24 88 00 00    mov    %rax,0x88(%rsp)
          1459:   00
          145a:   31 c0                   xor    %eax,%eax
      *   145c:   0f ae e8                lfence
          145f:   48 85 f6                test   %rsi,%rsi
          1462:   49 89 f4                mov    %rsi,%r12
          1465:   0f 84 42 03 00 00       je     17ad <__seccomp_filter+0x37d>
          146b:   65 48 8b 04 25 00 00    mov    %gs:0x0,%rax
          1472:   00 00
          1474:   48 8b 98 80 07 00 00    mov    0x780(%rax),%rbx
          147b:   48 85 db                test   %rbx,%rbx
      
      with smp_rmb();
      0000000000001430 <__seccomp_filter>:
          1430:   41 57                   push   %r15
          1432:   41 56                   push   %r14
          1434:   41 55                   push   %r13
          1436:   41 54                   push   %r12
          1438:   55                      push   %rbp
          1439:   53                      push   %rbx
          143a:   48 81 ec 90 00 00 00    sub    $0x90,%rsp
          1441:   89 7c 24 10             mov    %edi,0x10(%rsp)
          1445:   89 54 24 14             mov    %edx,0x14(%rsp)
          1449:   65 48 8b 04 25 28 00    mov    %gs:0x28,%rax
          1450:   00 00
          1452:   48 89 84 24 88 00 00    mov    %rax,0x88(%rsp)
          1459:   00
          145a:   31 c0                   xor    %eax,%eax
          145c:   48 85 f6                test   %rsi,%rsi
          145f:   49 89 f4                mov    %rsi,%r12
          1462:   0f 84 42 03 00 00       je     17aa <__seccomp_filter+0x37a>
          1468:   65 48 8b 04 25 00 00    mov    %gs:0x0,%rax
          146f:   00 00
          1471:   48 8b 98 80 07 00 00    mov    0x780(%rax),%rbx
          1478:   48 85 db                test   %rbx,%rbx
      Signed-off-by: default avatarwanghongzhe <wanghongzhe@huawei.com>
      Signed-off-by: default avatarKees Cook <keescook@chromium.org>
      Link: https://lore.kernel.org/r/1612496049-32507-1-git-send-email-wanghongzhe@huawei.com
      a381b70a
  2. 11 Jan, 2021 1 commit
  3. 03 Jan, 2021 1 commit
  4. 02 Jan, 2021 3 commits
    • Linus Torvalds's avatar
      Merge tag 's390-5.11-3' of git://git.kernel.org/pub/scm/linux/kernel/git/s390/linux · 3516bd72
      Linus Torvalds authored
      Pull s390 cleanups from Vasily Gorbik:
       "Update defconfigs and sort config select list"
      
      * tag 's390-5.11-3' of git://git.kernel.org/pub/scm/linux/kernel/git/s390/linux:
        s390/Kconfig: sort config S390 select list once again
        s390: update defconfigs
      3516bd72
    • Linus Torvalds's avatar
      Merge tag 'pm-5.11-rc2' of git://git.kernel.org/pub/scm/linux/kernel/git/rafael/linux-pm · d9296a7b
      Linus Torvalds authored
      Pull power management fixes from Rafael Wysocki:
       "These fix a crash in intel_pstate during resume from suspend-to-RAM
        that may occur after recent changes and two resource leaks in error
        paths in the operating performance points (OPP) framework, add a new
        C-states table to intel_idle and update the cpuidle MAINTAINERS entry
        to cover the governors too.
      
        Specifics:
      
         - Fix recently introduced crash in the intel_pstate driver that
           occurs if scale-invariance is disabled during resume from
           suspend-to-RAM due to inconsistent changes of APERF or MPERF MSR
           values made by the platform firmware (Rafael Wysocki).
      
         - Fix a memory leak and add a missing clk_put() in error paths in the
           OPP framework (Quanyang Wang, Viresh Kumar).
      
         - Add new C-states table for SnowRidge processors to the intel_idle
           driver (Artem Bityutskiy).
      
         - Update the MAINTAINERS entry for cpuidle to make it clear that the
           governors are covered by it too (Lukas Bulwahn)"
      
      * tag 'pm-5.11-rc2' of git://git.kernel.org/pub/scm/linux/kernel/git/rafael/linux-pm:
        intel_idle: add SnowRidge C-state table
        cpufreq: intel_pstate: Fix fast-switch fallback path
        opp: Call the missing clk_put() on error
        opp: fix memory leak in _allocate_opp_table
        MAINTAINERS: include governors into CPU IDLE TIME MANAGEMENT FRAMEWORK
      d9296a7b
    • Rafael J. Wysocki's avatar
      Merge branches 'pm-cpufreq' and 'pm-cpuidle' · 89ecf09e
      Rafael J. Wysocki authored
      * pm-cpufreq:
        cpufreq: intel_pstate: Fix fast-switch fallback path
      
      * pm-cpuidle:
        intel_idle: add SnowRidge C-state table
        MAINTAINERS: include governors into CPU IDLE TIME MANAGEMENT FRAMEWORK
      89ecf09e
  5. 01 Jan, 2021 4 commits
    • Linus Torvalds's avatar
      Merge tag 'scsi-fixes' of git://git.kernel.org/pub/scm/linux/kernel/git/jejb/scsi · eda809ae
      Linus Torvalds authored
      Pull SCSI fixes from James Bottomley:
       "This is a load of driver fixes (12 ufs, 1 mpt3sas, 1 cxgbi).
      
        The big core two fixes are for power management ("block: Do not accept
        any requests while suspended" and "block: Fix a race in the runtime
        power management code") which finally sorts out the resume problems
        we've occasionally been having.
      
        To make the resume fix, there are seven necessary precursors which
        effectively renames REQ_PREEMPT to REQ_PM, so every "special" request
        in block is automatically a power management exempt one.
      
        All of the non-PM preempt cases are removed except for the one in the
        SCSI Parallel Interface (spi) domain validation which is a genuine
        case where we have to run requests at high priority to validate the
        bus so this becomes an autopm get/put protected request"
      
      * tag 'scsi-fixes' of git://git.kernel.org/pub/scm/linux/kernel/git/jejb/scsi: (22 commits)
        scsi: cxgb4i: Fix TLS dependency
        scsi: ufs: Un-inline ufshcd_vops_device_reset function
        scsi: ufs: Re-enable WriteBooster after device reset
        scsi: ufs-mediatek: Use correct path to fix compile error
        scsi: mpt3sas: Signedness bug in _base_get_diag_triggers()
        scsi: block: Do not accept any requests while suspended
        scsi: block: Remove RQF_PREEMPT and BLK_MQ_REQ_PREEMPT
        scsi: core: Only process PM requests if rpm_status != RPM_ACTIVE
        scsi: scsi_transport_spi: Set RQF_PM for domain validation commands
        scsi: ide: Mark power management requests with RQF_PM instead of RQF_PREEMPT
        scsi: ide: Do not set the RQF_PREEMPT flag for sense requests
        scsi: block: Introduce BLK_MQ_REQ_PM
        scsi: block: Fix a race in the runtime power management code
        scsi: ufs-pci: Enable UFSHCD_CAP_RPM_AUTOSUSPEND for Intel controllers
        scsi: ufs-pci: Fix recovery from hibernate exit errors for Intel controllers
        scsi: ufs-pci: Ensure UFS device is in PowerDown mode for suspend-to-disk ->poweroff()
        scsi: ufs-pci: Fix restore from S4 for Intel controllers
        scsi: ufs-mediatek: Keep VCC always-on for specific devices
        scsi: ufs: Allow regulators being always-on
        scsi: ufs: Clear UAC for RPMB after ufshcd resets
        ...
      eda809ae
    • Linus Torvalds's avatar
      Merge tag 'block-5.11-2021-01-01' of git://git.kernel.dk/linux-block · 8b4805c6
      Linus Torvalds authored
      Pull block fixes from Jens Axboe:
       "Two minor block fixes from this last week that should go into 5.11:
      
         - Add missing NOWAIT debugfs definition (Andres)
      
         - Fix kerneldoc warning introduced this merge window (Randy)"
      
      * tag 'block-5.11-2021-01-01' of git://git.kernel.dk/linux-block:
        block: add debugfs stanza for QUEUE_FLAG_NOWAIT
        fs: block_dev.c: fix kernel-doc warnings from struct block_device changes
      8b4805c6
    • Linus Torvalds's avatar
      Merge tag 'io_uring-5.11-2021-01-01' of git://git.kernel.dk/linux-block · dc3e24b2
      Linus Torvalds authored
      Pull io_uring fixes from Jens Axboe:
       "A few fixes that should go into 5.11, all marked for stable as well:
      
         - Fix issue around identity COW'ing and users that share a ring
           across processes
      
         - Fix a hang associated with unregistering fixed files (Pavel)
      
         - Move the 'process is exiting' cancelation a bit earlier, so
           task_works aren't affected by it (Pavel)"
      
      * tag 'io_uring-5.11-2021-01-01' of git://git.kernel.dk/linux-block:
        kernel/io_uring: cancel io_uring before task works
        io_uring: fix io_sqe_files_unregister() hangs
        io_uring: add a helper for setting a ref node
        io_uring: don't assume mm is constant across submits
      dc3e24b2
    • Linus Torvalds's avatar
      depmod: handle the case of /sbin/depmod without /sbin in PATH · cedd1862
      Linus Torvalds authored
      Commit 436e980e ("kbuild: don't hardcode depmod path") stopped
      hard-coding the path of depmod, but in the process caused trouble for
      distributions that had that /sbin location, but didn't have it in the
      PATH (generally because /sbin is limited to the super-user path).
      
      Work around it for now by just adding /sbin to the end of PATH in the
      depmod.sh script.
      Reported-and-tested-by: default avatarSedat Dilek <sedat.dilek@gmail.com>
      Signed-off-by: default avatarLinus Torvalds <torvalds@linux-foundation.org>
      cedd1862
  6. 31 Dec, 2020 3 commits
  7. 30 Dec, 2020 6 commits
  8. 29 Dec, 2020 20 commits
  9. 28 Dec, 2020 1 commit