1. 02 Aug, 2022 40 commits
    • Logan Gunthorpe's avatar
      md/raid5: Pivot raid5_make_request() · 7e55c60a
      Logan Gunthorpe authored
      raid5_make_request() loops through every page in the request,
      finds the appropriate stripe and adds the bio for that page in the
      disk.
      
      This causes a great deal of contention on the hash_lock and extra
      work seeing each stripe must be found once for every data disk.
      
      The number of times a stripe must be found can be reduced by pivoting
      raid5_make_request() so that it loops through every stripe and then
      loops through every disk in that stripe to see if the bio must be
      added. This reduces the number of times the hash lock must be taken
      by a factor equal to the number of data disks.
      
      To accomplish this, the logical sectors that have already been added
      must be tracked. Tracking them is done with a bitmap: the bits
      for all pages are set at the start of the request and each bit
      is cleared once the bio is added to a stripe.
      
      Finding the next sector to be done is then just a call to
      find_first_bit() so that sectors that have been done can simply be
      skipped.
      
      One minor downside is that the maximum sectors for a request must be
      limited so that the bitmap can be appropriately sized on the stack.
      This limit is arbitrarily chosen to be 256 stripe pages which works out
      to 1MB if PAGE_SIZE == DEFAULT_STRIPE_SIZE. This doesn't actually
      restrict the maximum request further seeing the default block queue
      settings are used which restricts the number of segments to 128 (which
      results in request sizes that are approximately 512KB).
      Signed-off-by: default avatarLogan Gunthorpe <logang@deltatee.com>
      Signed-off-by: default avatarSong Liu <song@kernel.org>
      Signed-off-by: default avatarJens Axboe <axboe@kernel.dk>
      7e55c60a
    • Logan Gunthorpe's avatar
      md/raid5: Check all disks in a stripe_head for reshape progress · 486f6055
      Logan Gunthorpe authored
      When testing if a previous stripe has had reshape expand past it, use
      the earliest or latest logical sector in all the disks for that stripe
      head. This will allow adding multiple disks at a time in a subesquent
      patch.
      
      To do this cleaner, refactor the check into a helper function called
      stripe_ahead_of_reshape().
      Signed-off-by: default avatarLogan Gunthorpe <logang@deltatee.com>
      Reviewed-by: default avatarChristoph Hellwig <hch@lst.de>
      Signed-off-by: default avatarSong Liu <song@kernel.org>
      Signed-off-by: default avatarJens Axboe <axboe@kernel.dk>
      486f6055
    • Logan Gunthorpe's avatar
      md/raid5: Refactor add_stripe_bio() · 4ad1d984
      Logan Gunthorpe authored
      Factor out two helper functions from add_stripe_bio(): one to check for
      overlap (stripe_bio_overlaps()), and one to actually add the bio to the
      stripe (__add_stripe_bio()). The latter function will always succeed.
      
      This will be useful in the next patch so that overlap can be checked for
      multiple disks before adding any
      Signed-off-by: default avatarLogan Gunthorpe <logang@deltatee.com>
      Reviewed-by: default avatarChristoph Hellwig <hch@lst.de>
      Signed-off-by: default avatarSong Liu <song@kernel.org>
      Signed-off-by: default avatarJens Axboe <axboe@kernel.dk>
      4ad1d984
    • Logan Gunthorpe's avatar
      md/raid5: Keep a reference to last stripe_head for batch · 3312e6c8
      Logan Gunthorpe authored
      When batching, every stripe head has to find the previous stripe head to
      add to the batch list. This involves taking the hash lock which is
      highly contended during IO.
      
      Instead of finding the previous stripe_head each time, store a
      reference to the previous stripe_head in a pointer so that it doesn't
      require taking the contended lock another time.
      
      The reference to the previous stripe must be released before scheduling
      and waiting for work to get done. Otherwise, it can hold up
      raid5_activate_delayed() and deadlock.
      Signed-off-by: default avatarLogan Gunthorpe <logang@deltatee.com>
      Reviewed-by: default avatarChristoph Hellwig <hch@lst.de>
      Acked-by: default avatarGuoqing Jiang <guoqing.jiang@linux.dev>
      Signed-off-by: default avatarSong Liu <song@kernel.org>
      Signed-off-by: default avatarJens Axboe <axboe@kernel.dk>
      3312e6c8
    • Logan Gunthorpe's avatar
      md/raid5: Refactor for loop in raid5_make_request() into while loop · 0a2d1694
      Logan Gunthorpe authored
      The for loop with retry label can be more cleanly expressed as a while
      loop by moving the logical_sector increment into the success path.
      
      No functional changes intended.
      Signed-off-by: default avatarLogan Gunthorpe <logang@deltatee.com>
      Reviewed-by: default avatarChristoph Hellwig <hch@lst.de>
      Signed-off-by: default avatarSong Liu <song@kernel.org>
      Signed-off-by: default avatarJens Axboe <axboe@kernel.dk>
      0a2d1694
    • Logan Gunthorpe's avatar
      md/raid5: Move read_seqcount_begin() into make_stripe_request() · 4f354560
      Logan Gunthorpe authored
      Now that prepare_to_wait() isn't in the way, move read_sequcount_begin()
      into make_stripe_request().
      
      No functional changes intended.
      Signed-off-by: default avatarLogan Gunthorpe <logang@deltatee.com>
      Reviewed-by: default avatarChristoph Hellwig <hch@lst.de>
      Reviewed-by: default avatarGuoqing Jiang <guoqing.jiang@linux.dev>
      Signed-off-by: default avatarSong Liu <song@kernel.org>
      Signed-off-by: default avatarJens Axboe <axboe@kernel.dk>
      4f354560
    • Logan Gunthorpe's avatar
      md/raid5: Drop the do_prepare flag in raid5_make_request() · 1cdb5b41
      Logan Gunthorpe authored
      prepare_to_wait() can be reasonably called after schedule instead of
      setting a flag and preparing in the next loop iteration.
      
      This means that prepare_to_wait() will be called before
      read_seqcount_begin(), but there shouldn't be any reason that the order
      matters here. On the first iteration of the loop prepare_to_wait() is
      already called first.
      Signed-off-by: default avatarLogan Gunthorpe <logang@deltatee.com>
      Reviewed-by: default avatarChristoph Hellwig <hch@lst.de>
      Reviewed-by: default avatarGuoqing Jiang <guoqing.jiang@linux.dev>
      Signed-off-by: default avatarSong Liu <song@kernel.org>
      Signed-off-by: default avatarJens Axboe <axboe@kernel.dk>
      1cdb5b41
    • Logan Gunthorpe's avatar
      md/raid5: Factor out helper from raid5_make_request() loop · f4aec6a0
      Logan Gunthorpe authored
      Factor out the inner loop of raid5_make_request() into it's own helper
      called make_stripe_request().
      
      The helper returns a number of statuses: SUCCESS, RETRY,
      SCHEDULE_AND_RETRY and FAIL. This makes the code a bit easier to
      understand and allows the SCHEDULE_AND_RETRY path to be made common.
      
      A context structure is added to contain do_flush. It will be used
      more in subsequent patches for state that needs to be kept
      outside the loop.
      
      No functional changes intended. This will be cleaned up further in
      subsequent patches to untangle the gen_lock and do_prepare logic
      further.
      Signed-off-by: default avatarLogan Gunthorpe <logang@deltatee.com>
      Signed-off-by: default avatarSong Liu <song@kernel.org>
      Signed-off-by: default avatarJens Axboe <axboe@kernel.dk>
      f4aec6a0
    • Logan Gunthorpe's avatar
      md/raid5: Move common stripe get code into new find_get_stripe() helper · 1baa1126
      Logan Gunthorpe authored
      Both uses of find_stripe() require a fairly complicated dance to
      increment the reference count. Move this into a common find_get_stripe()
      helper.
      
      No functional changes intended.
      Signed-off-by: default avatarLogan Gunthorpe <logang@deltatee.com>
      Reviewed-by: default avatarChristoph Hellwig <hch@lst.de>
      Acked-by: default avatarGuoqing Jiang <guoqing.jiang@linux.dev>
      Signed-off-by: default avatarSong Liu <song@kernel.org>
      Signed-off-by: default avatarJens Axboe <axboe@kernel.dk>
      1baa1126
    • Logan Gunthorpe's avatar
      md/raid5: Move stripe_add_to_batch_list() call out of add_stripe_bio() · 8757fef6
      Logan Gunthorpe authored
      stripe_add_to_batch_list() is better done in the loop in make_request
      instead of inside add_stripe_bio(). This is clearer and allows for
      storing the batch_head state outside the loop in a subsequent patch.
      
      The call to add_stripe_bio() in retry_aligned_read() is for read
      and batching only applies to write. So it's impossible for batching
      to happen at that call site.
      
      No functional changes intended.
      Signed-off-by: default avatarLogan Gunthorpe <logang@deltatee.com>
      Reviewed-by: default avatarChristoph Hellwig <hch@lst.de>
      Reviewed-by: default avatarGuoqing Jiang <guoqing.jiang@linux.dev>
      Signed-off-by: default avatarSong Liu <song@kernel.org>
      Signed-off-by: default avatarJens Axboe <axboe@kernel.dk>
      8757fef6
    • Logan Gunthorpe's avatar
      md/raid5: Refactor raid5_make_request loop · 27fb7010
      Logan Gunthorpe authored
      Break immediately if raid5_get_active_stripe() returns NULL and deindent
      the rest of the loop. Annotate this check with an unlikely().
      
      This makes the code easier to read and reduces the indentation level.
      
      No functional changes intended.
      Signed-off-by: default avatarLogan Gunthorpe <logang@deltatee.com>
      Reviewed-by: default avatarChristoph Hellwig <hch@lst.de>
      Reviewed-by: default avatarGuoqing Jiang <guoqing.jiang@linux.dev>
      Signed-off-by: default avatarSong Liu <song@kernel.org>
      Signed-off-by: default avatarJens Axboe <axboe@kernel.dk>
      27fb7010
    • Logan Gunthorpe's avatar
      md/raid5: Factor out ahead_of_reshape() function · a8bb304c
      Logan Gunthorpe authored
      There are a few uses of an ugly ternary operator in raid5_make_request()
      to check if a sector is a head of a reshape sector.
      
      Factor this out into a simple helper called ahead_of_reshape().
      
      No functional changes intended.
      Suggested-by: default avatarChristoph Hellwig <hch@lst.de>
      Signed-off-by: default avatarLogan Gunthorpe <logang@deltatee.com>
      Reviewed-by: default avatarChristoph Hellwig <hch@lst.de>
      Signed-off-by: default avatarSong Liu <song@kernel.org>
      Signed-off-by: default avatarJens Axboe <axboe@kernel.dk>
      a8bb304c
    • Logan Gunthorpe's avatar
      md/raid5: Make logic blocking check consistent with logic that blocks · 6e3f50d3
      Logan Gunthorpe authored
      The check in raid5_make_request differs very slightly from the logic
      that causes it to block lower down. This likely does not cause a bug
      as the check is fuzzy anyway (as reshape may move on between the first
      check and the subsequent check). However, make it consistent so it can
      be cleaned up in a subsequent patch.
      
      The condition which causes the schedule is:
      
       !(mddev->reshape_backwards ? logical_sector < conf->reshape_progress :
         logical_sector >= conf->reshape_progress) &&
        (mddev->reshape_backwards ? logical_sector < conf->reshape_safe :
         logical_sector >= conf->reshape_safe)
      
      The condition that causes the early bailout is made to match this.
      Signed-off-by: default avatarLogan Gunthorpe <logang@deltatee.com>
      Signed-off-by: default avatarSong Liu <song@kernel.org>
      Signed-off-by: default avatarJens Axboe <axboe@kernel.dk>
      6e3f50d3
    • Guoqing Jiang's avatar
      md: unlock mddev before reap sync_thread in action_store · 9dfbdafd
      Guoqing Jiang authored
      Since the bug which commit 8b48ec23 ("md: don't unregister sync_thread
      with reconfig_mutex held") fixed is related with action_store path, other
      callers which reap sync_thread didn't need to be changed.
      
      Let's pull md_unregister_thread from md_reap_sync_thread, then fix previous
      bug with belows.
      
      1. unlock mddev before md_reap_sync_thread in action_store.
      2. save reshape_position before unlock, then restore it to ensure position
         not changed accidentally by others.
      Signed-off-by: default avatarGuoqing Jiang <guoqing.jiang@linux.dev>
      Signed-off-by: default avatarSong Liu <song@kernel.org>
      Signed-off-by: default avatarJens Axboe <axboe@kernel.dk>
      9dfbdafd
    • Chris Webb's avatar
      md: Explicitly create command-line configured devices · 05ce7fb9
      Chris Webb authored
      Boot-time assembly of arrays with md= command-line arguments breaks when
      CONFIG_BLOCK_LEGACY_AUTOLOAD is unset. md_setup_drive() in md-autodetect.c
      calls blkdev_get_by_dev(), assuming this implicitly creates the block
      device.
      
      Fix this by attempting to md_alloc() the array first. As in the probe path,
      ignore any error as failure is caught by blkdev_get_by_dev() anyway.
      Signed-off-by: default avatarChris Webb <chris@arachsys.com>
      Reviewed-by: default avatarChristoph Hellwig <hch@lst.de>
      Signed-off-by: default avatarSong Liu <song@kernel.org>
      Signed-off-by: default avatarJens Axboe <axboe@kernel.dk>
      05ce7fb9
    • Logan Gunthorpe's avatar
      md: Notify sysfs sync_completed in md_reap_sync_thread() · 9973f0fa
      Logan Gunthorpe authored
      The mdadm test 07layouts randomly produces a kernel hung task deadlock.
      The deadlock is caused by the suspend_lo/suspend_hi files being set by
      the mdadm background process during reshape and not being cleared
      because the process hangs. (Leaving aside the issue of the fragility of
      freezing kernel tasks by buggy userspace processes...)
      
      When the background mdadm process hangs it, is waiting (without a
      timeout) on a change to the sync_completed file signalling that the
      reshape has completed. The process is woken up a couple times when
      the reshape finishes but it is woken up before MD_RECOVERY_RUNNING
      is cleared so sync_completed_show() reports 0 instead of "none".
      
      To fix this, notify the sysfs file in md_reap_sync_thread() after
      MD_RECOVERY_RUNNING has been cleared. This wakes up mdadm and causes
      it to continue and write to suspend_lo/suspend_hi to allow IO to
      continue.
      Signed-off-by: default avatarLogan Gunthorpe <logang@deltatee.com>
      Reviewed-by: default avatarChristoph Hellwig <hch@lst.de>
      Signed-off-by: default avatarSong Liu <song@kernel.org>
      Signed-off-by: default avatarJens Axboe <axboe@kernel.dk>
      9973f0fa
    • Logan Gunthorpe's avatar
      md: Ensure resync is reported after it starts · b368856a
      Logan Gunthorpe authored
      The 07layouts test in mdadm fails on some systems. The failure
      presents itself as the backup file not being removed before the next
      layout is grown into:
      
        mdadm: /dev/md0: cannot create backup file /tmp/md-test-backup:
            File exists
      
      This is because the background mdadm process, which is responsible for
      cleaning up this backup file gets into an infinite loop waiting for
      the reshape to start. mdadm checks the mdstat file if a reshape is
      going and, if it is not, it waits for an event on the file or times
      out in 5 seconds. On faster machines, the reshape may complete before
      the 5 seconds times out, and thus the background mdadm process loops
      waiting for a reshape to start that has already occurred.
      
      mdadm reads the mdstat file to start, but mdstat does not report that the
      reshape has begun, even though it has indeed begun. So the mdstat_wait()
      call (in mdadm) which polls on the mdstat file won't ever return until
      timing out.
      
      The reason mdstat reports the reshape has started is due to an issue
      in status_resync(). recovery_active is subtracted from curr_resync which
      will result in a value of zero for the first chunk of reshaped data, and
      the resulting read will report no reshape in progress.
      
      To fix this, if "resync - recovery_active" is an overloaded value, force
      the value to be MD_RESYNC_ACTIVE so the code reports a resync in progress.
      Signed-off-by: default avatarLogan Gunthorpe <logang@deltatee.com>
      Reviewed-by: default avatarChristoph Hellwig <hch@lst.de>
      Signed-off-by: default avatarSong Liu <song@kernel.org>
      Signed-off-by: default avatarJens Axboe <axboe@kernel.dk>
      b368856a
    • Logan Gunthorpe's avatar
      md: Use enum for overloaded magic numbers used by mddev->curr_resync · eac58d08
      Logan Gunthorpe authored
      Comments in the code document special values used for
      mddev->curr_resync. Make this clearer by using an enum to label these
      values.
      
      The only functional change is a couple places use the wrong comparison
      operator that implied 3 is another special value. They are all
      fixed to imply that 3 or greater is an active resync.
      Signed-off-by: default avatarLogan Gunthorpe <logang@deltatee.com>
      Reviewed-by: default avatarChristoph Hellwig <hch@lst.de>
      Signed-off-by: default avatarSong Liu <song@kernel.org>
      Signed-off-by: default avatarJens Axboe <axboe@kernel.dk>
      eac58d08
    • Logan Gunthorpe's avatar
      md/raid5-cache: Annotate pslot with __rcu notation · 6f28c5c3
      Logan Gunthorpe authored
      radix_tree_lookup_slot() and radix_tree_replace_slot() API expect the
      slot returned and looked up to be marked with __rcu. Otherwise
      sparse warnings are generated:
      
        drivers/md/raid5-cache.c:2939:23: warning: incorrect type in
      			assignment (different address spaces)
        drivers/md/raid5-cache.c:2939:23:    expected void **pslot
        drivers/md/raid5-cache.c:2939:23:    got void [noderef] __rcu **
      Signed-off-by: default avatarLogan Gunthorpe <logang@deltatee.com>
      Reviewed-by: default avatarChristoph Hellwig <hch@lst.de>
      Signed-off-by: default avatarSong Liu <song@kernel.org>
      Signed-off-by: default avatarJens Axboe <axboe@kernel.dk>
      6f28c5c3
    • Logan Gunthorpe's avatar
      md/raid5-cache: Clear conf->log after finishing work · b13015af
      Logan Gunthorpe authored
      A NULL pointer dereferlence on conf->log is seen randomly with
      the mdadm test 21raid5cache. Kasan reporst:
      
      BUG: KASAN: null-ptr-deref in r5l_reclaimable_space+0xf5/0x140
      Read of size 8 at addr 0000000000000860 by task md0_reclaim/3086
      
      Call Trace:
        dump_stack_lvl+0x5a/0x74
        kasan_report.cold+0x5f/0x1a9
        __asan_load8+0x69/0x90
        r5l_reclaimable_space+0xf5/0x140
        r5l_do_reclaim+0xf4/0x5e0
        r5l_reclaim_thread+0x69/0x3b0
        md_thread+0x1a2/0x2c0
        kthread+0x177/0x1b0
        ret_from_fork+0x22/0x30
      
      This is caused by conf->log being cleared in r5l_exit_log() before
      stopping the reclaim thread.
      
      To fix this, clear conf->log after the reclaim_thread is unregistered
      and after flushing disable_writeback_work.
      Signed-off-by: default avatarLogan Gunthorpe <logang@deltatee.com>
      Reviewed-by: default avatarChristoph Hellwig <hch@lst.de>
      Signed-off-by: default avatarSong Liu <song@kernel.org>
      Signed-off-by: default avatarJens Axboe <axboe@kernel.dk>
      b13015af
    • Logan Gunthorpe's avatar
      md/raid5-cache: Drop RCU usage of conf->log · 7769085c
      Logan Gunthorpe authored
      The only place that uses RCU to access conf->log is in
      r5l_log_disk_error(). This function is mostly used in the IO path
      and once with mddev_lock() held in raid5_change_consistency_policy().
      
      It is known that the IO will be suspended before the log is freed and
      r5l_log_exit() is called with the mddev_lock() held.
      
      This should mean that conf->log can not be freed while the function is
      being called, so the RCU protection is not necessary. Drop the
      rcu_read_lock() as well as the synchronize_rcu() and
      rcu_assign_pointer() usage.
      Signed-off-by: default avatarLogan Gunthorpe <logang@deltatee.com>
      Reviewed-by: default avatarChristoph Hellwig <hch@lst.de>
      Signed-off-by: default avatarSong Liu <song@kernel.org>
      Signed-off-by: default avatarJens Axboe <axboe@kernel.dk>
      7769085c
    • Logan Gunthorpe's avatar
      md/raid5-cache: Take mddev_lock in r5c_journal_mode_show() · 78ede6a0
      Logan Gunthorpe authored
      The mddev->lock spinlock doesn't protect against the removal of
      conf->log in r5l_exit_log() so conf->log may be freed before it
      is used.
      
      To fix this, take the mddev_lock() insteaad of the mddev->lock spinlock.
      Signed-off-by: default avatarLogan Gunthorpe <logang@deltatee.com>
      Reviewed-by: default avatarChristoph Hellwig <hch@lst.de>
      Signed-off-by: default avatarSong Liu <song@kernel.org>
      Signed-off-by: default avatarJens Axboe <axboe@kernel.dk>
      78ede6a0
    • Logan Gunthorpe's avatar
      md/raid5: suspend the array for calls to log_exit() · c629f345
      Logan Gunthorpe authored
      The raid5-cache code relies on there being no IO in flight when
      log_exit() is called. There are two places where this is not
      guaranteed so add mddev_suspend() and mddev_resume() calls to these
      sites.
      
      The site in raid5_change_consistency_policy() is in the error path,
      and another similar call site already has suspend/resume calls just
      below it; so it should be equally safe to make that change here.
      
      There is one remaining site in raid5_remove_disk() that we call log_exit()
      without suspending the array. Unfortunately, as the comment stated, we
      cannot call mddev_suspend from raid5d.
      Signed-off-by: default avatarLogan Gunthorpe <logang@deltatee.com>
      Reviewed-by: default avatarChristoph Hellwig <hch@lst.de>
      Signed-off-by: default avatarSong Liu <song@kernel.org>
      Signed-off-by: default avatarJens Axboe <axboe@kernel.dk>
      c629f345
    • Logan Gunthorpe's avatar
      md/raid5-ppl: Drop unused argument from ppl_handle_flush_request() · e0fccdaf
      Logan Gunthorpe authored
      ppl_handle_flush_request() takes an struct r5log argument but doesn't
      use it. It has no buisiness taking this argument as it is only used
      by raid5-cache and has no way to derference it anyway. Remove
      the argument.
      
      No functional changes intended.
      Signed-off-by: default avatarLogan Gunthorpe <logang@deltatee.com>
      Reviewed-by: default avatarChristoph Hellwig <hch@lst.de>
      Signed-off-by: default avatarSong Liu <song@kernel.org>
      Signed-off-by: default avatarJens Axboe <axboe@kernel.dk>
      e0fccdaf
    • Logan Gunthorpe's avatar
      md/raid5-log: Drop extern decorators for function prototypes · ed0c6a5f
      Logan Gunthorpe authored
      extern is not necessary and recommended against when defining prototype
      functions in headers. checkpatch.pl complains about these. So remove
      them.
      Signed-off-by: default avatarLogan Gunthorpe <logang@deltatee.com>
      Reviewed-by: default avatarChristoph Hellwig <hch@lst.de>
      Signed-off-by: default avatarSong Liu <song@kernel.org>
      Signed-off-by: default avatarJens Axboe <axboe@kernel.dk>
      ed0c6a5f
    • Song Liu's avatar
    • Lars Ellenberg's avatar
      drbd: bm_page_async_io: fix spurious bitmap "IO error" on large volumes · 66757001
      Lars Ellenberg authored
      We usually do all our bitmap IO in units of PAGE_SIZE.
      
      With very small or oddly sized external meta data, or with
      PAGE_SIZE != 4k, it can happen that our last on-disk bitmap page
      is not fully PAGE_SIZE aligned, so we may need to adjust the size
      of the IO.
      
      We used to do that with
        min_t(unsigned int, PAGE_SIZE,
      	last_allowed_sector - current_offset);
      And for just the right diff, (unsigned int)(diff) will result in 0.
      
      A bio of length 0 will correctly be rejected with an IO error
      (and some scary WARN_ON_ONCE()) by the scsi layer.
      
      Do the calculation properly.
      Signed-off-by: default avatarLars Ellenberg <lars.ellenberg@linbit.com>
      Signed-off-by: default avatarChristoph Böhmwalder <christoph.boehmwalder@linbit.com>
      Link: https://lore.kernel.org/r/20220622204932.196830-1-christoph.boehmwalder@linbit.comSigned-off-by: default avatarJens Axboe <axboe@kernel.dk>
      66757001
    • Linus Torvalds's avatar
      Merge tag 'for-6.0/dm-changes' of... · 8374cfe6
      Linus Torvalds authored
      Merge tag 'for-6.0/dm-changes' of git://git.kernel.org/pub/scm/linux/kernel/git/device-mapper/linux-dm
      
      Pull device mapper updates from Mike Snitzer:
      
       - Refactor DM core's mempool allocation so that it clearer by not being
         split acorss files.
      
       - Improve DM core's BLK_STS_DM_REQUEUE and BLK_STS_AGAIN handling.
      
       - Optimize DM core's more common bio splitting by eliminating the use
         of bio cloning with bio_split+bio_chain. Shift that cloning cost to
         the relatively unlikely dm_io requeue case that only occurs during
         error handling. Introduces dm_io_rewind() that will clone a bio that
         reflects the subset of the original bio that must be requeued.
      
       - Remove DM core's dm_table_get_num_targets() wrapper and audit all
         dm_table_get_target() callers.
      
       - Fix potential for OOM with DM writecache target by setting a default
         MAX_WRITEBACK_JOBS (set to 256MiB or 1/16 of total system memory,
         whichever is smaller).
      
       - Fix DM writecache target's stats that are reported through
         DM-specific table info.
      
       - Fix use-after-free crash in dm_sm_register_threshold_callback().
      
       - Refine DM core's Persistent Reservation handling in preparation for
         broader work Mike Christie is doing to add compatibility with
         Microsoft Windows Failover Cluster.
      
       - Fix various KASAN reported bugs in the DM raid target.
      
       - Fix DM raid target crash due to md_handle_request() bio splitting
         that recurses to block core without properly initializing the bio's
         bi_dev.
      
       - Fix some code comment typos and fix some Documentation formatting.
      
      * tag 'for-6.0/dm-changes' of git://git.kernel.org/pub/scm/linux/kernel/git/device-mapper/linux-dm: (29 commits)
        dm: fix dm-raid crash if md_handle_request() splits bio
        dm raid: fix address sanitizer warning in raid_resume
        dm raid: fix address sanitizer warning in raid_status
        dm: Start pr_preempt from the same starting path
        dm: Fix PR release handling for non All Registrants
        dm: Start pr_reserve from the same starting path
        dm: Allow dm_call_pr to be used for path searches
        dm: return early from dm_pr_call() if DM device is suspended
        dm thin: fix use-after-free crash in dm_sm_register_threshold_callback
        dm writecache: count number of blocks discarded, not number of discard bios
        dm writecache: count number of blocks written, not number of write bios
        dm writecache: count number of blocks read, not number of read bios
        dm writecache: return void from functions
        dm kcopyd: use __GFP_HIGHMEM when allocating pages
        dm writecache: set a default MAX_WRITEBACK_JOBS
        Documentation: dm writecache: Render status list as list
        Documentation: dm writecache: add blank line before optional parameters
        dm snapshot: fix typo in snapshot_map() comment
        dm raid: remove redundant "the" in parse_raid_params() comment
        dm cache: fix typo in 2 comment blocks
        ...
      8374cfe6
    • Linus Torvalds's avatar
      Merge tag 'for-5.20/block-2022-07-29' of git://git.kernel.dk/linux-block · c013d0af
      Linus Torvalds authored
      Pull block updates from Jens Axboe:
      
       - Improve the type checking of request flags (Bart)
      
       - Ensure queue mapping for a single queues always picks the right queue
         (Bart)
      
       - Sanitize the io priority handling (Jan)
      
       - rq-qos race fix (Jinke)
      
       - Reserved tags handling improvements (John)
      
       - Separate memory alignment from file/disk offset aligment for O_DIRECT
         (Keith)
      
       - Add new ublk driver, userspace block driver using io_uring for
         communication with the userspace backend (Ming)
      
       - Use try_cmpxchg() to cleanup the code in various spots (Uros)
      
       - Finally remove bdevname() (Christoph)
      
       - Clean up the zoned device handling (Christoph)
      
       - Clean up independent access range support (Christoph)
      
       - Clean up and improve block sysfs handling (Christoph)
      
       - Clean up and improve teardown of block devices.
      
         This turns the usual two step process into something that is simpler
         to implement and handle in block drivers (Christoph)
      
       - Clean up chunk size handling (Christoph)
      
       - Misc cleanups and fixes (Bart, Bo, Dan, GuoYong, Jason, Keith, Liu,
         Ming, Sebastian, Yang, Ying)
      
      * tag 'for-5.20/block-2022-07-29' of git://git.kernel.dk/linux-block: (178 commits)
        ublk_drv: fix double shift bug
        ublk_drv: make sure that correct flags(features) returned to userspace
        ublk_drv: fix error handling of ublk_add_dev
        ublk_drv: fix lockdep warning
        block: remove __blk_get_queue
        block: call blk_mq_exit_queue from disk_release for never added disks
        blk-mq: fix error handling in __blk_mq_alloc_disk
        ublk: defer disk allocation
        ublk: rewrite ublk_ctrl_get_queue_affinity to not rely on hctx->cpumask
        ublk: fold __ublk_create_dev into ublk_ctrl_add_dev
        ublk: cleanup ublk_ctrl_uring_cmd
        ublk: simplify ublk_ch_open and ublk_ch_release
        ublk: remove the empty open and release block device operations
        ublk: remove UBLK_IO_F_PREFLUSH
        ublk: add a MAINTAINERS entry
        block: don't allow the same type rq_qos add more than once
        mmc: fix disk/queue leak in case of adding disk failure
        ublk_drv: fix an IS_ERR() vs NULL check
        ublk: remove UBLK_IO_F_INTEGRITY
        ublk_drv: remove unneeded semicolon
        ...
      c013d0af
    • Linus Torvalds's avatar
      Merge tag 'for-5.20/io_uring-zerocopy-send-2022-07-29' of git://git.kernel.dk/linux-block · 42df1cbf
      Linus Torvalds authored
      Pull io_uring zerocopy support from Jens Axboe:
       "This adds support for efficient support for zerocopy sends through
        io_uring. Both ipv4 and ipv6 is supported, as well as both TCP and
        UDP.
      
        The core network changes to support this is in a stable branch from
        Jakub that both io_uring and net-next has pulled in, and the io_uring
        changes are layered on top of that.
      
        All of the work has been done by Pavel"
      
      * tag 'for-5.20/io_uring-zerocopy-send-2022-07-29' of git://git.kernel.dk/linux-block: (34 commits)
        io_uring: notification completion optimisation
        io_uring: export req alloc from core
        io_uring/net: use unsigned for flags
        io_uring/net: make page accounting more consistent
        io_uring/net: checks errors of zc mem accounting
        io_uring/net: improve io_get_notif_slot types
        selftests/io_uring: test zerocopy send
        io_uring: enable managed frags with register buffers
        io_uring: add zc notification flush requests
        io_uring: rename IORING_OP_FILES_UPDATE
        io_uring: flush notifiers after sendzc
        io_uring: sendzc with fixed buffers
        io_uring: allow to pass addr into sendzc
        io_uring: account locked pages for non-fixed zc
        io_uring: wire send zc request type
        io_uring: add notification slot registration
        io_uring: add rsrc referencing for notifiers
        io_uring: complete notifiers in tw
        io_uring: cache struct io_notif
        io_uring: add zc notification infrastructure
        ...
      42df1cbf
    • Linus Torvalds's avatar
      Merge tag 'for-5.20/io_uring-buffered-writes-2022-07-29' of git://git.kernel.dk/linux-block · 98e24746
      Linus Torvalds authored
      Pull io_uring buffered writes support from Jens Axboe:
       "This contains support for buffered writes, specifically for XFS. btrfs
        is in progress, will be coming in the next release.
      
        io_uring does support buffered writes on any file type, but since the
        buffered write path just always -EAGAIN (or -EOPNOTSUPP) any attempt
        to do so if IOCB_NOWAIT is set, any buffered write will effectively be
        handled by io-wq offload. This isn't very efficient, and we even have
        specific code in io-wq to serialize buffered writes to the same inode
        to avoid further inefficiencies with thread offload.
      
        This is particularly sad since most buffered writes don't block, they
        simply copy data to a page and dirty it. With this pull request, we
        can handle buffered writes a lot more effiently.
      
        If balance_dirty_pages() needs to block, we back off on writes as
        indicated.
      
        This improves buffered write support by 2-3x.
      
        Jan Kara helped with the mm bits for this, and Stefan handled the
        fs/iomap/xfs/io_uring parts of it"
      
      * tag 'for-5.20/io_uring-buffered-writes-2022-07-29' of git://git.kernel.dk/linux-block:
        mm: honor FGP_NOWAIT for page cache page allocation
        xfs: Add async buffered write support
        xfs: Specify lockmode when calling xfs_ilock_for_iomap()
        io_uring: Add tracepoint for short writes
        io_uring: fix issue with io_write() not always undoing sb_start_write()
        io_uring: Add support for async buffered writes
        fs: Add async write file modification handling.
        fs: Split off inode_needs_update_time and __file_update_time
        fs: add __remove_file_privs() with flags parameter
        fs: add a FMODE_BUF_WASYNC flags for f_mode
        iomap: Return -EAGAIN from iomap_write_iter()
        iomap: Add async buffered write support
        iomap: Add flags parameter to iomap_page_create()
        mm: Add balance_dirty_pages_ratelimited_flags() function
        mm: Move updates of dirty_exceeded into one place
        mm: Move starting of background writeback into the main balancing loop
      98e24746
    • Linus Torvalds's avatar
      Merge tag 'for-5.20/io_uring-2022-07-29' of git://git.kernel.dk/linux-block · b349b118
      Linus Torvalds authored
      Pull io_uring updates from Jens Axboe:
      
       - As per (valid) complaint in the last merge window, fs/io_uring.c has
         grown quite large these days. io_uring isn't really tied to fs
         either, as it supports a wide variety of functionality outside of
         that.
      
         Move the code to io_uring/ and split it into files that either
         implement a specific request type, and split some code into helpers
         as well. The code is organized a lot better like this, and io_uring.c
         is now < 4K LOC (me).
      
       - Deprecate the epoll_ctl opcode. It'll still work, just trigger a
         warning once if used. If we don't get any complaints on this, and I
         don't expect any, then we can fully remove it in a future release
         (me).
      
       - Improve the cancel hash locking (Hao)
      
       - kbuf cleanups (Hao)
      
       - Efficiency improvements to the task_work handling (Dylan, Pavel)
      
       - Provided buffer improvements (Dylan)
      
       - Add support for recv/recvmsg multishot support. This is similar to
         the accept (or poll) support for have for multishot, where a single
         SQE can trigger everytime data is received. For applications that
         expect to do more than a few receives on an instantiated socket, this
         greatly improves efficiency (Dylan).
      
       - Efficiency improvements for poll handling (Pavel)
      
       - Poll cancelation improvements (Pavel)
      
       - Allow specifiying a range for direct descriptor allocations (Pavel)
      
       - Cleanup the cqe32 handling (Pavel)
      
       - Move io_uring types to greatly cleanup the tracing (Pavel)
      
       - Tons of great code cleanups and improvements (Pavel)
      
       - Add a way to do sync cancelations rather than through the sqe -> cqe
         interface, as that's a lot easier to use for some use cases (me).
      
       - Add support to IORING_OP_MSG_RING for sending direct descriptors to a
         different ring. This avoids the usually problematic SCM case, as we
         disallow those. (me)
      
       - Make the per-command alloc cache we use for apoll generic, place
         limits on it, and use it for netmsg as well (me).
      
       - Various cleanups (me, Michal, Gustavo, Uros)
      
      * tag 'for-5.20/io_uring-2022-07-29' of git://git.kernel.dk/linux-block: (172 commits)
        io_uring: ensure REQ_F_ISREG is set async offload
        net: fix compat pointer in get_compat_msghdr()
        io_uring: Don't require reinitable percpu_ref
        io_uring: fix types in io_recvmsg_multishot_overflow
        io_uring: Use atomic_long_try_cmpxchg in __io_account_mem
        io_uring: support multishot in recvmsg
        net: copy from user before calling __get_compat_msghdr
        net: copy from user before calling __copy_msghdr
        io_uring: support 0 length iov in buffer select in compat
        io_uring: fix multishot ending when not polled
        io_uring: add netmsg cache
        io_uring: impose max limit on apoll cache
        io_uring: add abstraction around apoll cache
        io_uring: move apoll cache to poll.c
        io_uring: consolidate hash_locked io-wq handling
        io_uring: clear REQ_F_HASH_LOCKED on hash removal
        io_uring: don't race double poll setting REQ_F_ASYNC_DATA
        io_uring: don't miss setting REQ_F_DOUBLE_POLL
        io_uring: disable multishot recvmsg
        io_uring: only trace one of complete or overflow
        ...
      b349b118
    • Linus Torvalds's avatar
      Merge branch 'turbostat' of git://git.kernel.org/pub/scm/linux/kernel/git/lenb/linux · efb28830
      Linus Torvalds authored
      Pull turbostat updates from Len Brown:
       "Only updating the turbostat tool here, no kernel changes"
      
      * 'turbostat' of git://git.kernel.org/pub/scm/linux/kernel/git/lenb/linux:
        tools/power turbostat: version 2022.07.28
        tools/power turbostat: do not decode ACC for ICX and SPR
        tools/power turbostat: fix SPR PC6 limits
        tools/power turbostat: cleanup 'automatic_cstate_conversion_probe()'
        tools/power turbostat: separate SPR from ICX
        tools/power turbosstat: fix comment
        tools/power turbostat: Support RAPTORLAKE P
        tools/power turbostat: add support for ALDERLAKE_N
        tools/power turbostat: dump secondary Turbo-Ratio-Limit
        tools/power turbostat: simplify dump_turbo_ratio_limits()
        tools/power turbostat: dump CPUID.7.EDX.Hybrid
        tools/power turbostat: update turbostat.8
        tools/power turbostat: Show uncore frequency
        tools/power turbostat: Fix file pointer leak
        tools/power turbostat: replace strncmp with single character compare
        tools/power turbostat: print the kernel boot commandline
        tools/power turbostat: Introduce support for RaptorLake
      efb28830
    • Linus Torvalds's avatar
      Merge tag 'thermal-5.20-rc1' of git://git.kernel.org/pub/scm/linux/kernel/git/rafael/linux-pm · c1dbe9a1
      Linus Torvalds authored
      Pull thermal control updates from Rafael Wysocki:
       "These start a rework of the handling of trip points in the thermal
        core, improve the cpufreq/devfreq cooling device handling, update some
        thermal control drivers and the tmon utility and clean up code.
      
        Specifics:
      
         - Consolidate the thermal core code by beginning to move the thermal
           trip structure from the thermal OF code as a generic structure to
           be used by the different sensors when registering a thermal zone
           (Daniel Lezcano).
      
         - Make per cpufreq / devfreq cooling device ops instead of using a
           global variable, fix comments and rework the trace information
           (Lukasz Luba).
      
         - Add the include/dt-bindings/thermal.h under the area covered by the
           thermal maintainer in the MAINTAINERS file (Lukas Bulwahn).
      
         - Improve the error output by giving the sensor identification when a
           thermal zone failed to initialize, the DT bindings by changing the
           positive logic and adding the r8a779f0 support on the rcar3
           (Wolfram Sang).
      
         - Convert the QCom tsens DT binding to the dtsformat format
           (Krzysztof Kozlowski).
      
         - Remove the pointless get_trend() function in the QCom, Ux500 and
           tegra thermal drivers, along with the unused DROP_FULL and
           RAISE_FULL trends definitions. Simplify the code by using clamp()
           macros (Daniel Lezcano).
      
         - Fix ref_table memory leak at probe time on the k3_j72xx bandgap
           (Bryan Brattlof).
      
         - Fix array underflow in prep_lookup_table (Dan Carpenter).
      
         - Add static annotation to the k3_j72xx_bandgap_j7* data structure
           (Jin Xiaoyun).
      
         - Fix typos in comments detected on sun8i by Coccinelle (Julia
           Lawall).
      
         - Fix typos in comments on rzg2l (Biju Das).
      
         - Remove as unnecessary call to dev_err() as the error is already
           printed by the failing function on u8500 (Yang Li).
      
         - Register the thermal zones as hwmon sensors for the Qcom thermal
           sensors (Dmitry Baryshkov).
      
         - Fix 'tmon' tool compilation issue by adding phtread.h include
           (Markus Mayer).
      
         - Fix typo in the comments for the 'tmon' tool (Slark Xiao).
      
         - Make the thermal core use ida_alloc()/free() directly instead of
           ida_simple_get()/ida_simple_remove() that have been deprecated
           (keliu).
      
         - Drop ACPI_FADT_LOW_POWER_S0 check from the Intel PCH thermal
           control driver (Rafael Wysocki)"
      
      * tag 'thermal-5.20-rc1' of git://git.kernel.org/pub/scm/linux/kernel/git/rafael/linux-pm: (39 commits)
        thermal/of: Initialize trip points separately
        thermal/of: Use thermal trips stored in the thermal zone
        thermal/core: Add thermal_trip in thermal_zone
        thermal/core: Rename 'trips' to 'num_trips'
        thermal/core: Move thermal_set_delay_jiffies to static
        thermal/core: Remove unneeded EXPORT_SYMBOLS
        thermal/of: Move thermal_trip structure to thermal.h
        thermal/of: Remove the device node pointer for thermal_trip
        thermal/of: Replace device node match with device node search
        thermal/core: Remove duplicate information when an error occurs
        thermal/core: Avoid calling ->get_trip_temp() unnecessarily
        thermal/tools/tmon: Fix typo 'the the' in comment
        thermal/tools/tmon: Include pthread and time headers in tmon.h
        thermal/ti-soc-thermal: Fix comment typo
        thermal/drivers/qcom/spmi-adc-tm5: Register thermal zones as hwmon sensors
        thermal/drivers/qcom/temp-alarm: Register thermal zones as hwmon sensors
        thermal/drivers/u8500: Remove unnecessary print function dev_err()
        thermal/drivers/rzg2l: Fix comments
        thermal/drivers/sun8i: Fix typo in comment
        thermal/drivers/k3_j72xx_bandgap: Make k3_j72xx_bandgap_j721e_data and k3_j72xx_bandgap_j7200_data static
        ...
      c1dbe9a1
    • Linus Torvalds's avatar
      Merge tag 'pm-5.20-rc1' of git://git.kernel.org/pub/scm/linux/kernel/git/rafael/linux-pm · a771ea64
      Linus Torvalds authored
      Pull power management updates from Rafael Wysocki:
       "These are mostly minor improvements all over including new CPU IDs for
        the Intel RAPL driver, an Energy Model rework to use micro-Watt as the
        power unit, cpufreq fixes and cleanus, cpuidle updates, devfreq
        updates, documentation cleanups and a new version of the pm-graph
        suite of utilities.
      
        Specifics:
      
         - Make cpufreq_show_cpus() more straightforward (Viresh Kumar).
      
         - Drop unnecessary CPU hotplug locking from store() used by cpufreq
           sysfs attributes (Viresh Kumar).
      
         - Make the ACPI cpufreq driver support the boost control interface on
           Zhaoxin/Centaur processors (Tony W Wang-oc).
      
         - Print a warning message on attempts to free an active cpufreq
           policy which should never happen (Viresh Kumar).
      
         - Fix grammar in the Kconfig help text for the loongson2 cpufreq
           driver (Randy Dunlap).
      
         - Use cpumask_var_t for an on-stack CPU mask in the ondemand cpufreq
           governor (Zhao Liu).
      
         - Add trace points for guest_halt_poll_ns grow/shrink to the haltpoll
           cpuidle driver (Eiichi Tsukata).
      
         - Modify intel_idle to treat C1 and C1E as independent idle states on
           Sapphire Rapids (Artem Bityutskiy).
      
         - Extend support for wakeirq to callback wrappers used during system
           suspend and resume (Ulf Hansson).
      
         - Defer waiting for device probe before loading a hibernation image
           till the first actual device access to avoid possible deadlocks
           reported by syzbot (Tetsuo Handa).
      
         - Unify device_init_wakeup() for PM_SLEEP and !PM_SLEEP (Bjorn
           Helgaas).
      
         - Add Raptor Lake-P to the list of processors supported by the Intel
           RAPL driver (George D Sworo).
      
         - Add Alder Lake-N and Raptor Lake-P to the list of processors for
           which Power Limit4 is supported in the Intel RAPL driver (Sumeet
           Pawnikar).
      
         - Make pm_genpd_remove() check genpd_debugfs_dir against NULL before
           attempting to remove it (Hsin-Yi Wang).
      
         - Change the Energy Model code to represent power in micro-Watts and
           adjust its users accordingly (Lukasz Luba).
      
         - Add new devfreq driver for Mediatek CCI (Cache Coherent
           Interconnect) (Johnson Wang).
      
         - Convert the Samsung Exynos SoC Bus bindings to DT schema of
           exynos-bus.c (Krzysztof Kozlowski).
      
         - Address kernel-doc warnings by adding the description for unused
           function parameters in devfreq core (Mauro Carvalho Chehab).
      
         - Use NULL to pass a null pointer rather than zero according to the
           function propotype in imx-bus.c (Colin Ian King).
      
         - Print error message instead of error interger value in
           tegra30-devfreq.c (Dmitry Osipenko).
      
         - Add checks to prevent setting negative frequency QoS limits for
           CPUs (Shivnandan Kumar).
      
         - Update the pm-graph suite of utilities to the latest revision 5.9
           including multiple improvements (Todd Brandt).
      
         - Drop pme_interrupt reference from the PCI power management
           documentation (Mario Limonciello)"
      
      * tag 'pm-5.20-rc1' of git://git.kernel.org/pub/scm/linux/kernel/git/rafael/linux-pm: (27 commits)
        powercap: RAPL: Add Power Limit4 support for Alder Lake-N and Raptor Lake-P
        PM: QoS: Add check to make sure CPU freq is non-negative
        PM: hibernate: defer device probing when resuming from hibernation
        intel_idle: make SPR C1 and C1E be independent
        cpufreq: ondemand: Use cpumask_var_t for on-stack cpu mask
        cpufreq: loongson2: fix Kconfig "its" grammar
        pm-graph v5.9
        cpufreq: Warn users while freeing active policy
        cpufreq: scmi: Support the power scale in micro-Watts in SCMI v3.1
        firmware: arm_scmi: Get detailed power scale from perf
        Documentation: EM: Switch to micro-Watts scale
        PM: EM: convert power field to micro-Watts precision and align drivers
        PM / devfreq: tegra30: Add error message for devm_devfreq_add_device()
        PM / devfreq: imx-bus: use NULL to pass a null pointer rather than zero
        PM / devfreq: shut up kernel-doc warnings
        dt-bindings: interconnect: samsung,exynos-bus: convert to dtschema
        PM / devfreq: mediatek: Introduce MediaTek CCI devfreq driver
        dt-bindings: interconnect: Add MediaTek CCI dt-bindings
        PM: domains: Ensure genpd_debugfs_dir exists before remove
        PM: runtime: Extend support for wakeirq for force_suspend|resume
        ...
      a771ea64
    • Linus Torvalds's avatar
      Merge tag 'acpi-5.20-rc1' of git://git.kernel.org/pub/scm/linux/kernel/git/rafael/linux-pm · 8fa0db3a
      Linus Torvalds authored
      Pull ACPI updates from Rafael Wysocki:
       "These rework the handling of ACPI device objects to use the driver
        core facilities for managing child ones instead of some questionable
        home-grown ways without the requisite locking and reference counting,
        clean up the EC driver, improve suspend-to-idle handling on x86, add
        some systems to the ACPI backlight quirk list, fix some assorted
        issues, clean up code and improve documentation.
      
        Specifics:
      
         - Use facilities provided by the driver core and some additional
           helpers to handle the children of a given ACPI device object in
           multiple places instead of using the children and node list heads
           in struct acpi_device which is error prone (Rafael Wysocki).
      
         - Fix ACPI-related device reference counting issue in the hisi_lpc
           bus driver (Yang Yingliang).
      
         - Drop the children and node list heads that are not needed any more
           from struct acpi_device (Rafael Wysocki).
      
         - Drop driver member from struct acpi_device (Uwe Kleine-König).
      
         - Drop redundant check from acpi_device_remove() (Uwe Kleine-König).
      
         - Prepare the CPPC library for handling backwards-compatible future
           _CPC return package formats gracefully (Rafael Wysocki).
      
         - Clean up the ACPI EC driver after previous changes in it (Hans de
           Goede).
      
         - Drop leftover acpi_processor_get_limit_info() declaration (Riwen
           Lu).
      
         - Split out thermal initialization from ACPI PSS (Riwen Lu).
      
         - Annotate more functions in the ACPI CPU idle driver to live in the
           cpuidle section (Guilherme G. Piccoli).
      
         - Fix _EINJ vs "special purpose" EFI memory regions (Dan Williams).
      
         - Implement a better fix to avoid spamming the console with old error
           logs (Tony Luck).
      
         - Fix typo in a comment in the APEI code (Xiang wangx).
      
         - Save NVS memory during transitions into S3 on Lenovo G40-45 (Manyi
           Li).
      
         - Add support for upcoming AMD uPEP device ID AMDI008 to the ACPI
           suspend-to-idle driver for x86 platforms (Shyam Sundar S K).
      
         - Clean up checks related to the ACPI_FADT_LOW_POWER_S0 platform flag
           in the LPIT table driver and the suspend-to-idle driver for x86
           platforms (Rafael Wysocki).
      
         - Print information messages regarding declared LPS0 idle support in
           the platform firmware (Rafael Wysocki).
      
         - Fix missing check in register_device_clock() in the ACPI driver for
           Intel SoCs (huhai).
      
         - Fix ACS setup in the VIOT table parser (Eric Auger).
      
         - Skip IRQ override on AMD Zen platforms where it's harmful
           (Chuanhong Guo).
      
         - Use native backlight on Dell Inspiron N4010 (Hans de Goede).
      
         - Use native backlight on some TongFang devices (Werner Sembach).
      
         - Drop X86 dependency from the ACPI backlight driver Kconfig (Riwen
           Lu).
      
         - Shorten the quirk list in the ACPI backlight driver by identifying
           Clevo by board_name only (Werner Sembach).
      
         - Remove useless NULL pointer checks from 2 ACPI PCI link management
           functions (Andrey Strachuk).
      
         - Fix obsolete example in the ACPI EINJ documentation (Qifu Zhang).
      
         - Update links and references to _DSD-related documents (Sudeep
           Holla)"
      
      * tag 'acpi-5.20-rc1' of git://git.kernel.org/pub/scm/linux/kernel/git/rafael/linux-pm: (46 commits)
        ACPI/PCI: Remove useless NULL pointer checks
        ACPI: CPPC: Do not prevent CPPC from working in the future
        ACPI: PM: x86: Print messages regarding LPS0 idle support
        ACPI: resource: skip IRQ override on AMD Zen platforms
        Documentation: ACPI: EINJ: Fix obsolete example
        ACPI: video: Use native backlight on Dell Inspiron N4010
        ACPI: PM: s2idle: Use LPS0 idle if ACPI_FADT_LOW_POWER_S0 is unset
        Revert "ACPI / PM: LPIT: Register sysfs attributes based on FADT"
        ACPI: video: Shortening quirk list by identifying Clevo by board_name only
        ACPI: video: Force backlight native for some TongFang devices
        ACPI: PM: s2idle: Add support for upcoming AMD uPEP HID AMDI008
        ACPI: VIOT: Fix ACS setup
        ACPI: bus: Drop unused list heads from struct acpi_device
        hisi_lpc: Use acpi_dev_for_each_child()
        bus: hisi_lpc: fix missing platform_device_put() in hisi_lpc_acpi_probe()
        ACPI: bus: Drop driver member of struct acpi_device
        ACPI: bus: Drop redundant check in acpi_device_remove()
        ACPI: APEI: Fix _EINJ vs EFI_MEMORY_SP
        ACPI: LPSS: Fix missing check in register_device_clock()
        ACPI: APEI: Better fix to avoid spamming the console with old error logs
        ...
      8fa0db3a
    • Linus Torvalds's avatar
      Merge tag 'hwmon-for-v5.20' of git://git.kernel.org/pub/scm/linux/kernel/git/groeck/linux-staging · 64ae88ff
      Linus Torvalds authored
      Pull hwmon updates from Guenter Roeck:
      
       - Substantial rewrite of lm90 driver to support several additional
         chips and improve support for existing chips.
      
       - Add support of ROG ZENITH II EXTREME, Maximus XI Hero, and
         Strix Z690-a D4 to asus-ec-sensors driver
      
       - Add support of F71858AD to f71882fg driver
      
       - Add support of Aquacomputer Quadro to aquacomputer_d5next driver
      
       - Improved assembler code and add support for Dell G5 5590 as well as
         XPS 13 7390 in dell-smm driver
      
       - Add support for ASUS TUF GAMING B550-PLUS WIFI II to nct775 driver
      
       - Add support for IEEE 754 half precision to PMBus core. Also support
         for Analog Devices LT7182S, improve regulator support, and report
         various MFR register values in debugfs.
      
       - Various other minor improvements and fixes
      
      * tag 'hwmon-for-v5.20' of git://git.kernel.org/pub/scm/linux/kernel/git/groeck/linux-staging: (85 commits)
        hwmon: (aquacomputer_d5next) Add support for Aquacomputer Quadro fan controller
        hwmon: (dell-smm) Improve documentation
        hwmon: (nct6775) add ASUS TUF GAMING B550-PLUS WIFI II
        hwmon: (occ) Replace open-coded variant of %*phN specifier
        hwmon: (sht15) Fix wrong assumptions in device remove callback
        hwmon: (aquacomputer_d5next) Add support for reading the +12V voltage sensor on D5 Next
        hwmon: (tps23861) fix byte order in current and voltage registers
        hwmon: (aspeed-pwm-tacho) increase fan tach period (again)
        hwmon: (aquacomputer_d5next) Add D5 Next fan control support
        hwmon: (mcp3021) improve driver support for newer hwmon interface
        hwmon: (asus-ec-sensors) add definitions for ROG ZENITH II EXTREME
        hwmon: (aquacomputer_d5next) Move device-specific data into struct aqc_data
        hwmon: (asus-ec-sensors) add missing sensors for X570-I GAMING
        hwmon: (drivetemp) Add module alias
        hwmon: (asus_wmi_sensors) Save a few bytes of memory
        hwmon: (lm90) Use worker for alarm notifications
        hwmon: (asus-ec-sensors) add support for Maximus XI Hero
        hwmon: (dell-smm) Improve assembly code
        hwmon: (pmbus/ltc2978) Set voltage resolution
        hwmon: (pmbus) Add list_voltage to pmbus ops
        ...
      64ae88ff
    • Linus Torvalds's avatar
      Merge tag 'pwm/for-5.20-rc1' of... · 530c28df
      Linus Torvalds authored
      Merge tag 'pwm/for-5.20-rc1' of git://git.kernel.org/pub/scm/linux/kernel/git/thierry.reding/linux-pwm
      
      Pull pwm updates from Thierry Reding:
       "After v5.19 had all drivers converted to the new atomic API and nobody
        has reported any breakage, this set of changes starts by dropping the
        legacy support.
      
        Some existing drivers get improvements and broader chip support and a
        new driver is added that emulates a PWM controller using a clock
        output.
      
        Other than that there's the usual bits of cleanups and minor fixes"
      
      * tag 'pwm/for-5.20-rc1' of git://git.kernel.org/pub/scm/linux/kernel/git/thierry.reding/linux-pwm: (21 commits)
        pwm: lpc18xx: Fix period handling
        pwm: lpc18xx: Convert to use dev_err_probe()
        pwm: twl-led: Document some limitations and link to the reference manual
        MAINTAINERS: Remove myself as PWM maintainer
        MAINTAINERS: Add include/dt-bindings/pwm to PWM SUBSYSTEM
        dt-bindings: pwm: mediatek: Add compatible string for MT8195
        pwm: Add clock based PWM output driver
        dt-bindings: pwm: Document clk based PWM controller
        pwm: sifive: Shut down hardware only after pwmchip_remove() completed
        pwm: sifive: Ensure the clk is enabled exactly once per running PWM
        pwm: sifive: Simplify clk handling
        pwm: sifive: Enable clk only after period check in .apply()
        pwm: sifive: Reduce time the controller lock is held
        pwm: sifive: Fold pwm_sifive_enable() into its only caller
        pwm: sifive: Simplify offset calculation for PWMCMP registers
        pwm: mediatek: Add MT8365 support
        dt-bindings: pwm: Add MT8365 SoC binding
        pwm: Drop unused forward declaration from pwm.h
        pwm: Reorder header file to get rid of struct pwm_capture forward declaration
        pwm: atmel-tcb: Fix typo in comment
        ...
      530c28df
    • Linus Torvalds's avatar
      Merge tag 'spi-v5.20' of git://git.kernel.org/pub/scm/linux/kernel/git/broonie/spi · 0805c6fb
      Linus Torvalds authored
      Pull spi updates from Mark Brown:
       "The big update this time around is some excellent work from David
        Jander who went through the fast path and really eliminated overheads,
        meaning that we are seeing a huge reduction in the time spent between
        transfers for single threaded clients.
      
        Benchmarking has been coming out at about a halving of overhead which
        is clearly visible in system level usage that stresses SPI like some
        CAN and IIO applications, especially with small transfers. Thanks to
        David for taking the time to drill down into this and push the work
        upstream.
      
        Otherwise there's been a bunch of new device support and the usual
        updates.
      
         - Optimisation of the fast path, particularly around the number and
           types of locking operations, from David Jander.
      
         - Support for Arbel NPCM845, HP GXP, Intel Meteor Lake and Thunder
           Bay, MediaTek MT8188 and MT8365, Microchip FPGAs, nVidia Tegra 241
           and Samsung Exynos Auto v9 and 4210"
      
      * tag 'spi-v5.20' of git://git.kernel.org/pub/scm/linux/kernel/git/broonie/spi: (97 commits)
        MAINTAINERS: add spi support to GXP
        spi: dt-bindings: add documentation for hpe,gxp-spifi
        spi: spi-gxp: Add support for HPE GXP SoCs
        spi: a3700: support BE for AC5 SPI driver
        spi/panel: dt-bindings: drop CPHA and CPOL from common properties
        spi: bcm2835: enable shared interrupt support
        spi: dt-bindings: spi-controller: correct example indentation
        spi: dt-bindings: qcom,spi-geni-qcom: allow three interconnects
        spi: npcm-fiu: Add NPCM8XX support
        dt-binding: spi: Add npcm845 compatible to npcm-fiu document
        spi: npcm-fiu: Modify direct read dummy configuration
        spi: atmel: remove #ifdef CONFIG_{PM, SLEEP}
        spi: dt-bindings: Add compatible for MediaTek MT8188
        spi: dt-bindings: mediatek,spi-mtk-nor: Update bindings for nor flash
        spi: dt-bindings: atmel,at91rm9200-spi: convert to json-schema
        spi: tegra20-slink: fix UAF in tegra_slink_remove()
        spi: Fix simplification of devm_spi_register_controller
        spi: microchip-core: switch to use dev_err_probe()
        spi: microchip-core: switch to use devm_spi_alloc_master()
        spi: microchip-core: fix UAF in mchp_corespi_remove()
        ...
      0805c6fb
    • Linus Torvalds's avatar
      Merge tag 'regulator-v5.20' of git://git.kernel.org/pub/scm/linux/kernel/git/broonie/regulator · 416e05e5
      Linus Torvalds authored
      Pull regulator updates from Mark Brown:
       "This has been a fairly quiet release for the regulator API, a few new
        drivers and a small API update:
      
         - Support for specifying an initial load as part of requesting
           regulators through the bulk API
      
         - Support for Maxim MAX597x, Qualcomm PM8074, PM8909 and Realtek
           RT5120 devices"
      
      * tag 'regulator-v5.20' of git://git.kernel.org/pub/scm/linux/kernel/git/broonie/regulator: (35 commits)
        regulator: core: Allow drivers to define their init data as const
        regulator: core: Allow specifying an initial load w/ the bulk API
        regulator: mt6380: Fix unused array warning
        regulator: Add missing type for 'regulator-microvolt-offset'
        regulator: core: Fix off-on-delay-us for always-on/boot-on regulators
        regulator: of: Fix refcount leak bug in of_get_regulation_constraints()
        regulator: pwm: Update Lee Jones' email address
        regulator: max597x: Don't return uninitialized variable in .probe
        regulator: qcom,spmi-regulator: add PMP8074 PMIC
        regulator: qcom,spmi-regulator: Convert to dtschema
        regulator: qcom_spmi: add support for PMP8074 regulators
        regulator: qcom_spmi: add support for HT_P600
        regulator: qcom_spmi: add support for HT_P150
        regulator: max597x: Remove unused including <linux/version.h>
        regulator: Fix MFD_MAX597X dependency
        regulator: Fix parameter declaration and spelling mistake.
        regulator: max597x: Add support for max597x regulator
        regulator: scmi: Add missing of_node_get()
        regulator: qcom_smd: Add PM8909 RPM regulators
        regulator: dt-bindings: qcom,smd-rpm: Add PM8909
        ...
      416e05e5