• Ryan Roberts's avatar
    mm: thp: introduce multi-size THP sysfs interface · 3485b883
    Ryan Roberts authored
    In preparation for adding support for anonymous multi-size THP, introduce
    new sysfs structure that will be used to control the new behaviours.  A
    new directory is added under transparent_hugepage for each supported THP
    size, and contains an `enabled` file, which can be set to "inherit" (to
    inherit the global setting), "always", "madvise" or "never".  For now, the
    kernel still only supports PMD-sized anonymous THP, so only 1 directory is
    populated.
    
    The first half of the change converts transhuge_vma_suitable() and
    hugepage_vma_check() so that they take a bitfield of orders for which the
    user wants to determine support, and the functions filter out all the
    orders that can't be supported, given the current sysfs configuration and
    the VMA dimensions.  The resulting functions are renamed to
    thp_vma_suitable_orders() and thp_vma_allowable_orders() respectively. 
    Convenience functions that take a single, unencoded order and return a
    boolean are also defined as thp_vma_suitable_order() and
    thp_vma_allowable_order().
    
    The second half of the change implements the new sysfs interface.  It has
    been done so that each supported THP size has a `struct thpsize`, which
    describes the relevant metadata and is itself a kobject.  This is pretty
    minimal for now, but should make it easy to add new per-thpsize files to
    the interface if needed in future (e.g.  per-size defrag).  Rather than
    keep the `enabled` state directly in the struct thpsize, I've elected to
    directly encode it into huge_anon_orders_[always|madvise|inherit]
    bitfields since this reduces the amount of work required in
    thp_vma_allowable_orders() which is called for every page fault.
    
    See Documentation/admin-guide/mm/transhuge.rst, as modified by this
    commit, for details of how the new sysfs interface works.
    
    [ryan.roberts@arm.com: fix build warning when CONFIG_SYSFS is disabled]
      Link: https://lkml.kernel.org/r/20231211125320.3997543-1-ryan.roberts@arm.com
    Link: https://lkml.kernel.org/r/20231207161211.2374093-4-ryan.roberts@arm.comSigned-off-by: default avatarRyan Roberts <ryan.roberts@arm.com>
    Reviewed-by: default avatarBarry Song <v-songbaohua@oppo.com>
    Tested-by: default avatarKefeng Wang <wangkefeng.wang@huawei.com>
    Tested-by: default avatarJohn Hubbard <jhubbard@nvidia.com>
    Acked-by: default avatarDavid Hildenbrand <david@redhat.com>
    Cc: Alistair Popple <apopple@nvidia.com>
    Cc: Anshuman Khandual <anshuman.khandual@arm.com>
    Cc: Catalin Marinas <catalin.marinas@arm.com>
    Cc: David Rientjes <rientjes@google.com>
    Cc: "Huang, Ying" <ying.huang@intel.com>
    Cc: Hugh Dickins <hughd@google.com>
    Cc: Itaru Kitayama <itaru.kitayama@gmail.com>
    Cc: Kirill A. Shutemov <kirill.shutemov@linux.intel.com>
    Cc: Luis Chamberlain <mcgrof@kernel.org>
    Cc: Matthew Wilcox (Oracle) <willy@infradead.org>
    Cc: Vlastimil Babka <vbabka@suse.cz>
    Cc: Yang Shi <shy828301@gmail.com>
    Cc: Yin Fengwei <fengwei.yin@intel.com>
    Cc: Yu Zhao <yuzhao@google.com>
    Cc: Zi Yan <ziy@nvidia.com>
    Signed-off-by: default avatarAndrew Morton <akpm@linux-foundation.org>
    3485b883
task_mmu.c 67.5 KB