• Ryan Roberts's avatar
    arm64: mm: Don't remap pgtables for allocate vs populate · 0e9df1c9
    Ryan Roberts authored
    During linear map pgtable creation, each pgtable is fixmapped /
    fixunmapped twice; once during allocation to zero the memory, and a
    again during population to write the entries. This means each table has
    2 TLB invalidations issued against it. Let's fix this so that each table
    is only fixmapped/fixunmapped once, halving the number of TLBIs, and
    improving performance.
    
    Achieve this by separating allocation and initialization (zeroing) of
    the page. The allocated page is now fixmapped directly by the walker and
    initialized, before being populated and finally fixunmapped.
    
    This approach keeps the change small, but has the side effect that late
    allocations (using __get_free_page()) must also go through the generic
    memory clearing routine. So let's tell __get_free_page() not to zero the
    memory to avoid duplication.
    
    Additionally this approach means that fixmap/fixunmap is still used for
    late pgtable modifications. That's not technically needed since the
    memory is all mapped in the linear map by that point. That's left as a
    possible future optimization if found to be needed.
    
    Execution time of map_mem(), which creates the kernel linear map page
    tables, was measured on different machines with different RAM configs:
    
                   | Apple M2 VM | Ampere Altra| Ampere Altra| Ampere Altra
                   | VM, 16G     | VM, 64G     | VM, 256G    | Metal, 512G
    ---------------|-------------|-------------|-------------|-------------
                   |   ms    (%) |   ms    (%) |   ms    (%) |    ms    (%)
    ---------------|-------------|-------------|-------------|-------------
    before         |   11   (0%) |  161   (0%) |  656   (0%) |  1654   (0%)
    after          |   10 (-11%) |  104 (-35%) |  438 (-33%) |  1223 (-26%)
    Signed-off-by: default avatarRyan Roberts <ryan.roberts@arm.com>
    Suggested-by: default avatarMark Rutland <mark.rutland@arm.com>
    Tested-by: default avatarItaru Kitayama <itaru.kitayama@fujitsu.com>
    Tested-by: default avatarEric Chanudet <echanude@redhat.com>
    Reviewed-by: default avatarMark Rutland <mark.rutland@arm.com>
    Reviewed-by: default avatarArd Biesheuvel <ardb@kernel.org>
    Link: https://lore.kernel.org/r/20240412131908.433043-4-ryan.roberts@arm.comSigned-off-by: default avatarWill Deacon <will@kernel.org>
    0e9df1c9
mmu.c 40.5 KB