1. 30 Oct, 2023 14 commits
    • Oliver Upton's avatar
      Merge branch kvm-arm64/writable-id-regs into kvmarm/next · a87a3643
      Oliver Upton authored
      * kvm-arm64/writable-id-regs:
        : Writable ID registers, courtesy of Jing Zhang
        :
        : This series significantly expands the architectural feature set that
        : userspace can manipulate via the ID registers. A new ioctl is defined
        : that makes the mutable fields in the ID registers discoverable to
        : userspace.
        KVM: selftests: Avoid using forced target for generating arm64 headers
        tools headers arm64: Fix references to top srcdir in Makefile
        KVM: arm64: selftests: Test for setting ID register from usersapce
        tools headers arm64: Update sysreg.h with kernel sources
        KVM: selftests: Generate sysreg-defs.h and add to include path
        perf build: Generate arm64's sysreg-defs.h and add to include path
        tools: arm64: Add a Makefile for generating sysreg-defs.h
        KVM: arm64: Document vCPU feature selection UAPIs
        KVM: arm64: Allow userspace to change ID_AA64ZFR0_EL1
        KVM: arm64: Allow userspace to change ID_AA64PFR0_EL1
        KVM: arm64: Allow userspace to change ID_AA64MMFR{0-2}_EL1
        KVM: arm64: Allow userspace to change ID_AA64ISAR{0-2}_EL1
        KVM: arm64: Bump up the default KVM sanitised debug version to v8p8
        KVM: arm64: Reject attempts to set invalid debug arch version
        KVM: arm64: Advertise selected DebugVer in DBGDIDR.Version
        KVM: arm64: Use guest ID register values for the sake of emulation
        KVM: arm64: Document KVM_ARM_GET_REG_WRITABLE_MASKS
        KVM: arm64: Allow userspace to get the writable masks for feature ID registers
      Signed-off-by: default avatarOliver Upton <oliver.upton@linux.dev>
      a87a3643
    • Oliver Upton's avatar
      KVM: selftests: Avoid using forced target for generating arm64 headers · 70c7b704
      Oliver Upton authored
      The 'prepare' target that generates the arm64 sysreg headers had no
      prerequisites, so it wound up forcing a rebuild of all KVM selftests
      each invocation. Add a rule for the generated headers and just have
      dependents use that for a prerequisite.
      Reported-by: default avatarNina Schoetterl-Glausch <nsg@linux.ibm.com>
      Fixes: 9697d84c ("KVM: selftests: Generate sysreg-defs.h and add to include path")
      Tested-by: default avatarNina Schoetterl-Glausch <nsg@linux.ibm.com>
      Link: https://lore.kernel.org/r/20231027005439.3142015-3-oliver.upton@linux.devSigned-off-by: default avatarOliver Upton <oliver.upton@linux.dev>
      70c7b704
    • Oliver Upton's avatar
      tools headers arm64: Fix references to top srcdir in Makefile · fbb075c1
      Oliver Upton authored
      Aishwarya reports that KVM selftests for arm64 fail with the following
      error:
      
       | make[4]: Entering directory '/tmp/kci/linux/tools/testing/selftests/kvm'
       | Makefile:270: warning: overriding recipe for target
       | '/tmp/kci/linux/build/kselftest/kvm/get-reg-list'
       | Makefile:265: warning: ignoring old recipe for target
       | '/tmp/kci/linux/build/kselftest/kvm/get-reg-list'
       | make -C ../../../../tools/arch/arm64/tools/
       | make[5]: Entering directory '/tmp/kci/linux/tools/arch/arm64/tools'
       | Makefile:10: ../tools/scripts/Makefile.include: No such file or directory
       | make[5]: *** No rule to make target '../tools/scripts/Makefile.include'.
       |  Stop.
      
      It would appear that this only affects builds from the top-level
      Makefile (e.g. make kselftest-all), as $(srctree) is set to ".". Work
      around the issue by shadowing the kselftest naming scheme for the source
      tree variable.
      Reported-by: default avatarAishwarya TCV <aishwarya.tcv@arm.com>
      Fixes: 0359c946 ("tools headers arm64: Update sysreg.h with kernel sources")
      Link: https://lore.kernel.org/r/20231027005439.3142015-2-oliver.upton@linux.devSigned-off-by: default avatarOliver Upton <oliver.upton@linux.dev>
      fbb075c1
    • Oliver Upton's avatar
      Merge branch kvm-arm64/sgi-injection into kvmarm/next · 54b44ad2
      Oliver Upton authored
      * kvm-arm64/sgi-injection:
        : vSGI injection improvements + fixes, courtesy Marc Zyngier
        :
        : Avoid linearly searching for vSGI targets using a compressed MPIDR to
        : index a cache. While at it, fix some egregious bugs in KVM's mishandling
        : of vcpuid (user-controlled value) and vcpu_idx.
        KVM: arm64: Clarify the ordering requirements for vcpu/RD creation
        KVM: arm64: vgic-v3: Optimize affinity-based SGI injection
        KVM: arm64: Fast-track kvm_mpidr_to_vcpu() when mpidr_data is available
        KVM: arm64: Build MPIDR to vcpu index cache at runtime
        KVM: arm64: Simplify kvm_vcpu_get_mpidr_aff()
        KVM: arm64: Use vcpu_idx for invalidation tracking
        KVM: arm64: vgic: Use vcpu_idx for the debug information
        KVM: arm64: vgic-v2: Use cpuid from userspace as vcpu_id
        KVM: arm64: vgic-v3: Refactor GICv3 SGI generation
        KVM: arm64: vgic-its: Treat the collection target address as a vcpu_id
        KVM: arm64: vgic: Make kvm_vgic_inject_irq() take a vcpu pointer
      Signed-off-by: default avatarOliver Upton <oliver.upton@linux.dev>
      54b44ad2
    • Oliver Upton's avatar
      Merge branch kvm-arm64/stage2-vhe-load into kvmarm/next · df26b779
      Oliver Upton authored
      * kvm-arm64/stage2-vhe-load:
        : Setup stage-2 MMU from vcpu_load() for VHE
        :
        : Unlike nVHE, there is no need to switch the stage-2 MMU around on guest
        : entry/exit in VHE mode as the host is running at EL2. Despite this KVM
        : reloads the stage-2 on every guest entry, which is needless.
        :
        : This series moves the setup of the stage-2 MMU context to vcpu_load()
        : when running in VHE mode. This is likely to be a win across the board,
        : but also allows us to remove an ISB on the guest entry path for systems
        : with one of the speculative AT errata.
        KVM: arm64: Move VTCR_EL2 into struct s2_mmu
        KVM: arm64: Load the stage-2 MMU context in kvm_vcpu_load_vhe()
        KVM: arm64: Rename helpers for VHE vCPU load/put
        KVM: arm64: Reload stage-2 for VMID change on VHE
        KVM: arm64: Restore the stage-2 context in VHE's __tlb_switch_to_host()
        KVM: arm64: Don't zero VTTBR in __tlb_switch_to_host()
      Signed-off-by: default avatarOliver Upton <oliver.upton@linux.dev>
      df26b779
    • Oliver Upton's avatar
      Merge branch kvm-arm64/nv-trap-fixes into kvmarm/next · 51e60796
      Oliver Upton authored
      * kvm-arm64/nv-trap-fixes:
        : NV trap forwarding fixes, courtesy Miguel Luis and Marc Zyngier
        :
        :  - Explicitly define the effects of HCR_EL2.NV on EL2 sysregs in the
        :    NV trap encoding
        :
        :  - Make EL2 registers that access AArch32 guest state UNDEF or RAZ/WI
        :    where appropriate for NV guests
        KVM: arm64: Handle AArch32 SPSR_{irq,abt,und,fiq} as RAZ/WI
        KVM: arm64: Do not let a L1 hypervisor access the *32_EL2 sysregs
        KVM: arm64: Refine _EL2 system register list that require trap reinjection
        arm64: Add missing _EL2 encodings
        arm64: Add missing _EL12 encodings
      Signed-off-by: default avatarOliver Upton <oliver.upton@linux.dev>
      51e60796
    • Oliver Upton's avatar
      Merge branch kvm-arm64/smccc-filter-cleanups into kvmarm/next · 25a35c1a
      Oliver Upton authored
      * kvm-arm64/smccc-filter-cleanups:
        : Cleanup the management of KVM's SMCCC maple tree
        :
        : Avoid the cost of maintaining the SMCCC filter maple tree if userspace
        : hasn't writen a rule to the filter. While at it, rip out the now
        : unnecessary VM flag to indicate whether or not the SMCCC filter was
        : configured.
        KVM: arm64: Use mtree_empty() to determine if SMCCC filter configured
        KVM: arm64: Only insert reserved ranges when SMCCC filter is used
        KVM: arm64: Add a predicate for testing if SMCCC filter is configured
      Signed-off-by: default avatarOliver Upton <oliver.upton@linux.dev>
      25a35c1a
    • Oliver Upton's avatar
      Merge branch kvm-arm64/pmevtyper-filter into kvmarm/next · 7ff7dfe9
      Oliver Upton authored
      * kvm-arm64/pmevtyper-filter:
        : Fixes to KVM's handling of the PMUv3 exception level filtering bits
        :
        :  - NSH (count at EL2) and M (count at EL3) should be stateful when the
        :    respective EL is advertised in the ID registers but have no effect on
        :    event counting.
        :
        :  - NSU and NSK modify the event filtering of EL0 and EL1, respectively.
        :    Though the kernel may not use these bits, other KVM guests might.
        :    Implement these bits exactly as written in the pseudocode if EL3 is
        :    advertised.
        KVM: arm64: Add PMU event filter bits required if EL3 is implemented
        KVM: arm64: Make PMEVTYPER<n>_EL0.NSH RES0 if EL2 isn't advertised
      Signed-off-by: default avatarOliver Upton <oliver.upton@linux.dev>
      7ff7dfe9
    • Oliver Upton's avatar
      Merge branch kvm-arm64/feature-flag-refactor into kvmarm/next · d47dcb67
      Oliver Upton authored
      * kvm-arm64/feature-flag-refactor:
        : vCPU feature flag cleanup
        :
        : Clean up KVM's handling of vCPU feature flags to get rid of the
        : vCPU-scoped bitmaps and remove failure paths from kvm_reset_vcpu().
        KVM: arm64: Get rid of vCPU-scoped feature bitmap
        KVM: arm64: Remove unused return value from kvm_reset_vcpu()
        KVM: arm64: Hoist NV+SVE check into KVM_ARM_VCPU_INIT ioctl handler
        KVM: arm64: Prevent NV feature flag on systems w/o nested virt
        KVM: arm64: Hoist PAuth checks into KVM_ARM_VCPU_INIT ioctl
        KVM: arm64: Hoist SVE check into KVM_ARM_VCPU_INIT ioctl handler
        KVM: arm64: Hoist PMUv3 check into KVM_ARM_VCPU_INIT ioctl handler
        KVM: arm64: Add generic check for system-supported vCPU features
      Signed-off-by: default avatarOliver Upton <oliver.upton@linux.dev>
      d47dcb67
    • Oliver Upton's avatar
      Merge branch kvm-arm64/misc into kvmarm/next · 054056bf
      Oliver Upton authored
      * kvm-arm64/misc:
        : Miscellaneous updates
        :
        :  - Put an upper bound on the number of I-cache invalidations by
        :    cacheline to avoid soft lockups
        :
        :  - Get rid of bogus refererence count transfer for THP mappings
        :
        :  - Do a local TLB invalidation on permission fault race
        :
        :  - Fixes for page_fault_test KVM selftest
        :
        :  - Add a tracepoint for detecting MMIO instructions unsupported by KVM
        KVM: arm64: Add tracepoint for MMIO accesses where ISV==0
        KVM: arm64: selftest: Perform ISB before reading PAR_EL1
        KVM: arm64: selftest: Add the missing .guest_prepare()
        KVM: arm64: Always invalidate TLB for stage-2 permission faults
        KVM: arm64: Do not transfer page refcount for THP adjustment
        KVM: arm64: Avoid soft lockups due to I-cache maintenance
        arm64: tlbflush: Rename MAX_TLBI_OPS
        KVM: arm64: Don't use kerneldoc comment for arm64_check_features()
      Signed-off-by: default avatarOliver Upton <oliver.upton@linux.dev>
      054056bf
    • Oliver Upton's avatar
      KVM: arm64: Add tracepoint for MMIO accesses where ISV==0 · d11974dc
      Oliver Upton authored
      It is a pretty well known fact that KVM does not support MMIO emulation
      without valid instruction syndrome information (ESR_EL2.ISV == 0). The
      current kvm_pr_unimpl() is pretty useless, as it contains zero context
      to relate the event to a vCPU.
      
      Replace it with a precise tracepoint that dumps the relevant context
      so the user can make sense of what the guest is doing.
      Acked-by: default avatarZenghui Yu <yuzenghui@huawei.com>
      Acked-by: default avatarMarc Zyngier <maz@kernel.org>
      Link: https://lore.kernel.org/r/20231026205306.3045075-1-oliver.upton@linux.devSigned-off-by: default avatarOliver Upton <oliver.upton@linux.dev>
      d11974dc
    • Zenghui Yu's avatar
      KVM: arm64: selftest: Perform ISB before reading PAR_EL1 · 06899aa5
      Zenghui Yu authored
      It looks like a mistake to issue ISB *after* reading PAR_EL1, we should
      instead perform it between the AT instruction and the reads of PAR_EL1.
      
      As according to DDI0487J.a IJTYVP,
      
      "When an address translation instruction is executed, explicit
       synchronization is required to guarantee the result is visible to
       subsequent direct reads of PAR_EL1."
      
      Otherwise all guest_at testcases fail on my box with
      
      ==== Test Assertion Failure ====
        aarch64/page_fault_test.c:142: par & 1 == 0
        pid=1355864 tid=1355864 errno=4 - Interrupted system call
           1	0x0000000000402853: vcpu_run_loop at page_fault_test.c:681
           2	0x0000000000402cdb: run_test at page_fault_test.c:730
           3	0x0000000000403897: for_each_guest_mode at guest_modes.c:100
           4	0x00000000004019f3: for_each_test_and_guest_mode at page_fault_test.c:1105
           5	 (inlined by) main at page_fault_test.c:1131
           6	0x0000ffffb153c03b: ?? ??:0
           7	0x0000ffffb153c113: ?? ??:0
           8	0x0000000000401aaf: _start at ??:?
        0x1 != 0x0 (par & 1 != 0)
      Signed-off-by: default avatarZenghui Yu <yuzenghui@huawei.com>
      Acked-by: default avatarMarc Zyngier <maz@kernel.org>
      Link: https://lore.kernel.org/r/20231007124043.626-2-yuzenghui@huawei.comSigned-off-by: default avatarOliver Upton <oliver.upton@linux.dev>
      06899aa5
    • Zenghui Yu's avatar
      KVM: arm64: selftest: Add the missing .guest_prepare() · beaf35b4
      Zenghui Yu authored
      Running page_fault_test on a Cortex A72 fails with
      
      Test: ro_memslot_no_syndrome_guest_cas
      Testing guest mode: PA-bits:40,  VA-bits:48,  4K pages
      Testing memory backing src type: anonymous
      ==== Test Assertion Failure ====
        aarch64/page_fault_test.c:117: guest_check_lse()
        pid=1944087 tid=1944087 errno=4 - Interrupted system call
           1	0x00000000004028b3: vcpu_run_loop at page_fault_test.c:682
           2	0x0000000000402d93: run_test at page_fault_test.c:731
           3	0x0000000000403957: for_each_guest_mode at guest_modes.c:100
           4	0x00000000004019f3: for_each_test_and_guest_mode at page_fault_test.c:1108
           5	 (inlined by) main at page_fault_test.c:1134
           6	0x0000ffff868e503b: ?? ??:0
           7	0x0000ffff868e5113: ?? ??:0
           8	0x0000000000401aaf: _start at ??:?
        guest_check_lse()
      
      because we don't have a guest_prepare stage to check the presence of
      FEAT_LSE and skip the related guest_cas testing, and we end-up failing in
      GUEST_ASSERT(guest_check_lse()).
      
      Add the missing .guest_prepare() where it's indeed required.
      Signed-off-by: default avatarZenghui Yu <yuzenghui@huawei.com>
      Acked-by: default avatarMarc Zyngier <maz@kernel.org>
      Link: https://lore.kernel.org/r/20231007124043.626-1-yuzenghui@huawei.comSigned-off-by: default avatarOliver Upton <oliver.upton@linux.dev>
      beaf35b4
    • Oliver Upton's avatar
      KVM: arm64: Always invalidate TLB for stage-2 permission faults · be097997
      Oliver Upton authored
      It is possible for multiple vCPUs to fault on the same IPA and attempt
      to resolve the fault. One of the page table walks will actually update
      the PTE and the rest will return -EAGAIN per our race detection scheme.
      KVM elides the TLB invalidation on the racing threads as the return
      value is nonzero.
      
      Before commit a12ab137 ("KVM: arm64: Use local TLBI on permission
      relaxation") KVM always used broadcast TLB invalidations when handling
      permission faults, which had the convenient property of making the
      stage-2 updates visible to all CPUs in the system. However now we do a
      local invalidation, and TLBI elision leads to the vCPU thread faulting
      again on the stale entry. Remember that the architecture permits the TLB
      to cache translations that precipitate a permission fault.
      
      Invalidate the TLB entry responsible for the permission fault if the
      stage-2 descriptor has been relaxed, regardless of which thread actually
      did the job.
      Acked-by: default avatarMarc Zyngier <maz@kernel.org>
      Link: https://lore.kernel.org/r/20230922223229.1608155-1-oliver.upton@linux.devSigned-off-by: default avatarOliver Upton <oliver.upton@linux.dev>
      be097997
  2. 25 Oct, 2023 5 commits
  3. 24 Oct, 2023 2 commits
  4. 23 Oct, 2023 1 commit
    • Marc Zyngier's avatar
      KVM: arm64: Move VTCR_EL2 into struct s2_mmu · fe49fd94
      Marc Zyngier authored
      We currently have a global VTCR_EL2 value for each guest, even
      if the guest uses NV. This implies that the guest's own S2 must
      fit in the host's. This is odd, for multiple reasons:
      
      - the PARange values and the number of IPA bits don't necessarily
        match: you can have 33 bits of IPA space, and yet you can only
        describe 32 or 36 bits of PARange
      
      - When userspace set the IPA space, it creates a contract with the
        kernel saying "this is the IPA space I'm prepared to handle".
        At no point does it constraint the guest's own IPA space as
        long as the guest doesn't try to use a [I]PA outside of the
        IPA space set by userspace
      
      - We don't even try to hide the value of ID_AA64MMFR0_EL1.PARange.
      
      And then there is the consequence of the above: if a guest tries
      to create a S2 that has for input address something that is larger
      than the IPA space defined by the host, we inject a fatal exception.
      
      This is no good. For all intent and purposes, a guest should be
      able to have the S2 it really wants, as long as the *output* address
      of that S2 isn't outside of the IPA space.
      
      For that, we need to have a per-s2_mmu VTCR_EL2 setting, which
      allows us to represent the full PARange. Move the vctr field into
      the s2_mmu structure, which has no impact whatsoever, except for NV.
      
      Note that once we are able to override ID_AA64MMFR0_EL1.PARange
      from userspace, we'll also be able to restrict the size of the
      shadow S2 that NV uses.
      Signed-off-by: default avatarMarc Zyngier <maz@kernel.org>
      Link: https://lore.kernel.org/r/20231012205108.3937270-1-maz@kernel.orgSigned-off-by: default avatarOliver Upton <oliver.upton@linux.dev>
      fe49fd94
  5. 20 Oct, 2023 5 commits
  6. 18 Oct, 2023 5 commits
  7. 05 Oct, 2023 3 commits
  8. 04 Oct, 2023 5 commits