Commits · f4f510508741680e423524c222f615276ca6222c · nexedi / linux

24 Oct, 2010 40 commits

KVM: Convert PIC lock from raw spinlock to ordinary spinlock · f4f51050

Avi Kivity authored Sep 19, 2010

The PIC code used to be called from preempt_disable() context, which
wasn't very good for PREEMPT_RT.  That is no longer the case, so move
back from raw_spinlock_t to spinlock_t.
Signed-off-by: Avi Kivity <avi@redhat.com>
Acked-by: Thomas Gleixner <tglx@linutronix.de>
Signed-off-by: Marcelo Tosatti <mtosatti@redhat.com>

f4f51050

KVM: x86: Fix kvmclock bug · 28e4639a

Zachary Amsden authored Sep 18, 2010

If preempted after kvmclock values are updated, but before hardware
virtualization is entered, the last tsc time as read by the guest is
never set.  It underflows the next time kvmclock is updated if there
has not yet been a successful entry / exit into hardware virt.

Fix this by simply setting last_tsc to the newly read tsc value so
that any computed nsec advance of kvmclock is nulled.
Signed-off-by: Zachary Amsden <zamsden@redhat.com>
Signed-off-by: Marcelo Tosatti <mtosatti@redhat.com>

28e4639a

KVM: MMU: Don't track nested fault info in error-code · 0959ffac

Joerg Roedel authored Sep 14, 2010

This patch moves the detection whether a page-fault was
nested or not out of the error code and moves it into a
separate variable in the fault struct.
Signed-off-by: Joerg Roedel <joerg.roedel@amd.com>
Signed-off-by: Avi Kivity <avi@redhat.com>

0959ffac

KVM: VMX: Move fixup_rmode_irq() to avoid forward declaration · 625831a3
Avi Kivity authored Jul 22, 2010
```
No code changes.
Signed-off-by: Avi Kivity <avi@redhat.com>
```
625831a3

KVM: Non-atomic interrupt injection · b463a6f7

Avi Kivity authored Jul 20, 2010

Change the interrupt injection code to work from preemptible, interrupts
enabled context.  This works by adding a ->cancel_injection() operation
that undoes an injection in case we were not able to actually enter the guest
(this condition could never happen with atomic injection).
Signed-off-by: Avi Kivity <avi@redhat.com>

b463a6f7

KVM: VMX: Parameterize vmx_complete_interrupts() for both exit and entry · 83422e17

Avi Kivity authored Jul 20, 2010

Currently vmx_complete_interrupts() can decode event information from vmx
exit fields into the generic kvm event queues. Make it able to decode
the information from the entry fields as well by parametrizing it.
Signed-off-by: Avi Kivity <avi@redhat.com>

83422e17

KVM: VMX: Move real-mode interrupt injection fixup to vmx_complete_interrupts() · 537b37e2
Avi Kivity authored Jul 22, 2010
```
This allows reuse of vmx_complete_interrupts() for cancelling injections.
Signed-off-by: Avi Kivity <avi@redhat.com>
```
537b37e2

KVM: VMX: Split up vmx_complete_interrupts() · 51aa01d1

Avi Kivity authored Jul 20, 2010

vmx_complete_interrupts() does too much, split it up:
 - vmx_vcpu_run() gets the "cache important vmcs fields" part
 - a new vmx_complete_atomic_exit() gets the parts that must be done atomically
 - a new vmx_recover_nmi_blocking() does what its name says
 - vmx_complete_interrupts() retains the event injection recovery code

This helps in reducing the work done in atomic context.
Signed-off-by: Avi Kivity <avi@redhat.com>

51aa01d1

KVM: Check for pending events before attempting injection · 3842d135

Avi Kivity authored Jul 27, 2010

Instead of blindly attempting to inject an event before each guest entry,
check for a possible event first in vcpu->requests.  Sites that can trigger
event injection are modified to set KVM_REQ_EVENT:

- interrupt, nmi window opening
- ppr updates
- i8259 output changes
- local apic irr changes
- rflags updates
- gif flag set
- event set on exit

This improves non-injecting entry performance, and sets the stage for
non-atomic injection.
Signed-off-by: Avi Kivity <avi@redhat.com>

3842d135

KVM: MMU: Fix regression with ept memory types merged into non-ept page tables · b0bc3ee2

Avi Kivity authored Sep 13, 2010

Commit "KVM: MMU: Make tdp_enabled a mmu-context parameter" made real-mode
set ->direct_map, and changed the code that merges in the memory type depend
on direct_map instead of tdp_enabled. However, in this case what really
matters is tdp, not direct_map, since tdp changes the pte format regardless
of whether the mapping is direct or not.

As a result, real-mode shadow mappings got corrupted with ept memory types.
The result was a huge slowdown, likely due to the cache being disabled.

Change it back as the simplest fix for the regression (real fix is to move
all that to vmx code, and not use tdp_enabled as a synonym for ept).
Signed-off-by: Avi Kivity <avi@redhat.com>

b0bc3ee2

KVM: Document that KVM_GET_SUPPORTED_CPUID may return emulated values · c39cbd2a
Avi Kivity authored Sep 12, 2010
```
Signed-off-by: Avi Kivity <avi@redhat.com>
```
c39cbd2a

KVM: X86: Report SVM bit to userspace only when supported · 4c62a2dc

Joerg Roedel authored Sep 10, 2010

This patch fixes a bug in KVM where it _always_ reports the
support of the SVM feature to userspace. But KVM only
supports SVM on AMD hardware and only when it is enabled in
the kernel module. This patch fixes the wrong reporting.

Cc: stable@kernel.org
Signed-off-by: Joerg Roedel <joerg.roedel@amd.com>
Signed-off-by: Avi Kivity <avi@redhat.com>

4c62a2dc

KVM: SVM: Report Nested Paging support to userspace · 3d4aeaad

Joerg Roedel authored Sep 10, 2010

This patch implements the reporting of the nested paging
feature support to userspace.
Signed-off-by: Joerg Roedel <joerg.roedel@amd.com>
Signed-off-by: Avi Kivity <avi@redhat.com>

3d4aeaad

KVM: SVM: Expect two more candiates for exit_int_info · 55c5e464

Joerg Roedel authored Sep 10, 2010

This patch adds INTR and NMI intercepts to the list of
expected intercepts with an exit_int_info set. While this
can't happen on bare metal it is architectural legal and may
happen with KVMs SVM emulation.
Signed-off-by: Joerg Roedel <joerg.roedel@amd.com>
Signed-off-by: Avi Kivity <avi@redhat.com>

55c5e464

KVM: SVM: Initialize Nested Nested MMU context on VMRUN · 4b16184c

Joerg Roedel authored Sep 10, 2010

This patch adds code to initialize the Nested Nested Paging
MMU context when the L1 guest executes a VMRUN instruction
and has nested paging enabled in its VMCB.
Signed-off-by: Joerg Roedel <joerg.roedel@amd.com>
Signed-off-by: Avi Kivity <avi@redhat.com>

4b16184c

KVM: SVM: Implement MMU helper functions for Nested Nested Paging · 5bd2edc3

Joerg Roedel authored Sep 10, 2010

This patch adds the helper functions which will be used in
the mmu context for handling nested nested page faults.
Signed-off-by: Joerg Roedel <joerg.roedel@amd.com>
Signed-off-by: Avi Kivity <avi@redhat.com>

5bd2edc3

KVM: MMU: Track NX state in struct kvm_mmu · 2d48a985

Joerg Roedel authored Sep 10, 2010

With Nested Paging emulation the NX state between the two
MMU contexts may differ. To make sure that always the right
fault error code is recorded this patch moves the NX state
into struct kvm_mmu so that the code can distinguish between
L1 and L2 NX state.
Signed-off-by: Joerg Roedel <joerg.roedel@amd.com>
Signed-off-by: Avi Kivity <avi@redhat.com>

2d48a985

KVM: MMU: Allow long mode shadows for legacy page tables · 81407ca5

Joerg Roedel authored Sep 10, 2010

Currently the KVM softmmu implementation can not shadow a 32
bit legacy or PAE page table with a long mode page table.
This is a required feature for nested paging emulation
because the nested page table must alway be in host format.
So this patch implements the missing pieces to allow long
mode page tables for page table types.
Signed-off-by: Joerg Roedel <joerg.roedel@amd.com>
Signed-off-by: Avi Kivity <avi@redhat.com>

81407ca5

KVM: MMU: Refactor mmu_alloc_roots function · 651dd37a

Joerg Roedel authored Sep 10, 2010

This patch factors out the direct-mapping paths of the
mmu_alloc_roots function into a seperate function. This
makes it a lot easier to avoid all the unnecessary checks
done in the shadow path which may break when running direct.
In fact, this patch already fixes a problem when running PAE
guests on a PAE shadow page table.
Signed-off-by: Joerg Roedel <joerg.roedel@amd.com>
Signed-off-by: Avi Kivity <avi@redhat.com>

651dd37a

KVM: MMU: Introduce kvm_pdptr_read_mmu · d41d1895

Joerg Roedel authored Sep 10, 2010

This function is implemented to load the pdptr pointers of
the currently running guest (l1 or l2 guest). Therefore it
takes care about the current paging mode and can read pdptrs
out of l2 guest physical memory.
Signed-off-by: Joerg Roedel <joerg.roedel@amd.com>
Signed-off-by: Avi Kivity <avi@redhat.com>

d41d1895

KVM: MMU: Add kvm_mmu parameter to load_pdptrs function · ff03a073

Joerg Roedel authored Sep 10, 2010

This function need to be able to load the pdptrs from any
mmu context currently in use. So change this function to
take an kvm_mmu parameter to fit these needs.
As a side effect this patch also moves the cached pdptrs
from vcpu_arch into the kvm_mmu struct.
Signed-off-by: Joerg Roedel <joerg.roedel@amd.com>
Signed-off-by: Avi Kivity <avi@redhat.com>

ff03a073

KVM: X86: Propagate fetch faults · d47f00a6

Joerg Roedel authored Sep 10, 2010

KVM currently ignores fetch faults in the instruction
emulator. With nested-npt we could have such faults. This
patch adds the code to handle these.
Signed-off-by: Joerg Roedel <joerg.roedel@amd.com>
Signed-off-by: Avi Kivity <avi@redhat.com>

d47f00a6

KVM: MMU: Propagate the right fault back to the guest after gva_to_gpa · d4f8cf66

Joerg Roedel authored Sep 10, 2010

This patch implements logic to make sure that either a
page-fault/page-fault-vmexit or a nested-page-fault-vmexit
is propagated back to the guest.
Signed-off-by: Joerg Roedel <joerg.roedel@amd.com>
Signed-off-by: Avi Kivity <avi@redhat.com>

d4f8cf66

KVM: MMU: Introduce init_kvm_nested_mmu() · 02f59dc9

Joerg Roedel authored Sep 10, 2010

This patch introduces the init_kvm_nested_mmu() function
which is used to re-initialize the nested mmu when the l2
guest changes its paging mode.
Signed-off-by: Joerg Roedel <joerg.roedel@amd.com>
Signed-off-by: Avi Kivity <avi@redhat.com>

02f59dc9

KVM: MMU: Introduce kvm_read_nested_guest_page() · 3d06b8bf

Joerg Roedel authored Sep 10, 2010

This patch introduces the kvm_read_guest_page_x86 function
which reads from the physical memory of the guest. If the
guest is running in guest-mode itself with nested paging
enabled it will read from the guest's guest physical memory
instead.
The patch also changes changes the code to use this function
where it is necessary.
Signed-off-by: Joerg Roedel <joerg.roedel@amd.com>
Signed-off-by: Avi Kivity <avi@redhat.com>

3d06b8bf

KVM: MMU: Make walk_addr_generic capable for two-level walking · 2329d46d

Joerg Roedel authored Sep 10, 2010

This patch uses kvm_read_guest_page_tdp to make the
walk_addr_generic functions suitable for two-level page
table walking.
Signed-off-by: Joerg Roedel <joerg.roedel@amd.com>
Signed-off-by: Avi Kivity <avi@redhat.com>

2329d46d

KVM: X86: Add kvm_read_guest_page_mmu function · ec92fe44

Joerg Roedel authored Sep 10, 2010

This patch adds a function which can read from the guests
physical memory or from the guest's guest physical memory.
This will be used in the two-dimensional page table walker.
Signed-off-by: Joerg Roedel <joerg.roedel@amd.com>
Signed-off-by: Avi Kivity <avi@redhat.com>

ec92fe44

KVM: MMU: Implement nested gva_to_gpa functions · 6539e738

Joerg Roedel authored Sep 10, 2010

This patch adds the functions to do a nested l2_gva to
l1_gpa page table walk.
Signed-off-by: Joerg Roedel <joerg.roedel@amd.com>
Signed-off-by: Avi Kivity <avi@redhat.com>

6539e738

KVM: X86: Introduce pointer to mmu context used for gva_to_gpa · 14dfe855

Joerg Roedel authored Sep 10, 2010

This patch introduces the walk_mmu pointer which points to
the mmu-context currently used for gva_to_gpa translations.
Signed-off-by: Joerg Roedel <joerg.roedel@amd.com>
Signed-off-by: Avi Kivity <avi@redhat.com>

14dfe855

KVM: MMU: Add infrastructure for two-level page walker · c30a358d

Joerg Roedel authored Sep 10, 2010

This patch introduces a mmu-callback to translate gpa
addresses in the walk_addr code. This is later used to
translate l2_gpa addresses into l1_gpa addresses.
Signed-off-by: Joerg Roedel <joerg.roedel@amd.com>
Signed-off-by: Avi Kivity <avi@redhat.com>

c30a358d

KVM: MMU: Introduce generic walk_addr function · 1e301feb

Joerg Roedel authored Sep 10, 2010

This is the first patch in the series towards a generic
walk_addr implementation which could walk two-dimensional
page tables in the end. In this first step the walk_addr
function is renamed into walk_addr_generic which takes a
mmu context as an additional parameter.
Signed-off-by: Joerg Roedel <joerg.roedel@amd.com>
Signed-off-by: Avi Kivity <avi@redhat.com>

1e301feb

KVM: MMU: Track page fault data in struct vcpu · 8df25a32

Joerg Roedel authored Sep 10, 2010

This patch introduces a struct with two new fields in
vcpu_arch for x86:

	* fault.address
	* fault.error_code

This will be used to correctly propagate page faults back
into the guest when we could have either an ordinary page
fault or a nested page fault. In the case of a nested page
fault the fault-address is different from the original
address that should be walked. So we need to keep track
about the real fault-address.
Signed-off-by: Joerg Roedel <joerg.roedel@amd.com>
Signed-off-by: Avi Kivity <avi@redhat.com>

8df25a32

KVM: MMU: Let is_rsvd_bits_set take mmu context instead of vcpu · 3241f22d

Joerg Roedel authored Sep 10, 2010

This patch changes is_rsvd_bits_set() function prototype to
take only a kvm_mmu context instead of a full vcpu.
Signed-off-by: Joerg Roedel <joerg.roedel@amd.com>
Signed-off-by: Avi Kivity <avi@redhat.com>

3241f22d

KVM: MMU: Introduce kvm_init_shadow_mmu helper function · 52fde8df

Joerg Roedel authored Sep 10, 2010

Some logic of the init_kvm_softmmu function is required to
build the Nested Nested Paging context. So factor the
required logic into a seperate function and export it.
Also make the whole init path suitable for more than one mmu
context.
Signed-off-by: Joerg Roedel <joerg.roedel@amd.com>
Signed-off-by: Avi Kivity <avi@redhat.com>

52fde8df

KVM: MMU: Introduce inject_page_fault function pointer · cb659db8

Joerg Roedel authored Sep 10, 2010

This patch introduces an inject_page_fault function pointer
into struct kvm_mmu which will be used to inject a page
fault. This will be used later when Nested Nested Paging is
implemented.
Signed-off-by: Joerg Roedel <joerg.roedel@amd.com>
Signed-off-by: Avi Kivity <avi@redhat.com>

cb659db8

KVM: MMU: Introduce get_cr3 function pointer · 5777ed34

Joerg Roedel authored Sep 10, 2010

This function pointer in the MMU context is required to
implement Nested Nested Paging.
Signed-off-by: Joerg Roedel <joerg.roedel@amd.com>
Signed-off-by: Avi Kivity <avi@redhat.com>

5777ed34

KVM: X86: Introduce a tdp_set_cr3 function · 1c97f0a0

Joerg Roedel authored Sep 10, 2010

This patch introduces a special set_tdp_cr3 function pointer
in kvm_x86_ops which is only used for tpd enabled mmu
contexts. This allows to remove some hacks from svm code.
Signed-off-by: Joerg Roedel <joerg.roedel@amd.com>
Signed-off-by: Avi Kivity <avi@redhat.com>

1c97f0a0

KVM: MMU: Make set_cr3 a function pointer in kvm_mmu · f43addd4

Joerg Roedel authored Sep 10, 2010

This is necessary to implement Nested Nested Paging. As a
side effect this allows some cleanups in the SVM nested
paging code.
Signed-off-by: Joerg Roedel <joerg.roedel@amd.com>
Signed-off-by: Avi Kivity <avi@redhat.com>

f43addd4

KVM: MMU: Make tdp_enabled a mmu-context parameter · c5a78f2b

Joerg Roedel authored Sep 10, 2010

This patch changes the tdp_enabled flag from its global
meaning to the mmu-context and renames it to direct_map
there. This is necessary for Nested SVM with emulation of
Nested Paging where we need an extra MMU context to shadow
the Nested Nested Page Table.
Signed-off-by: Joerg Roedel <joerg.roedel@amd.com>
Signed-off-by: Avi Kivity <avi@redhat.com>

c5a78f2b

KVM: MMU: Check for root_level instead of long mode · 957446af

Joerg Roedel authored Sep 10, 2010

The walk_addr function checks for !is_long_mode in its 64
bit version. But what is meant here is a check for pae
paging. Change the condition to really check for pae paging
so that it also works with nested nested paging.
Signed-off-by: Joerg Roedel <joerg.roedel@amd.com>
Signed-off-by: Avi Kivity <avi@redhat.com>

957446af