Commits · 597a5f551ec4cd0aa0966e4fff4684ecc8c31c0d · nexedi / linux

20 Jul, 2008 40 commits

KVM: Adjust smp_call_function_mask() callers to new requirements · 597a5f55

Avi Kivity authored Jul 20, 2008

smp_call_function_mask() now complains when called in a preemptible context;
adjust its callers accordingly.
Signed-off-by: Avi Kivity <avi@qumranet.com>

597a5f55

KVM: MMU: Fix potential race setting upper shadow ptes on nonpae hosts · 722c05f2

Avi Kivity authored Jul 13, 2008

The direct mapped shadow code (used for real mode and two dimensional paging)
sets upper-level ptes using direct assignment rather than calling
set_shadow_pte().  A nonpae host will split this into two writes, which opens
up a race if another vcpu accesses the same memory area.

Fix by calling set_shadow_pte() instead of assigning directly.

Noticed by Izik Eidus.
Signed-off-by: Avi Kivity <avi@qumranet.com>

722c05f2

KVM: x86 emulator: emulate clflush · 2a7c5b8b

Glauber Costa authored Jul 10, 2008

If the guest issues a clflush in a mmio address, the instruction
can trap into the hypervisor. Currently, we do not decode clflush
properly, causing the guest to hang. This patch fixes this emulating
clflush (opcode 0f ae).
Signed-off-by: Glauber Costa <gcosta@redhat.com>
Signed-off-by: Avi Kivity <avi@qumranet.com>

2a7c5b8b

KVM: MMU: improve invalid shadow root page handling · 376c53c2

Marcelo Tosatti authored Jul 10, 2008

Harden kvm_mmu_zap_page() against invalid root pages that
had been shadowed from memslots that are gone.
Signed-off-by: Marcelo Tosatti <mtosatti@redhat.com>
Signed-off-by: Avi Kivity <avi@qumranet.com>

376c53c2

KVM: MMU: nuke shadowed pgtable pages and ptes on memslot destruction · 34d4cb8f

Marcelo Tosatti authored Jul 10, 2008

Flush the shadow mmu before removing regions to avoid stale entries.
Signed-off-by: Marcelo Tosatti <mtosatti@redhat.com>
Signed-off-by: Avi Kivity <avi@qumranet.com>

34d4cb8f

KVM: Prefix some x86 low level function with kvm_, to avoid namespace issues · d6e88aec
Avi Kivity authored Jul 10, 2008
```
Fixes compilation with CONFIG_VMI enabled.
Signed-off-by: Avi Kivity <avi@qumranet.com>
```
d6e88aec

KVM: check injected pic irq within valid pic irqs · c65bbfa1

Ben-Ami Yassour authored Jul 06, 2008

Check that an injected pic irq is between 0 and 15.
Signed-off-by: Ben-Ami Yassour <benami@il.ibm.com>
Signed-off-by: Avi Kivity <avi@qumranet.com>

c65bbfa1

KVM: x86 emulator: Fix HLT instruction · 19fdfa0d

Mohammed Gamal authored Jul 06, 2008

This patch fixes issue encountered with HLT instruction
under FreeDOS's HIMEM XMS Driver.

The HLT instruction jumped directly to the done label and
skips updating the EIP value, therefore causing the guest
to spin endlessly on the same instruction.

The patch changes the instruction so that it writes back
the updated EIP value.
Signed-off-by: Mohammed Gamal <m.gamal005@gmail.com>
Signed-off-by: Avi Kivity <avi@qumranet.com>

19fdfa0d

KVM: Apply the kernel sigmask to vcpus blocked due to being uninitialized · ac9f6dc0
Avi Kivity authored Jul 06, 2008
```
Signed-off-by: Avi Kivity <avi@qumranet.com>
```
ac9f6dc0

KVM: VMX: Add ept_sync_context in flush_tlb · 4e1096d2

Sheng Yang authored Jul 06, 2008

Fix a potention issue caused by kvm_mmu_slot_remove_write_access(). The
old behavior don't sync EPT TLB with modified EPT entry, which result
in inconsistent content of EPT TLB and EPT table.
Signed-off-by: Sheng Yang <sheng.yang@intel.com>
Signed-off-by: Avi Kivity <avi@qumranet.com>

4e1096d2

KVM: mmu_shrink: kvm_mmu_zap_page requires slots_lock to be held · 5a4c9288

Marcelo Tosatti authored Jul 03, 2008

kvm_mmu_zap_page() needs slots lock held (rmap_remove->gfn_to_memslot,
for example).

Since kvm_lock spinlock is held in mmu_shrink(), do a non-blocking
down_read_trylock().

Untested.
Signed-off-by: Avi Kivity <avi@qumranet.com>

5a4c9288

x86: KVM guest: make kvm_smp_prepare_boot_cpu() static · 7e37c299

Adrian Bunk authored Jul 01, 2008

This patch makes the needlessly global kvm_smp_prepare_boot_cpu() static.
Signed-off-by: Adrian Bunk <bunk@kernel.org>
Signed-off-by: Avi Kivity <avi@qumranet.com>

7e37c299

KVM: SVM: fix suspend/resume support · 0da1db75

Joerg Roedel authored Jul 02, 2008

On suspend the svm_hardware_disable function is called which frees all svm_data
variables. On resume they are not re-allocated. This patch removes the
deallocation of svm_data from the hardware_disable function to the
hardware_unsetup function which is not called on suspend.
Signed-off-by: Joerg Roedel <joerg.roedel@amd.com>
Signed-off-by: Avi Kivity <avi@qumranet.com>

0da1db75

KVM: s390: rename private structures · 180c12fb

Christian Borntraeger authored Jun 27, 2008

While doing some tests with our lcrash implementation I have seen a
naming conflict with prefix_info in kvm_host.h vs. addrconf.h

To avoid future conflicts lets rename private definitions in
asm/kvm_host.h by adding the kvm_s390 prefix.
Signed-off-by: Christian Borntraeger <borntraeger@de.ibm.com>
Signed-off-by: Carsten Otte <cotte@de.ibm.com>
Signed-off-by: Avi Kivity <avi@qumranet.com>

180c12fb

KVM: s390: Set guest storage limit and offset to sane values · 4da29e90

Christian Borntraeger authored Jun 27, 2008

Some machines do not accept 16EB as guest storage limit. Lets change the
default for the guest storage limit to a sane value. We also should set
the guest_origin to what userspace thinks it is. This allows guests
starting at an address != 0.
Signed-off-by: Christian Borntraeger <borntraeger@de.ibm.com>
Signed-off-by: Carsten Otte <cotte@de.ibm.com>
Signed-off-by: Avi Kivity <avi@qumranet.com>

4da29e90

KVM: Fix memory leak on guest exit · dfdded7c

Carsten Otte authored Jun 27, 2008

This patch fixes a memory leak, we want to free the physmem when destroying
the vm.
Signed-off-by: Carsten Otte <cotte@de.ibm.com>
Signed-off-by: Avi Kivity <avi@qumranet.com>

dfdded7c

KVM: s390: dont allocate dirty bitmap · eff0114a

Carsten Otte authored Jun 27, 2008

This patch #ifdefs the bitmap array for dirty tracking. We don't have dirty
tracking on s390 today, and we'd love to use our storage keys to store the
dirty information for migration. Therefore, we won't need this array at all,
and due to our limited amount of vmalloc space this limits the amount of guests
we can run.
Signed-off-by: Carsten Otte <cotte@de.ibm.com>
Signed-off-by: Avi Kivity <avi@qumranet.com>

eff0114a

KVM: move slots_lock acquision down to vapic_exit · f8b78fa3

Marcelo Tosatti authored Jun 23, 2008

There is no need to grab slots_lock if the vapic_page will not
be touched.
Signed-off-by: Marcelo Tosatti <mtosatti@redhat.com>
Signed-off-by: Avi Kivity <avi@qumranet.com>

f8b78fa3

KVM: VMX: Fake emulate Intel perfctr MSRs · efa67e0d

Chris Lalancette authored Jun 20, 2008

Older linux guests (in this case, 2.6.9) can attempt to
access the performance counter MSRs without a fixup section, and injecting
a GPF kills the guest. Work around by allowing the guest to write those MSRs.

Tested by me on RHEL-4 i386 and x86_64 guests, as well as F-9 guests.
Signed-off-by: Chris Lalancette <clalance@redhat.com>
Signed-off-by: Avi Kivity <avi@qumranet.com>

efa67e0d

KVM: VMX: Fix a wrong usage of vmcs_config · 65267ea1

Sheng Yang authored Jun 18, 2008

The function ept_update_paging_mode_cr0() write to
CPU_BASED_VM_EXEC_CONTROL based on vmcs_config.cpu_based_exec_ctrl. That's
wrong because the variable may not consistent with the content in the
CPU_BASE_VM_EXEC_CONTROL MSR.
Signed-off-by: Sheng Yang <sheng.yang@intel.com>
Signed-off-by: Avi Kivity <avi@qumranet.com>

65267ea1

KVM: MMU: Fix printk format · db475c39
Avi Kivity authored Jun 22, 2008
```
Signed-off-by: Avi Kivity <avi@qumranet.com>
```
db475c39
KVM: MMU: When debug is enabled, make it a run-time parameter · 6ada8cca
Avi Kivity authored Jun 22, 2008
```
Signed-off-by: Avi Kivity <avi@qumranet.com>
```
6ada8cca

KVM: x86 emulator: lazily evaluate segment registers · 7a5b56df

Avi Kivity authored Jun 22, 2008

Instead of prefetching all segment bases before emulation, read them at the
last moment.  Since most of them are unneeded, we save some cycles on
Intel machines where this is a bit expensive.
Signed-off-by: Avi Kivity <avi@qumranet.com>

7a5b56df

KVM: x86 emulator: avoid segment base adjust for lea · 0adc8675
Avi Kivity authored Jun 15, 2008
```
Signed-off-by: Avi Kivity <avi@qumranet.com>
```
0adc8675

KVM: x86 emulator: simplify rip relative decoding · f5b4edcd

Avi Kivity authored Jun 15, 2008

rip relative decoding is relative to the instruction pointer of the next
instruction; by moving address adjustment until after decoding is complete,
we remove the need to determine the instruction size.
Signed-off-by: Avi Kivity <avi@qumranet.com>

f5b4edcd

KVM: x86 emulator: simplify r/m decoding · 84411d85

Avi Kivity authored Jun 15, 2008

Consolidate the duplicated code when not in any special case.
Signed-off-by: Avi Kivity <avi@qumranet.com>

84411d85

KVM: x86 emulator: simplify sib decoding · dc71d0f1

Avi Kivity authored Jun 15, 2008

Instead of using sparse switches, use simpler if/else sequences.
Signed-off-by: Avi Kivity <avi@qumranet.com>

dc71d0f1

KVM: x86 emulator: handle undecoded rex.b with r/m = 5 in certain cases · 8684c0af
Avi Kivity authored Jun 15, 2008
```
x86_64 does not decode rex.b in certain cases, where the r/m field = 5.
Signed-off-by: Avi Kivity <avi@qumranet.com>
```
8684c0af
KVM: x86 emulator: emulate nop and xchg reg, acc (opcodes 0x90 - 0x97) · b13354f8
Mohammed Gamal authored Jun 15, 2008
```
Signed-off-by: Mohammed Gamal <m.gamal005@gmail.com>
Signed-off-by: Avi Kivity <avi@qumranet.com>
```
b13354f8

KVM: Use printk_rlimit() instead of reporting emulation failures just once · f76c710d

Avi Kivity authored Jun 13, 2008

Emulation failure reports are useful, so allow more than one per the lifetime
of the module.
Signed-off-by: Avi Kivity <avi@qumranet.com>

f76c710d

KVM: Support mixed endian machines · 9ef621d3

Tan, Li authored May 23, 2008

Currently kvmtrace is not portable. This will prevent from copying a
trace file from big-endian target to little-endian workstation for analysis.
In the patch, kernel outputs metadata containing a magic number to trace
log, and changes 64-bit words to be u64 instead of a pair of u32s.
Signed-off-by: Tan Li <li.tan@intel.com>
Acked-by: Jerone Young <jyoung5@us.ibm.com>
Acked-by: Hollis Blanchard <hollisb@us.ibm.com>
Signed-off-by: Avi Kivity <avi@qumranet.com>

9ef621d3

KVM: Do not calculate linear rip in emulation failure report · 25be4608

Glauber Costa authored Jun 10, 2008

If we're not gonna do anything (case in which failure is already
reported), we do not need to even bother with calculating the linear rip.
Signed-off-by: Glauber Costa <gcosta@redhat.com>
Signed-off-by: Avi Kivity <avi@qumranet.com>

25be4608

KVM: only abort guest entry if timer count goes from 0->1 · 622395a9

Marcelo Tosatti authored Jun 11, 2008

Only abort guest entry if the timer count went from 0->1, since for 1->2
or larger the bit will either be set already or a timer irq will have
been injected.

Using atomic_inc_and_test() for it also introduces an SMP barrier
to the LAPIC version (thought it was unecessary because of timer
migration, but guest can be scheduled to a different pCPU between exit
and kvm_vcpu_block(), so there is the possibility for a race).

Noticed by Avi.
Signed-off-by: Marcelo Tosatti <mtosatti@redhat.com>
Signed-off-by: Avi Kivity <avi@qumranet.com>

622395a9

KVM: Add coalesced MMIO support (ia64 part) · 7f39f8ac

Laurent Vivier authored May 30, 2008

This patch enables coalesced MMIO for ia64 architecture.
It defines KVM_MMIO_PAGE_OFFSET and KVM_CAP_COALESCED_MMIO.
It enables the compilation of coalesced_mmio.c.

[akpm: fix compile error on ia64]
Signed-off-by: Laurent Vivier <Laurent.Vivier@bull.net>
Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
Signed-off-by: Avi Kivity <avi@qumranet.com>

7f39f8ac

KVM: Add coalesced MMIO support (powerpc part) · 588968b6

Laurent Vivier authored May 30, 2008

This patch enables coalesced MMIO for powerpc architecture.
It defines KVM_MMIO_PAGE_OFFSET and KVM_CAP_COALESCED_MMIO.
It enables the compilation of coalesced_mmio.c.
Signed-off-by: Laurent Vivier <Laurent.Vivier@bull.net>
Signed-off-by: Avi Kivity <avi@qumranet.com>

588968b6

KVM: Add coalesced MMIO support (x86 part) · 542472b5

Laurent Vivier authored May 30, 2008

This patch enables coalesced MMIO for x86 architecture.
It defines KVM_MMIO_PAGE_OFFSET and KVM_CAP_COALESCED_MMIO.
It enables the compilation of coalesced_mmio.c.
Signed-off-by: Laurent Vivier <Laurent.Vivier@bull.net>
Signed-off-by: Avi Kivity <avi@qumranet.com>

542472b5

KVM: Add coalesced MMIO support (common part) · 5f94c174

Laurent Vivier authored May 30, 2008

This patch adds all needed structures to coalesce MMIOs.
Until an architecture uses it, it is not compiled.

Coalesced MMIO introduces two ioctl() to define where are the MMIO zones that
can be coalesced:

- KVM_REGISTER_COALESCED_MMIO registers a coalesced MMIO zone.
  It requests one parameter (struct kvm_coalesced_mmio_zone) which defines
  a memory area where MMIOs can be coalesced until the next switch to
  user space. The maximum number of MMIO zones is KVM_COALESCED_MMIO_ZONE_MAX.

- KVM_UNREGISTER_COALESCED_MMIO cancels all registered zones inside
  the given bounds (bounds are also given by struct kvm_coalesced_mmio_zone).

The userspace client can check kernel coalesced MMIO availability by asking
ioctl(KVM_CHECK_EXTENSION) for the KVM_CAP_COALESCED_MMIO capability.
The ioctl() call to KVM_CAP_COALESCED_MMIO will return 0 if not supported,
or the page offset where will be stored the ring buffer.
The page offset depends on the architecture.

After an ioctl(KVM_RUN), the first page of the KVM memory mapped points to
a kvm_run structure. The offset given by KVM_CAP_COALESCED_MMIO is
an offset to the coalesced MMIO ring expressed in PAGE_SIZE relatively
to the address of the start of th kvm_run structure. The MMIO ring buffer
is defined by the structure kvm_coalesced_mmio_ring.

[akio: fix oops during guest shutdown]
Signed-off-by: Laurent Vivier <Laurent.Vivier@bull.net>
Signed-off-by: Akio Takebe <takebe_akio@jp.fujitsu.com>
Signed-off-by: Avi Kivity <avi@qumranet.com>

5f94c174

KVM: kvm_io_device: extend in_range() to manage len and write attribute · 92760499

Laurent Vivier authored May 30, 2008

Modify member in_range() of structure kvm_io_device to pass length and the type
of the I/O (write or read).

This modification allows to use kvm_io_device with coalesced MMIO.
Signed-off-by: Laurent Vivier <Laurent.Vivier@bull.net>
Signed-off-by: Avi Kivity <avi@qumranet.com>

92760499

KVM: MMU: Avoid page prefetch on SVM · 131d8279

Avi Kivity authored May 29, 2008

SVM cannot benefit from page prefetching since guest page fault bypass
cannot by made to work there.  Avoid accessing the guest page table in
this case.
Signed-off-by: Avi Kivity <avi@qumranet.com>

131d8279

KVM: MMU: Move nonpaging_prefetch_page() · d761a501

Avi Kivity authored May 29, 2008

In preparation for next patch. No code change.
Signed-off-by: Avi Kivity <avi@qumranet.com>

d761a501