Commits · f9400afb1e11c33830bd99a0c9fafe3f4d07a7cc · Kirill Smelkov / linux

02 Nov, 2017 40 commits

drm/nouveau/mmu/gp100,gp10b: implement new vmm backend · f9400afb

Ben Skeggs authored Nov 01, 2017

Adds support for:
- 64KiB/2MiB big page sizes (128KiB not supported by HW with new PT layout).
- System-memory PTs.
- LPTE "invalid" state.
- (Tegra) Use of video memory aperture.
- Sparse PDEs/PTEs.
- Additional blocklinear kinds.
- 49-bit address-space.

GP100 supports an entirely new 5-level page table layout that provides
an expanded 49-bit address-space.  It also supports the layout present
on previous generations, which we've been making do with until now.

This commit implements support for the new layout, and enables it by
default.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

f9400afb

drm/nouveau/mmu/gm200,gm20b: implement new vmm backend · e12cf6ad

Ben Skeggs authored Nov 01, 2017

Adds support for:
- 64KiB big page size.
- System-memory PTs.
- LPTE "invalid" state.
- (Tegra) Use of video memory aperture.
- Sparse PDEs/PTEs.
- Additional blocklinear kinds.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

e12cf6ad

drm/nouveau/mmu/gk104,gk20a: implement new vmm backend · b5977643

Ben Skeggs authored Nov 01, 2017

Adds support for:
- 64KiB big page size.
- System-memory PTs.
- LPTE "invalid" state.
- (Tegra) Use of video memory aperture.

Adds support for marking LPTEs invalid, resulting in the corresponding
SPTEs being ignored, which is supposed to speed up TLB invalidates.

On The Tegra side, this will switch to using the video memory aperture
for all mappings.  The HW will still target non-coherent system memory,
but this aperture needs to be selected in order to support compression.

Tegra's instmem backend somewhat cheated to get this effect previously.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

b5977643

drm/nouveau/mmu/gf100: implement new vmm backend · b77791da

Ben Skeggs authored Nov 01, 2017

Adds support for:
- 64KiB big page size.
- System-memory PTs.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

b77791da

drm/nouveau/mmu/nv50,g84: implement new vmm backend · fd542a3e
Ben Skeggs authored Nov 01, 2017
```
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
fd542a3e
drm/nouveau/mmu/nv44: implement new vmm backend · 6ce51352
Ben Skeggs authored Nov 01, 2017
```
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
6ce51352
drm/nouveau/mmu/nv41: implement new vmm backend · 473f9aca
Ben Skeggs authored Nov 01, 2017
```
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
473f9aca
drm/nouveau/mmu/nv04: implement new vmm backend · dd12d158
Ben Skeggs authored Nov 01, 2017
```
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
dd12d158

drm/nouveau/mmu: implement new vmm backend · eb813999

Ben Skeggs authored Nov 01, 2017

This is the common code to support a rework of the VMM backends.

It adds support for more than 2 levels of page table nesting, which
is required to be able to support GP100's MMU layout.

Sparse mappings (that don't cause MMU faults when accessed) are now
supported, where the backend provides it.

Dual-PT handling had to become more sophisticated to support sparse,
but this also allows us to support an optimisation the MMU provides
on GK104 and newer.

Certain operations can now be combined into a single page tree walk
to avoid some overhead, but also enables optimsations like skipping
PTE unmap writes when the PT will be destroyed anyway.

The old backend has been hacked up to forward requests onto the new
backend, if present, so that it's possible to bisect between issues
in the backend changes vs the upcoming frontend changes.

Until the new frontend has been merged, new backends will leak BAR2
page tables on module unload.  This is expected, and it's not worth
the effort of hacking around this as it doesn't effect runtime.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

eb813999

drm/nouveau/fb/gm200: enable NV_PFB_MMU_CTRL_USE_FULL_COMP_TAG_LINE where appropriate · bda9e379

Ben Skeggs authored Nov 01, 2017

To avoid wasting compression tags when using 64KiB pages, we need to
enable this so we can select between upper/lower comptagline in PTEs.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

bda9e379

drm/nouveau/ltc/gm200: limit NV_MMU_PTE_COMPTAGLINE bits to 16 where required · f8a12039

Ben Skeggs authored Nov 01, 2017

If NV_PFB_MMU_CTRL_USE_FULL_COMP_TAG_LINE is TRUE, then the last bit of
NV_MMU_PTE_COMPTAGLINE is re-purposed to select the upper/lower half of
a compression tag when using 64KiB big pages.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

f8a12039

drm/nouveau/fifo/nv04-nv40: fix missing nvkm_kmap() calls around ramfc access · ac47c15b
Ben Skeggs authored Nov 01, 2017
```
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
ac47c15b

drm/nouveau/mmu: handle instance block setup · d30af7ce

Ben Skeggs authored Nov 01, 2017

We previously required each VMM user to allocate their own page directory
and fill in the instance block themselves.

It makes more sense to handle this in a common location.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

d30af7ce

drm/nouveau/mmu: remove old vm creation hooks · af3b8d53
Ben Skeggs authored Nov 01, 2017
```
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
af3b8d53

drm/nouveau/mmu/gp100,gp10b: implement vmm on top of new base · 8e39abff

Ben Skeggs authored Nov 01, 2017

Adds support for:
- Selection of old/new-style page table layout (GP100MmuLayout=0/1).
- System-memory PDs.

New layout disabled by default for the moment, as we don't have a
backend that can handle it yet.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

8e39abff

drm/nouveau/mmu/gm200,gm20b: implement vmm on top of new base · 5f300fed

Ben Skeggs authored Nov 01, 2017

Adds support for:
- Per-VMM selection of big page size.
- System-memory PDs.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

5f300fed

drm/nouveau/mmu/gk104,gk20a: implement vmm on top of new base · 7de078aa

Ben Skeggs authored Nov 01, 2017

Adds support for:
- Selection of a 64KiB big page size (NvFbBigPage=16).
- System-memory PDs.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

7de078aa

drm/nouveau/mmu/gf100: implement vmm on top of new base · 540a1dde

Ben Skeggs authored Nov 01, 2017

Adds support for:
- Selection of a 64KiB big page size (NvFbBigPage=16).
- System-memory PDs.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

540a1dde

drm/nouveau/mmu/nv50,g84: implement vmm on top of new base · 9f6219fd
Ben Skeggs authored Nov 01, 2017
```
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
9f6219fd
drm/nouveau/mmu/nv44: implement vmm on top of new base · 03b0ba7b
Ben Skeggs authored Nov 01, 2017
```
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
03b0ba7b
drm/nouveau/mmu/nv41: implement vmm on top of new base · 77783435
Ben Skeggs authored Nov 01, 2017
```
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
77783435
drm/nouveau/mmu/nv04: implement vmm on top of new base · 5b17f362
Ben Skeggs authored Nov 01, 2017
```
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
5b17f362

drm/nouveau/mmu: implement base for new vm management · 806a7335

Ben Skeggs authored Nov 01, 2017

This is the first chunk of the new VMM code that provides the structures
needed to describe a GPU virtual address-space layout, as well as common
interfaces to handle VMM creation, and connecting instances to a VMM.

The constructor now allocates the PD itself, rather than having the user
handle that manually. This won't/can't be used until after all backends
have been ported to these interfaces, so a little bit of memory will be
wasted on Fermi and newer for a couple of commits in the series.

Compatibility has been hacked into the old code to allow each GPU backend
to be ported individually.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

806a7335

drm/nouveau/mmu: implement page table sub-allocation · f1280394

Ben Skeggs authored Nov 01, 2017

GP100 "big" (which is a funny name, when it supports "even bigger") page
tables are small enough that we want to be able to suballocate them from
a larger block of memory.

This builds on the previous page table cache interfaces so that the VMM
code doesn't need to know the difference.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

f1280394

drm/nouveau/mmu: implement page table cache · 9a45ddaa

Ben Skeggs authored Nov 01, 2017

Builds up and maintains a small cache of each page table size in order
to reduce the frequency of expensive allocations, particularly in the
pathological case where an address range ping-pongs between allocated
and free.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

9a45ddaa

drm/nouveau/mmu: automatically handle "un-bootstrapping" of vmm · 5e075fde

Ben Skeggs authored Nov 01, 2017

Removes the need to expose internals outside of MMU, and GP100 is both
different, and a lot harder to deal with.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

5e075fde

drm/nouveau/mmu/gp10b: fork from gf100 · 6359c982
Ben Skeggs authored Nov 01, 2017
```
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
6359c982
drm/nouveau/mmu/gp100: fork from gf100 · b86a4587
Ben Skeggs authored Nov 01, 2017
```
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
b86a4587
drm/nouveau/mmu/gm20b: fork from gf100 · cedc4d57
Ben Skeggs authored Nov 01, 2017
```
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
cedc4d57
drm/nouveau/mmu/gm200: fork from gf100 · e1e33c79
Ben Skeggs authored Nov 01, 2017
```
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
e1e33c79
drm/nouveau/mmu/gk20a: fork from gf100 · d1f6c8d2
Ben Skeggs authored Nov 01, 2017
```
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
d1f6c8d2
drm/nouveau/mmu/gk104: fork from gf100 · db018585
Ben Skeggs authored Nov 01, 2017
```
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
db018585
drm/nouveau/mmu/g84: fork from nv50 · 0f43715f
Ben Skeggs authored Nov 01, 2017
```
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
0f43715f
drm/nouveau/fb/ram: remove old allocators · b4e114f1
Ben Skeggs authored Nov 01, 2017
```
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
b4e114f1

drm/nouveau: allocate vram with nvkm_ram_get() · 1de33775

Ben Skeggs authored Nov 01, 2017

This will cause a subtle behaviour change on GPUs that are in mixed-memory
configurations in that VRAM in the degraded section of VRAM will no longer
be used for TTM buffer objects.

That section of VRAM is not meant to be used for displayable/compressed
surfaces, and we have no reliable way with the current interfaces to be
able to make that decision properly.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

1de33775

drm/nouveau: directly handle comptag allocation · 7b865663

Ben Skeggs authored Nov 01, 2017

Another transition step to allow finer-grained patches transitioning to
new MMU backends.

Old backends will continue operate as before (accessing nvkm_mem::tag),
and new backends will get a reference to the tags allocated here.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

7b865663

drm/nouveau: wrap nvkm_mem objects in nvkm_memory interfaces · bd275f1d

Ben Skeggs authored Nov 01, 2017

This is a transition step, to enable finer-grained commits while
transitioning to new MMU interfaces.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

bd275f1d

drm/nouveau/ltc/gf100-: allocate tagram with nvkm_ram_get() · bd447053
Ben Skeggs authored Nov 01, 2017
```
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
bd447053
drm/nouveau/imem/nv50: allocate memory with nvkm_ram_get() · 7f4f82af
Ben Skeggs authored Nov 01, 2017
```
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
7f4f82af
drm/nouveau/fb/ram/gt215: allocate training buffer with nvkm_ram_get() · 2bfa0b01
Ben Skeggs authored Nov 01, 2017
```
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
2bfa0b01