Commits · c09597f083960cea492e1d658b9039b06d8a6839 · nexedi / linux

02 Nov, 2017 40 commits

drm/nouveau/core/memory: add some useful accessor macros · c09597f0

Ben Skeggs authored Nov 01, 2017

Adds support for 64-bit writes, and optimised filling of buffers with
fixed 32/64-bit values.

These will all be used by the upcoming MMU changes.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

c09597f0

drm/nouveau/core/memory: add reference counting · 997a8900

Ben Skeggs authored Nov 01, 2017

We need to be able to prevent memory from being freed while it's still
mapped in a GPU's address-space.

Will be used by upcoming MMU changes.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

997a8900

drm/nouveau/core/memory: add mechanism to retrieve allocation granularity · 2c9c4910

Ben Skeggs authored Nov 01, 2017

Needed by VMM code to determine whether an allocation is compatible with
a given page size (ie. you can't map 4KiB system memory pages into 64KiB
GPU pages).
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

2c9c4910

drm/nouveau/core/memory: change map interface to support upcoming mmu changes · 19a82e49

Ben Skeggs authored Nov 01, 2017

Map flags (access, kind, etc) are currently defined in either the VMA,
or the memory object, which turns out to not be ideal for things like
suballocated buffers, etc.

These will become per-map flags instead, so we need to support passing
these arguments in nvkm_memory_map().
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

19a82e49

drm/nouveau/core/memory: comptag allocation · 7f53d6dc

Ben Skeggs authored Nov 01, 2017

nvkm_memory is going to be used by the upcoming mmu rework for the basic
representation of a memory allocation, as such, this commit adds support
for comptag allocation to nvkm_memory.

This is very simple for now, in that it requires comptags for the entire
memory allocation even if only certain ranges are compressed.

Support for tracking ranges will be added at a later date.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

7f53d6dc

drm/nouveau/ltc: init comptag mm in fb subdev · 6cd7670c

Ben Skeggs authored Nov 01, 2017

A single location for the MM allows us to share allocation logic.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

6cd7670c

drm/nouveau/fb/gf100: clear comptags at allocation time rather than mmu map · b1e839f3

Ben Skeggs authored Nov 01, 2017

We probably don't want to destroy compression data when doing multiple
mappings of a memory object.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

b1e839f3

drm/nouveau/fb: move comptag init out of ram submodule · af793b8c
Ben Skeggs authored Nov 01, 2017
```
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
af793b8c

drm/nouveau/fb: move comptags mm into nvkm_fb · 7ef44bee

Ben Skeggs authored Nov 01, 2017

We're moving towards having a central place to handle comptag allocation,
and as some GPUs don't have a ram submodule (ie. Tegra), we need to move
the mm somewhere else.

It probably never belonged in ram anyways.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

7ef44bee

drm/nouveau/core/mm: introduce functions to access info about a given allocation · b7e1f3f1
Ben Skeggs authored Nov 01, 2017
```
These will be used in upcoming patches.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
b7e1f3f1

drm/nouveau/core/mm: have users explicitly define heap identifiers · 4d058fab

Ben Skeggs authored Nov 01, 2017

Different sections of VRAM may have different properties (ie. can't be used
for compression/display, can't be mapped, etc).

We currently already support this, but it's a bit magic.  This change makes
it more obvious where we're allocating from.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

4d058fab

drm/nouveau: separate constant-va tracking from nvkm vma structure · 24e8375b
Ben Skeggs authored Nov 01, 2017
```
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
24e8375b
drm/nouveau: separate buffer object backing memory from nvkm structures · 9ce523cc
Ben Skeggs authored Nov 01, 2017
```
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
9ce523cc

drm/nouveau: hang drm client of a master · cb7e88e7

Ben Skeggs authored Nov 01, 2017

TTM memory allocations will be hanging off the DRM's client, but the
locking needed to do so gets really tricky with all the other use of
the DRM's object tree.

To solve this, we make the normal DRM client a child of a new master,
where the memory allocations will be done from instead.

This also solves a potential race with client creation.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

cb7e88e7

drm/nouveau: consolidate identical functions in nouveau_ttm.c · 6be4421a
Ben Skeggs authored Nov 01, 2017
```
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
6be4421a
drm/nouveau: remove unnecessary use of ttm_mem_type_manager::priv · 792067e0
Ben Skeggs authored Nov 01, 2017
```
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
792067e0
drm/nouveau: swap loop order in move_notify() hook · a48296ab
Ben Skeggs authored Nov 01, 2017
```
The conditional is the same for every mapping.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
a48296ab

drm/nouveau: simplify const-va map condition · 425b34f7

Ben Skeggs authored Nov 01, 2017

We don't really care about where the memory is, just that it's compatible
with a VMA allocated for a given page size.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

425b34f7

drm/nouveau: split various bo flags out into their own members · 7760a2e3
Ben Skeggs authored Nov 01, 2017
```
It's far more convenient to deal with like this.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
7760a2e3
drm/nouveau: remove unused sysmem fence code · bc3b0c7a
Ben Skeggs authored Nov 01, 2017
```
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
bc3b0c7a
drm/nouveau: store nouveau_drm in nouveau_cli, as opposed to drm_device · e75c091b
Ben Skeggs authored Nov 01, 2017
```
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
e75c091b
drm/nouveau/gr/gf100-gk208: copy big page size setting from fb · b6838c14
Ben Skeggs authored Nov 01, 2017
```
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
b6838c14
drm/nouveau/gr/gf100-gk208: make use of init_gpc_mmu() hook to share setup · 223eaf4b
Ben Skeggs authored Nov 01, 2017
```
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
223eaf4b

drm/nouveau/fb: finalise big page size selection in constructor · 2854ab8d

Ben Skeggs authored Nov 01, 2017

MMU will need to know this during its constructor, so we can't delay
deciding this until init-time.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

2854ab8d

drm/nouveau/mmu/nv04-nv4x: move global vmm to nvkm_mmu · 0b11b30d

Ben Skeggs authored Nov 01, 2017

In a future commit, this will be constructed by common code.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

0b11b30d

drm/nouveau/imem: use fast-path for resume restore · ffd937bb

Ben Skeggs authored Nov 01, 2017

Before: "imem: init completed in 299277us"
 After: "imem: init completed in  11574us"

Suspend from Fedora 26 gnome desktop on GP102.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

ffd937bb

drm/nouveau/imem: use fast-path for suspend backup · e9be3c7d

Ben Skeggs authored Nov 01, 2017

Before: "imem: suspend completed in 5540487us"
 After: "imem: suspend completed in 1871526us"

Suspend from Fedora 26 gnome desktop on GP102.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

e9be3c7d

drm/nouveau/imem: separate pre-BAR2-bootstrap objects from the rest · b00b8430
Ben Skeggs authored Nov 01, 2017
```
These will require slow-path access during suspend/resume.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
b00b8430
drm/nouveau/imem: switch to kvmalloc/kvfree for suspend/resume backup · 54c70e3a
Ben Skeggs authored Nov 01, 2017
```
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
54c70e3a
drm/nouveau/imem: separate suspend/resume backup handling into their own functions · d52ddc95
Ben Skeggs authored Nov 01, 2017
```
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
d52ddc95
drm/nouveau/imem: remove now-unused wrapper for backend objects · 71370e62
Ben Skeggs authored Nov 01, 2017
```
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
71370e62

drm/nouveau/imem/nv50: support eviction of BAR2 mappings · 03edf1b3

Ben Skeggs authored Nov 01, 2017

A good deal of the structures we map into here aren't accessed very often
at all, and Fedora 26 has exposed an issue where after creating a heap of
channels, BAR2 space would run out, and we'd need to make use of the slow
path while accessing important structures like page tables.

This implements an LRU on BAR2 space, which allows eviction of mappings
that aren't currently needed, to make space for other objects.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

03edf1b3

drm/nouveau/imem/nv50: prevent fast-path for mapped objects when BAR isn't ready · 69b136f2

Ben Skeggs authored Nov 01, 2017

Another piece of solving the "GP100 BAR2 VMM bootstrap" puzzle.

Without doing this, we'd attempt to write PDEs for the lower page table
levels through BAR2 before BAR2 access has been fully initialised.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

69b136f2

drm/nouveau/imem/nv50: map bar2 write-combined · dfcbd550
Ben Skeggs authored Nov 01, 2017
```
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
dfcbd550

drm/nouveau/imem/nv50: embed nvkm_instobj directly into nv04_instobj · be55287a

Ben Skeggs authored Nov 01, 2017

This is not as simple as it was for earlier GPUs, due to the need to swap
accessor functions depending on whether BAR2 is usable or not.

We were previously protected by nvkm_instobj's accessor functions keeping
an object mapped permanently, with some unclear magic that managed to hit
the slow-path where needed even if an object was marked as mapped.

That's been replaced here by reference counting maps (some objects, like
page tables can be accessed concurrently), and swapping the functions as
necessary.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

be55287a

drm/nouveau/imem/nv50: move slow-path locking into rd/wr functions · af515ec8

Ben Skeggs authored Nov 01, 2017

This is to simplify upcoming changes. The slow-path is something that
currently occurs during bootstrap of the BAR2 VMM, while backing up an
object during suspend/resume, or when BAR2 address space runs out.

The latter is a real problem that can happen at runtime, and occurs in
Fedora 26 already (due to some change that causes a lot of channels to
be created at login), so ideally we'd prefer not to make it any slower.

We'd also like suspend/resume speed to not suffer.

Upcoming commits will solve those problems in a better way, making the
extra overhead of moving the locking here a non-issue.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

af515ec8

drm/nouveau/imem/nv50: split object map out from api functions · f584bde6

Ben Skeggs authored Nov 01, 2017

acquire()/boot() will need different logic in addition to performing
the actual mapping.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

f584bde6

drm/nouveau/imem/nv40: map bar2 write-combined · b807270c
Ben Skeggs authored Nov 01, 2017
```
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
b807270c
drm/nouveau/imem/nv40: embed nvkm_instobj directly into nv04_instobj · 62465ac5
Ben Skeggs authored Nov 01, 2017
```
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
62465ac5
drm/nouveau/imem/nv04: directly embed nvkm_instobj into nv04_instobj · 87717e7f
Ben Skeggs authored Nov 01, 2017
```
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
87717e7f