Commits · 2f958e8240be28acee26085ba1686b4321ba4306 · Kirill Smelkov / linux

19 Jul, 2018 1 commit

drm/nouveau/fb/gp100-: disable address remapper · 2f958e82

Ben Skeggs authored Jul 18, 2018

This was causing problems on a system with a large amount of RAM, where
display push buffers were being fetched incorrectly when placed in high
system memory addresses.

While this commit will resolve the issue on that particular system, the
issue will be avoided completely with another patch to more fully solve
problems with display and large amounts of system memory on Pascal.

It's still probably a good idea to disable this to prevent weird issues
in the future.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

2f958e82

16 Jul, 2018 7 commits

drm/nouveau: Set DRIVER_ATOMIC cap earlier to fix debugfs · eb493fbc

Lyude Paul authored Jul 03, 2018

Currently nouveau doesn't actually expose the state debugfs file that's
usually provided for any modesetting driver that supports atomic, even
if nouveau is loaded with atomic=1. This is due to the fact that the
standard debugfs files that DRM creates for atomic drivers is called
when drm_get_pci_dev() is called from nouveau_drm.c. This happens well
before we've initialized the display core, which is currently
responsible for setting the DRIVER_ATOMIC cap.

So, move the atomic option into nouveau_drm.c and just add the
DRIVER_ATOMIC cap whenever it's enabled on the kernel commandline. This
shouldn't cause any actual issues, as the atomic ioctl will still fail
as expected even if the display core doesn't disable it until later in
the init sequence. This also provides the added benefit of being able to
use the state debugfs file to check the current display state even if
clients aren't allowed to modify it through anything other than the
legacy ioctls.

Additionally, disable the DRIVER_ATOMIC cap in nv04's display core, as
this was already disabled there previously.
Signed-off-by: Lyude Paul <lyude@redhat.com>
Cc: stable@vger.kernel.org
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

eb493fbc

drm/nouveau: Remove bogus crtc check in pmops_runtime_idle · 68fe23a6

Lyude Paul authored Jul 12, 2018

This both uses the legacy modesetting structures in a racy manner, and
additionally also doesn't even check the right variable (enabled != the
CRTC is actually turned on for atomic).

This fixes issues on my P50 regarding the dedicated GPU not entering
runtime suspend.
Signed-off-by: Lyude Paul <lyude@redhat.com>
Cc: stable@vger.kernel.org
Reviewed-by: Daniel Vetter <daniel.vetter@ffwll.ch>
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

68fe23a6

drm/nouveau/drm/nouveau: Fix runtime PM leak in nv50_disp_atomic_commit() · e5d54f19

Lyude Paul authored Jul 12, 2018

A CRTC being enabled doesn't mean it's on! It doesn't even necessarily
mean it's being used. This fixes runtime PM leaks on the P50 I've got
next to me.
Signed-off-by: Lyude Paul <lyude@redhat.com>
Cc: stable@vger.kernel.org
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

e5d54f19

drm/nouveau: Avoid looping through fake MST connectors · 37afe55b

Lyude Paul authored Jul 13, 2018

When MST and atomic were introduced to nouveau, another structure that
could contain a drm_connector embedded within it was introduced; struct
nv50_mstc. This meant that we no longer would be able to simply loop
through our connector list and assume that nouveau_connector() would
return a proper pointer for each connector, since the assertion that
all connectors coming from nouveau have a full nouveau_connector struct
became invalid.

Unfortunately, none of the actual code that looped through connectors
ever got updated, which means that we've been causing invalid memory
accesses for quite a while now.

An example that was caught by KASAN:

[  201.038698] ==================================================================
[  201.038792] BUG: KASAN: slab-out-of-bounds in nvif_notify_get+0x190/0x1a0 [nouveau]
[  201.038797] Read of size 4 at addr ffff88076738c650 by task kworker/0:3/718
[  201.038800]
[  201.038822] CPU: 0 PID: 718 Comm: kworker/0:3 Tainted: G           O      4.18.0-rc4Lyude-Test+ #1
[  201.038825] Hardware name: LENOVO 20EQS64N0B/20EQS64N0B, BIOS N1EET78W (1.51 ) 05/18/2018
[  201.038882] Workqueue: events nouveau_display_hpd_work [nouveau]
[  201.038887] Call Trace:
[  201.038894]  dump_stack+0xa4/0xfd
[  201.038900]  print_address_description+0x71/0x239
[  201.038929]  ? nvif_notify_get+0x190/0x1a0 [nouveau]
[  201.038935]  kasan_report.cold.6+0x242/0x2fe
[  201.038942]  __asan_report_load4_noabort+0x19/0x20
[  201.038970]  nvif_notify_get+0x190/0x1a0 [nouveau]
[  201.038998]  ? nvif_notify_put+0x1f0/0x1f0 [nouveau]
[  201.039003]  ? kmsg_dump_rewind_nolock+0xe4/0xe4
[  201.039049]  nouveau_display_init.cold.12+0x34/0x39 [nouveau]
[  201.039089]  ? nouveau_user_framebuffer_create+0x120/0x120 [nouveau]
[  201.039133]  nouveau_display_resume+0x5c0/0x810 [nouveau]
[  201.039173]  ? nvkm_client_ioctl+0x20/0x20 [nouveau]
[  201.039215]  nouveau_do_resume+0x19f/0x570 [nouveau]
[  201.039256]  nouveau_pmops_runtime_resume+0xd8/0x2a0 [nouveau]
[  201.039264]  pci_pm_runtime_resume+0x130/0x250
[  201.039269]  ? pci_restore_standard_config+0x70/0x70
[  201.039275]  __rpm_callback+0x1f2/0x5d0
[  201.039279]  ? rpm_resume+0x560/0x18a0
[  201.039283]  ? pci_restore_standard_config+0x70/0x70
[  201.039287]  ? pci_restore_standard_config+0x70/0x70
[  201.039291]  ? pci_restore_standard_config+0x70/0x70
[  201.039296]  rpm_callback+0x175/0x210
[  201.039300]  ? pci_restore_standard_config+0x70/0x70
[  201.039305]  rpm_resume+0xcc3/0x18a0
[  201.039312]  ? rpm_callback+0x210/0x210
[  201.039317]  ? __pm_runtime_resume+0x9e/0x100
[  201.039322]  ? kasan_check_write+0x14/0x20
[  201.039326]  ? do_raw_spin_lock+0xc2/0x1c0
[  201.039333]  __pm_runtime_resume+0xac/0x100
[  201.039374]  nouveau_display_hpd_work+0x67/0x1f0 [nouveau]
[  201.039380]  process_one_work+0x7a0/0x14d0
[  201.039388]  ? cancel_delayed_work_sync+0x20/0x20
[  201.039392]  ? lock_acquire+0x113/0x310
[  201.039398]  ? kasan_check_write+0x14/0x20
[  201.039402]  ? do_raw_spin_lock+0xc2/0x1c0
[  201.039409]  worker_thread+0x86/0xb50
[  201.039418]  kthread+0x2e9/0x3a0
[  201.039422]  ? process_one_work+0x14d0/0x14d0
[  201.039426]  ? kthread_create_worker_on_cpu+0xc0/0xc0
[  201.039431]  ret_from_fork+0x3a/0x50
[  201.039441]
[  201.039444] Allocated by task 79:
[  201.039449]  save_stack+0x43/0xd0
[  201.039452]  kasan_kmalloc+0xc4/0xe0
[  201.039456]  kmem_cache_alloc_trace+0x10a/0x260
[  201.039494]  nv50_mstm_add_connector+0x9a/0x340 [nouveau]
[  201.039504]  drm_dp_add_port+0xff5/0x1fc0 [drm_kms_helper]
[  201.039511]  drm_dp_send_link_address+0x4a7/0x740 [drm_kms_helper]
[  201.039518]  drm_dp_check_and_send_link_address+0x1a7/0x210 [drm_kms_helper]
[  201.039525]  drm_dp_mst_link_probe_work+0x71/0xb0 [drm_kms_helper]
[  201.039529]  process_one_work+0x7a0/0x14d0
[  201.039533]  worker_thread+0x86/0xb50
[  201.039537]  kthread+0x2e9/0x3a0
[  201.039541]  ret_from_fork+0x3a/0x50
[  201.039543]
[  201.039546] Freed by task 0:
[  201.039549] (stack is not available)
[  201.039551]
[  201.039555] The buggy address belongs to the object at ffff88076738c1a8
                                 which belongs to the cache kmalloc-2048 of size 2048
[  201.039559] The buggy address is located 1192 bytes inside of
                                 2048-byte region [ffff88076738c1a8, ffff88076738c9a8)
[  201.039563] The buggy address belongs to the page:
[  201.039567] page:ffffea001d9ce200 count:1 mapcount:0 mapping:ffff88084000d0c0 index:0x0 compound_mapcount: 0
[  201.039573] flags: 0x8000000000008100(slab|head)
[  201.039578] raw: 8000000000008100 ffffea001da3be08 ffffea001da25a08 ffff88084000d0c0
[  201.039582] raw: 0000000000000000 00000000000d000d 00000001ffffffff 0000000000000000
[  201.039585] page dumped because: kasan: bad access detected
[  201.039588]
[  201.039591] Memory state around the buggy address:
[  201.039594]  ffff88076738c500: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
[  201.039598]  ffff88076738c580: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
[  201.039601] >ffff88076738c600: 00 00 00 00 00 00 00 00 00 00 fc fc fc fc fc fc
[  201.039604]                                                  ^
[  201.039607]  ffff88076738c680: fc fc fc fc fc fc fc fc fc fc fc fc fc fc fc fc
[  201.039611]  ffff88076738c700: fc fc fc fc fc fc fc fc fc fc fc fc fc fc fc fc
[  201.039613] ==================================================================
Signed-off-by: Lyude Paul <lyude@redhat.com>
Cc: stable@vger.kernel.org
Cc: Karol Herbst <karolherbst@gmail.com>
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

37afe55b

drm/nouveau: Use drm_connector_list_iter_* for iterating connectors · 22b76bbe

Lyude Paul authored Jul 13, 2018

Every codepath in nouveau that loops through the connector list
currently does so using the old method, which is prone to race
conditions from MST connectors being created and destroyed. This has
been causing a multitude of problems, including memory corruption from
trying to access connectors that have already been freed!
Signed-off-by: Lyude Paul <lyude@redhat.com>
Cc: stable@vger.kernel.org
Cc: Karol Herbst <karolherbst@gmail.com>
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

22b76bbe

drm/nouveau/gem: off by one bugs in nouveau_gem_pushbuf_reloc_apply() · 7f073d01

Dan Carpenter authored Jul 03, 2018

The bo array has req->nr_buffers elements so the > should be >= so we
don't read beyond the end of the array.

Fixes: a1606a95 ("drm/nouveau: new gem pushbuf interface, bump to 0.0.16")
Signed-off-by: Dan Carpenter <dan.carpenter@oracle.com>
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

7f073d01

drm/nouveau/kms/nv50-: ensure window updates are submitted when flushing mst disables · df0c97e2

Ben Skeggs authored Jul 03, 2018

It was possible for this to be skipped when shutting down MST streams, and
leaving the core channel interlocked with a wndw channel update that never
happens - leading to a hung display.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
Tested-By: Lyude Paul <lyude@redhat.com>

df0c97e2

19 Jun, 2018 1 commit

drm/nouveau/kms/nv50-: cursors always use core channel vram ctxdma · 1264f832

Ben Skeggs authored Jun 18, 2018

Ctxdmas for cursors from all heads are setup in the core channel, and due
to us tracking allocated handles per-window, we were failing with -EEXIST
on multiple-head setups trying to allocate duplicate handles.

The cursor code is hardcoded to use the core channel vram ctxdma already,
so just skip ctxdma allocation for cursor fbs to fix the issue.

Fixes: 5bca1621 ("drm/nouveau/kms/nv50-: move fb ctxdma tracking into windows")
Reported-by: Adam Borowski <kilobyte@angband.pl>
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

1264f832

18 May, 2018 31 commits

drm/nouveau/gr/gf100-: insert some WFIs during gr init · 6c46d01f

Ben Skeggs authored May 16, 2018

Inserted wait-for-gr-idle in the places it seems that RM does it, seems
to prevent some random mmio timeouts on Quadro GV100.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

6c46d01f

drm/nouveau/clk: Use list_for_each_entry_from_reverse · dd3b89be

Arushi Singhal authored May 08, 2018

It's better to use "list_for_each_entry_from_reverse" for iterating list
than "for loop" as it makes the code more clear to read.
This patch replace "for loop" with "list_for_each_entry_from_reverse"
and "start" variable with "cstate" which helps in refactoring
the code and also "cstate" variable is more commonly used in the other
functions.

changes in v2:
"start" variable is removed, before "cstate" variable was removed
but "cstate" is more common so preferred "cstate" over "start".
Signed-off-by: Arushi Singhal <arushisinghal19971997@gmail.com>
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

dd3b89be

drm/nouveau: fix temp/pwm visibility, skip hwmon when no sensors exist · 7a22c737

Ilia Mirkin authored Apr 22, 2018

A NV34 GPU was seeing temp and pwm entries in hwmon, which would error
out when read. These should not have been visible, but also the whole
hwmon object should just not have been registered in the first place.
Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

7a22c737

drm/nouveau: fix nouveau_dsm_get_client_id()'s return type · f43cda5c

Luc Van Oostenryck authored Apr 24, 2018

The method struct vga_switcheroo_handler::get_client_id() is defined
as returning an 'enum vga_switcheroo_client_id' but the implementation
in this driver, nouveau_dsm_get_client_id(), returns an 'int'.

Fix this by returning 'enum vga_switcheroo_client_id' in this driver too.
Signed-off-by: Luc Van Oostenryck <luc.vanoostenryck@gmail.com>
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

f43cda5c

drm/nouveau: fix mode_valid's return type · 54b202f1

Luc Van Oostenryck authored Apr 24, 2018

The method struct drm_connector_helper_funcs::mode_valid is defined
as returning an 'enum drm_mode_status' but the driver implementation
for this method uses an 'int' for it.

Fix this by using 'enum drm_mode_status' in the driver too.
Signed-off-by: Luc Van Oostenryck <luc.vanoostenryck@gmail.com>
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

54b202f1

drm/nouveau/gr/gv100: initial support · d521097f
Ben Skeggs authored May 08, 2018
```
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
d521097f
drm/nouveau/ce/gv100: initial support · 6e1f34e3
Ben Skeggs authored May 08, 2018
```
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
6e1f34e3
drm/nouveau/fifo/gv100: initial support · 37e1c45a
Ben Skeggs authored May 08, 2018
```
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
37e1c45a
drm/nouveau/kms/gv100: initial support · facaed62
Ben Skeggs authored May 08, 2018
```
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
facaed62
drm/nouveau/disp/gv100: initial support · 290ffeaf
Ben Skeggs authored May 08, 2018
```
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
290ffeaf
drm/nouveau/dma/gv100: initial support · 6fb566b9
Ben Skeggs authored May 08, 2018
```
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
6fb566b9
drm/nouveau/therm/gv100: initial support · 24a7513c
Ben Skeggs authored May 08, 2018
```
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
24a7513c
drm/nouveau/pmu/gv100: initial support · ada0c562
Ben Skeggs authored May 08, 2018
```
Appears to be compatible with GP102.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
ada0c562
drm/nouveau/fault/gv100: initial support · 8b811951
Ben Skeggs authored May 08, 2018
```
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
8b811951
drm/nouveau/bar/gv100: initial support · 013b7b37
Ben Skeggs authored May 08, 2018
```
Appears to be compatible with GM107.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
013b7b37

drm/nouveau/mmu/gv100: initial support · edf50395

Ben Skeggs authored May 08, 2018

VEID support hacked in here, as it's the most convenient place for now.

Will be refined once it's better understood.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

edf50395

drm/nouveau/ltc/gv100: initial support · 1bce5725
Ben Skeggs authored May 08, 2018
```
Appears to be compatible with GP102.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
1bce5725
drm/nouveau/fb/gv100: initial support · 3582942c
Ben Skeggs authored May 08, 2018
```
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
3582942c

drm/nouveau/imem/gv100: initial support · a4a0cfb6

Ben Skeggs authored May 08, 2018

Can't imagine this will be any different.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

a4a0cfb6

drm/nouveau/tmr/gv100: initial support · 936240c9
Ben Skeggs authored May 08, 2018
```
Appears to be compatible with GK20A.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
936240c9
drm/nouveau/bus/gv100: initial support · 9506bd24
Ben Skeggs authored May 08, 2018
```
Appears to be compatible with GF100.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
9506bd24
drm/nouveau/mc/gv100: initial support · 41af75bd
Ben Skeggs authored May 08, 2018
```
Appears to be compatible with GP100.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
41af75bd
drm/nouveau/fuse/gv100: initial support · 29255049
Ben Skeggs authored May 08, 2018
```
Appears to be compatible with GM107.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
29255049
drm/nouveau/i2c/gv100: initial support · d2e3b57d
Ben Skeggs authored May 08, 2018
```
Appears to be compatible with GM200.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
d2e3b57d
drm/nouveau/gpio/gv100: initial support · 8afbcca5
Ben Skeggs authored May 08, 2018
```
Appears to be compatible with GK104.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
8afbcca5
drm/nouveau/ibus/gv100: initial support · 46fe1a81
Ben Skeggs authored May 08, 2018
```
Appears to be compatible with GM200.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
46fe1a81
drm/nouveau/top/gv100: initial support · a1c771a5
Ben Skeggs authored May 08, 2018
```
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
a1c771a5
drm/nouveau/devinit/gv100: initial support · 8769dc98
Ben Skeggs authored May 08, 2018
```
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
8769dc98
drm/nouveau/bios/pll: limits table 5.0 · 6827c9a8
Ben Skeggs authored May 08, 2018
```
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
6827c9a8
drm/nouveau/bios/gv100: initial support · 75e482ef
Ben Skeggs authored May 08, 2018
```
No real surprises here so far.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
75e482ef
drm/nouveau/pci/gv100: initial support · 893855d8
Ben Skeggs authored May 08, 2018
```
Appears to be compatible with GP100.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
893855d8