1. 31 Oct, 2013 3 commits
    • Erik Faye-Lund's avatar
      gpu: host1x: check relocs after all gathers are consumed · a9ff9995
      Erik Faye-Lund authored
      The num_relocs count are passed to the kernel per job, not per gather.
      
      For multi-gather jobs, we would previously fail if there were relocs in
      other gathers aside from the first one.
      
      Fix this by simply moving the check until all gathers have been
      consumed.
      Signed-off-by: default avatarErik Faye-Lund <kusmabite@gmail.com>
      Reviewed-by: default avatarArto Merilainen <amerilainen@nvidia.com>
      Acked-By: default avatarTerje Bergstrom <tbergstrom@nvidia.com>
      Signed-off-by: default avatarThierry Reding <treding@nvidia.com>
      a9ff9995
    • Thierry Reding's avatar
      drm: Fix typo in debug message · f28c38ae
      Thierry Reding authored
      Fix a typo (iotcl -> ioctl) in the debug message when an unknown IOCTL
      is encountered.
      Acked-by: default avatarDavid Airlie <airlied@linux.ie>
      Signed-off-by: default avatarThierry Reding <treding@nvidia.com>
      f28c38ae
    • Thierry Reding's avatar
      drm: Track the proper DPMS mode of connectors · a6ad6230
      Thierry Reding authored
      When userspace removes the active framebuffer using DRM_IOCTL_MODE_RMFB,
      or explicitly disables the CRTC (by calling drmModeSetCrtc(..., NULL)
      for example), a NULL framebuffer will be passed to the .set_config()
      implementation of a CRTC. The drm_crtc_helper_set_config() helper will
      decide to disable a CRTC when that happens.
      
      To do so, it calls drm_crtc_helper_disable(), which in turn will iterate
      over all encoders and decouple them from their connectors and finally
      call drm_helper_disable_unused_functions() to clean up and call the
      .disable() or .dpms() implementation for each encoder. However, at no
      point during this sequence does it track the DPMS mode of a connector,
      so it will usually remain on after this.
      
      When a connector is enabled again, drm_helper_connector_dpms() will not
      notice that the DPMS mode actually changed and won't do anything, which
      causes the connector to stay disabled indefinitely.
      
      To prevent this from happening, explicitly set the connector's DPMS mode
      to off when the CRTC is disabled. That way it reflects the correct state
      and can be enabled again.
      
      This solves an issue observed when terminating an X server running on
      the xf86-video-modesetting driver. Without this patch, the connector
      would not be enabled properly and the screen would stay dark.
      Acked-by: default avatarDavid Airlie <airlied@linux.ie>
      Signed-off-by: default avatarThierry Reding <treding@nvidia.com>
      a6ad6230
  2. 15 Oct, 2013 2 commits
    • Dave Airlie's avatar
      drm/i915: abstract the conversion of device->minor out to a macro · 14c8d110
      Dave Airlie authored
      This will make the next patch to change how this works a lot cleaner.
      Signed-off-by: default avatarDave Airlie <airlied@redhat.com>
      14c8d110
    • Dave Airlie's avatar
      Merge branch 'drm-intel-next' of git://people.freedesktop.org/~danvet/drm-intel into drm-next · 5259c522
      Dave Airlie authored
      New feature pile for 3.12! Highlights:
      - Stereo/3d support for hdmi from Damien, both the drm core bits and
        the i915 integration.
      - Manual boost/deboost logic for gpu turbo (Chris)
      - Fixed up clock readout support for vlv (Chris).
      - Tons of little fixes and improvements for vlv in general (Chon Minng
        Lee and Jesse Barnes).
      - Power well support for the legacy vga plane (Ville).
      - DP impromevents from Jani.
      - Improvements to the Haswell modeset sequence (Ville+Paulo).
      - Haswell DDI improvements, using the VBT for some tuning values and
        to check the configuration (Paulo).
      - Tons of other small improvements and fixups.
      
      * 'drm-intel-next' of git://people.freedesktop.org/~danvet/drm-intel: (92 commits)
        drm/i915: Use adjusted_mode in the fastboot hack to disable pfit
        drm/i915: Add a more detailed comment about the set_base() fastboot hack
        drm/i915/vlv: Turn off power gate for BIOS-less system.
        drm/i915/vlv: reset DPIO on load and resume v2
        drm/i915: Simplify PSR debugfs
        drm/i915: Tweak RPS thresholds to more aggressively downclock
        drm/i915: Boost RPS frequency for CPU stalls
        drm/i915: Fix __wait_seqno to use true infinite timeouts
        drm/i915: Add some missing steps to i915_driver_load error path
        drm/i915: Clean up the ring scaling calculations
        drm/i915: Don't populate pipe_src_{w,h} multiple times
        drm/i915: implement the Haswell mode set sequence workaround
        drm/i915: Disable/enable planes as the first/last thing during modeset on HSW
        i915/vlv: untangle integrated clock source handling v4
        drm/i915: fix typo s/PatherPoint/PantherPoint/
        drm/i915: Make intel_resume_power_well() static
        drm/i915: destroy connector sysfs files earlier
        drm/i915/dp: do not write DP_TRAINING_PATTERN_SET all the time
        drm/i915/dp: retry i2c-over-aux seven times on AUX DEFER
        drm/i915/vlv: reduce GT FIFO error info to a debug message
        ...
      5259c522
  3. 10 Oct, 2013 1 commit
  4. 09 Oct, 2013 22 commits
  5. 04 Oct, 2013 4 commits
  6. 03 Oct, 2013 8 commits
    • Rodrigo Vivi's avatar
      drm/i915: Simplify PSR debugfs · a031d709
      Rodrigo Vivi authored
      for igt test case.
      
      v2: remove trailing spaces and fix conflicts
      Signed-off-by: default avatarRodrigo Vivi <rodrigo.vivi@gmail.com>
      [danvet:
      - make it comipile
      - s/IS_HASWELL/HAS_PSR/]
      Signed-off-by: default avatarDaniel Vetter <daniel.vetter@ffwll.ch>
      a031d709
    • Chris Wilson's avatar
      drm/i915: Tweak RPS thresholds to more aggressively downclock · dd75fdc8
      Chris Wilson authored
      After applying wait-boost we often find ourselves stuck at higher clocks
      than required. The current threshold value requires the GPU to be
      continuously and completely idle for 313ms before it is dropped by one
      bin. Conversely, we require the GPU to be busy for an average of 90% over
      a 84ms period before we upclock. So the current thresholds almost never
      downclock the GPU, and respond very slowly to sudden demands for more
      power. It is easy to observe that we currently lock into the wrong bin
      and both underperform in benchmarks and consume more power than optimal
      (just by repeating the task and measuring the different results).
      
      An alternative approach, as discussed in the bspec, is to use a
      continuous threshold for upclocking, and an average value for downclocking.
      This is good for quickly detecting and reacting to state changes within a
      frame, however it fails with the common throttling method of waiting
      upon the outstanding frame - at least it is difficult to choose a
      threshold that works well at 15,000fps and at 60fps. So continue to use
      average busy/idle loads to determine frequency change.
      
      v2: Use 3 power zones to keep frequencies low in steady-state mostly
      idle (e.g. scrolling, interactive 2D drawing), and frequencies high
      for demanding games. In between those end-states, we use a
      fast-reclocking algorithm to converge more quickly on the desired bin.
      
      v3: Bug fixes - make sure we reset adj after switching power zones.
      
      v4: Tune - drop the continuous busy thresholds as it prevents us from
      choosing the right frequency for glxgears style swap benchmarks. Instead
      the goal is to be able to find the right clocks irrespective of the
      wait-boost.
      Signed-off-by: default avatarChris Wilson <chris@chris-wilson.co.uk>
      Cc: Kenneth Graunke <kenneth@whitecape.org>
      Cc: Stéphane Marchesin <stephane.marchesin@gmail.com>
      Cc: Owen Taylor <otaylor@redhat.com>
      Cc: "Meng, Mengmeng" <mengmeng.meng@intel.com>
      Cc: "Zhuang, Lena" <lena.zhuang@intel.com>
      Reviewed-by: default avatarJesse Barnes <jbarnes@virtuousgeek.org>
      Signed-off-by: default avatarDaniel Vetter <daniel.vetter@ffwll.ch>
      dd75fdc8
    • Chris Wilson's avatar
      drm/i915: Boost RPS frequency for CPU stalls · b29c19b6
      Chris Wilson authored
      If we encounter a situation where the CPU blocks waiting for results
      from the GPU, give the GPU a kick to boost its the frequency.
      
      This should work to reduce user interface stalls and to quickly promote
      mesa to high frequencies - but the cost is that our requested frequency
      stalls high (as we do not idle for long enough before rc6 to start
      reducing frequencies, nor are we aggressive at down clocking an
      underused GPU). However, this should be mitigated by rc6 itself powering
      off the GPU when idle, and that energy use is dependent upon the workload
      of the GPU in addition to its frequency (e.g. the math or sampler
      functions only consume power when used). Still, this is likely to
      adversely affect light workloads.
      
      In particular, this nearly eliminates the highly noticeable wake-up lag
      in animations from idle. For example, expose or workspace transitions.
      (However, given the situation where we fail to downclock, our requested
      frequency is almost always the maximum, except for Baytrail where we
      manually downclock upon idling. This often masks the latency of
      upclocking after being idle, so animations are typically smooth - at the
      cost of increased power consumption.)
      
      Stéphane raised the concern that this will punish good applications and
      reward bad applications - but due to the nature of how mesa performs its
      client throttling, I believe all mesa applications will be roughly
      equally affected. To address this concern, and to prevent applications
      like compositors from permanently boosting the RPS state, we ratelimit the
      frequency of the wait-boosts each client recieves.
      
      Unfortunately, this techinique is ineffective with Ironlake - which also
      has dynamic render power states and suffers just as dramatically. For
      Ironlake, the thermal/power headroom is shared with the CPU through
      Intelligent Power Sharing and the intel-ips module. This leaves us with
      no GPU boost frequencies available when coming out of idle, and due to
      hardware limitations we cannot change the arbitration between the CPU and
      GPU quickly enough to be effective.
      
      v2: Limit each client to receiving a single boost for each active period.
          Tested by QA to only marginally increase power, and to demonstrably
          increase throughput in games. No latency measurements yet.
      
      v3: Cater for front-buffer rendering with manual throttling.
      
      v4: Tidy up.
      
      v5: Sadly the compositor needs frequent boosts as it may never idle, but
      due to its picking mechanism (using ReadPixels) may require frequent
      waits. Those waits, along with the waits for the vrefresh swap, conspire
      to keep the GPU at low frequencies despite the interactive latency. To
      overcome this we ditch the one-boost-per-active-period and just ratelimit
      the number of wait-boosts each client can receive.
      Reported-and-tested-by: default avatarPaul Neumann <paul104x@yahoo.de>
      Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=68716Signed-off-by: default avatarChris Wilson <chris@chris-wilson.co.uk>
      Cc: Kenneth Graunke <kenneth@whitecape.org>
      Cc: Stéphane Marchesin <stephane.marchesin@gmail.com>
      Cc: Owen Taylor <otaylor@redhat.com>
      Cc: "Meng, Mengmeng" <mengmeng.meng@intel.com>
      Cc: "Zhuang, Lena" <lena.zhuang@intel.com>
      Reviewed-by: default avatarJesse Barnes <jbarnes@virtuousgeek.org>
      [danvet: No extern for function prototypes in headers.]
      Signed-off-by: default avatarDaniel Vetter <daniel.vetter@ffwll.ch>
      b29c19b6
    • Chris Wilson's avatar
      drm/i915: Fix __wait_seqno to use true infinite timeouts · 094f9a54
      Chris Wilson authored
      When we switched to always using a timeout in conjunction with
      wait_seqno, we lost the ability to detect missed interrupts. Since, we
      have had issues with interrupts on a number of generations, and they are
      required to be delivered in a timely fashion for a smooth UX, it is
      important that we do log errors found in the wild and prevent the
      display stalling for upwards of 1s every time the seqno interrupt is
      missed.
      
      Rather than continue to fix up the timeouts to work around the interface
      impedence in wait_event_*(), open code the combination of
      wait_event[_interruptible][_timeout], and use the exposed timer to
      poll for seqno should we detect a lost interrupt.
      
      v2: In order to satisfy the debug requirement of logging missed
      interrupts with the real world requirments of making machines work even
      if interrupts are hosed, we revert to polling after detecting a missed
      interrupt.
      
      v3: Throw in a debugfs interface to simulate broken hw not reporting
      interrupts.
      
      v4: s/EGAIN/EAGAIN/ (Imre)
      Signed-off-by: default avatarChris Wilson <chris@chris-wilson.co.uk>
      Reviewed-by: default avatarImre Deak <imre.deak@intel.com>
      [danvet: Don't use the struct typedef in new code.]
      Signed-off-by: default avatarDaniel Vetter <daniel.vetter@ffwll.ch>
      094f9a54
    • Chris Wilson's avatar
      drm/i915: Add some missing steps to i915_driver_load error path · cbb47d17
      Chris Wilson authored
      We missed adding a few cleanup steps for recent additions.
      
      Reviewer:  Ben Widawsky <ben@bwidawsk.net>
      Signed-off-by: default avatarChris Wilson <chris@chris-wilson.co.uk>
      Signed-off-by: default avatarRodrigo Vivi <rodrigo.vivi@gmail.com>
      Reviewed-by: default avatarBen Widawsky <ben@bwidawsk.net>
      Signed-off-by: default avatarDaniel Vetter <daniel.vetter@ffwll.ch>
      cbb47d17
    • Ben Widawsky's avatar
      drm/i915: Clean up the ring scaling calculations · f6aca45c
      Ben Widawsky authored
      This patch attempts to clean up the ring/IA scaling programming in the
      following ways.
      1. Fix the comment about the DDR frequency. The math is 266MHz, not
      133MHz. Formula was right, docs are wrong.
      
      2. Mask the DCLK register since I don't know how it is defined on future
      platforms.
      
      3. use mult_frac instead of magic math.
      
      This helps for future platform enabling.
      
      v2: Actually use the right patch. The v1 was a mix of things, none of
      which was right. Note that due to rounding, we actually get different
      values (slightly higher) for the effective ring frequency.
      
      v3: Use 1.25 instead of 1.33 as the original code did. (Jesse)
      
      CC: Jesse Barnes <jbarnes@virtuousgeek.org>
      CC: Chris Wilson <chris@chris-wilson.co.uk>
      Signed-off-by: default avatarBen Widawsky <ben@bwidawsk.net>
      Reviewed-by: default avatarJesse Barnes <jbarnes@virtuousgeek.org>
      Signed-off-by: default avatarDaniel Vetter <daniel.vetter@ffwll.ch>
      f6aca45c
    • Ville Syrjälä's avatar
      drm/i915: Don't populate pipe_src_{w,h} multiple times · e41a56be
      Ville Syrjälä authored
      If we ever end up doing the retry loop due to bandwidth constraints, we
      would rewrite pipe_src_{w,n} based on adjusted_mode timings. But by that
      time the encoder may have already replaced the adjusted_mode with a
      fixed panel mode, which would then corrupt pipe_src_{w,h}.
      
      v2: Use requested_mode and slap on a big comment from Daniel
      Signed-off-by: default avatarVille Syrjälä <ville.syrjala@linux.intel.com>
      Signed-off-by: default avatarDaniel Vetter <daniel.vetter@ffwll.ch>
      e41a56be
    • Paulo Zanoni's avatar
      drm/i915: implement the Haswell mode set sequence workaround · e4916946
      Paulo Zanoni authored
      This workaround is described in the mode set sequence documentation.
      When enabling planes for the second pipe, we need to wait for 2
      vblanks on the first pipe. This should solve "a flash of screen
      corruption if planes are enabled on second/third pipe during the time
      that big FIFO mode is exiting". Watermarks are fun :)
      
      v2: Save indentation levels
      Signed-off-by: default avatarPaulo Zanoni <paulo.r.zanoni@intel.com>
      Reviewed-by: default avatarVille Syrjälä <ville.syrjala@linux.intel.com>
      Signed-off-by: default avatarDaniel Vetter <daniel.vetter@ffwll.ch>
      e4916946