• Wu Fengguang's avatar
    writeback: balance_dirty_pages(): reduce calls to global_page_state · e50e3720
    Wu Fengguang authored
    Reducing the number of times balance_dirty_pages calls global_page_state
    reduces the cache references and so improves write performance on a
    variety of workloads.
    
    'perf stats' of simple fio write tests shows the reduction in cache
    access.  Where the test is fio 'write,mmap,600Mb,pre_read' on AMD AthlonX2
    with 3Gb memory (dirty_threshold approx 600 Mb) running each test 10
    times, dropping the fasted & slowest values then taking the average &
    standard deviation
    
    		average (s.d.) in millions (10^6)
    2.6.31-rc8	648.6 (14.6)
    +patch		620.1 (16.5)
    
    Achieving this reduction is by dropping clip_bdi_dirty_limit as it rereads
    the counters to apply the dirty_threshold and moving this check up into
    balance_dirty_pages where it has already read the counters.
    
    Also by rearrange the for loop to only contain one copy of the limit tests
    allows the pdflush test after the loop to use the local copies of the
    counters rather than rereading them.
    
    In the common case with no throttling it now calls global_page_state 5
    fewer times and bdi_stat 2 fewer.
    
    Fengguang:
    
    This patch slightly changes behavior by replacing clip_bdi_dirty_limit()
    with the explicit check (nr_reclaimable + nr_writeback >= dirty_thresh) to
    avoid exceeding the dirty limit.  Since the bdi dirty limit is mostly
    accurate we don't need to do routinely clip.  A simple dirty limit check
    would be enough.
    
    The check is necessary because, in principle we should throttle everything
    calling balance_dirty_pages() when we're over the total limit, as said by
    Peter.
    
    We now set and clear dirty_exceeded not only based on bdi dirty limits,
    but also on the global dirty limit.  The global limit check is added in
    place of clip_bdi_dirty_limit() for safety and not intended as a behavior
    change.  The bdi limits should be tight enough to keep all dirty pages
    under the global limit at most time; occasional small exceeding should be
    OK though.  The change makes the logic more obvious: the global limit is
    the ultimate goal and shall be always imposed.
    
    We may now start background writeback work based on outdated conditions.
    That's safe because the bdi flush thread will (and have to) double check
    the states.  It reduces overall overheads because the test based on old
    states still have good chance to be right.
    
    [akpm@linux-foundation.org] fix uninitialized dirty_exceeded
    Signed-off-by: default avatarRichard Kennedy <richard@rsk.demon.co.uk>
    Signed-off-by: default avatarWu Fengguang <fengguang.wu@intel.com>
    Cc: Jan Kara <jack@suse.cz>
    Acked-by: default avatarPeter Zijlstra <a.p.zijlstra@chello.nl>
    Cc: Christoph Hellwig <hch@infradead.org>
    Cc: Dave Chinner <david@fromorbit.com>
    Cc: Jens Axboe <axboe@kernel.dk>
    Signed-off-by: default avatarAndrew Morton <akpm@linux-foundation.org>
    Signed-off-by: default avatarLinus Torvalds <torvalds@linux-foundation.org>
    e50e3720
page-writeback.c 37.6 KB