• Daisuke Nishimura's avatar
    memcg: remove memcg_tasklist · 9ab322ca
    Daisuke Nishimura authored
    memcg_tasklist was introduced at commit 7f4d454d(memcg: avoid deadlock
    caused by race between oom and cpuset_attach) instead of cgroup_mutex to
    fix a deadlock problem.  The cgroup_mutex, which was removed by the
    commit, in mem_cgroup_out_of_memory() was originally introduced at commit
    c7ba5c9e (Memory controller: OOM handling).
    
    IIUC, the intention of this cgroup_mutex was to prevent task move during
    select_bad_process() so that situations like below can be avoided.
    
      Assume cgroup "foo" has exceeded its limit and is about to trigger oom.
      1. Process A, which has been in cgroup "baa" and uses large memory, is just
         moved to cgroup "foo". Process A can be the candidates for being killed.
      2. Process B, which has been in cgroup "foo" and uses large memory, is just
         moved from cgroup "foo". Process B can be excluded from the candidates for
         being killed.
    
    But these race window exists anyway even if we hold a lock, because
    __mem_cgroup_try_charge() decides wether it should trigger oom or not
    outside of the lock.  So the original cgroup_mutex in
    mem_cgroup_out_of_memory and thus current memcg_tasklist has no use.  And
    IMHO, those races are not so critical for users.
    
    This patch removes it and make codes simpler.
    Signed-off-by: default avatarDaisuke Nishimura <nishimura@mxp.nes.nec.co.jp>
    Cc: Balbir Singh <balbir@linux.vnet.ibm.com>
    Acked-by: default avatarKAMEZAWA Hiroyuki <kamezawa.hiroyu@jp.fujitsu.com>
    Signed-off-by: default avatarAndrew Morton <akpm@linux-foundation.org>
    Signed-off-by: default avatarLinus Torvalds <torvalds@linux-foundation.org>
    9ab322ca
memcontrol.c 85.9 KB