1. 30 Nov, 2023 25 commits
  2. 29 Nov, 2023 14 commits
  3. 28 Nov, 2023 1 commit
    • Paolo Abeni's avatar
      Merge branch 'net-page_pool-add-netlink-based-introspection' · a3799729
      Paolo Abeni authored
      Jakub Kicinski says:
      
      ====================
      net: page_pool: add netlink-based introspection
      
      We recently started to deploy newer kernels / drivers at Meta,
      making significant use of page pools for the first time.
      We immediately run into page pool leaks both real and false positive
      warnings. As Eric pointed out/predicted there's no guarantee that
      applications will read / close their sockets so a page pool page
      may be stuck in a socket (but not leaked) forever. This happens
      a lot in our fleet. Most of these are obviously due to application
      bugs but we should not be printing kernel warnings due to minor
      application resource leaks.
      
      Conversely the page pool memory may get leaked at runtime, and
      we have no way to detect / track that, unless someone reconfigures
      the NIC and destroys the page pools which leaked the pages.
      
      The solution presented here is to expose the memory use of page
      pools via netlink. This allows for continuous monitoring of memory
      used by page pools, regardless if they were destroyed or not.
      Sample in patch 15 can print the memory use and recycling
      efficiency:
      
      $ ./page-pool
          eth0[2]	page pools: 10 (zombies: 0)
      		refs: 41984 bytes: 171966464 (refs: 0 bytes: 0)
      		recycling: 90.3% (alloc: 656:397681 recycle: 89652:270201)
      
      v4:
       - use dev_net(netdev)->loopback_dev
       - extend inflight doc
      v3: https://lore.kernel.org/all/20231122034420.1158898-1-kuba@kernel.org/
       - ID is still here, can't decide if it matters
       - rename destroyed -> detach-time, good enough?
       - fix build for netsec
      v2: https://lore.kernel.org/r/20231121000048.789613-1-kuba@kernel.org
       - hopefully fix build with PAGE_POOL=n
      v1: https://lore.kernel.org/all/20231024160220.3973311-1-kuba@kernel.org/
       - The main change compared to the RFC is that the API now exposes
         outstanding references and byte counts even for "live" page pools.
         The warning is no longer printed if page pool is accessible via netlink.
      RFC: https://lore.kernel.org/all/20230816234303.3786178-1-kuba@kernel.org/
      ====================
      
      Link: https://lore.kernel.org/r/20231126230740.2148636-1-kuba@kernel.orgSigned-off-by: default avatarPaolo Abeni <pabeni@redhat.com>
      a3799729