• Julius Goryavsky's avatar
    MDEV-34234: SST hangs when running on unprivileged containers on RHEL9 · 642195d2
    Julius Goryavsky authored
    The lsof utility is prone to blocking on system calls that
    it uses to obtain information about sockets (or files, devices,
    etc.). This behavior is described in its own documentation.
    It has a '-b' option (in combination with warnings suppression
    via '-w') that reduces the probability of blocking, introducing
    new problems (luckily probably not relevant for our use case).
    However, there is no guarantee that it will not hang on some
    distributions, with some TCP/IP stack implementations, or with
    some filesystems, etc. Also, of the three utilities that are
    suitable for our purposes, lsof is the slowest. So if there
    are other utilities that we use during SST, such as 'ss' or
    'sockstat', it is reasonable to use them instead of lsof.
    This commit changes the prioritization of utilities, it does
    not need additional tests (besides the numerous SST tests
    already available in the galera suites). If the system still
    need to use lsof, this commit adds the '-b' and '-w' options
    to it command line - to reduce the likelihood of blocking.
    642195d2
wsrep_sst_common.sh 63.6 KB