• Guenter Roeck's avatar
    ipv4: Don't drop redirected route cache entry unless PTMU actually expired · 5e016cbf
    Guenter Roeck authored
    TCP sessions over IPv4 can get stuck if routers between endpoints
    do not fragment packets but implement PMTU instead, and we are using
    those routers because of an ICMP redirect.
    
    Setup is as follows
    
           MTU1    MTU2   MTU1
        A--------B------C------D
    
    with MTU1 > MTU2. A and D are endpoints, B and C are routers. B and C
    implement PMTU and drop packets larger than MTU2 (for example because
    DF is set on all packets). TCP sessions are initiated between A and D.
    There is packet loss between A and D, causing frequent TCP
    retransmits.
    
    After the number of retransmits on a TCP session reaches tcp_retries1,
    tcp calls dst_negative_advice() prior to each retransmit. This results
    in route cache entries for the peer to be deleted in
    ipv4_negative_advice() if the Path MTU is set.
    
    If the outstanding data on an affected TCP session is larger than
    MTU2, packets sent from the endpoints will be dropped by B or C, and
    ICMP NEEDFRAG will be returned. A and D receive NEEDFRAG messages and
    update PMTU.
    
    Before the next retransmit, tcp will again call dst_negative_advice(),
    causing the route cache entry (with correct PMTU) to be deleted. The
    retransmitted packet will be larger than MTU2, causing it to be
    dropped again.
    
    This sequence repeats until the TCP session aborts or is terminated.
    
    Problem is fixed by removing redirected route cache entries in
    ipv4_negative_advice() only if the PMTU is expired.
    Signed-off-by: default avatarGuenter Roeck <guenter.roeck@ericsson.com>
    Signed-off-by: default avatarDavid S. Miller <davem@davemloft.net>
    5e016cbf
route.c 84.2 KB