Commit 30a93d2b authored by Antoine Tenart's avatar Antoine Tenart Committed by David S. Miller

vxlan: do not modify the shared tunnel info when PMTU triggers an ICMP reply

When the interface is part of a bridge or an Open vSwitch port and a
packet exceed a PMTU estimate, an ICMP reply is sent to the sender. When
using the external mode (collect metadata) the source and destination
addresses are reversed, so that Open vSwitch can match the packet
against an existing (reverse) flow.

But inverting the source and destination addresses in the shared
ip_tunnel_info will make following packets of the flow to use a wrong
destination address (packets will be tunnelled to itself), if the flow
isn't updated. Which happens with Open vSwitch, until the flow times
out.

Fixes this by uncloning the skb's ip_tunnel_info before inverting its
source and destination addresses, so that the modification will only be
made for the PTMU packet, not the following ones.

Fixes: fc68c995 ("vxlan: Support for PMTU discovery on directly bridged links")
Tested-by: default avatarEelco Chaudron <echaudro@redhat.com>
Reviewed-by: default avatarEelco Chaudron <echaudro@redhat.com>
Signed-off-by: default avatarAntoine Tenart <atenart@kernel.org>
Signed-off-by: default avatarDavid S. Miller <davem@davemloft.net>
parent aa5a5b7a
...@@ -2725,12 +2725,17 @@ static void vxlan_xmit_one(struct sk_buff *skb, struct net_device *dev, ...@@ -2725,12 +2725,17 @@ static void vxlan_xmit_one(struct sk_buff *skb, struct net_device *dev,
goto tx_error; goto tx_error;
} else if (err) { } else if (err) {
if (info) { if (info) {
struct ip_tunnel_info *unclone;
struct in_addr src, dst; struct in_addr src, dst;
unclone = skb_tunnel_info_unclone(skb);
if (unlikely(!unclone))
goto tx_error;
src = remote_ip.sin.sin_addr; src = remote_ip.sin.sin_addr;
dst = local_ip.sin.sin_addr; dst = local_ip.sin.sin_addr;
info->key.u.ipv4.src = src.s_addr; unclone->key.u.ipv4.src = src.s_addr;
info->key.u.ipv4.dst = dst.s_addr; unclone->key.u.ipv4.dst = dst.s_addr;
} }
vxlan_encap_bypass(skb, vxlan, vxlan, vni, false); vxlan_encap_bypass(skb, vxlan, vxlan, vni, false);
dst_release(ndst); dst_release(ndst);
...@@ -2781,12 +2786,17 @@ static void vxlan_xmit_one(struct sk_buff *skb, struct net_device *dev, ...@@ -2781,12 +2786,17 @@ static void vxlan_xmit_one(struct sk_buff *skb, struct net_device *dev,
goto tx_error; goto tx_error;
} else if (err) { } else if (err) {
if (info) { if (info) {
struct ip_tunnel_info *unclone;
struct in6_addr src, dst; struct in6_addr src, dst;
unclone = skb_tunnel_info_unclone(skb);
if (unlikely(!unclone))
goto tx_error;
src = remote_ip.sin6.sin6_addr; src = remote_ip.sin6.sin6_addr;
dst = local_ip.sin6.sin6_addr; dst = local_ip.sin6.sin6_addr;
info->key.u.ipv6.src = src; unclone->key.u.ipv6.src = src;
info->key.u.ipv6.dst = dst; unclone->key.u.ipv6.dst = dst;
} }
vxlan_encap_bypass(skb, vxlan, vxlan, vni, false); vxlan_encap_bypass(skb, vxlan, vxlan, vni, false);
......
Markdown is supported
0%
or
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment