Commits · 3fbe7ca847367d0f9c3861283767ae702c2a19ab · Kirill Smelkov / iproute2

30 Dec, 2015 2 commits
- iproute2: ip-route.8.in: Add expires option for ip route · 3fbe7ca8
  Hangbin Liu authored Dec 25, 2015
```
Signed-off-by: Hangbin Liu <liuhangbin@gmail.com>
```
  3fbe7ca8
- iproute2: ip-route.8.in: Add missing '[' before 'pref' · 966fe23a
  Hangbin Liu authored Dec 25, 2015
```
Signed-off-by: Hangbin Liu <liuhangbin@gmail.com>
```
  966fe23a
22 Dec, 2015 3 commits
- route: allow routes to be configured with expire values · 68eede25
  Hangbin Liu authored Dec 21, 2015
```
Signed-off-by: Hangbin Liu <liuhangbin@gmail.com>
```
  68eede25
- Merge branch 'master' into net-next · 5d3ec438
  Stephen Hemminger authored Dec 21, 2015
  
  5d3ec438
- iptunnel: Fix compile error in ip/tunnel.c · f8fc1d10
  Phil Sutter authored Dec 21, 2015
```
I repeatedly failed to get this right, so now I have to clean up my mess
afterwards.

Fixes: 7d6aadcd ("ip{,6}tunnel: have a shared stats parser/printer")
Signed-off-by: Phil Sutter <phil@nwl.cc>
```
  f8fc1d10
18 Dec, 2015 12 commits

ip{,6}tunnel: have a shared stats parser/printer · 7d6aadcd

Phil Sutter authored Dec 18, 2015

This has a slight side-effect of not aborting when /proc/net/dev is
malformed, but OTOH stats are not parsed for uninteresting interfaces.
Signed-off-by: Phil Sutter <phil@nwl.cc>

7d6aadcd

lwtunnel: implement support for ip6 encap · d95cdcf5

Paolo Abeni authored Dec 18, 2015

Currently ip6 encap support for lwtunnel is missing.
This patch implement it, mostly duplicating the ipv4 parts.

Also be sure to insert a space after the encap type, when
showing lwtunnel, to avoid the tunnel type and the following
argument being merged into a single word.
Signed-off-by: Paolo Abeni <pabeni@redhat.com>

d95cdcf5

gre: add support for collect metadata flag · 926b39e1

Paolo Abeni authored Dec 18, 2015

This patch add support for IFLA_GRE_COLLECT_METADATA via the
'external' keyword to the gre link.
Signed-off-by: Paolo Abeni <pabeni@redhat.com>

926b39e1

vxlan: add support for collect metadata flag · e79c327e

Paolo Abeni authored Dec 18, 2015

This patch add support for IFLA_VXLAN_COLLECT_METADATA via the
'external' keyword to the vxlan link.

Also enforce mutual exclusion between 'vni' and 'external'.
Signed-off-by: Paolo Abeni <pabeni@redhat.com>

e79c327e

iproute: print addrgenmode stable_secret and fallback otherwise · 5c5176ce
Hannes Frederic Sowa authored Dec 16, 2015
```
Signed-off-by: Hannes Frederic Sowa <hannes@stressinduktion.org>
```
5c5176ce

bpf: minor fix in api and bpf_dump_error() usage · fd7f9c7f

Daniel Borkmann authored Dec 14, 2015

Fix a whitespace in bpf_dump_error() usage, and also a missing closing
bracket in ntohl() macro for eBPF programs.
Signed-off-by: Daniel Borkmann <daniel@iogearbox.net>

fd7f9c7f

include: update kernel headers · 741c20b0
Stephen Hemminger authored Dec 17, 2015
```
Current headers for net-next
```
741c20b0
Merge branch 'master' into net-next · 00a2a174
Stephen Hemminger authored Dec 17, 2015

00a2a174

lwtunnel: fix argument parsing · f0df4081

Paolo Abeni authored Dec 15, 2015

Currently parse_encap_ip() does not update correctly argv/argc;
if multiple lwtunnel arguments are provided, the parsing fails after
the first one, i.e.

 ip route add 172.16.101.0/24 dev vxlan1 encap ip id 42 dst 192.168.255.1

fails with:

 Error: either "to" is duplicate, or "dst" is a garbage.

This commit addresses the issue, stepping to next argument at each iteration
of the parsing loop.

Fixes: 1e529305 ("lwtunnel: Add encapsulation support to ip route")
Signed-off-by: Paolo Abeni <pabeni@redhat.com>

f0df4081

route: Fix printing of locked entries · ed6b8652

Phil Sutter authored Dec 12, 2015

Commit 0f754332 ("route: ignore RTAX_HOPLIMIT of value -1")
accidentally reordered fprintf statements. This patch restores the
original ordering.

Fixes: 0f754332 ("route: ignore RTAX_HOPLIMIT of value -1")
Signed-off-by: Phil Sutter <phil@nwl.cc>

ed6b8652

ip neigh: device is optional for proxy entries · e834eb8e

Konstantin Khlebnikov authored Dec 01, 2015

Though dumping such entries crashes present kernels.
Signed-off-by: Konstantin Khlebnikov <koct9i@gmail.com>

e834eb8e

ila: Add support for ILA lwtunnels · 5866bddd

Tom Herbert authored Nov 30, 2015

This patch:
 - Adds a utility function for parsing a 64 bit address
 - Adds a utility function for converting a 64 bit address to ASCII
 - Adds and ILA encap type in lwt tunnels
Signed-off-by: Tom Herbert <tom@herbertland.com>

5866bddd

10 Dec, 2015 7 commits

examples, bpf: further improve examples · 41d6e33f

Daniel Borkmann authored Dec 02, 2015

Improve example files further and add a more generic set of possible
helpers for them that can be used.
Signed-off-by: Daniel Borkmann <daniel@iogearbox.net>
Acked-by: Alexei Starovoitov <ast@kernel.org>

41d6e33f

Merge branch 'master' into net-next · 6ad355ca
Stephen Hemminger authored Dec 10, 2015

6ad355ca
ip: fix format string when reading statistics · 654ae881
Stephen Hemminger authored Dec 10, 2015
```
The tunnel code was doing sscanf(buf, "%ld", &x) where x was unsigned
long.
```
654ae881

tc.8: Fix reference to tc-tcindex.8 · b08b5ff1

Phil Sutter authored Dec 10, 2015

Just a typo there, it's spelled correctly in SEE ALSO section..
Signed-off-by: Phil Sutter <phil@nwl.cc>

b08b5ff1

vrf: Add support for table names · 8a23f820

David Ahern authored Dec 08, 2015

Currently, the table id for VRF devices requires an integer. Convert
it to use rtnl_rttable_a2n which handles table names from the iproute2
directory.

This also fixes a bug in the original commit where table name are not
properly handled.

Fixes: 15faa0a3 ("add support for VRF device")
Signed-off-by: David Ahern <dsa@cumulusnetworks.com>

8a23f820

libnetlink: don't confuse variables in rtnl_talk() · ed108cfc

Nicolas Dichtel authored Dec 03, 2015

There is two variables named 'len' in rtnl_talk. In fact, commit
c079e121 didn't work. For example, it was possible to trigger
a seg fault with this command:
$ ip link set gre2 type ip6gre hoplimit 32

Let's rename the argument len to maxlen.

Fixes: c079e121 ("libnetlink: add size argument to rtnl_talk")
Reported-by: Thomas Faivre <thomas.faivre@6wind.com>
Signed-off-by: Nicolas Dichtel <nicolas.dichtel@6wind.com>

ed108cfc

route: ignore RTAX_HOPLIMIT of value -1 · 0f754332

Phil Sutter authored Dec 02, 2015

Older kernels use -1 internally as indicator to use the sysctl default,
but they still export the setting. Newer kernels use 0 to indicate that
(which is why the conversion from -1 to 0 was done here), but they also
stopped exporting the value. Since the meaning of -1 is clear, treat it
equally like default on newer kernels (which is to not print anything).
Signed-off-by: Phil Sutter <phil@nwl.cc>

0f754332

29 Nov, 2015 16 commits

iptunnel: cleanup code · a96a5d94
Stephen Hemminger authored Nov 29, 2015
```
Make iptunnel pass checkpatch (mostly).
```
a96a5d94

ip_tunnel: determine tunnel address family from the tunnel type · cc9c1dfa

Konstantin Shemyak authored Nov 26, 2015

On 24.11.2015 02:26, Stephen Hemminger wrote:
> On Thu, 12 Nov 2015 21:10:08 +0000
> Konstantin Shemyak <konstantin@shemyak.com> wrote:
>
>> When creating an IP tunnel over IPv6, the address family must be passed in
>> the option, e.g.
>>
>> ip -6 tunnel add mode ip6gre local 1::1 remote 2::2
>>
>> This makes it impossible to create both IPv4 and IPv6 tunnels in one batch.
>>
>> In fact the address family option is redundant here, as each tunnel mode is
>> relevant for only one address family.
>> The patch determines whether the applicable address family is AF_INET6
>> instead of the default AF_INET and makes the "-6" option unnecessary for
>> "ip tunnel add".
>>
>> Signed-off-by: Konstantin Shemyak <konstantin@shemyak.com>
>> ---
>>   ip/iptunnel.c                          | 26 ++++++++++++++++++++++++++
>>   testsuite/tests/ip/tunnel/add_tunnel.t | 14 ++++++++++++++
>>   2 files changed, 40 insertions(+)
>>   create mode 100755 testsuite/tests/ip/tunnel/add_tunnel.t
>>
>> diff --git a/ip/iptunnel.c b/ip/iptunnel.c
>> index 78fa988..7826a37 100644
>> --- a/ip/iptunnel.c
>> +++ b/ip/iptunnel.c
>> @@ -629,8 +629,34 @@ static int do_6rd(int argc, char **argv)
>>          return tnl_6rd_ioctl(cmd, medium, &ip6rd);
>>   }
>>
>> +static int tunnel_mode_is_ipv6(char *tunnel_mode) {
>> +       char *ipv6_modes[] = {
>> +               "ipv6/ipv6", "ip6ip6",
>> +               "vti6",
>> +               "ip/ipv6", "ipv4/ipv6", "ipip6", "ip4ip6",
>> +               "ip6gre", "gre/ipv6",
>> +               "any/ipv6", "any"
>> +       };
>> +       int i;
>> +
>> +       for (i = 0; i < sizeof(ipv6_modes) / sizeof(char *); i++) {
>> +               if (strcmp(ipv6_modes[i], tunnel_mode) == 0)
>> +                       return 1;
>> +       }
>> +       return 0;
>> +}
>> +
>
> The ipv6_modes table should be static const.

Thank you for the note! attached the corrected patch.

> Also is it possible to use strstr for ipv6 and ip6 or even strchr(tunnel_mode, '6')
> to simplify this?

There is IPv6 tunnel mode 'any', and IPv4 tunnel mode 'ipv6/ip' (aka
'sit'). It looks to me that attempts to find some substring match
would not make the code much shorter, but definitely less readable.

Konstantin Shemyak.

>From 42d27db0055c3a114fe6eb86d680bef9ec098ad4 Mon Sep 17 00:00:00 2001
From: Konstantin Shemyak <konstantin@shemyak.com>
Date: Thu, 12 Nov 2015 20:52:02 +0200
Subject: [PATCH] Tunnel address family is determined from the tunnel mode

When the tunnel mode already tells the IP address family, "ip tunnel"
command determines it and does not require option "-4"/"-6" to be passed.

This makes possible creating both IPv4 and IPv6 tunnels in one batch.
Signed-off-by: Konstantin Shemyak <konstantin@shemyak.com>

cc9c1dfa

{f,m}_bpf: add more example code · 0b7e3fc8

Daniel Borkmann authored Nov 26, 2015

I've added three examples to examples/bpf/ that demonstrate how one can
implement eBPF tail calls in tc with f.e. multiple levels of nesting.
That should act as a good starting point, but also as test cases for the
ELF loader and kernel. A real test suite for {f,m,e}_bpf is still to be
developed in future work.
Signed-off-by: Daniel Borkmann <daniel@iogearbox.net>
Acked-by: Alexei Starovoitov <ast@kernel.org>

0b7e3fc8

{f,m}_bpf: allow updates on program arrays · 91d88eeb

Daniel Borkmann authored Nov 26, 2015

Since we have all infrastructure in place now, allow atomic live updates
on program arrays. This can be very useful e.g. in case programs that are
being tail-called need to be replaced, f.e. when classifier functionality
needs to be changed, new protocols added/removed during runtime, etc.

Thus, provide a way for in-place code updates, minimal example: Given is
an object file cls.o that contains the entry point in section 'classifier',
has a globally pinned program array 'jmp' with 2 slots and id of 0, and
two tail called programs under section '0/0' (prog array key 0) and '0/1'
(prog array key 1), the section encoding for the loader is <id/key>.
Adding the filter loads everything into cls_bpf:

tc filter add dev foo parent ffff: bpf da obj cls.o

Now, the program under section '0/1' needs to be replaced with an updated
version that resides in the same section (also full path to tc's subfolder
of the mount point can be passed, e.g. /sys/fs/bpf/tc/globals/jmp):

tc exec bpf graft m:globals/jmp obj cls.o sec 0/1

In case the program resides under a different section 'foo', it can also
be injected into the program array like:

tc exec bpf graft m:globals/jmp key 1 obj cls.o sec foo

If the new tail called classifier program is already available as a pinned
object somewhere (here: /sys/fs/bpf/tc/progs/parser), it can be injected
into the prog array like:

tc exec bpf graft m:globals/jmp key 1 fd m:progs/parser

In the kernel, the program on key 1 is being atomically replaced and the
old one's refcount dropped.
Signed-off-by: Daniel Borkmann <daniel@iogearbox.net>
Acked-by: Alexei Starovoitov <ast@kernel.org>

91d88eeb

{f, m}_bpf: allow for user-defined object pinnings · f6793eec

Daniel Borkmann authored Nov 26, 2015

The recently introduced object pinning can be further extended in order
to allow sharing maps beyond tc namespace. F.e. maps that are being pinned
from tracing side, can be accessed through this facility as well.
Signed-off-by: Daniel Borkmann <daniel@iogearbox.net>
Acked-by: Alexei Starovoitov <ast@kernel.org>

f6793eec

{f, m}_bpf: check map attributes when fetching as pinned · 9e607f2e

Daniel Borkmann authored Nov 26, 2015

Make use of the new show_fdinfo() facility and verify that when a
pinned map is being fetched that its basic attributes are the same
as the map we declared from the ELF file. I.e. when placed into the
globalns, collisions could occur. In such a case warn the user and
bail out.
Signed-off-by: Daniel Borkmann <daniel@iogearbox.net>
Acked-by: Alexei Starovoitov <ast@kernel.org>

9e607f2e

{f,m}_bpf: make tail calls working · 910b543d

Daniel Borkmann authored Nov 26, 2015

Now that we have the possibility of sharing maps, it's time we get the
ELF loader fully working with regards to tail calls. Since program array
maps are pinned, we can keep them finally alive. I've noticed two bugs
that are being fixed in bpf_fill_prog_arrays() with this patch. Example
code comes as follow-up.
Signed-off-by: Daniel Borkmann <daniel@iogearbox.net>
Acked-by: Alexei Starovoitov <ast@kernel.org>

910b543d

Merge branch 'master' into net-next · fece33c1
Stephen Hemminger authored Nov 29, 2015

fece33c1

vxlan: Add support for remote checksum offload · 35f59d86

Tom Herbert authored Nov 27, 2015

This patch adds support to remote checksum checksum offload
to VXLAN. This patch adds remcsumtx and remcsumrx to ip vxlan
configuration to enable remote checksum offload for transmit
and receive on the VXLAN tunnel.

https://tools.ietf.org/html/draft-herbert-vxlan-rco-00

Example:

ip link add name vxlan0 type vxlan id 42 group 239.1.1.1 dev eth0 \
    udpcsum remcsumtx remcsumrx

Testing:

Ran single netperf over mlnx4 to illustrate the effest:

- Without RCO (UDP csum set to zero)
  4335.99 Mbps
- With RCO enabled
  7661.81 Mbps
Signed-off-by: Tom Herbert <tom@herbertland.com>

35f59d86

get rid of unnecessary fgets() buffer size limitation · 61170fd8

Phil Sutter authored Nov 28, 2015

fgets() will read at most size-1 bytes into the buffer and add a
terminating null-char at the end. Therefore it is not necessary to pass
a reduced buffer size when calling it.

This change was generated using the following semantic patch:

@@
identifier buf, fp;
@@
- fgets(buf, sizeof(buf) - 1, fp)
+ fgets(buf, sizeof(buf), fp)
Signed-off-by: Phil Sutter <phil@nwl.cc>

61170fd8

get rid of remaining -Wunused-result warnings · d572ed4d

Phil Sutter authored Nov 28, 2015

Although not fundamentally necessary to check return codes in these
spots, preventing the warnings will put new ones into focus.
Signed-off-by: Phil Sutter <phil@nwl.cc>

d572ed4d

ss: review is_ephemeral() · c29d3792

Phil Sutter authored Nov 28, 2015

No need to keep static port boundaries global, they are not used
directly. Keeping them local also allows to safely reduce their names to
the minimum. Assign hardcoded fallback values also if fscanf() fails.
Get rid of unnecessary braces around return parameter.

Instead of more or less duplicating is_ephemeral() in run_ssfilter(),
simply call the function instead.
Signed-off-by: Phil Sutter <phil@nwl.cc>

c29d3792

ss: reduce max indentation level in init_service_resolver() · 596307ea

Phil Sutter authored Nov 28, 2015

Exit early or continue on error instead of putting conditional into
conditional to make reading the code a bit easier.

Also, the call to memcpy() can be skipped by initialising prog with the
desired prefix.
Signed-off-by: Phil Sutter <phil@nwl.cc>

596307ea

lnstat: review lnstat_update() · db3ef44c

Phil Sutter authored Nov 28, 2015

Instead of calling rewind() and fgets() before every call to
scan_lines(), move them into scan_lines() itself.

This should also fix compat mode, as before the second call to
scan_lines() the first line was skipped unconditionally.
Signed-off-by: Phil Sutter <phil@nwl.cc>

db3ef44c

bridge.8: minor formatting cleanup · fc31817d

Phil Sutter authored Nov 24, 2015

- Replace commas at end of subsection with dots.
- Replace double whitespace by single one.
Signed-off-by: Phil Sutter <phil@nwl.cc>

fc31817d

iproute: restrict hoplimit values to be in range [0; 255] · ea6cbab7

Phil Sutter authored Nov 24, 2015

Technically, the range of possible hoplimit values are defined by IPv4
and IPv6 header formats. Both define the field to be eight bits in size,
which leads to a value range of [0;255]. Setting a packet's hoplimit
field to 0 though makes not much sense, as the next hop would
immediately drop the packet. Therefore Linux uses 0 as a special value
indicating to use the system's default hoplimit (configurable via
sysctl). In iproute, setting the hoplimit of a route to 0 is equivalent
to omitting the hoplimit parameter alltogether, so it is actually not
necessary to allow that value to be specified, but keep it anyway for
backwards compatibility.
Signed-off-by: Phil Sutter <phil@nwl.cc>

ea6cbab7