encap-forward

Apr 27, 2022

daefd11 · Apr 27, 2022

Name	Name	Last commit message	Last commit date
parent directory ..
Makefile	Makefile	encap-forward: Support conditional IPv6 encapsulation with IPV6=1	Oct 7, 2020
README.org	README.org	encap-forward: Add README describing the issue	Oct 7, 2020
encap.h	encap.h	encap-forward: Move setting of the ethertype to the encap_* functions	Nov 25, 2020
setup-test.sh	setup-test.sh	encap-forward: Run ping and tcpdump as part of test setup	Oct 6, 2020
tc_bpf_encap.c	tc_bpf_encap.c	encap-forward: Don't redefine AF_INET	Apr 27, 2022
xdp_encap.c	xdp_encap.c	encap-forward: Move setting of the ethertype to the encap_* functions	Nov 25, 2020

README.org

BPF encapsulation and forward example

This example demonstrates a particular oddness around encapsulation and forwarding of packets in BPF:

If a BPF-based ingress filter (TC or XDP) wants to encapsulate and forward a packet but reuse the kernel FIB for routing, it’ll need to pass the packet up the stack to do neighbour discovery if there is no cached neighbour in the routing table.

In this case, the kernel will treat IPv4 and IPv6 differently: For IPv6 (as the outer encapsulation), things will just work, but for IPv4, the rp_filter and accept_local sysctls will affect how the packet is treated. If the encapsulation is done in XDP, the packet is likely to be discarded by rp_filter on the ingress interface. The BPF ingress filter seems to run after the rp_filter check, but here accept_local will discard the packet (assuming the source address of the encapsulated packet matches a local address of the host, which is the underlying assumption).

Seeing this in action

The examples in this directory contain XDP and TC BPF implementations of a naive encapsulation scheme with hard-coded IP addresses. By default, the example is compiled with IPv4 encapsulation, which will cause the encapsulated packets to be dropped as described above. Compile with make IPV6=1 to instead use IPv6 encapsulation.

Run the setup-test.sh script to run a simple test in a network namespace. It’ll set up two virtual interfaces into the namespace, and install the encapsulation program on one of them. Then a ping from the host will be run on one of those veth interfaces, and tcpdump will be started on the other; if forwarding is successful, the encapsulated ping packets should show up on the second interface (which it does with IPv6 encapsulation, or if the accept_local sysctl is changed in the setup script). Run setup-test.sh teardown to remove the test setup again.

Note that the example uses iproute2 to load the BPF programs, which results in (harmless) errors due to iproute2 not supporting BTF.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Files

encap-forward

encap-forward

README.org

BPF encapsulation and forward example

Seeing this in action

Files

encap-forward

Directory actions

More options

Directory actions

More options

Latest commit

History

encap-forward

Folders and files

parent directory

README.org

BPF encapsulation and forward example

Seeing this in action