ICMP Error Handling for VPNs in SRv6 Networks

While a solution for diagnostics in MPLS VPNs has been created, the solution designed for MPLS based VPN ping or traceroute has many inherited drawbacks. MPLS technology has its special encapsulation, i.e., the MPLS header is a label stack. In case of MPLS, P routers have no options to identify the ingress of the MPLS tunnel, as labels in the header point towards the network egress point. This characteristic restricts the possible solutions to provide VPN-specific ICMP handling in MPLS networks and resulted in involving of egress-PE nodes in the forwarding of the ICMP error messages. IPv6 encapsulation used by SRv6 has an IP SA field referring to the originator of the IP packet, i.e., the ingress endpoint of the SRv6 tunnel. Therefore the MPLS restriction does not have to apply for SRv6 networks. The solution described here takes advantages from this presence of the ingress endpoint information to provide an optimal method and not to change procedures for P nodes. Node functions in the described method are as follows: Ingress PE (ingress node of the SRv6 tunnel): VPN packet encapsulation follows Uniform model. The node adds VPN-specific information to the encapsulated packet (i.e., IP SA=VPN-specific-SID of the Ingress PE) and forwards it over the SRv6 network. P node = Originator of the ICMP error (within the SRv6 domain): it does standard operation, so an ICMP error message is sent to the originator (i.e., the ingress PE) of the SRv6 encapsulated packet, that caused the ICMP message generation (e.g., when the Hop Limit of the packet expired). Ingress PE: it processes the ICMP error message and forwards it to the original source of the (payload) packet, what is located within the VPN context. This processing is done by a VPN-associated-ICMP-process-function and is described in detail in .

shows the reference topology used to describe the ICMP error handling.

| | | +-----+ +----+ +----+ +-----+ HostA----+ PE1 +------+ P1 |------| P2 +------+ PE2 +------HostB +-----+ +----+ +----+ +-----+ | | |<--------------- VPN -------------->| ]]> Packet processing works as follows: HostA sends a packet to HostB. PE1 encapsulates the packet in an SRv6 tunnel (using the Uniform model). The IP SA of the encapsulation is a VPN-specific SID of PE1. Encapsulated packet reaches P2 where Hop Limit expires. P2 generates an ICMP Error Message and sends it to PE1, using the VPN-specific SID as an IP DA. PE1 processes the ICMP Error Message according to its VPN-associated-ICMP-process-function and identifies the related VPN instance. PE1 sends the processed ICMP Error Message to HostA. HostA is informed about the Hop Limit expire event and its network location (i.e., P2). The VPN-specific SID of PE1 refers to the VPN instance where the prefixes of the VPN can be looked up. The VPN-specific SID is allocated by the PE. SIDs processing already defines the upper-layer header steps (as per , section 4.1.1). The upper-layer = ICMPv6, therefore there is no need for extra parsing rules. There might be no need for extra SID allocation for a VPN. The solution uses a SID per VPN, what is allocated for the VPN service. More specifically for a VPN service the PE node can allocate SID(s) per-prefix (e.g., End.DX6) or per-vrf (e.g., End.DT6). The solution uses a per-vrf SID (e.g., End.DT6) in the IP SA of the SRv6 encapsulated packets. For more sophisticated VPN configurations (e.g., Hub-and-Spoke VPN) where multiple VRFs (and SIDs) are configured for a given VPN, the VPN specific SID of PE1 always refers to the VRF instance (and its per-vrf SID) where the prefixes of the connected customer site(s) can be looked up. As the locator part of the VPN-specific SID is routable within the SRv6 domain other PE and P nodes of the SRv6 domain can send/route packets to it. The SRv6 encapsulation process on the ingress PE node needs several input information to construct the outer SRv6 header. One group of information is related to the IP DA and the SRH part of the SRv6 encapsulation. They are derived from the remote service information (e.g., VPN SID on the egress PE) and the SR policy (if exists). The SR policy defines the path to which an ingress PE node steers a packet flow. Applying a SR policy means to select the path (e.g., defined by a SID list) and placing the path descriptors into the IP DA and the SRH fields of the outer SRv6 encapsulation. Another group of information is needed as well for the SRv6 encapsulation, like IP SA, Traffic Class, FlowLabel, HopLimit, NextHeader. They are derived by various local functionalities. The here described solution impacts only the selection of the IP SA. As per the source IP MUST resolve to a unique node in the SRv6 domain, what is fulfilled by the above described VPN-specific SID. All other fields are defined by related RFCs. For example, Traffic Class might be copied from the inner packet, FlowLabel might be locally generated, etc. The VPN-associated-ICMP-process-function operation contains the following steps: It processes the received ICMPv6 error message (originated e.g., from a P node within the SRv6 domain). It identifies the related VPN, based on the VPN-specific IP SA value in the SRv6 encapsulation of the received ICMPv6 error message. It modifies the ICMP error message: It removes the SRv6 domain specific encapsulation/header(s) of the received ICMPv6 error message. It identifies the VPN-specific source of the original packet that caused the ICMPv6 error message, based on the invoking packet header part of the ICMPv6 error message payload. It removes the SRv6 domain specific header(s) from the invoking packet header part of the ICMPv6 error message payload. It creates a new header for the ICMP error message, where the IP SA refers to the Originator-of-the-ICMPv6-error-message and the IP DA=SourceIP-of-the-invoking-packet. Forwards the modified ICMP error message according to the local VPN routing table (VRF). The VPN-associated-ICMP-process-function may translate the IP address of the Originator-of-the-ICMPv6-error-message (e.g., a P node) to limit the VPN-specific visibility characteristics. For example, if the SRv6 domain operator does not want to export the real NodeIP or SID values used by the SRv6 domain nodes.

This section illustares a VPNv6 Traceroute from a customer host.

In case of IPv4-VPN service the VPN-associated-ICMP-process-function operates as follows (v4/v6 are noted for clarity): It processes the received ICMPv6 error message (originated e.g., from a P node within the SRv6 domain). It identifies the related VPN, based on the VPN-specific IP SA value in the SRv6 encapsulation of the received ICMPv6 error message. It synthesizes an ICMPv4 error message based on the received ICMPv6 error message: It identifies the VPNv4 specific source of the original IPv4 packet that caused the ICMPv6 error message, based on the invoking packet header parts of the ICMPv6 error message payload. It creates the header for the ICMPv4 error message, in accordance with (Section 4.8) and (Section 3), i.e., IPv4 SA=192.0.0.8, Node Identification Object containing the IPv6 SA of the ICMPv6 error message and IPv4 DA=IPv4-SA-of-the-original-packet. Forwards the modified ICMPv4 error message according to the local VPNv4 routing table (VRF). When PE node is aware of the IPv4 address of the SRv6 node that generated the ICMPv6 error message, then the PE node may use it as the IPv4 SA of the synthesized ICMPv4 message. How the PE node is aware of that information is out-of-scope in this document.

This section illustares a VPNv4 Traceroute from a customer host.

Some network scenarios result in a packet having multiple transport outer IPv6 headers preceding the customer's inner IP header. For example in TI-LFA scenarios within the SRv6 domain. The solution described in this document handles TI-LFA scenarios and a traceroute may display the TI-LFA backup path when activated. Note: other multiple encapsulation scenarios need further discussions by the WG.

ICMP Error Handling for VPNs in SRv6 Networks has the following characteristics: It eliminates the shortcomings of the MPLS based solutions, as (1) it works in case of failures between ingress-PE and egress-PE and (2) it supports direct localization of failures. It defines new functions only for Ingress PE nodes. It uses a VPN specifc SID as a source address on ingress PE nodes. It does not result in additional complexity on P nodes. It is compliant to existing standards on P nodes, like . It makes P nodes service agnostic and allows building IPv6-only core networks. It does not involve Egress PE nodes in the forwarding of the ICMP error messages. It can hide the SIDs used inside the SRv6 domain and can provide different visibility for served VPNs if needed.