PCN Working Group	B. Briscoe
Internet-Draft	BT
Intended status: Standards Track	October 26, 2009
Expires: April 29, 2010

Emulating Border Flow Policing using Re-PCN on Bulk Data
draft-briscoe-re-pcn-border-cheat-03

Status of This Memo

This Internet-Draft is submitted to IETF in full conformance with the provisions of BCP 78 and BCP 79.

Internet-Drafts are working documents of the Internet Engineering Task Force (IETF), its areas, and its working groups. Note that other groups may also distribute working documents as Internet-Drafts.

Internet-Drafts are draft documents valid for a maximum of six months and may be updated, replaced, or obsoleted by other documents at any time. It is inappropriate to use Internet-Drafts as reference material or to cite them other than as “work in progress.”

The list of current Internet-Drafts can be accessed at http://www.ietf.org/ietf/1id-abstracts.txt.

The list of Internet-Draft Shadow Directories can be accessed at http://www.ietf.org/shadow.html.

This Internet-Draft will expire on April 29, 2010.

Copyright Notice

This document is subject to BCP 78 and the IETF Trust's Legal Provisions Relating to IETF Documents in effect on the date of publication of this document (http://trustee.ietf.org/license-info). Please review these documents carefully, as they describe your rights and restrictions with respect to this document.

Abstract

Scaling per flow admission control to the Internet is a hard problem. The approach of combining Diffserv and pre-congestion notification (PCN) provides a service slightly better than Intserv controlled load that scales to networks of any size without needing Diffserv's usual overprovisioning, but only if domains trust each other to comply with admission control and rate policing. This memo claims to solve this trust problem without losing scalability. It provides a sufficient emulation of per-flow policing at borders but with only passive bulk metering rather than per-flow processing. Measurements are sufficient to apply penalties against cheating neighbour networks.

1. Introduction
2. Requirements Notation
3. The Problem
    3.1. The Traditional Per-flow Policing Problem
    3.2. Generic Scenario
4. Re-ECN Protocol in IP with Two Congestion Marking Levels
    4.1. Protocol Overview
    4.2. Re-PCN Abstracted Network Layer Wire Protocol (IPv4 or v6)
        4.2.1. Re-ECN Recap
        4.2.2. Re-ECN Combined with Pre-Congestion Notification (re-PCN)
    4.3. Protocol Operation
        4.3.1. Protocol Operation for an Established Flow
        4.3.2. Aggregate Bootstrap
        4.3.3. Flow Bootstrap
        4.3.4. Router Forwarding Behaviour
        4.3.5. Extensions
5. Emulating Border Policing with Re-ECN
    5.1. Informal Terminology
    5.2. Policing Overview
    5.3. Pre-requisite Contractual Arrangements
    5.4. Emulation of Per-Flow Rate Policing: Rationale and Limits
    5.5. Sanctioning Dishonest Marking
    5.6. Border Mechanisms
        5.6.1. Border Accounting Mechanisms
        5.6.2. Competitive Routing
        5.6.3. Fail-safes
6. Analysis
7. Incremental Deployment
8. Design Choices and Rationale
9. Security Considerations
10. IANA Considerations
11. Conclusions
12. Acknowledgements
13. Comments Solicited
14. References
    14.1. Normative References
    14.2. Informative References
Appendix A. Implementation
    A.1. Ingress Gateway Algorithm for Blanking the RE flag
    A.2. Downstream Congestion Metering Algorithms
        A.2.1. Bulk Downstream Congestion Metering Algorithm
        A.2.2. Inflation Factor for Persistently Negative Flows
    A.3. Algorithm for Sanctioning Negative Traffic

Status (to be removed by the RFC Editor)

The IETF PCN working group is initially chartered to consider PCN domains only under a single trust authority. However, after its initial work is complete the charter says the working group may re-charter to consider concatenated Diffserv domains, amongst other new work items. The charter ends by stating "The details of these work items are outside the scope of the initial phase; but the WG may consider their requirements to design components that are sufficiently general to support such extensions in the future."

This memo is therefore contributed to describe how PCN could be extended to inter-domain. We wanted to document the solution to reduce the chances that something else eats up the codepoint space needed before PCN re-charters to consider inter-domain. Losing the chance to standardise this simple, scalable solution to the problem of inter-domain flow admission control would be unfortunate (understatement), given it took years to find, and even then it was very difficult to find codepoint space for it.

The scheme described here (Section 4 (Re-ECN Protocol in IP with Two Congestion Marking Levels)) requires the PCN ingress gateway to re-echo any PCN feedback it receives back into the forward stream of IP packets (hence we call this scheme re-PCN). Re-PCN works in a very similar way to the re-ECN proposal on which it is based [I‑D.briscoe‑tsvwg‑re‑ecn‑tcp] (Briscoe, B., Jacquet, A., Moncaster, T., and A. Smith, “Re-ECN: Adding Accountability for Causing Congestion to TCP/IP,” September 2009.), the only difference being that PCN might encode three states of congestion, whereas ECN encodes two. This document is written to stand alone from re-ECN, so that readers do not have to read [I‑D.briscoe‑tsvwg‑re‑ecn‑tcp] (Briscoe, B., Jacquet, A., Moncaster, T., and A. Smith, “Re-ECN: Adding Accountability for Causing Congestion to TCP/IP,” September 2009.).

The authors seek comments from the Internet community on whether combining PCN and re-ECN to create re-PCN in this way is a sufficient solution to the problem of scaling microflow admission control to the Internet as a whole. Here we emphasise that scaling is not just an issue of numbers of flows, but also the number of security entities—networks and users—who may all have conflicting interests.

This memo is posted as an Internet-Draft with the intent to eventually be broken down in two documents; one for the standards track and one for informational status. But until it becomes an item of IETF working group business the whole proposal has been kept together to aid understanding. Only the text of Section 4 (Re-ECN Protocol in IP with Two Congestion Marking Levels) of this document is intended to be normative (requiring standardisation). The rest of the sections are merely informative, describing how a system might be built from these protocols by the operators of an internetwork. Note in particular that the policing and monitoring functions proposed for the trust boundaries between operators would not need standardisation by the IETF. They simply represent one possible way that the proposed protocols could be used to extend the PCN architecture [RFC5559] (Eardley, P., “Pre-Congestion Notification (PCN) Architecture,” June 2009.) to span multiple domains without mutual trust between the operators.

Dependencies (to be removed by the RFC Editor)

To realise the system described, this document also depends on other documents chartered in the IETF Transport Area progressing along the standards track:

Pre-congestion notification (PCN) marking on interior nodes [I‑D.ietf‑pcn‑marking‑behaviour] (Eardley, P., “Metering and marking behaviour of PCN-nodes,” August 2009.), chartered for standardisation in the PCN w-g;
The baseline encoding of pre-congestion notification in the IP header [I‑D.ietf‑pcn‑baseline‑encoding] (Moncaster, T., Briscoe, B., and M. Menth, “Baseline Encoding and Transport of Pre-Congestion Information,” September 2009.), also chartered for standardisation in the PCN w-g;
Feedback of aggregate PCN measurements by suitably extending the admission control signalling protocol (e.g. RSVP extension [RSVP‑ECN] (Le Faucheur, F., Charny, A., Briscoe, B., Eardley, P., Babiarz, J., and K. Chan, “RSVP Extensions for Admission Control over Diffserv using Pre-congestion Notification,” June 2006.) or NSIS extension [I‑D.arumaithurai‑nsis‑pcn] (Arumaithurai, M., “NSIS PCN-QoSM: A Quality of Service Model for Pre-Congestion Notification (PCN),” September 2007.)).

The baseline encoding makes no new demands on codepoint space in the IP header but provides just two PCN encoding states (not marked and marked). The PCN architecture recognises that operators might want PCN marking to trigger two functions (admission control and flow termination) at different levels of pre-congestion, which seems to require three encoding states. A scheme has been proposed [I‑D.charny‑pcn‑single‑marking] (Charny, A., Zhang, X., Faucheur, F., and V. Liatsos, “Pre-Congestion Notification Using Single Marking for Admission and Termination,” November 2007.) that can do both functions with just two encoding states, but simulations have shown it performs poorly under certain conditions that might be typical. As it seems likely that PCN might need three encoding states to be fully operational, we want to be sure that three encoding states can be extended to work inter-domain. Therefore, we have defined a three-state extension encoding scheme in this document, then we have added the re-PCN scheme to it. The three-state encoding we have chosen depends on standardisation of yet another document in the IETF Transport Area:

Propagation beyond the tunnel decapsulator of any changes in the ECN field to ECT(0) or ECT(1) made within a tunnel (the ideal decapsulation rules of [I‑D.ietf‑tsvwg‑ecn‑tunnel] (Briscoe, B., “Tunnelling of Explicit Congestion Notification,” July 2009.));

Changes from previous drafts (to be removed by the RFC Editor)

Full diffs of incremental changes between drafts are available at URL: <http://www.cs.ucl.ac.uk/staff/B.Briscoe/pubs.html#repcn>

Changes from <draft-briscoe-re-pcn-border-cheat-02> to <draft-briscoe-re-pcn-border-cheat-03> (current version):

Updated references and other minor changes.

Changes from <draft-briscoe-re-pcn-border-cheat-01> to <draft-briscoe-re-pcn-border-cheat-02>:

Considerably updated the 'Status' note to explain the relationship of this draft to other documents in the IETF process (or not) and to chartered PCN w-g activity.

Split out the dependencies into a separate note and added dependencies on new PCN documents in progress.

Made scalability motivation in the introduction clearer, explaining why Diffserv over-provisioning doesn't scale unless PCN is used.

Clarified that the standards action in Section 4 (Re-ECN Protocol in IP with Two Congestion Marking Levels) is to define the meanings of the combination of fields in the IP header: the RE flag and 2-level congestion marking in the ECN field. And that it is not characterised by a particular feedback style in the transport.

Switched round the two ECT codepoints to be compatible with the new PCN baseline encoding and used less confusing naming for re-PCN codepoints (Section 4 (Re-ECN Protocol in IP with Two Congestion Marking Levels)).

Generalised rules for encoding probes when bootstrapping or re-starting aggregates & flows (Section 4.3.2 (Aggregate Bootstrap)).

Downgraded drop sanction behaviour from MUST to conditional SHOULD (Section 5.5 (Sanctioning Dishonest Marking)).

Added incremental deployment safety justification for choice of which way round the RE flag works (Section 7 (Incremental Deployment)).

Added possible vulnerability to brief attacks and possible solution to security considerations (Section 9 (Security Considerations)).

Updated references and terminology, particularly taking account of recent new PCN w-g documents;

Replaced suggested Ingress Gateway Algorithm for Blanking the RE flag (Appendix A.1 (Ingress Gateway Algorithm for Blanking the RE flag))

Clarifications throughout;

Changes from <draft-briscoe-re-pcn-border-cheat-00> to <draft-briscoe-re-pcn-border-cheat-01>:

Updated references.

Changes from <draft-briscoe-tsvwg-re-ecn-border-cheat-01> to <draft-briscoe-re-pcn-border-cheat-00>:

Changed filename to associate it with the new IETF PCN w-g, rather than the TSVWG w-g.

Introduction: Clarified that bulk policing only replaces per-flow policing at interior inter-domain borders, while per-flow policing is still needed at the access interface to the internetwork. Also clarified that the aim is to neutralise any gains from cheating using local bilateral contracts between neighbouring networks, rather than merely identifying remote cheaters.

Section 3.1 (The Traditional Per-flow Policing Problem): Described the traditional per-flow policing problem with inter-domain reservations more precisely, particularly with respect to direction of reservations and of traffic flows.

Clarified status of Section 5 (Emulating Border Policing with Re-ECN) onwards, in particular that policers and monitors would not need standardisation, but that the protocol in Section 4 (Re-ECN Protocol in IP with Two Congestion Marking Levels) would require standardisation.

Section 5.6.2 (Competitive Routing) on competitive routing: Added discussion of direct incentives for a receiver to switch to a different provider even if the provider has a termination monopoly.

Clarified that "Designing in security from the start" merely means allowing codepoint space in the PCN protocol encoding. There is no need to actually implement inter-domain security mechanisms for solutions confined to a single domain.

Updated some references and added a ref to the Security Considerations, as well as other minor corrections and improvements.

Changes from <draft-briscoe-tsvwg-re-ecn-border-cheat-00> to <draft-briscoe-tsvwg-re-ecn-border-cheat-01>:

Added subsection on Border Accounting Mechanisms (Section 5.6.1 (Border Accounting Mechanisms))

Section 4.2 (Re-PCN Abstracted Network Layer Wire Protocol (IPv4 or v6)) on the re-ECN wire protocol clarified and re-organised to separately discuss re-ECN for default ECN marking and for pre-congestion marking (PCN).

Router Forwarding Behaviour subsection added to re-organised section on Protocol Operation (Section 4.3 (Protocol Operation)). Extensions section moved within Protocol Operations.

Emulating Border Policing (Section 5 (Emulating Border Policing with Re-ECN)) reorganised, starting with a new Terminology subsection heading, and a simplified overview section. Added a large new subsection on Border Accounting Mechanisms within a new section bringing together other subsections on Border Mechanisms generally (Section 5.6 (Border Mechanisms)). Some text moved from old subsections into these new ones.

Added section on Incremental Deployment (Section 7 (Incremental Deployment)), drawing together relevant points about deployment made throughout.

Sections on Design Rationale (Section 8 (Design Choices and Rationale)) and Security Considerations (Section 9 (Security Considerations)) expanded with some new material, including new attacks and their defences.

Suggested Border Metering Algorithms improved (Appendix A.2 (Downstream Congestion Metering Algorithms)) for resilience to newly identified attacks.

ECN field	RFC3168 codepoint	RE flag	Extended ECN codepoint	Re-ECN meaning
00	Not-ECT	0	Not-RECT	Not re-ECN-capable transport
00	Not-ECT	1	FNE	Feedback not established
10	ECT(0)	0	---	Legacy ECN use only
10	ECT(0)	1	--CU--	Currently unused
01	ECT(1)	0	Re-Echo	Re-echoed congestion and RECT
01	ECT(1)	1	RECT	Re-ECN capable transport
11	CE	0	CE(0)	Congestion experienced with Re-Echo
11	CE	1	CE(-1)	Congestion experienced

ECN field	PCN codepoint	RE flag	Extended PCN codepoint	Re-PCN meaning
00	Not-PCN	0	Not-PCN	Not PCN-capable transport
00	Not-PCN	1	FNE	Feedback not established
10	NM	0	Re-PCT-Echo	Re-echoed congestion and Re-PCT
10	NM	1	Re-PCT	Re-PCN capable transport
01	AM	0	AM(0)	Admission Marking with Re-Echo
01	AM	1	AM(-1)	Admission Marking
11	TM	0	TM(0)	Termination Marking with Re-Echo
11	TM	1	TM(-1)	Termination Marking

Border observation point	Approximate Downstream pre-congestion
ingress -- A	3% - 0% = 3%
A -- B	3% - 1% = 2%
B -- C	3% - 1% = 2%
C -- egress	3% - 3% = 0%

[I-D.briscoe-tsvwg-re-ecn-tcp]	Briscoe, B., Jacquet, A., Moncaster, T., and A. Smith, “Re-ECN: Adding Accountability for Causing Congestion to TCP/IP,” draft-briscoe-tsvwg-re-ecn-tcp-08 (work in progress), September 2009 (TXT).
[I-D.ietf-pcn-baseline-encoding]	Moncaster, T., Briscoe, B., and M. Menth, “Baseline Encoding and Transport of Pre-Congestion Information,” draft-ietf-pcn-baseline-encoding-07 (work in progress), September 2009 (TXT).
[I-D.ietf-pcn-marking-behaviour]	Eardley, P., “Metering and marking behaviour of PCN-nodes,” draft-ietf-pcn-marking-behaviour-05 (work in progress), August 2009 (TXT).
[I-D.ietf-tsvwg-ecn-tunnel]	Briscoe, B., “Tunnelling of Explicit Congestion Notification,” draft-ietf-tsvwg-ecn-tunnel-03 (work in progress), July 2009 (TXT).
[RFC2119]	Bradner, S., “Key words for use in RFCs to Indicate Requirement Levels,” BCP 14, RFC 2119, March 1997 (TXT, HTML, XML).
[RFC2211]	Wroclawski, J., “Specification of the Controlled-Load Network Element Service,” RFC 2211, September 1997 (TXT, HTML, XML).
[RFC3168]	Ramakrishnan, K., Floyd, S., and D. Black, “The Addition of Explicit Congestion Notification (ECN) to IP,” RFC 3168, September 2001 (TXT).
[RFC3246]	Davie, B., Charny, A., Bennet, J., Benson, K., Le Boudec, J., Courtney, W., Davari, S., Firoiu, V., and D. Stiliadis, “An Expedited Forwarding PHB (Per-Hop Behavior),” RFC 3246, March 2002 (TXT).
[RFC4774]	Floyd, S., “Specifying Alternate Semantics for the Explicit Congestion Notification (ECN) Field,” BCP 124, RFC 4774, November 2006 (TXT).

[CLoop_pol]	Salvatori, A., “Closed Loop Traffic Policing,” Politecnico Torino and Institut Eurécom Masters Thesis , September 2005.
[ECN-BGP]	Mortier, R. and I. Pratt, “Incentive Based Inter-Domain Routeing,” Proc Internet Charging and QoS Technology Workshop (ICQT'03) pp308--317, September 2003 (PDF).
[I-D.arumaithurai-nsis-pcn]	Arumaithurai, M., “NSIS PCN-QoSM: A Quality of Service Model for Pre-Congestion Notification (PCN),” draft-arumaithurai-nsis-pcn-00 (work in progress), September 2007 (TXT).
[I-D.charny-pcn-single-marking]	Charny, A., Zhang, X., Faucheur, F., and V. Liatsos, “Pre-Congestion Notification Using Single Marking for Admission and Termination,” draft-charny-pcn-single-marking-03 (work in progress), November 2007 (TXT).
[I-D.ietf-nsis-rmd]	Bader, A., Westberg, L., Karagiannis, G., Kappler, C., Tschofenig, H., Phelan, T., Takacs, A., and A. Csaszar, “RMD-QOSM - The Resource Management in Diffserv QOS Model,” draft-ietf-nsis-rmd-15 (work in progress), July 2009 (TXT).
[I-D.ietf-tsvwg-admitted-realtime-dscp]	Baker, F., Polk, J., and M. Dolly, “DSCP for Capacity-Admitted Traffic,” draft-ietf-tsvwg-admitted-realtime-dscp-05 (work in progress), November 2008 (TXT).
[IXQoS]	Briscoe, B. and S. Rudkin, “Commercial Models for IP Quality of Service Interconnect,” BT Technology Journal (BTTJ) 23(2)171--195, April 2005 (PDF).
[QoS_scale]	Reid, A., “Economics and Scalability of QoS Solutions,” BT Technology Journal (BTTJ) 23(2)97--117, April 2005.
[RFC2205]	Braden, B., Zhang, L., Berson, S., Herzog, S., and S. Jamin, “Resource ReSerVation Protocol (RSVP) -- Version 1 Functional Specification,” RFC 2205, September 1997 (TXT, HTML, XML).
[RFC2207]	Berger, L. and T. O'Malley, “RSVP Extensions for IPSEC Data Flows,” RFC 2207, September 1997 (TXT, HTML, XML).
[RFC2208]	Mankin, A., Baker, F., Braden, B., Bradner, S., O'Dell, M., Romanow, A., Weinrib, A., and L. Zhang, “Resource ReSerVation Protocol (RSVP) Version 1 Applicability Statement Some Guidelines on Deployment,” RFC 2208, September 1997 (TXT, HTML, XML).
[RFC2747]	Baker, F., Lindell, B., and M. Talwar, “RSVP Cryptographic Authentication,” RFC 2747, January 2000 (TXT).
[RFC2998]	Bernet, Y., Ford, P., Yavatkar, R., Baker, F., Zhang, L., Speer, M., Braden, R., Davie, B., Wroclawski, J., and E. Felstaine, “A Framework for Integrated Services Operation over Diffserv Networks,” RFC 2998, November 2000 (TXT).
[RFC3540]	Spring, N., Wetherall, D., and D. Ely, “Robust Explicit Congestion Notification (ECN) Signaling with Nonces,” RFC 3540, June 2003 (TXT).
[RFC4301]	Kent, S. and K. Seo, “Security Architecture for the Internet Protocol,” RFC 4301, December 2005 (TXT).
[RFC4727]	Fenner, B., “Experimental Values In IPv4, IPv6, ICMPv4, ICMPv6, UDP, and TCP Headers,” RFC 4727, November 2006 (TXT).
[RFC5129]	Davie, B., Briscoe, B., and J. Tay, “Explicit Congestion Marking in MPLS,” RFC 5129, January 2008 (TXT).
[RFC5559]	Eardley, P., “Pre-Congestion Notification (PCN) Architecture,” RFC 5559, June 2009 (TXT).
[RSVP-ECN]	Le Faucheur, F., Charny, A., Briscoe, B., Eardley, P., Babiarz, J., and K. Chan, “RSVP Extensions for Admission Control over Diffserv using Pre-congestion Notification,” draft-lefaucheur-rsvp-ecn-01 (work in progress), June 2006 (TXT).
[Re-fb]	Briscoe, B., Jacquet, A., Di Cairano-Gilfedder, C., Salvatori, A., Soppera, A., and M. Koyabe, “Policing Congestion Response in an Internetwork Using Re-Feedback,” ACM SIGCOMM CCR 35(4)277--288, August 2005 (PDF).
[Smart_rtg]	Goldenberg, D., Qiu, L., Xie, H., Yang, Y., and Y. Zhang, “Optimizing Cost and Performance for Multihoming,” ACM SIGCOMM CCR 34(4)79--92, October 2004 (PDF).
[Steps_DoS]	Handley, M. and A. Greenhalgh, “Steps towards a DoS-resistant Internet Architecture,” Proc. ACM SIGCOMM workshop on Future directions in network architecture (FDNA'04) pp 49--56, August 2004.

	Bob Briscoe
	BT
	B54/77, Adastral Park
	Martlesham Heath
	Ipswich IP5 3RE
	UK
Phone:	+44 1473 645196
EMail:	bob.briscoe@bt.com
URI:	http://bobbriscoe.net/

Emulating Border Flow Policing using Re-PCN on Bulk Datadraft-briscoe-re-pcn-border-cheat-03

Status of This Memo

Copyright Notice

Abstract

Table of Contents

Status (to be removed by the RFC Editor)

Dependencies (to be removed by the RFC Editor)

Changes from previous drafts (to be removed by the RFC Editor)

1. Introduction

2. Requirements Notation

3. The Problem

3.1. The Traditional Per-flow Policing Problem

3.2. Generic Scenario

4. Re-ECN Protocol in IP with Two Congestion Marking Levels

4.1. Protocol Overview

4.2. Re-PCN Abstracted Network Layer Wire Protocol (IPv4 or v6)

4.2.1. Re-ECN Recap

4.2.2. Re-ECN Combined with Pre-Congestion Notification (re-PCN)

4.3. Protocol Operation

4.3.1. Protocol Operation for an Established Flow

4.3.2. Aggregate Bootstrap

4.3.3. Flow Bootstrap

4.3.4. Router Forwarding Behaviour

4.3.5. Extensions

5. Emulating Border Policing with Re-ECN

5.1. Informal Terminology

5.2. Policing Overview

5.3. Pre-requisite Contractual Arrangements

5.4. Emulation of Per-Flow Rate Policing: Rationale and Limits

5.5. Sanctioning Dishonest Marking

5.6. Border Mechanisms

5.6.1. Border Accounting Mechanisms

5.6.2. Competitive Routing

5.6.3. Fail-safes

6. Analysis

7. Incremental Deployment

8. Design Choices and Rationale

9. Security Considerations

10. IANA Considerations

11. Conclusions

12. Acknowledgements

13. Comments Solicited

14. References

14.1. Normative References

14.2. Informative References

Appendix A. Implementation

A.1. Ingress Gateway Algorithm for Blanking the RE flag

A.2. Downstream Congestion Metering Algorithms

A.2.1. Bulk Downstream Congestion Metering Algorithm

A.2.2. Inflation Factor for Persistently Negative Flows

A.3. Algorithm for Sanctioning Negative Traffic

Author's Address

Emulating Border Flow Policing using Re-PCN on Bulk Data
draft-briscoe-re-pcn-border-cheat-03