1 / 29

Channel Reservation Protocol for Over-Subscribed Channels and Destinations

Channel Reservation Protocol for Over-Subscribed Channels and Destinations. George Michelogiannakis , Nan Jiang, Daniel Becker, William J. Dally This work was completed in Stanford University. Introduction. HPC and datacenter networks increasingly oversubscribed

garvey
Download Presentation

Channel Reservation Protocol for Over-Subscribed Channels and Destinations

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Channel Reservation Protocol for Over-Subscribed Channels and Destinations George Michelogiannakis, Nan Jiang, Daniel Becker, William J. Dally This work was completed in Stanford University

  2. Introduction • HPC and datacenter networks increasingly oversubscribed • Exascale for HPC may need 1 billion-way parallelism • Datacenter server count annual growth 7-17% • Levels of expensive bandwidth: • Between servers (intra-rack) • Between racks (intra-cluster) • Between clusters (intra-datacenter) • Between buildings (metro) • Between regions (longhaul) Why optical data communications and why now? Applied Physics. 2009 Facebook’s datacenter network architecture. OSI 2013

  3. Introduction • To make it worse, many traffic patterns create unbalanced load • Unbalanced load creates long paths of blocked packets (known as tree saturation) • I’ll present a channel reservation protocol which prevents network and endpoint congestion • We focus on lossless flow control • Tree saturation is a major drawback

  4. Agenda • Motivation and related work • Channel reservation protocol • Evaluation

  5. Oversubscription and Hotspots Cluster 2 Cluster 1 Oversubscribed channels Oversubscribed H This setting represents over-subscribed links between network clusters, or even between racks Tree saturation root. Affects benign traffic

  6. Impact on Benign Traffic • Adversarial pattern tops at 5% flit injection • Benign pattern slightly higher (6-7%) • Ideal flow control would avoid any interference Benign traffic is negatively affected

  7. Explicit Congestion Notification ECN: State of the art congestion handling scheme Oversubscribed channels ECN detects congestion at the root of the congestion tree Signals to the sources to throttle down

  8. Agenda • Motivation and related work • Channel reservation protocol • Evaluation

  9. Channel Reservation Protocol Source is informed to transmit in cycle 10 Resource available cycles 5 and 10 Channel is reserved for cycle 10 Cluster 2 Cluster 1 Oversubscribed Oversubscribed Destination available cycles 10 and 15. Result: cycle 10 Destination reserves cycle 10 H Reply (ACK) creates reservations for the chosen time slot in all oversubscribed resources Encounters congestion. Converted to a single-flit reservation request Potentially long packet sent speculatively

  10. CRP: Doodle for Packets Challenge: Participant’s availabilities are distributed across the network

  11. Reservation Tables • Reservation table is one line in the Doodle • Doodle asks for the length of time slots • We call a time slot a cell • Cells have Cmaxcycles • We keep a counter per cell because packet sizes differ

  12. Reservation Vectors • Request packets carry a vector to record what time slots are available in the resources traversed so far • This is used to build up to the final result of the Doodle

  13. Request Traversing a Channel • Request size: 80 cycles

  14. Request Arriving at Destination

  15. CRP: Doodle for Packets We have identified the common availability. Now we need to inform everybody

  16. Destination Reserving Bandwidth Original destination table: Resulting destination table: Subtracts reservation size (80 cycles) from the appropriate cells (time slots)

  17. ACK Traversing the Channel • Reserves 80 cycles starting from the granted timestamp cell (time slot) Original reservation table: Resulting reservation table:

  18. Protocol Considerations • If participants cannot agree on a time, we wait and then try again • If time slot no longer available, ACK is converted to a retry • If network uncongested, speculative packets succeed and no overhead for reservation

  19. Agenda • Motivation and related work • Channel reservation protocol • Evaluation

  20. Methodology • Two clusters of 144-node fat trees • 12x12 routers • Clusters connected with four channels • All channels are 10Gb/s • Messages 2KB, divided into eight packets • CRP applies to the message Oversubscribed Oversubscribed 4 H

  21. Uniform Random

  22. Uniform Random By the time ECN reacts, the flow is done ECN does not share congestion state with other destinations in the same cluster S A Oversubscribed Oversubscribed 4 B

  23. Combined Traffic ECN can be configured to prevent tree saturation in steady-state traffic

  24. Combined Traffic 3.5% lower for CRP CRP has extra control overhead

  25. Transient Traffic ECN allows congestion occur and reacts to it. CRP prevents it entirely 300,000 cycles to stabilize for ECN

  26. Transient Traffic ECN allows congestion occur and reacts to it. CRP prevents it entirely ECN’s maximum latency: 37,000 cycles 300,000 cycles to stabilize for ECN

  27. ECN Sensitivity: Three Clusters ECN configuration is sensitive to network topology, routing, and traffic pattern

  28. ECN Sensitivity: Four Clusters ECN needs to be reconfigured

  29. Conclusions • CRP is a statistical scheme to avoid overwhelming channels and destinations • CRP effectively prevents congestion • Avoids pitfalls of ECN and reactive techniques • CRP focuses on lossless flow control but similar benefits are possible in lossy flow control • Congestion causes many packet drops

More Related