1 / 43

Content Placement as a Key to Leveraging Geo-Distributed I nfrastructures

Content Placement as a Key to Leveraging Geo-Distributed I nfrastructures. Abhigyan Sharma University of Massachusetts Amherst. Geo-distributed, heterogeneous computing infrastructure. Alice. Datacenters. Set top boxes. Speed of light. Cellular base stations. Routers.

tress
Download Presentation

Content Placement as a Key to Leveraging Geo-Distributed I nfrastructures

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Content Placement as a Key to Leveraging Geo-Distributed Infrastructures Abhigyan Sharma University of Massachusetts Amherst

  2. Geo-distributed, heterogeneous computing infrastructure Alice Datacenters Set top boxes Speed of light Cellular base stations Routers

  3. Network mobility of endpoint principals Wi-Fi: IP3 Cellular: IP4 Alice’s phone • Interfaces • Devices • Things • Content • Services • ... Wi-Fi: IP1 Cellular: IP2 www.dropbox.com/file1 Server A IP5 Server B IP6

  4. Research vision for future Internet services Services to support network mobility Service A Service B Services to leverage geo-distributed, heterogeneous infrastructure • Meta-service for automatic deployment & reconfiguration

  5. This talk is about … Auspice global name service Network CDN SIGCOMM 2014 SIGMETRICS 2013, INFOCOM 2011

  6. Degrees of freedom enabled by geo-distribution ✖ ✖ ✔ Network routing Content placement Request redirection

  7. Network routing/traffic engineering Link capacity Traffic Engineering Network routing Traffic matrix Traffic from node ito node j

  8. Network cost function • Convex functions of link utilization, e.g., maximum link utilization (MLU) Shortest path Flow split 0.5 Mbps 1 Mbps 1 Mbps A B A B 3 Mbps 3 Mbps 1.5 Mbps 2 Mbps MLU = 2/3 = 0.67 MLU = 0.5/1 = 0.5

  9. Leveraging geo-distribution: content placement Placement vs. redirection Alice Placement vs. routing Place all content at all locations Traffic matrix No routing problem!

  10. Auspice global name service Network CDN SIGMETRICS 2013

  11. ISPs evolving to Network CDNs (NCDNs) Content providers Content delivery network (CDN) NCDN NCDN NCDN Internet service providers (ISPs) • CDNs commoditized: licensed & managed CDNs • 30+ NCDNs 3X traffic growth (’13-18’), 79% video traffic (’18)* *Cisco Visual Networking Index 2013-18

  12. NCDN model Origin servers NCDN POP Backbone router Content servers Backbone router at exit nodes 12

  13. NCDN management problem Link capacity NCDN management Storage capacity Placement Redirection Routing Content size vector , Content matrix Demand for content i at node j

  14. NCDN placement routing interaction 0.25 Mbps B C 0.5 Mbps Traffic labeled with flow value 0.25 Mbps 8 Mbps 4 Mbps 1.25 Mbps Link labeled with capacity 1.5 Mbps A D 0.75 Mbps Demand = 1 Mbps Demand = 0.5 Mbps Maximum link utilization (MLU) = 0.75/1.5 = 0.5

  15. NCDN placement routing interaction B C 0.5 Mbps Traffic labeled with flow value 8 Mbps 1 Mbps 4 Mbps 0.5 Mbps Link labeled with capacity 1.5 Mbps A D Demand = 1 Mbps Demand = 0.5 Mbps Maximum link utilization (MLU) = 0.75/1.5 = 0.5 = 1/8 = 0.125

  16. NCDN joint optimization • Joint optimization is NP-complete & inapproximable Origin servers Mixed integer program for joint opt. Joint optimization Content matrix Placement, Redirection Optimize: Cost function Constraints: Link capacity Storage capacity … … Routing

  17. Planned NCDN management CM(t) : Content matrix in interval t PRR(t) : Placement, redirection, routing in interval t Planned: CM(t-1) PRR(t) Oracle: CM(t) PRR(t) Timeline (t-1) t (t+1) NCDN joint opt. NCDN joint opt.

  18. Unplanned NCDN management Placement  LRU caching Redirection  Closest hop count Routing  Static shortest-path

  19. Network cost Akamai traces: 7.2M users, 28.2M requests, 1455 TB data Content placement matters tremendously in NCDNs 3x 18%

  20. Traffic engg. vs. static shortest-path routing LRU Caching + Static short-path routing vs. LRU Caching + Traffic engineering Traditional traffic engg. gives small cost reduction in NCDNs 10% or less

  21. Message to NCDNs • Keep it simple • Realistic joint optimization performs worse than simple unplanned • Little room for improvement over simple unplanned • Content placement matters more than routing in Network CDNs

  22. SIGCOMM 2014 Auspice global name service Network CDN

  23. Poor support for mobility in Internet Ungraceful disruptions Unidirectional communication initiation VOIP/Messaging ? Notification service Cloud storage Redundant app-specific mobility support

  24. How global name service handles mobility Global name service (GNS) Alice’s phone IP address new Socket(“ ”); Name/Identity Network location Name-based communication

  25. Scalable global name service (GNS) f0:56:81:c1:c0:eb node1.cs.umass.edu dropbox.com netflix.com/<object> devices in [lat,long,radius] Global name service (GNS) interface device service content group of names Goal: A massively scalable, geo-distributed GNS to enable secure, name-based communication with flexible endpoint principals with arbitrary (fixed) names despite high mobility. 10B devices, 100 addresses/day ≈ 1M updates/sec

  26. Key contributions • Clean-slate naming system design • Arbitrary names • Decentralized root of trust • Scalable name-to-address mapping service • Small lookup latency under high mobility • Deployable in DNS

  27. Security DNS Single root of trust + DNSSEC Domain name IP addresses Decouple certification & resolution Name certification services GNS providers, e.g., Auspice Certificate search services Arbitrary name IP addresses Self-certifying GUID GNS provider

  28. Flexibility Root DNS TLD Domain name IP addresses Auth. NS Name certification services GNS providers, e.g., Auspice “JohnSmith2178@Amherst” “Living room chandelier” “Taxis near Times Square” Auspice IP addresses Arbitrary name Self-certifying GUID GNS provider

  29. Key contributions • Clean-slate naming system design • Arbitrary names • Decentralized root of trust • Scalable name-to-address mapping service • Small lookup latency under high mobility • Deployable in DNS

  30. Active replication cost Total capacity of all servers Lookup cost Update cost (1 active replica/name-record) Update cost (5 active replicas/name-record) • Update cost for name i #active_replicasi X update_ratei

  31. Cost-benefit tradeoff in active replication • Update cost for name i #active_replicasi X update_ratei Replicate-at-all-locations Auspice Update cost Consistent hashing with (static) k-replication Resource limit Name lookup latency

  32. Demand-aware active replication lookup_ratei • #active_replicasof name i update_ratei Geolocality-aware Load-aware i i i j j j S6 S11 S8 S7 S5 S2 S4 S3 S1 S9 S10 j i i j j i

  33. Placement engine Strong consistency Consistent hashing S1 S2 S3 S4 S5 S6 S7 S8 S9 S10 Name i Placement engine for name i Demand-aware replication Active replicas for name i Demand geo-distribution S1 S2 S3 S4 S5 S6 S7 S8 S9 S10 Flexible consistency

  34. Placement schemes comparison Testbed: 16 server cluster emulating with 80 NS an 80 local NS Workload: 90% mobile names (geolocality 0.75), 10% service names Auspice outperforms static placement and a DHT-based scheme Better redirection Better placement

  35. Managed DNS comparison Ultra DNS (16 replicas) vs. Auspice 5/10/15 replicas out of 80 locations One-third update cost, similar latency Auspice reduces cost/latency over today’s managed DNS 60% less latency, similar cost

  36. Auspice GNS summary Enables secure, name-based communication • arbitrary name/location representation • flexible endpoint principals • Key differences from DNS for today’s Internet • decouple certification and resolution • active replication • demand-aware placement

  37. Take-aways Content placement key to leverage geo-distribution Bad news: Redirection, routing can’t redress poor placement Good news:Simple placement works well

  38. Future directions • Simplify management of geo-distributed services Service A Service B • Meta-service for automatic deployment & reconfiguration

  39. Future directions • Secure software platform for in-network computing Set top boxes Cellular base stations Routers

  40. Future directions Content datacenter [Current work] • Energy-efficiency of computing infrastructure Global energy management Network information plane

  41. Additional content

  42. User-perceived latency cost Latency Cost = E2E propagation delay + Link utilization dependent delay Content placement matters tremendously in NCDNs 28%

  43. Traffic engg. schemes comparison for random placement All traffic engg. schemes achieve near-optimal capacity. Static shortest-path routing at most 30% sub-optimal. INFOCOM 2011 43

More Related