QoS , QoS Baby

QoS, QoS Baby OpenStack Barcelona 2016

Speakers Anne McCormick Robert Starmer Alka Sathnur Software Engineer, Cisco @amccormi4 CTO & Principal Kumulus Technologies @rstarmer Ops QA, Cisco @alkasat12

Topics Traditional QoS Concepts Current/Future OpenStack QoS Beyond the Network Compute/Storage Bottlenecks and Differentiation Use Case Q/A

Traditional QoS Concepts

Quality of service (QoS) is the overall performance of a telephony or computer network, particularly the performance seen by the users of the network. - Wikipedia

What is Quality of Service • Network centric view of resource • Availability • Reliability • Provides a model for understanding and manipulating network impact on service delivery • Jitter/latency/loss all are important aspects of a communications channel

“QoS” in OpenStack

QoS in the “physical” Network • Initial QoS was managed by Routers • Commited Information Rate • Routers matched bandwidth between different networks • Handling contention led to QoS Policies or Classes • Priority • Multi-queue • And multiple models of handling those queues • FIFO • WRED

QoS in Layer 2 Networks • L2 networks tend to try to avoid ever storing packets • Less chance to manage different flows of traffic • But L2 networks really aren’t L2 any more • So we can classify traiffic and if necessary queue it • Really helps when you have multiple types of traffic like storage and voice or video on the same network

Current/Future OpenStack QoS

QoS in the early days • RXTX Factor • Nova network based “Sharing” algorithm • Based on nova flavor metadata • Neutron Mitaka • ML 2 Extension • SR-IOV, OVS, Linux Bridge “bandwidth” limiations (e.g. rx/tx factor) • Neutron Newton • As with Mitaka • Adds DSCP marking < This is a big deal

Rate Limiting • Seems like a straight forward approach: • Like non-oversubscribed processors • Sharing fixed IOPs limits on a storage array • Rate limiting flows or specific services can have unintended consequences: • Dramatic impact to “good put” vs. “through put” • Particularly bursty applications can become unstable

DSCP Marking • Let’s help the network out • Mark packets so that the network infrastructure has some better information to go on • Execute marking via application/OS level (VM • or • Execute marking via the switch input • Not a panacea • May still have “good put” impact • At least provides a better interaction for determining who gets access to the available bandwidth resources

Beyond the NetworkSo you got the traffic there faster… now what? Compute and storage bottlenecks!

Compute Bottlenecks … and how to alleviate them

Compute1 Compute2 ComputeN Controller nova-scheduler

VeryImportantTM VM VeryImportantTM VM CPU Hog Controller nova-scheduler Node1 Node1

VeryImportantTM VM VeryImportantTM VM Controller nova-scheduler Node1 Node1

Cost of CPU Sharing/Context Switching Ran a simple OpenStack multicast iperf test: Network highly optimized for multicast (SR-IOV port, multiple rx queues with maximum queue size, RSS, ARFS, QoS) iPerf receiver on tenant VM, receiving steady 800 Mbits/sec multicast stream When context switching, receiver experienced up to 0.2% packet loss, particularly when switching across NUMA nodes (as opposed to switching within same node)

Compute Resource Differentiation/Prioritization Host aggregates – define separate groups of compute hosts Flavors – define hardware needs such as number of cores, CPU capabilities/limits, affinity/anti-affinity, etc., via host filters CPU pinning/NUMA awareness – pin VMs to dedicated cores to prevent context switches across NUMA nodes

Storage Bottlenecks … and how to alleviate them

VeryImportantTM VM Compute Storage1 I/O Hog Compute Compute I/O Traffic Storage2 Compute I/O Traffic StorageN

Cost of Storage Contention Ran a simple OpenStack read/write I/O test: Two VMs running on same host, different volumes 3 Ceph nodes, active/active/active When reading simultaneously, both VMs experienced 80 MB/s drop in read rate When writing simultaneously, both experienced 100 MB/s drop in write rate

Storage Resource Differentiation/Prioritization Host aggregates – define separate groups/clusters of storage servers Flavors – define I/O bandwidth limits for VMs (outbound traffic) Differentiate at storage backend Cinder has QoS specs, volume types, priority (more IOPS to particular volumes) Ceph has storage types and the ability to limit IOPS if needed AFAIK, Swift does not have the ability to differentiate/prioritize storage resources at the backend

Conclusion Network QoS is only a partial solution In order to guarantee resources for mission-critical applications and data, a solution across all cloud resources (network, compute, storage) must be used It is complicated to get this right across all resources, but it can be done

Use Case

Real World Use Case Bringing an existing Content Delivery Network that is comprised of bare metal cache nodes onto an Openstack platform

A Delivery Service is a software structure in OMD that maps an origin source to Traffic Servers by the Fully Qualified Domain Name (FQDN). FQDN is in the Request URI from the client media player. Cache groups can belong to a single or multiple Delivery services Cache Group is a logical grouping for HA. Each cache is typically located in different location to provide site-level redundancy. Each cache in cache group associated has single geo coordinates. Content Delivery Network Origin Server Origin Server Origin Server Enables: • Multiple Content Sources • Per Content Source content cache/storage • Intelligent load balancing Orchestration Traffic Server Mid-tier cache Traffic Server Traffic Server Control Server Traffic Server CDN Monitor CDN Analytics Traffic Server Traffic Server Traffic Server Traffic Server Traffic server (Edge) Traffic Server Traffic Server Traffic Server Traffic Server (Edge) Edge cache Groups

A Delivery Service is a software structure in OMD that maps an origin source to Traffic Servers by the Fully Qualified Domain Name (FQDN). FQDN is in the Request URI from the client media player. Cache groups can belong to a single or multiple Delivery services Cache Group is a logical grouping for HA. Each cache is typically located in different location to provide site-level redundancy. Each cache in cache group associated has single geo coordinates. Content Delivery Network Origin Server Origin Server Origin Server Orchestration Director Mid-tier cache Traffic Server Traffic Server CDN Analytics Traffic Server Traffic Server Traffic Server Traffic Server Traffic Server Control Server Traffic Server CDN Monitor Traffic Server Traffic Server Traffic server (Edge) Traffic Server Traffic Server Traffic Server Traffic server (Edge) Edge Cache Groups and Storage Clusters

Use Case Summary Dynamically expanding a Content Delivery Network is possible, provided the Orchestrator ensures that network, compute and storage give top priority to the application traffic.

¿Preguntas?

¡Gracias!

QoS , QoS Baby

QoS , QoS Baby

Presentation Transcript

IP QoS

QoS Guarantees

IP QoS

QoS Management

Standardizing for QoS (QoS Questions)

QoS I

Internet QoS

QoS

QoS monitoring

QoS

QOS

QOS

Internet QoS

QOS

QoS Interactions

QOS

QOS