Application Layer Multicasting

Application Layer Multicasting Yash Sheth 42961722 SwapnilTiwari 63242307 VishalSahu 19489633

Application Layer Multicast Multicasting is increasingly used as a efficient and scalable mechanism for the dissemination of data across heterogeneous systems on the internet Today’s IP multicast is restricted due to separate “islands” of singular control and ISPs are reluctant to enable it in order to reduce router load To overcome this hurdle of router dependence, there is a need for efficient data delivery without modifying the network. This is where the Application layer comes in!

All multicast activity is implemented by the peers, instead of the routers The goal of this multicast protocol is to construct and maintain an efficient an efficient overlay network of nodes for data transmission Overlay networks are formed by peers that need to transmit or receive the data. They self-organize themselves into logical topologies which facilitates multi-point message forwarding

Paper Study: • C.K. Yeo,B.S. Lee, M.H. Er, "A survey of application level multicast techniques” • Banerjee, S. and Bhattacharjee, B. and Kommareddy, C., "Scalable Application Layer Multicast” • B. Zhang, S. Jamin, L. Zhang, "Host Multicast: A Framework for Delivering Multicast to End Users" • Kyungbaek Kim, SharadMehrotra, and NaliniVenkatasubramanian, "FaReCast: Fast, Reliable ALM for Flash Dissemination"

Paper 1: A survey of application level multicast techniques In this paper, we come across various different kinds of overlay networks and their characteristics, as well as compare their performances on a few standard metrics Classification of Application Layer Multicast • Overlay Topology Design • Service Model • Architecture

Classification by Overlay Topology • Control Topology and Data Topology together determine the overall structure of the Overlay Network • Tree-Based Topology: • Application Layer Multicast Architecture (ALMA) • Self-Configuring source specific trees where data topology is comprised within the control topology. Centralized Directory Server (DS) maintains an updated records of the tree. • New members query the DS for a list of potential parents and informs it in case they change their parent • Each member incorporates dual performance metrics of loss rate and RTT provided by each member to its peers from a ready-list for each member

This is similar to the Gossip algorithm, where each member shares its end-to-end metric data with its gossip candidates • Members change their parent to the one having the most optimum distance metric. This is called spiraling with their ancestors and siblings • Loop detection in this dynamic tree topology is implemented by maintaining a level number for each member, which is updated whenever it finds a new parent with respect to the new parent’s level

Application Level Multicast Infrastructure (ALMI) • Provides a multicast middleware which is implemented above the socket layer. Tailored for a small group in a many-to-many service model • Central controller creates a minimum spanning tree MST consisting of unicast connections between end hosts. • Latency between members is used as the link cost of the MST, thereby optimizing ALMI overlay for low latency data delivery • Dynamic change can create different versions of the MST, which are tracked by the controller by version number. This is done to prevent loops and partitions in the new tree • New MST is periodically sent to the previous version member to update their knowledge of connections • Packet having newer version number is not propagated from a member with older version till it is updated

Banana Tree protocol (BTP) • Uses a receiver-based self-organizing approach to build a shared group data tree designed mainly for distributed file sharing applications • First host to join the group becomes the root. Subsequent member learn of the root and join the tree • Tree is incrementally optimized for low delay by allowing a node to switch to a sibling if it is closer than the parent. The sibling information is provided to each node by its new parent • Loop prevention is handled by fulfilling these conditions: • A node rejects all attempts for switching if it is itself in the process of switching parents • It must include the current parent information while switching to validate that the potential parent is its sibling

Host Multicast (HM) • Provides best-effort delivery to applications while being compatible with IP multicast to the furthest extent possible • Automates interconnection of IP multicast islands and provides multicast to “incapable” end-hosts via unicast tunnels • Multicast islands are connected via UDP tunnels between Designated Members (DM) of the island • It has a distributed tree building protocol scaling to O(N) • A new member discovers the root of the shared tree through a Rendezvous Point (RP). It then sets this as a potential parent and takes the list of its children • DFS-style search for closest parent • Loop detection is easy, as member knows it its entire path from the root until that point

Overlay Multicast Network Infrastructure (OMNI) • Targeted at media streaming applications • Single source overlay tree aimed at minimizing end-to-end average latency • Each member maintains information about its neighbors and ancestors • Storing the root path aids in loop detection

MeshTree • A two step approach: • Group Members self organize themselves in control topology called mesh. • A routing protocol runs across this control topology to define unique path to each member. • Advantage over Tree approach: Lesser construction and maintenance overhead as routing protocol itself takes care of loop avoidance and detection.

Narada • First a mesh control topology across member node is built, followed by nodes self organizing themselves in source-rooted multicast trees to form a data topology using DVMRP protocol. • Each member maintains state information about all other members which is periodically exchanged, leading to large number of messages. • The overhead of member discovery and maintenance of membership results in considerable overhead which limits Narada to small/medium groups. • The utility of each link is periodically evaluated and links may be deleted and added as per need.

Kudos • Extension of Narada. Hierarchical topology. • Partitions nodes into clusters with a unique representative (cluster head). Within each cluster independent instance of Narada is run. At the top level, Narada runs on overlay nodes comprising of all cluster heads. • The data topology comprises of independent trees at bottom level and a tree at the top level. • Complex architecture due to cluster migration, diffusion and splitting. • However, it provides superior scalability and low management overhead.

Scattercast • Similar to Narada but used ScatterCast proxies (infrastructure service agents). • Uses Gossamer to self organize into a mesh control topology. Source rooted distribution trees are then constructed using a routing algorithm. • Gossip style member discovery. Each member periodically picks up random nodes to exchange membership information. • Latency used a cost metric. • Mesh optimization using a cost function (which is cost of routing to different sources via its neighbors.

Embedded Structure • Members of the overlay network are assigned logical addresses from some abstract coordinate space • Overlay is built using these logical addresses • Topology Agnostic with some exceptions • Highly Scalable

ALM - CAN • Designed to scale to a large group with multiple source service model • Uses underlying CAN architecture of virtual d-dimensional space divided into zones • Each node owns it zone and maintains a routing table consisting of only its neighbors • Control topology: d-dimensional space • Data Topology: Directed flooding over Control topology • Dissemination by data forwarding rule • Forward only to neighbors except the one you received it from • Don’t forward if the packet has travelled half the space forward from source • When nodes join the zone splits, when nodes leave the zones are remerged • Long data distribution paths

ALM-DT • Designed to support large multicast size • Established and maintained in a distributed fashion • Each member is assigned (x, y) coordinates • Distance metric is the neighbor test • Control topology consists of combinations of DTs • Nodes only maintain information about neighbors • Source rooted data topology is embedded in the DT • Uses compass routing • New nodes join using the DT server • Set of alternative routes available in case of failure • Suboptimal mapping of the overlay to the physical network

Compass Routing

Bayeux • Designed to be primarily fault tolerant • Control Topology: Tapestry Overlay • Data Topology: • Built explicitly over control overlay • Consists of nodes acting as software routers as well as multicast receivers • Routing done using local routing maps stored at each node • Any destination can be reached in logbN steps • Topology aware • Join and Leaving the group • Generates more traffic since it maintains more group membership information • Root is the single point of failure • Can be avoided by a multicast tree partitioning mechanism

Routing in Bayeux

Join Request in Bayeux

NICE • Designed for low-bandwidth, data streaming applications with large receiver sets • Distance metric is the latency between members • Control topology: Consists of multilevel hierarchy • A distributed clustering protocol at each layer chooses a cluster leader which is the graph theoretic center of cluster in Layer Li to join Layer Li+1 • Cluster size at each layer is between k to 3k-1 where k is constant and comprises of members close to each other • Joining operation: • A new member is bootstrapped by RP to the hosts in highest layer(Li) of hierarchy • Joining host contacts the members of these layer to find the closest one by latency • The selected node informs the node about its peers in layer Li -1 • The process iteratively continues till the node locates its cluster in Layer L0

Multilayered hierarchy in NICE

NICE (Contd) • Topology aware since the proximity is based on latency between nodes • In the control topology, all members of each cluster peer with each other and exchange refresh messages incorporating the latencies between them • Also, members probe peers in supercluster to identify a new cluster leader to adapt to changing network conditions • Cluster leaders are also responsible for splitting and merging of clusters to follow size bounds • Worst case control overhead is O(klogN) • Number of application level hops between any pair of members : O(logN)

NICE (Contd) • Data Topology: Embedded source specific tree with a simple forwarding rule • Source sends the packet to all its peers in the control toplogy • A receiver forwards the packets only if it’s a cluster leader • Loops and partitions may occur due to stale cluster membership or node failure • No special mechanisms for handling, depends on cluster members and leaders to restore the hierarchical relationships and reconcile the cluster view for all members

Flat Topology v/s Hierarchy Topology • In a flat topology, there is only a single-layer overlay sitting atop the physical network. • In a hierarchical topology, we introduce a hierarchy into the overlay forming a multi-layer overlay. • Hierarchical topology are highly scalable and incur low management overhead • Children nodes cannot form overlay links across clusters • Additional overhead required for maintenance of clusters

Service Models • Mechanism in which data is delivered multiple destinations. • Best Effort vs Reliability • Source-specific delivery vs many any-source delivery • Determines the type of application the various multicast techniques can support. (video conferencing vs data replication)

Best Effort vs Reliable transfer • Reliability here refers to end-to-end reliability. • Unlike TCP which only provides hop-to-hop reliability. • Must be augmented with data duplication and retransmission. • Puts pressure on source to handle retransmission and rate control. • Receivers can implement buffering and rate limiting capabilities to help source node.

Best Effort vs Reliable transfer contd... • ALMI uses direct connection to source for retransmission in case of error recovery. • Scattercast uses Scaleable Reliable Multicast (SRM) to recover lost data. A recovery request is forwarded to source through intermediate proxy nodes (SCX) until one of the SCX can recover data. • Some protocols use some sort of heuristic to choose best outgoing path or predicting the shortest and most reliable link to the next hop.

Source-specific vs any-source • Source-specific • Builds a data topology tree rooted at a source tree. • Simple and grow linearly with node addition. • Overhead of maintaining and optimizing multiple trees. Any-specific • Any member can be root in the tree. • Less overhead in tree building and management. • But not optimized for a single source.

Source-specific vs any-source contd ... • Type of model adopted by a protocol depends on the targeted applications. • Scattercast uses single-source at it is heavily used in media streaming and broadcasting. • ALMI which is primarily targeted for collaborative applications uses any source model.

Architecture • Peer-to-Peer (P2P) • All functionalities are vested with end users. • Simplicity of set up and deployment. • Provides redundancy and resource sharing. Proxy Support • Dedicated servers strategically deployed within network to provide efficient data distribution and value added services. • Better capable that individual host to provide more efficient services. • Usually static in nature. Less responsive to changing conditions. • Problems with deployment.

Peer-to-Peer vs Proxy contd ... • OMNI deploys proxies called Multicast Service Nodes. Prior to data transfer MSN organize themselves in into a data delivery tree. • Overcast organizes proxies called overcast nodes into a distribution tree rooted at single source. Proxies also provide storage capability in the network. • Scattercast uses its proxies (SCXs) to build an application aware framework. (such as rate adaptation) SCX are also used to repair partitions in overlay network. • Host Multicast (HM) uses designated members (DM) to build a shared data tree. DMs are not proxies as they are not dedicated server nodes.

Architecture • Centralized • All responsibilities of group management, overlay computation and optimization are with a central controller. • Leads to simplification of routing algorithm, efficient and simple group management. • However, limited scalability and central point of failure. Distributed • Group management, overlay computation vested with individual nodes of network. • Require some sort of mechanism for host to learn about some nodes in multicast group. • More robust to failure and scalable. • Excessive overhead. Not optimal and efficient in building overlay network.

Architecture contd… • Hybrid • A lightweight controller along with distributed algorithm. • Controller is used to facilitate group management, error recovery and some of overlay construction. • Actual overlay construction is done by individual nodes.

Centralized, Distributed and Hybrid contd ... • ALMI only one that uses completely centralized approach where session controller has complete knowledge of membership. • Overcast protocol maintains a global registry through which proxies learn about the overlay network. • SCRIBE requires a contact node in the overlay network which is responsible for bootstrapping the new node. • NICE and TBCP need a rendezvous point to learn about higher level/root nodes. • ALMA and HM are examples of hybrid approach. They both use centralized server as a rendezvous point for new members. RP maintains membership information.

Performance Comparison • Scalability • Limited by protocol efficiency • Number of receivers supported can be a good metric • Another metric can be the protocol overhead • A system which scales up more doesn’t mean it is better than a system which doesn’t scale to the same extent!

Performance Comparison

Performance Comparison • Efficiency of Protocols • Depends on factors such as: • Quality of data paths generated • Protocol Overhead • Time and resources used to construct the application • Layer multicast overlay • Amount of state information required at each node to maintain the overlay.

Performance Comparison Quality of data path: • Stretch • Also called as Relative Data penalty • Measure of the increase in latency that applications incur while using overlay routing • Stretch = Protocol’s unicast routing distance IP unicast routing distance • Stress • Measure of effectiveness of the protocol in distributing network load across different physical links • Defined on per node per link basis • Counts the number of identical packets sent by the protocol on that link • All comparisons are made against IP multicast • As these factors are difficult to measure, other factors such as path length(for Stretch) and Out-degree(for Stress) can be used for analysis

Performance Comparison Protocol Overhead • Measured as total bytes of non-data that enters the network • Includes control and maintenance traffic, probe traffic etc • State information maintained by a node also contributes to the protocol overhead Failure Tolerance • Depends on various mechanisms put in place for error and failure recovery • Whether prone to single point of failure or failure can be contained or localized

Paper 2:Host Multicast: A Framework for Delivering Multicast to End Users In this paper, we study the technique of Host Multicast HM, which is an application layer technique Along with overcoming the deployment and control pitfalls of IP multicasting, HM is compatible with existing IP multicast infrastructure This paper goes on to discuss the Host Multicast Tree Protocol HMTP and further suggest some performance tuning and provide the performance evaluation of this technique

Host Multicast • It is a hybrid approach to Application layer multicast and IP multicast • Host Multicast Framework to provide best effort delivery to applications • Compatibility with IP multicast allows it to accept and forward packets from existing IP multicast networks, hence deployable on existing internet • No support is required from the OS, routers or servers. Hence, it supports transmission of data over heterogeneous systems irrespective of the network

DM Host Unicast Tunnel Designated Member (DM) DM DM Host Host IP Multicast Island Normal Member Host Host Host • HM Architecture Rendezvous Point (HMRP) • Each cloud above is considered to be an existing IP multicast island being linked by a channel of communication. HM is used for distribution of data over such a network RP

HM Architecture • Every island has a Designated Member DM assigned to communicate with other such islands • Any 2 DMs are connected via a Unicast Tunnel over UDP • All DMs run HMTP to self-organize their own island as a member of a bi-directional shared tree. Here, a special node is assigned as the root • From member to root, there is only one loop-free path • Incoming data is replicated and sent to remaining neighbors, thus capable of reaching any node on the tree • Each multicast group or island has a Group ID and Session Directory stored in a Host Multicast Rendezvous Point (HMRP)

Host Multicast Tree Protocol-HMTP • Build a bi-directional shared-tree connecting all islands • The tree should be congruent to physical network topology to be efficient: use member-to-member round-trip time as distance metric in current design • The tree should be robust: be able to handle node failure, dynamic join/leave etc. • In HMTP members maintain the tree structure by periodically exchanging messages with neighbors • The mechanisms to detect node failure and loop formation are described further..

Application Layer Multicasting

Application Layer Multicasting

Presentation Transcript

application layer

Application Layer

application layer

Application Layer

APPLICATION LAYER MULTICASTING

Application layer

Application layer

Application Layer

Application Layer

Application layer

Application Layer

Application Layer

Application Layer

Application layer

Application Layer

Application Layer

Application Layer

Application Layer

Application layer

Application Layer

APPLICATION LAYER

Application Layer