Parallel IP Lookup using Multiple SRAM-based Pipelines

Parallel IP Lookup using Multiple SRAM-based Pipelines Authors: Weirong Jiang and Viktor K. Prasanna Presenter: Yi-Sheng, Lin (林意勝) Date: 2008.12.10 Publisher/Conf. : Parallel and Distributed Processing, 2008. IPDPS 2008. IEEE International Symposium on Dept. of Computer Science and Information Engineering National Cheng Kung University, Taiwan R.O.C.

Outline • Introduction • Related Work • Architecture Overview • Memory Balancing • Traffic Balancing • Performance Evaluation • Conclusion

Introduction • Multiple pipelines can be utilized in parallel to improve the throughput further. • The memory distribution over different pipelines as well as across different stages of each pipeline must be balanced. • The traffic among these pipelines should be balanced. IP/prefix caching to utilize the locality of Internet traffic. [1, 14] [1] M. J. Akhbarizadeh, M. Nourani, R. Panigrahy, and S. Sharma. A TCAM-based parallel architecture for highspeed packet forwarding. IEEE Trans. Comput.,56(1): 58–72, 2007. [14] D. Lin, Y. Zhang, C. Hu, B. Liu, X. Zhang, and D. Pao. Route table partitioning and load balancing for parallel searching with TCAMs. In Proc. IPDPS ’07, pages 1–10.

Introduction • The caching may fail to capture the traffic locality due to the long pipeline delay. A flow pre-caching scheme benefits from deep pipelining since it utilizes the inherent caching in the architecture. • The intra-flow packets may go out of order.  An approach called payload exchange, which exploits the pipeline delay, is used to maintain the intra-flow packet order.

Related Work

Related Work • Most published parallel IP lookup engines are TCAM-based. They partition the full routing table into several blocks, and make the search process parallel on different blocks. Ex : Trie-based approaches  Splits the trie by carving subtries. [24] F. Zane, G. J. Narlikar, and A. Basu. CoolCAMs: Powerefficient TCAMs for forwarding engines. In Proc. INFOCOM’03, pages 42–52.

Architecture Overview

Architecture Overview • Lookup Engines : • The routing table is constructed as a leaf-pushed uni-bit trie. • To store the mapping function between subtries and pipelines, several small memories called Destination Index Tables (DITs) are used. • By searching the DIT, the packet also retrieves the address of the subtrie’s root in the first stage of the pipeline. • Each pipeline employs a multi-port queue to handle the access conflicts when multiple incoming packets are directed to the same pipeline.

Architecture Overview • Load Balancer : • Caching is an efficient way to exploit Internet traffic locality for parallel IP lookup. • Define a sequence of packets with the same destination IP address as a flow. • We propose a scheme called flow pre-caching, which allows the destination IP address of a flow to be cached before its next-hop information is retrieved. • If the intra-flow out-oforder packet is detected, a task to exchange the payload between out-of-order packets is initiated.

Memory Balancing

Memory Balancing • Experimental results

Memory Balancing

Memory Balancing • Experimental results

Traffic Balancing • Flow Pre-Caching

Traffic Balancing • Detecting out-of-order packets

Performance Evaluation

Conclusion • This paper proposed a parallel SRAM-based multipipeline architecture for terabit trie-based IP lookup. • Memory and traffic balancing, and intra-flow packet ordering were identified as three major problems. • Our future work includes applying the SRAM-based pipeline architectures to multidimensional packet classification and deep packet inspection.

Parallel IP Lookup using Multiple SRAM-based Pipelines

Parallel IP Lookup using Multiple SRAM-based Pipelines

Presentation Transcript

ip lookup

Content-Based Route Lookup using CAMs

SRAM-based FPGA

Bluespec-4: Architectural exploration using IP lookup Arvind

IP-Lookup and Packet Classification

Example RAD Design: IP Router using Fast IP Lookup

High-performance TCAM-based IP Lookup Engines

PARALLEL-SEARCH TRIE-BASED SCHEME FOR FAST IP LOOKUP

FlumeJava Easy, Efficient Data-Parallel Pipelines

Creating Multiple Web Sites using Multiple IP Addresses

PIPELINES Parallel Level 1

MIPS Extension for a TCAM Based Parallel Architecture for Fast IP Lookup

FRUGAL IP LOOKUP BASED ON A PARALLEL SEARCH

What Is My IP Lookup Tool

Best IP Address Lookup Tool

What is My IP Address Lookup

An Efficient IP Address Lookup Algorithm Using a Priority Trie

SRAM

Fast IP Address Lookup Algorithms

IP Address Lookup

Multi-Terabit IP Lookup Using Parallel Bidirectional Pipelines

Parallel-Search Trie-based Scheme for Fast IP Lookup