FlexDriver: a network driver for your accelerator
- 22 February 2022
- conference paper
- conference paper
- Published by Association for Computing Machinery (ACM) in Proceedings of the 27th ACM International Conference on Architectural Support for Programming Languages and Operating Systems
Abstract
We propose a new system design for connecting hardware and FPGA accelerators to the network, allowing the accelerator to directly control commodity Network Interface Cards (NICs) without using the CPU. This enables us to solve the key challenge of leveraging existing NIC hardware offloads such as virtualization, tunneling, and RDMA for accelerator networking. Our approach supports a diverse set of use cases, from direct network access for disaggregated accelerators to inline-acceleration of the network stack, all without the complex networking logic in the accelerator. To demonstrate the feasibility of this approach, we build FlexDriver (FLD), an on-accelerator hardware module that implements a NIC data-plane driver. Our main technical contribution is a mechanism that compresses the NIC control structures by two orders of magnitude, allowing FLD to achieve high networking scalability with low die area cost and no bandwidth interference with the accelerator logic. The prototype for NVIDIA Innova-2 FPGA SmartNICs showcases our design’s utility for three different accelerators: a disaggregated LTE cipher, an IP-defragmentation inline accelerator, and an IoT cryptographic-token authentication offload. These accelerators reach 25 Gbps line rate and leverage the NIC for RDMA processing, VXLAN tunneling, and traffic shaping without CPU involvement.Keywords
Funding Information
- Israel Science Foundation (1027/18)
This publication has 48 references indexed in Scilit:
- GPUrdmaPublished by Association for Computing Machinery (ACM) ,2016
- Recent Advances in Cloud Radio Access Networks: System Architectures, Key Techniques, and Open IssuesIEEE Communications Surveys & Tutorials, 2016
- High Performance Packet Processing with FlexNICPublished by Association for Computing Machinery (ACM) ,2016
- Scalable 10Gbps TCP/IP Stack Architecture for Reconfigurable HardwarePublished by Institute of Electrical and Electronics Engineers (IEEE) ,2015
- NetFPGA SUME: Toward 100 Gbps as Research CommodityIEEE Micro, 2014
- Infiniband-Verbs on GPU: A Case Study of Controlling an Infiniband Network Device from the GPUPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2014
- MVAPICH-PRISMPublished by Association for Computing Machinery (ACM) ,2013
- Efficient Inter-node MPI Communication Using GPUDirect RDMA for InfiniBand Clusters with NVIDIA GPUsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2013
- High performance network virtualization with SR-IOVJournal of Parallel and Distributed Computing, 2012
- Cuckoo hashingJournal of Algorithms, 2004