Improving the Performance of Circuit-Switched Interconnection Network for a Multi-FPGA System
- 1 December 2021
- journal article
- research article
- Published by Institute of Electronics, Information and Communications Engineers (IEICE) in IEICE Transactions on Information and Systems
- Vol. E104.D (12), 2029-2039
- https://doi.org/10.1587/transinf.2021pap0002
Abstract
Multi-FPGA systems have gained attention because of their high performance and power efficiency. A multi-FPGA system called Flow-in-Cloud (FiC) is currently being developed as an accelerator of multi-access edge computing (MEC). FiC consists of multiple mid-range FPGAs tightly connected by high-speed serial links. Since time-critical jobs are assumed in MEC, a circuit-switched network with static time-division multiplexing (STDM) switches has been implemented on FiC. This paper investigates techniques of enhancing the interconnection performance of FiC. Unlike switching fabrics for Network on Chips or parallel machines, economical multi-FPGA systems, such as FiC, use Xilinx Aurora IP and FireFly cables with multiple lanes. We adopted the link aggregation and the slot distribution for using multiple lanes. To mitigate the bottleneck between an STDM switch and user logic, we also propose a multi-ejection STDM switch. We evaluated various combinations of our techniques by using three practical applications on an FiC prototype with 24 boards. When the number of slots is large and transferred data size is small, the slot distribution was sometimes more effective, while the link aggregation was superior for other most cases. Our multi-ejection STDM switch mitigated the bottleneck in ejection ports and successfully reduced the number of time slots. As a result, by combining the link aggregation and multi-ejection STDM switch, communication performance improved up to 7.50 times with few additional resources. Although the performance of the fast Fourier transform with the highest communication ratio could not be enhanced by using multiple boards when a lane was used, 1.99 times performance improvement was achieved by using 8 boards with four lanes and our multi-ejection switch compared with a board.Keywords
This publication has 22 references indexed in Scilit:
- Optimizing Slot Utilization and Network Topology for Communication Pattern on Circuit-Switched Parallel Computing SystemsIEICE Transactions on Information and Systems, 2019
- High Performance Communication on Reconfigurable ClustersPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2018
- High Performance Dynamic Communication on Reconfigurable ClustersPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2018
- Performance evaluation of Stratix V DE5-Net FPGA board for high performance computingPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2016
- MultiCS: Circuit switched NoC with multiple sub-networks and sub-channelsJournal of Systems Architecture, 2015
- A reconfigurable fabric for accelerating large-scale datacenter servicesPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2014
- A Statically Scheduled Time-Division-Multiplexed Network-on-Chip for Real-Time SystemsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2012
- A SDM-TDM based circuit-switched router for on-chip networksPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2011
- Improving NoC Performance by Non-contention Ejection ArchitecturePublished by Institute of Electrical and Electronics Engineers (IEEE) ,2010
- Multi-path routing in time-division-multiplexed networks on chipPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2009