Analytic modeling of network processors for parallel workload mapping
- 22 April 2009
- journal article
- Published by Association for Computing Machinery (ACM) in ACM Transactions on Embedded Computing Systems
- Vol. 8 (3), 1-29
- https://doi.org/10.1145/1509288.1509290
Abstract
Network processors are heterogeneous system-on-chip multiprocessors that are optimized to perform packet forwarding and processing tasks at Gigabit data rates. To meet the performance demands of increasing link speeds and complex network applications, network processors are implemented with several dozen embedded processor cores and hardware accelerators that run multiple packet processing applications in parallel. The parallel nature of the processing system makes it increasingly difficult for application developers to understand and manage resources and map processing tasks to the hardware. To address this problem, we present a methodology for profiling and analyzing network processor applications, mapping processing tasks to a generalized network processor architecture, and analytically determining the expected throughput performance. The key novelty of this work is not only the adaptation of application analysis and mapping algorithms to heterogeneous network processors, but also that the entire process can be automated and hidden from the application developer. Starting with the analysis of a uniprocessor implementation of the application, the process yields a mapping of the partitioned application that shows best performance for a given network processor system. The simplicity of the proposed randomized mapping algorithm allows the use of this methodology in network processor runtime systems where dynamic reallocation of tasks is necessary but processing power is limited. We present results that show the effectiveness of the analysis and mapping methodology as well as its application to design space exploration.Keywords
This publication has 20 references indexed in Scilit:
- Programmable stream processorsComputer, 2003
- The click modular routerACM Transactions on Computer Systems, 2000
- Static scheduling algorithms for allocating directed task graphs to multiprocessorsACM Computing Surveys, 1999
- Scheduling issues in high-performance computingACM SIGMETRICS Performance Evaluation Review, 1999
- Requirements for IP Version 4 RoutersPublished by RFC Editor ,1995
- Performance tradeoffs in multithreaded processorsIEEE Transactions on Parallel and Distributed Systems, 1992
- An introduction to randomized algorithmsDiscrete Applied Mathematics, 1991
- Memory interference in multimicroprocessor systems with a time-shared busIEE Proceedings E Computers and Digital Techniques, 1984
- A General Model for Memory Interference in MultiprocessorsInternational Conference on Acoustics, Speech, and Signal Processing (ICASSP), 1977
- Analysis of Memory Interference in MultiprocessorsInternational Conference on Acoustics, Speech, and Signal Processing (ICASSP), 1975