HERO
- 4 November 2018
- conference paper
- conference paper
- Published by Association for Computing Machinery (ACM) in Proceedings of the 2nd Workshop on AutotuniNg and aDaptivity AppRoaches for Energy efficient HPC Systems - ANDARE '18
Abstract
No abstract availableKeywords
This publication has 16 references indexed in Scilit:
- Performance Analysis and Optimization of Clang's OpenMP 4.5 GPU SupportPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2016
- Controlling NUMA effects in embedded manycore applications with lightweight nested parallelism supportParallel Computing, 2016
- A quantitative analysis on microarchitectures of modern CPU-FPGA platformsPublished by Association for Computing Machinery (ACM) ,2016
- Evaluating OpenMP 4.0's Effectiveness as a Heterogeneous Parallel Programming ModelPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2016
- Runtime Support for Multiple Offload-Based Programming Models on Clustered Manycore AcceleratorsIEEE Transactions on Emerging Topics in Computing, 2016
- Ultra-low-latency lightweight DMA for tightly coupled multi-core clustersPublished by Association for Computing Machinery (ACM) ,2014
- Implementation and Optimization of the OpenMP Accelerator Model for the TI Keystone II ArchitectureLecture Notes in Computer Science, 2014
- accULL: An OpenACC Implementation with CUDA and OpenCL SupportLecture Notes in Computer Science, 2012
- OmpSs: A PROPOSAL FOR PROGRAMMING HETEROGENEOUS MULTI-CORE ARCHITECTURESParallel Processing Letters, 2011
- OpenMP: an industry standard API for shared-memory programmingIEEE Computational Science and Engineering, 1998