Evaluating Operators in Deep Neural Networks for Improving Performance Portability of SYCL
Thesis: Automatic Code Rewriting for Performance Portability
Performance Portable Monte Carlo Particle Transport on Intel, NVIDIA, and AMD GPUs
#CUDA #OpenMP #MonteCarlo #PerformancePortability #Intel #AMD #NVIDIA #Package
Retargeting and Respecializing GPU Workloads for Performance Portability