Concurrent Scheduling of High-Level Parallel Programs on Multi-GPU Systems
We're used to leaning on children's books in Computer Science - with Gulliver's big-endian vs little-endian. Back at Supercomputing hashtag#SC24, I spoke at the hashtag#Intel booth all about open standards, performance portability, and the journey up the Yellow Brick Road to see the Wizard of Oz. Check out the video of the talk on YouTube:
https://youtu.be/xO8FGAOScpo?si=_BnVilvTBa0Ns6dX
#performanceportability #OpenMP #SYCL
Performance portability via C++ PSTL, SYCL, OpenMP, and HIP: the Gaia AVU-GSR case study
#HIP #SYCL #OpenMP #CUDA #PerformancePortability #HPC #Astrophysics #Package
Kokkidio: Fast, expressive, portable code, based on Kokkos and Eigen
Thesis: Collection skeletons: declarative abstractions for data collections
Evaluating Operators in Deep Neural Networks for Improving Performance Portability of SYCL
Thesis: Automatic Code Rewriting for Performance Portability
Performance Portable Monte Carlo Particle Transport on Intel, NVIDIA, and AMD GPUs
#CUDA #OpenMP #MonteCarlo #PerformancePortability #Intel #AMD #NVIDIA #Package
Retargeting and Respecializing GPU Workloads for Performance Portability