mastodon.world is one of the many independent Mastodon servers you can use to participate in the fediverse.
Generic Mastodon server for anyone to use.

Server stats:

8.4K
active users

#cuda

11 posts10 participants1 post today

New #ZLUDA 5 Preview Released For #CUDA On Non-NVIDIA #GPU
For now this ability to run unmodified CUDA apps on non-#NVIDIA GPUs is focused on #AMD GPUs of the #Radeon RX 5000 series and newer, which is AMD Radeon GPUs with #ROCm. Besides CUDA code samples, GeekBench has been one of the early targets for testing.
phoronix.com/news/ZLUDA-5-prev

www.phoronix.comNew ZLUDA 5 Preview Released For CUDA On Non-NVIDIA GPUsZLUDA Version 5-preview.43 was released today as this open-source CUDA implementation for use on non-NVIDIA GPUs, with one of the current focuses being on enabling CUDA on AMD Radeon GPUs with ROCm.

#ZLUDA Making Progress In 2025 On Bringing #CUDA To Non-NVIDIA #GPU
ZLUDA #opensource effort that started half-decade ago as drop-in CUDA implementation for #Intel GPUs and then for several years was funded by ##AMD as a CUDA implementation for #Radeon GPUs atop #ROCm and then open-sourced but then reverted has been continuing to push along a new path since last year. Current take on ZLUDA is a multi-vendor CUDA implementation for non-NVIDIA GPUs for #AI workloads & more.
phoronix.com/news/ZLUDA-Q2-202

www.phoronix.comZLUDA Making Progress In 2025 On Bringing CUDA To Non-NVIDIA GPUs

🌗 一週末使用 CUDA 實現光線追蹤:性能提升至 RTX 的兩倍
➤ CUDA優化之路:從零開始到性能飛躍
karimsayedre.github.io/RTIOW.h
本文深入探討了作者如何使用 CUDA 撰寫光線追蹤器,並在相同硬體上,有時甚至高達 3 倍的速度超越了 Vulkan/RTX 實作。作者詳細記錄了優化過程,包括分析瓶頸、調整程式碼,並最終實現了比最初預期更好的性能。關鍵在於透過“內聯光線追蹤”的方式,減少記憶體流量並提升效能。研究發現,在特定條件下,完全使用運算單元處理光線追蹤可能優於使用專用 RT 核心。
+ 真是令人驚訝的成果!這篇文章證明瞭即使在強大的硬體加速下,優化的重要性仍然不容忽視。
+ 這對於學習 GPU 程式設計和性能優化非常有幫助,作者分享了許多實用的經驗和洞見。
#圖像處理 #CUDA #光線追蹤 #性能優化

karimsayedre.github.ioKarim Sayed - Rendering EngineerA showcase of my projects and portfolio.

🔊 Whisper has a serious challenger: Moshi STT

Developed by the French research lab Kyutai, Moshi STT is a new open-source speech recognition system that’s blazingly fast, highly accurate, and optimized for Apple Silicon and CUDA — all designed with real-time performance in mind.

scalastic.io/en/moshi-stt-vs-w

Scalastic · Why Moshi STT Could Replace Whisper (and How to Install It on macOS!)Discover Moshi STT by Kyutai, an open-source real-time speech transcription solution, optimized for Mac (Apple Silicon) and CUDA—fast, accurate, and easy to install. Includes a guide, user feedback, and useful links.