GTC talk: Designing Killer CUDA Applications for X86, multiGPU, and CPU+GPU
Accepted and to be scheduled at GTC. See you there!
CUDA redefined software development with 10 to 1000-times faster GPU applications. Now a single CUDA source tree can support the x86 mass market (no GPU required) and 1/3 billion CUDA-enabled GPUs. MultiGPU and CPU+GPU apps utilize all system resources. GPUdirect, UVA, caches, prefetching, ILP (Instruction level Parallelism), automated analysis tools and more offer ease, capability, and performance. The overall impact on software investment, scalability, balance metrics, programming API, and lifecycle will be considered. Working real-time video and other examples from my book, ”CUDA Application Design and Development” provide practical insight to enable augmented reality and your killer apps.