CUDA, Supercomputing for the Masses, Part 20

 

Part 18 with more conventional OpenGL rendering methods. If you're interested in any of these topics and thinking about purchasing the professional version of Parallel Nsight, you should find this article illuminating. The professional version of Parallel Nsight provides the ability to trace and analyze:

  • CUDA applications. In particular, Parallel Nsight has the ability to insert code to capture application traces that include operating system calls, CUDA kernels, data transfers, and host-based methods. NVIDIA put this video online to give viewers a sense of the Nsight tracing and analysis capabilities.
  • OpenCL applications. Parallel Nsight has been designed to support multiple languages so OpenCL applications can be traced with the same functionality as CUDA applications.
  • Graphics Analysis. For game and visual computing developers, Parallel Nsight allows for the tracing of OpenGL and DirectX API calls in version 1.0, with the tracing of graphics GPU workloads coming in the upcoming version 1.5. This is in addition to the Graphics Inspector, which is part of Parallel Nsight Standard. The Graphics Inspector provides a heads-up performance display (HUD), real-time profiling, plus the ability to examine performance markers and inspect the graphics pipelines. While not covered in this article, NVIDIA demonstrates these capabilities and more in this video. Neither Version 1.0 or 1.5 support OpenGL in the Graphics Inspector.
    • Note that the Graphics Inspector is part of the standard version.
    • Version 1.5 of Parallel Nsight adds a Graphics Workload Trace to the analysis capability, which traces the GPU workloads spawned by individual draw calls