CUDA C++ Optimization: Coding Faster GPU Kernels (Generative AI LLM Programming)

$9.95
by David Spuler

Shop Now
Increase the efficiency of CUDA C++ kernels for AI and high-performance computing on the powerful NVIDIA GPUs. Leverage your GPU investment with the power of an efficient software layer. Main Topics - Speeding up CUDA C++ kernels - Parallelization and vectorization - Compute optimizations - Memory access optimizations Table of Contents: 1. Parallel Programming 2. Optimizing CUDA Programs 3. Vectorization 4. AI Kernel Optimization 5. Profiling Tools 6. Compilers and Optimizers 7. Timing CUDA C++ Programs 8. Memory Optimizations 9. Coalescing and Striding 10. Data Transfer Optimizations 11. Heap Memory Allocation 12. Compute Optimizations 13. Warp Divergence 14. Grid Optimizations 15. Compile-Time Optimizations 16. Arithmetic Optimizations 17. Floating-Point Bit Tricks 18. Advanced Techniques Appendix: CUDA C++ Slugs

Customer Reviews

No ratings. Be the first to rate

 customer ratings


How are ratings calculated?
To calculate the overall star rating and percentage breakdown by star, we don’t use a simple average. Instead, our system considers things like how recent a review is and if the reviewer bought the item on Amazon. It also analyzes reviews to verify trustworthiness.

Review This Product

Share your thoughts with other customers