###### tags: `CUDA` Note on CUDA programming === CUDA execution model --- - [Programming Model](/3xOWlRxkQ8iWea_bqXhHww) - [Compilation Workflow](/smfOoZv1QKKagZ3oXSauaQ) Global memory --- - [Memory architecture](/1Rpaxd0BQFWRkRnhZZ3x6g) - [Memory management](/MSxX1LiAQJSGz3I7HaeIVQ) - [Memory access pattern](/TrwAccEwTiq3pwFrwAi0Cw) Shared memory and constant memory --- - [Configurations of Shared Memory](/tT7xl4vcSU-LFUda0Ev71Q) - [Bank conflict](/wET73P1BQhCXYPDqnHbwUw) - [Synchronization of Shared Memory](/pSuPuzNUQeuepkEpRYvFCA) - [Warp Shuffles](/Lr3-m6jiReydZjZPlQjsaQ) Texture memory --- - [Texture Memory](/bpKNNQcoTXCsnyyO-iU-Sw) Streams and concurrency --- - [Stream and Concurrency](/-VsFMt9dS2OS5UoLZHU2mA) - [Triple Buffering]() Worked Examples --- - [Parallel Reduction](/vo-4fPtRTqugpJYLu-T_6A) - [Matrix Addition](/k1w1yiRFQvqtJpLmFZaMiQ) - [Matrix Transpose](/FVL8I6CCQeqOTmSvB6qLTg) - [Matrix Multiplcation](/kzsnoeYKS8qvgh8N6yfsdQ) Tensor core --- - [Tensor core](/CnHZ8KhCQvGxdS4lUEarnA) Profiling --- - [Calculation of Warp Occupancy](/nxg0xlLqQlyeJ5TE_b_YBQ) - [Performance tuning](/sXD0tx8DTuyEeWJF-YGEbA) - [Nsight compute](/Paa9DzzGTGOzcd9QuoxnTA) - [Nsight system](/zOywJzjYQhabhP2ezJ9-9w) Debugging --- - [cuda-gdb](/-KNLPssAQNO5xpR6baZj8Q) Mannual --- [nvcc](https://docs.nvidia.com/cuda/cuda-compiler-driver-nvcc/) [CUDA C++ Programming Guide](https://docs.nvidia.com/cuda/cuda-c-programming-guide/index.html) [CUDA C++ Best Practices Guide](https://docs.nvidia.com/cuda/cuda-c-best-practices-guide/index.html) [Parallel Thread Execution ISA Version 8.5](https://docs.nvidia.com/cuda/parallel-thread-execution/index.html) [CUDA runtime API](https://docs.nvidia.com/cuda/cuda-runtime-api/index.html) [CUDA driver API](https://docs.nvidia.com/cuda/cuda-driver-api/index.html) [Thrust](https://developer.nvidia.com/thrust) [CUDA-GDB](https://docs.nvidia.com/cuda/cuda-gdb/index.html) [Compute Sanitizer](https://docs.nvidia.com/compute-sanitizer/index.html) [Nsight System](https://docs.nvidia.com/nsight-systems/index.html) [Nsight compute](https://docs.nvidia.com/nsight-compute/index.html) [Profiler](https://docs.nvidia.com/cuda/profiler-users-guide/index.html) [CUDA binary utilities](https://docs.nvidia.com/cuda/cuda-binary-utilities/index.html) Miscellaneous --- - [Nvidia Technical Blog](/TImNuvCaS_ubrl5eFJW2Jw) - [CUDA FAQ](/1TwlHGzqRoGDxmM_HfCtkA) - [Syntax](/n0uzCKQ1SJ-_thzKY4Kf-Q) - [Thread indexing cheatsheet](/zLhfeBSZRSGCmojhXOjLdw) - [Trouble shooting](/PuZLlMCXRiWLJYpc85qjEg) - [NVIDIA A10](/wn4rhyicTa2YiV8-M-2E5g) - [kriegalex](https://github.com/kriegalex/wrox-pro-cuda-c/tree/master) - [HangJie720 ](https://github.com/HangJie720/Professional-CUDA-C-Programming/tree/master) - [deeplearning](https://github.com/deeperlearning/professional-cuda-c-programming) - [Good articles](https://siboehm.com/)
×
Sign in
Email
Password
Forgot password
or
Sign in via Google
Sign in via Facebook
Sign in via X(Twitter)
Sign in via GitHub
Sign in via Dropbox
Sign in with Wallet
Wallet (
)
Connect another wallet
Continue with a different method
New to HackMD?
Sign up
By signing in, you agree to our
terms of service
.