Trending
See what the GitHub community is most excited about this month.
NVlabs / instant-ngp
Instant neural graphics primitives: lightning fast NeRF and more
NVIDIA / nccl-tests
NCCL Tests
nerfstudio-project / gsplat
CUDA accelerated rasterization of gaussian splatting
sangyc10 / CUDA-code
flashinfer-ai / flashinfer
FlashInfer: Kernel Library for LLM Serving
Dao-AILab / causal-conv1d
Causal depthwise conv1d in CUDA, with a PyTorch interface
DefTruth / CUDA-Learn-Notes
🎉CUDA 笔记 / 大模型手撕CUDA / C++笔记,更新随缘: flash_attn、sgemm、sgemv、warp reduce、block reduce、dot product、elementwise、softmax、layernorm、rmsnorm、hist etc.
NVIDIA / nvbench
CUDA Kernel Benchmarking Library
rapidsai / cugraph
cuGraph - RAPIDS Graph Analytics Library
NVIDIA / cub
[ARCHIVED] Cooperative primitives for CUDA C++. See https://github.com/NVIDIA/cccl
NVIDIA / CUDALibrarySamples
CUDA Library Samples
ROCm / rccl-tests
RCCL Performance Benchmark Tests
angli66 / simsense
A Real-Time Depth Sensor Simulator with GPU Acceleration
olcf / cuda-training-series
Training materials associated with NVIDIA's CUDA Training Series (www.olcf.ornl.gov/cuda-training-series/)
mit-han-lab / torchsparse
TorchSparse: Efficient Training and Inference Framework for Sparse Convolution on GPUs.
ashawkey / diff-gaussian-rasterization
Tony-Tan / CUDA_Freshman
brucefan1983 / CUDA-Programming
Sample codes for my CUDA programming book