NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.
-
Updated
Jun 1, 2024 - C++
NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.
HugeCTR is a high efficiency GPU framework designed for Click-Through-Rate (CTR) estimating training
Deep learning toolkit-enabled VLSI placement
BlazingSQL is a lightweight, GPU accelerated, SQL engine for Python. Built on RAPIDS cuDF.
A high performance anime upscaler
Fast Neural Machine Translation in C++ - development repository
CUDA C++ Core Libraries
GPU-accelerated Levenberg-Marquardt curve fitting in CUDA
Cross Platform Professional Procedural Terrain Generation & Texturing Tool
stdgpu: Efficient STL-like Data Structures on the GPU
Vahana VR & VideoStitch Studio: software to create immersive 360° VR video, live and in post-production
OpenCL is the most powerful programming language ever created. Yet the OpenCL C++ bindings are cumbersome and the code overhead prevents many people from getting started. I created this lightweight OpenCL-Wrapper to greatly simplify OpenCL software development with C++ while keeping functionality and performance.
Vulkan compute for people
Automatic parallelization of Python/NumPy, C, and C++ codes on Linux and MacOSX
An adaptive mesh hydrodynamics simulation code for low Mach number reacting flows without level sub-cycling.
Node-based image editor with GPU-acceleration.
Distributed Communication-Optimal Matrix-Matrix Multiplication Algorithm
Piranha: A GPU Platform for Secure Computation
Add a description, image, and links to the gpu-acceleration topic page so that developers can more easily learn about it.
To associate your repository with the gpu-acceleration topic, visit your repo's landing page and select "manage topics."