Skip to content
View ggerganov's full-sized avatar
🦙
🦙 🦙
🦙
🦙 🦙

Sponsors

Organizations

@ggml-org
Block or Report

Block or report ggerganov

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

Pybind11 bindings for Whisper.cpp

C++ 19 6 Updated Apr 29, 2024

LLM training in simple, raw C/CUDA

Cuda 17,534 1,806 Updated May 1, 2024

llama.cpp gguf file parser for javascript

JavaScript 18 1 Updated Apr 22, 2024

A Javascript library (with Typescript types) to parse metadata of GGML based GGUF files.

TypeScript 37 1 Updated Apr 26, 2024

Standalone app for easy RAG with local LLM

JavaScript 751 23 Updated Apr 29, 2024

LLocalSearch is a completely locally running search aggregator using LLM Agents. The user can ask a question and the system will use a chain of LLMs to find the answer. The user can see the progres…

Svelte 4,990 311 Updated Apr 30, 2024

High-level, optionally asynchronous Rust bindings to llama.cpp

Rust 98 15 Updated Apr 29, 2024
Jupyter Notebook 29 1 Updated Mar 17, 2024

WebAssembly binding for llama.cpp - Enabling in-browser LLM inference

C++ 88 1 Updated Apr 28, 2024

Local CLI Copilot, powered by CodeLLaMa. 💻🦙

Go 1,041 34 Updated Apr 30, 2024

A set of AI-enabled effects, generators, and analyzers for Audacity®.

C++ 581 27 Updated Apr 30, 2024

A super simple web interface to perform blind tests on LLM outputs.

PHP 19 2 Updated Mar 9, 2024

Pure DOOM - Single Header Doom Source Port

C++ 241 21 Updated Apr 7, 2024

An AI assistant beyond the chat box.

HTML 301 28 Updated Mar 11, 2024

emoji_finder

Python 14 1 Updated Apr 7, 2024

lightweight, standalone C++ inference engine for Google's Gemma models.

C++ 5,516 470 Updated Apr 30, 2024

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 7,952 697 Updated Apr 23, 2024

Detect file content types with deep learning

Python 7,355 390 Updated May 1, 2024

Self-organizing AI note-taking app that runs models locally.

TypeScript 4,224 236 Updated Apr 30, 2024

WebAssembly (Wasm) Build and Bindings for llama.cpp

JavaScript 117 7 Updated Feb 14, 2024

Reverse engineering the rk3588 npu

C 40 2 Updated Apr 24, 2024

Gnome shell extension for accurate speech to text input in Linux using whisper.cpp. Input text from speech anywhere.

JavaScript 42 3 Updated Mar 27, 2024

🔥 TUI interface for LLMs written in Rust

Rust 226 8 Updated Mar 19, 2024

GGML implementation of BERT model with Python bindings and quantization.

C++ 48 3 Updated Feb 19, 2024

Implementation of C++ standard libraries in C

C 1,051 62 Updated Mar 4, 2024

Run LLMs on weak devices or make powerful devices even more powerful by distributing the workload and dividing the RAM usage.

C++ 751 49 Updated Apr 29, 2024

Pure C++ implementation of several models for real-time chatting on your computer (CPU)

C++ 195 15 Updated Apr 30, 2024

Inference of Mamba models in pure C

C 162 7 Updated Feb 26, 2024

The llama-cpp-agent framework is a tool designed for easy interaction with Large Language Models (LLMs). It provides a simple yet robust interface using llama-cpp-python, allowing users to chat wit…

Python 230 18 Updated May 1, 2024

Create characters in Unity with LLMs!

C# 345 39 Updated Apr 26, 2024
Next