AI Tools Directory

Discover the latest trending AI repositories and top videos from the best AI creators.

190+ Repositories
Updated: 1/12/2026

General News

Coding Agents

Learning

Trending AI Repositories

PaddleFormers

Python

PaddleFormers is an easy-to-use library of pre-trained large language model zoo based on PaddlePaddle.

⭐ 12,947 View on GitHub

sglang

Python

SGLang is a high-performance serving framework for large language models and multimodal models.

⭐ 22,272 View on GitHub

TensorRT-LLM

Python

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT LLM also contains components to create Python and C++ runtimes that orchestrate the inference execution in a performant way.

⭐ 12,603 View on GitHub

inspect_ai

Python

Inspect: A framework for large language model evaluations

⭐ 1,646 View on GitHub

ROLL

Python

An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models

⭐ 2,622 View on GitHub

NeMo

Python

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

⭐ 16,519 View on GitHub

chat-with-your-data-solution-accelerator

Python

A Solution Accelerator for the RAG pattern running in Azure, using Azure AI Search for retrieval and Azure OpenAI large language models to power ChatGPT-style and Q&A experiences. This includes most common requirements and best practices.

⭐ 1,145 View on GitHub

MemOS

Python

Build memory-native AI agents with Memory OS — an open-source framework for long-term memory, retrieval, and adaptive learning in large language models. Agent Memory | Memory System | Memory Management | Memory MCP | MCP System | LLM Memory | Agents Memory System |

⭐ 3,683 View on GitHub

azure-search-openai-demo

Python

A sample app for the Retrieval-Augmented Generation pattern running in Azure, using Azure AI Search for retrieval and Azure OpenAI large language models to power ChatGPT-style and Q&A experiences.

⭐ 7,522 View on GitHub

LightLLM

Python

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

⭐ 3,833 View on GitHub

helm

Python

Holistic Evaluation of Language Models (HELM) is an open source Python framework created by the Center for Research on Foundation Models (CRFM) at Stanford for holistic, reproducible and transparent evaluation of foundation models, including large language models (LLMs) and multimodal models.

⭐ 2,620 View on GitHub

chitu

Python

High-performance inference framework for large language models, focusing on efficiency, flexibility, and availability.

⭐ 1,381 View on GitHub

BitNet

Python

Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" in pytorch

⭐ 1,887 View on GitHub

Qwen3

Python

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

⭐ 26,112 View on GitHub

bitsandbytes

Python

Accessible large language models via k-bit quantization for PyTorch.

⭐ 7,881 View on GitHub

text-generation-inference

Python

Large Language Model Text Generation Inference

⭐ 10,727 View on GitHub

www-project-top-10-for-large-language-model-applications

Python

OWASP Top 10 for Large Language Model Apps (Part of the GenAI Security Project)

⭐ 1,037 View on GitHub

hcaptcha-challenger

Python

🥂 Gracefully face hCaptcha challenge with multimodal large language model.

⭐ 2,098 View on GitHub

guardrails

Python

Adding guardrails to large language models.

⭐ 6,253 View on GitHub

mergekit

Python

Tools for merging pretrained large language models.

⭐ 6,672 View on GitHub

PentestGPT

Python

Automated Penetration Testing Agentic Framework Powered by Large Language Models

⭐ 10,942 View on GitHub

llm

Python

Access large language models from the command-line

⭐ 10,813 View on GitHub

attackgen

Python

AttackGen is a cybersecurity incident response testing tool that leverages the power of large language models and the comprehensive MITRE ATT&CK framework. The tool generates tailored incident response scenarios based on user-selected threat actor groups and your organisation's details.

⭐ 1,205 View on GitHub

OpenAdapt

Python

Open Source Generative Process Automation (i.e. Generative RPA). AI-First Process Automation with Large ([Language (LLMs) / Action (LAMs) / Multimodal (LMMs)] / Visual Language (VLMs)) Models

⭐ 1,466 View on GitHub

nanotron

Python

Minimalistic large language model 3D-parallelism training

⭐ 2,411 View on GitHub

LLM4Decompile

Python

Reverse Engineering: Decompiling Binary Code with Large Language Models

⭐ 6,239 View on GitHub

llm2vec

Python

Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'

⭐ 1,637 View on GitHub

Qwen3-Coder

Python

Qwen3-Coder is the code version of Qwen3, the large language model series developed by Qwen team, Alibaba Cloud.

⭐ 14,843 View on GitHub

Qwen

Python

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

⭐ 20,124 View on GitHub

Dream

Python

Dream 7B, a large diffusion language model

⭐ 1,138 View on GitHub

eliza

TypeScript

Autonomous agents for everyone

⭐ 17,324 View on GitHub

cherry-studio

TypeScript

AI Agent + Coding Agent + 300+ assistants: agentic AI desktop with autonomous coding, intelligent automation, and unified access to frontier LLMs.

⭐ 37,605 View on GitHub

crewAI

Python

Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.

⭐ 42,537 View on GitHub

MassGen

Python

🚀 MassGen is an open-source multi-agent scaling system that runs in your terminal, autonomously orchestrating frontier models and agents to collaborate, reason, and produce high-quality results. | Join us on Discord: discord.massgen.ai

⭐ 684 View on GitHub

DeepAnalyze

Python

DeepAnalyze is the first agentic LLM for autonomous data science. 🎈你的AI数据分析师,自动分析大量数据,一键生成专业分析报告!

⭐ 3,455 View on GitHub

agent-zero

Python

Agent Zero AI framework

⭐ 13,180 View on GitHub

ralph-orchestrator

Python

An improved implementation of the Ralph Wiggum technique for autonomous AI agent orchestration

⭐ 509 View on GitHub

MemMachine

Python

Universal memory layer for AI Agents. It provides scalable, extensible, and interoperable memory storage and retrieval to streamline AI agent state management for next-generation autonomous systems.

⭐ 3,961 View on GitHub

claude-flow

JavaScript

🌊 The leading agent orchestration platform for Claude. Deploy intelligent multi-agent swarms, coordinate autonomous workflows, and build conversational AI systems. Features enterprise-grade architecture, distributed swarm intelligence, RAG integration, and native Claude Code support via MCP protocol. Ranked #1 in agent-based frameworks.

⭐ 11,443 View on GitHub

pentagi

Go

✨ Fully autonomous AI Agents system capable of performing complex penetration testing tasks

⭐ 932 View on GitHub

Autonomous-Agents

Autonomous Agents (LLMs) research papers. Updated Daily.

⭐ 1,109 View on GitHub

AIOpsLab

Python

A holistic framework to enable the design, development, and evaluation of autonomous AIOps agents.

⭐ 776 View on GitHub

dapr-agents

Python

Build autonomous, resilient and observable AI agents with built-in workflow orchestration, security, statefulness and telemetry.

⭐ 598 View on GitHub

ralph

TypeScript

Ralph is an autonomous AI agent loop that runs repeatedly until all PRD items are complete.

⭐ 2,792 View on GitHub

khoj

Python

Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous AI (gpt, claude, gemini, llama, qwen, mistral). Get started - free.

⭐ 32,157 View on GitHub

jido

Elixir

🤖 Autonomous agent framework for Elixir. Built for distributed, autonomous behavior and dynamic workflows.

⭐ 826 View on GitHub

claude_life_assistant

Shell

A symbiotic AI agent that remembers everything, acts autonomously, and extends your cognition.

⭐ 628 View on GitHub

ix

Python

Autonomous GPT-4 agent platform

⭐ 1,040 View on GitHub

Microverse

GDScript

A god-simulation sandbox game built on Godot 4 as a multi-agent AI social simulation system. In this virtual world, AI characters possess independent thinking and memory, capable of autonomous social interactions, task completion, and developing complex social relationships through continuous communication.

⭐ 2,040 View on GitHub

Awesome-AI-Agents

A collection of autonomous agents 🤖️ powered by LLM.

⭐ 919 View on GitHub

typedai

TypeScript

TypeScript AI platform with AI chat, Autonomous agents, Software developer agents, chatbots and more

⭐ 1,177 View on GitHub

BaseAI

TypeScript

BaseAI — The Web AI Framework. The easiest way to build serverless autonomous AI agents with memory. Start building local-first, agentic pipes, tools, and memory. Deploy serverless with one command.

⭐ 1,190 View on GitHub

agentUniverse

Python

agentUniverse is a LLM multi-agent framework that allows developers to easily build multi-agent applications.

⭐ 2,020 View on GitHub

agenticSeek

Python

Fully Local Manus AI. No APIs, No $200 monthly bills. Enjoy an autonomous agent that thinks, browses the web, and code for the sole cost of electricity. 🔔 Official updates only via twitter @Martin993886460 (Beware of fake account)

⭐ 24,387 View on GitHub

kimi-writer

Python

AI writing agent powered by kimi-k2-thinking - autonomously creates novels and stories with deep reasoning

⭐ 513 View on GitHub

hexstrike-ai

Python

HexStrike AI MCP Agents is an advanced MCP server that lets AI agents (Claude, GPT, Copilot, etc.) autonomously run 150+ cybersecurity tools for automated pentesting, vulnerability discovery, bug bounty automation, and security research. Seamlessly bridge LLMs with real-world offensive security capabilities.

⭐ 5,874 View on GitHub

AutoHedge

Python

Build your autonomous hedge fund in minutes. AutoHedge harnesses the power of swarm intelligence and AI agents to automate market analysis, risk management, and trade execution.

⭐ 973 View on GitHub

AIlice

Python

AIlice is a fully autonomous, general-purpose AI agent.

⭐ 1,372 View on GitHub

OpenManus

Python

OpenManus is an open-source initiative to replicate the capabilities of the Manus AI agent, a state-of-the-art general-purpose AI developed by Monica, which excels in autonomously executing complex tasks.

⭐ 835 View on GitHub

index

Python

The SOTA Open-Source Browser Agent for autonomously performing complex tasks on the web

⭐ 2,331 View on GitHub

chatluna

TypeScript

多平台模型接入,可扩展,多种输出格式,提供大语言模型聊天服务的插件 | A bot plugin for LLM chat with multi-model integration, extensibility, and various output formats

⭐ 385 View on GitHub

ComfyUI_UltimateSDUpscale

Python

ComfyUI nodes for the Ultimate Stable Diffusion Upscale script by Coyote-A.

⭐ 1,397 View on GitHub

Stable-Diffusion

JavaScript

FLUX, Stable Diffusion, SDXL, SD3, LoRA, Fine Tuning, DreamBooth, Training, Automatic1111, Forge WebUI, SwarmUI, DeepFake, TTS, Animation, Text To Video, Tutorials, Guides, Lectures, Courses, ComfyUI, Google Colab, RunPod, Kaggle, NoteBooks, ControlNet, TTS, Voice Cloning, AI, AI News, ML, ML News, News, Tech, Tech News, Kohya, Midjourney, RunPod

⭐ 2,620 View on GitHub

LanPaint

Python

High quality training free inpaint for every stable diffusion model. Supports ComfyUI

⭐ 877 View on GitHub

SwarmUI

C#

SwarmUI (formerly StableSwarmUI), A Modular Stable Diffusion Web-User-Interface, with an emphasis on making powertools easily accessible, high performance, and extensibility.

⭐ 3,578 View on GitHub

StabilityMatrix

C#

Multi-Platform Package Manager for Stable Diffusion

⭐ 7,222 View on GitHub

ComfyUI_tinyterraNodes

Python

A selection of nodes for Stable Diffusion ComfyUI

⭐ 577 View on GitHub

fast-stable-diffusion

Python

fast-stable-diffusion + DreamBooth

⭐ 7,875 View on GitHub

sd-dynamic-thresholding

Python

Dynamic Thresholding (CFG Scale Fix) for Stable Diffusion (SwarmUI, ComfyUI, and Auto WebUI)

⭐ 1,228 View on GitHub

MoneyPrinterPlus

Python

AI一键批量生成各类短视频,自动批量混剪短视频,自动把视频发布到抖音,快手,小红书,视频号上,赚钱从来没有这么容易过! 支持本地语音模型chatTTS,fasterwhisper,GPTSoVITS,支持云语音:Azure,阿里云,腾讯云。支持Stable diffusion,comfyUI直接AI生图。Generate short videos with one click using AI LLM,print money together! support:chatTTS,faster-whisper,GPTSoVITS,Azure,tencent Cloud,Ali Cloud.

⭐ 5,583 View on GitHub

comflowy

MDX

Unleash endless possibilities with ComfyUI and Stable Diffusion, committed to crafting refined AI-Gen tools and cultivating a vibrant community for both developers and users.

⭐ 1,194 View on GitHub

Auto-Photoshop-StableDiffusion-Plugin

TypeScript

A user-friendly plug-in that makes it easy to generate stable diffusion images inside Photoshop using either Automatic or ComfyUI as a backend.

⭐ 7,207 View on GitHub

ComfyBox

TypeScript

Customizable Stable Diffusion frontend for ComfyUI

⭐ 679 View on GitHub

vllm-omni

Python

A framework for efficient model inference with omni-modality models

⭐ 2,086 View on GitHub

diffusers

Python

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.

⭐ 32,409 View on GitHub

nunchaku

Python

[ICLR2025 Spotlight] SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models

⭐ 3,588 View on GitHub

Diffusion-Explorer

JavaScript

Interactive visualizations of the geometric intuition behind diffusion models.

⭐ 945 View on GitHub

sglang

Python

SGLang is a high-performance serving framework for large language models and multimodal models.

⭐ 22,272 View on GitHub

SimpleTuner

Python

A general fine-tuning kit geared toward image/video/audio diffusion models.

⭐ 2,706 View on GitHub

DiffSynth-Studio

Python

Enjoy the magic of Diffusion models!

⭐ 11,413 View on GitHub

ComfyUI

Python

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

⭐ 99,870 View on GitHub

InvokeAI

TypeScript

Invoke is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. The solution offers an industry leading WebUI, and serves as the foundation for multiple commercial products.

⭐ 26,538 View on GitHub

stable-diffusion.cpp

C++

Diffusion model(SD,Flux,Wan,Qwen Image,Z-Image,...) inference in pure C/C++

⭐ 5,124 View on GitHub

LanPaint

Python

High quality training free inpaint for every stable diffusion model. Supports ComfyUI

⭐ 877 View on GitHub

mflux

Python

A MLX port of FLUX and other state of the art diffusion image models based on the Huggingface Diffusers implementation.

⭐ 1,750 View on GitHub

VideoCrafter

Python

VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models

⭐ 5,016 View on GitHub

TurboDiffusion

Python

TurboDiffusion: 100–200× Acceleration for Video Diffusion Models

⭐ 3,156 View on GitHub

awesome-diffusion-categorized

collection of diffusion model papers categorized by their subareas

⭐ 2,108 View on GitHub

HiDiffusion

Jupyter Notebook

[ECCV 2024] HiDiffusion: Increases the resolution and speed of your diffusion model by only adding a single line of code!

⭐ 835 View on GitHub

dllm

Python

dLLM: Simple Diffusion Language Modeling

⭐ 1,564 View on GitHub

WeDLM

Python

WeDLM: The fastest diffusion language model with standard causal attention and native KV cache compatibility, delivering real speedups over vLLM-optimized baselines.

⭐ 540 View on GitHub

ai-toolkit

Python

The ultimate training toolkit for finetuning diffusion models

⭐ 8,845 View on GitHub

tiny-diffusion

Python

A character-level language diffusion model trained on Tiny Shakespeare

⭐ 825 View on GitHub

kandinsky-5

Python

Kandinsky 5.0: A family of diffusion models for Video & Image generation

⭐ 681 View on GitHub

taesd

Python

Tiny AutoEncoder for Stable Diffusion (and other image models)

⭐ 856 View on GitHub

Lumina-DiMOO

Python

Lumina-DiMOO - An Open-Sourced Multi-Modal Large Diffusion Language Model

⭐ 924 View on GitHub

Diffuman4D

Python

[ICCV 2025] Diffuman4D: 4D Consistent Human View Synthesis from Sparse-View Videos with Spatio-Temporal Diffusion Models

⭐ 553 View on GitHub

Awesome-DLMs

The official GitHub repo for the survey paper "A Survey on Diffusion Language Models".

⭐ 636 View on GitHub

SiT

Python

Official PyTorch Implementation of "SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant Transformers"

⭐ 1,066 View on GitHub

diffusion-pipe

Python

A pipeline parallel training script for diffusion models.

⭐ 1,800 View on GitHub

TrajectoryCrafter

Python

[ICCV 2025, Oral] TrajectoryCrafter: Redirecting Camera Trajectory for Monocular Videos via Diffusion Models

⭐ 818 View on GitHub

LightningDiT

Python

[CVPR 2025 Oral] Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models

⭐ 1,358 View on GitHub

CatVTON

Python

[ICLR 2025] CatVTON is a simple and efficient virtual try-on diffusion model with 1) Lightweight Network (899.06M parameters totally), 2) Parameter-Efficient Training (49.57M parameters trainable) and 3) Simplified Inference (< 8G VRAM for 1024X768 resolution).

⭐ 1,569 View on GitHub

Constrained-Text-Generation-Studio

Python

Code repo for "Most Language Models can be Poets too: An AI Writing Assistant and Constrained Text Generation Studio" at the (CAI2) workshop, jointly held at (COLING 2022)

⭐ 213 View on GitHub

docs-mcp-server

TypeScript

Grounded Docs MCP Server: Open-Source Alternative to Context7, Nia, and Ref.Tools

⭐ 902 View on GitHub

browser-operator-core

TypeScript

Browser Operator - The AI browser with built in Multi-Agent platform! Open source alternative to ChatGPT Atlas, Perplexity Comet, Dia and Microsoft CoPilot Edge Browser

⭐ 411 View on GitHub

windsurf.vim

Vim Script

Free, ultrafast Copilot alternative for Vim and Neovim

⭐ 5,103 View on GitHub

codeium.el

Emacs Lisp

Free, ultrafast Copilot alternative for Emacs

⭐ 620 View on GitHub

vscode-extension

TypeScript

Flexpilot - Open-Source, Native and a True GitHub Copilot Alternative for VS Code

⭐ 819 View on GitHub

ClipboardConqueror

JavaScript

Clipboard Conqueror is a novel copy and paste copilot alternative designed to bring your very own LLM AI assistant to any text field.

⭐ 433 View on GitHub

companion-vscode

TypeScript

VSCode extension of Quack Companion 💻 Turn your team insights into a portable plug-and-play context for code generation. Alternative to GitHub Copilot powered by OSS LLMs (Mistral, Gemma, etc.), served with Ollama.

⭐ 233 View on GitHub

code-llama-for-vscode

Python

Use Code Llama with Visual Studio Code and the Continue extension. A local LLM alternative to GitHub Copilot.

⭐ 568 View on GitHub

WebLaTex

TeX

A complete alternative for Overleaf with VSCode + Web + Git Integration + Copilot + Grammar & Spell Checker + Live Collaboration Support. Based on GitHub Codespace and Dev container.

⭐ 1,398 View on GitHub

privy

TypeScript

An open-source alternative to GitHub copilot that runs locally.

⭐ 990 View on GitHub

fauxpilot

Python

FauxPilot - an open-source alternative to GitHub Copilot server

⭐ 14,762 View on GitHub

clara-copilot

JavaScript

A alternative to Github Copilot for vscode until you get the access to github copilot

⭐ 288 View on GitHub

slime

Python

slime is an LLM post-training framework for RL Scaling.

⭐ 3,281 View on GitHub

transformers

Python

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

⭐ 154,933 View on GitHub

ArcticTraining

Python

ArcticTraining is a framework designed to simplify and accelerate the post-training process for large language models (LLMs)

⭐ 267 View on GitHub

EasyR1

Python

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

⭐ 4,414 View on GitHub

verbalized-sampling

Python

Verbalized Sampling, a training-free prompting strategy to mitigate mode collapse in LLMs by requesting responses with probabilities. Achieves 2-3x diversity improvement while maintaining quality. Model-agnostic framework with CLI/API for creative writing, synthetic data generation, and dialogue simulation.

⭐ 655 View on GitHub

LLaVA-OneVision-1.5

Python

Fully Open Framework for Democratized Multimodal Training

⭐ 681 View on GitHub

paxml

Python

Pax is a Jax-based machine learning framework for training large scale models. Pax allows for advanced and fully configurable experimentation and parallelization, and has demonstrated industry leading model flop utilization rates.

⭐ 543 View on GitHub

MARTI

Python

A Framework for LLM-based Multi-Agent Reinforced Training and Inference

⭐ 386 View on GitHub

Search-R1

Python

Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL

⭐ 3,793 View on GitHub

SiLLM

Python

SiLLM simplifies the process of training and running Large Language Models (LLMs) on Apple Silicon by leveraging the MLX framework.

⭐ 285 View on GitHub

awesome-llm-pretraining

Awesome LLM pre-training resources, including data, frameworks, and methods.

⭐ 308 View on GitHub

higgsfield

Jupyter Notebook

Fault-tolerant, highly scalable GPU orchestration, and a machine learning framework designed for training models with billions to trillions of parameters

⭐ 3,522 View on GitHub

Machine-Learning-Guide

Python

Machine learning Guide. Learn all about Machine Learning Tools, Libraries, Frameworks, Large Language Models (LLMs), and Training Models.

⭐ 683 View on GitHub

OpenGPT

Jupyter Notebook

A framework for creating grounded instruction based datasets and training conversational domain expert Large Language Models (LLMs).

⭐ 361 View on GitHub

oumi

Python

Easily fine-tune, evaluate and deploy gpt-oss, Qwen3, DeepSeek-R1, or any open source LLM / VLM!

⭐ 8,819 View on GitHub

ktransformers

Python

A Flexible Framework for Experiencing Heterogeneous LLM Inference/Fine-tune Optimizations

⭐ 16,332 View on GitHub

aikit

Go

🏗️ Fine-tune, build, and deploy open-source LLMs easily!

⭐ 503 View on GitHub

Trinity-RFT

Python

Trinity-RFT is a general-purpose, flexible and scalable framework designed for reinforcement fine-tuning (RFT) of large language models (LLM).

⭐ 472 View on GitHub

LlamaFactory

Python

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

⭐ 65,433 View on GitHub

easy-dataset

JavaScript

A powerful tool for creating fine-tuning datasets for LLM

⭐ 12,755 View on GitHub

unsloth

Python

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek, Qwen, Llama, Gemma, TTS 2x faster with 70% less VRAM.

⭐ 50,577 View on GitHub

trainer

Go

Distributed AI Model Training and Fine-Tuning on Kubernetes

⭐ 1,999 View on GitHub

peft

Python

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

⭐ 20,439 View on GitHub

GraphGen

Python

GraphGen: Enhancing Supervised Fine-Tuning for LLMs with Knowledge-Driven Synthetic Data Generation

⭐ 789 View on GitHub

awesome_LLM-harmful-fine-tuning-papers

A survey on harmful fine-tuning attack for large language model

⭐ 230 View on GitHub

mlx-vlm

Python

MLX-VLM is a package for inference and fine-tuning of Vision Language Models (VLMs) on your Mac using MLX.

⭐ 1,986 View on GitHub

LLM-Zero-to-Hundred

Jupyter Notebook

This repository contains different LLM chatbot projects (RAG, LLM agents, etc.) and well-known techniques for training and fine tuning LLMs.

⭐ 432 View on GitHub

WeClone

Python

🚀 One-stop solution for creating your digital avatar from chat history 💡 Fine-tune LLMs with your chat logs to capture your unique style, then bind to a chatbot to bring your digital self to life. 从聊天记录创造数字分身的一站式解决方案

⭐ 16,187 View on GitHub

xTuring

Python

Build, personalize and control your own LLMs. From data pre-processing to fine-tuning, xTuring provides an easy way to personalize open-source LLMs. Join our discord community: https://discord.gg/TgHXuSJEk6

⭐ 2,662 View on GitHub

LlamaEdge

Rust

The easiest & fastest way to run customized and fine-tuned LLMs locally or on the edge

⭐ 1,576 View on GitHub

Kolo

Python

The Fastest Way to Fine-Tune LLMs Locally

⭐ 332 View on GitHub

AI-Bootcamp

Jupyter Notebook

Self-paced bootcamp on Generative AI. Tutorials on ML fundamentals, Ollama, LLMs, RAGs, LangChain, LangGraph, Fine-tuning, DSPy & AI Agents (CrewAI), (Using ChatGPT, gpt-oss, Claude, Qwen3, Gemma3, Llama 3)

⭐ 789 View on GitHub

h2o-llmstudio

Python

H2O LLM Studio - a framework and no-code GUI for fine-tuning LLMs. Documentation: https://docs.h2o.ai/h2o-llmstudio/

⭐ 4,763 View on GitHub

RAG-FiT

Python

Framework for enhancing LLMs for RAG tasks using fine-tuning.

⭐ 763 View on GitHub

codeqai

Python

Local first semantic code search and chat | Leverage custom copilots with fine-tuning datasets from code in Alpaca, Conversational, Completion and Instruction format

⭐ 495 View on GitHub

es-fine-tuning-paper

Python

This repo contains the source code for the paper "Evolution Strategies at Scale: LLM Fine-Tuning Beyond Reinforcement Learning"

⭐ 283 View on GitHub

llama-cookbook

Jupyter Notebook

Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama model family and using them on various provider services

⭐ 18,140 View on GitHub

MLX-GRPO

Python

A pure MLX-based training pipeline for fine-tuning LLMs using GRPO on Apple Silicon.

⭐ 226 View on GitHub

llm-finetuning

Python

Guide for fine-tuning Llama/Mistral/CodeLlama models and more

⭐ 645 View on GitHub

FineTuningLLMs

Jupyter Notebook

Official repository of my book "A Hands-On Guide to Fine-Tuning LLMs with PyTorch and Hugging Face"

⭐ 767 View on GitHub

Memento

Python

Official Code of Memento: Fine-tuning LLM Agents without Fine-tuning LLMs

⭐ 2,138 View on GitHub

LLM-engineer-handbook

A curated list of Large Language Model resources, covering model training, serving, fine-tuning, and building LLM applications.

⭐ 4,603 View on GitHub

RAG-Retrieval

Python

Unify Efficient Fine-tuning of RAG Retrieval, including Embedding, ColBERT, ReRanker.

⭐ 1,076 View on GitHub

DB-GPT-Hub

Python

A repository that contains models, datasets, and fine-tuning techniques for DB-GPT, with the purpose of enhancing model performance in Text-to-SQL

⭐ 1,949 View on GitHub

ragflow

Python

RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs

⭐ 71,267 View on GitHub

RAGLight

Python

RAGLight is a modular framework for Retrieval-Augmented Generation (RAG). It makes it easy to plug in different LLMs, embeddings, and vector stores, and now includes seamless MCP integration to connect external tools and data sources.

⭐ 625 View on GitHub

ViDoRAG

Python

[EMNLP 2025] ViDoRAG: Visual Document Retrieval-Augmented Generation via Dynamic Iterative Reasoning Agents

⭐ 619 View on GitHub

graphrag

Python

A modular graph-based Retrieval-Augmented Generation (RAG) system

⭐ 30,257 View on GitHub

Multimodal-RAG-Survey

A Survey on Multimodal Retrieval-Augmented Generation

⭐ 459 View on GitHub

rag

Python

This NVIDIA RAG blueprint serves as a reference solution for a foundational Retrieval Augmented Generation (RAG) pipeline.

⭐ 431 View on GitHub

agentic-rag-for-dummies

Jupyter Notebook

A minimal Agentic RAG built with LangGraph — learn Retrieval-Augmented Generation Agents in minutes.

⭐ 1,449 View on GitHub

SqlDatabaseVectorSearch

C#

A Blazor Web App and Minimal API for performing RAG (Retrieval Augmented Generation) and vector search using the native VECTOR type in Azure SQL Database and Azure OpenAI.

⭐ 129 View on GitHub

LightRAG

Python

[EMNLP2025] "LightRAG: Simple and Fast Retrieval-Augmented Generation"

⭐ 27,221 View on GitHub

Awesome-RAG-Vision

Awesome-RAG-Vision: a curated list of advanced retrieval augmented generation (RAG) for Computer Vision

⭐ 297 View on GitHub

raglite

Python

🥤 RAGLite is a Python toolkit for Retrieval-Augmented Generation (RAG) with DuckDB or PostgreSQL

⭐ 1,132 View on GitHub

serverless-chat-langchainjs

Bicep

Build your own serverless AI Chat with Retrieval-Augmented-Generation using LangChain.js, TypeScript and Azure

⭐ 838 View on GitHub

AI-agents-for-cybersecurity

Python

This repository contains resources and materials for the "AI Agents and Retrieval Augmented Generation (RAG) for Cybersecurity Operations" and other courses by Omar Santos.

⭐ 136 View on GitHub

local-LLM-with-RAG

Python

Running local Language Language Models (LLM) to perform Retrieval-Augmented Generation (RAG)

⭐ 250 View on GitHub

context-portal

Python

Context Portal (ConPort): A memory bank MCP server building a project-specific knowledge graph to supercharge AI assistants. Enables powerful Retrieval Augmented Generation (RAG) for context-aware development in your IDE.

⭐ 720 View on GitHub

XRAG

Python

XRAG: eXamining the Core - Benchmarking Foundational Component Modules in Advanced Retrieval-Augmented Generation

⭐ 114 View on GitHub

RAGHub

A community-driven collection of RAG (Retrieval-Augmented Generation) frameworks, projects, and resources. Contribute and explore the evolving RAG ecosystem.

⭐ 1,528 View on GitHub

Awesome-GraphRAG

Awesome-GraphRAG: A curated list of resources (surveys, papers, benchmarks, and opensource projects) on graph-based retrieval-augmented generation.

⭐ 2,024 View on GitHub

Advanced-QA-and-RAG-Series

Jupyter Notebook

This repository contains advanced LLM-based chatbots for Q&A using LLM agents, and Retrieval Augmented Generation (RAG) and with different databases. (VectorDB, GraphDB, SQLite, CSV, XLSX, etc.)

⭐ 423 View on GitHub

AutoRAG

Python

AutoRAG: An Open-Source Framework for Retrieval-Augmented Generation (RAG) Evaluation & Optimization with AutoML-Style Automation

⭐ 4,518 View on GitHub

GPT-RAG

Bicep

Sharing the learning along the way we been gathering to enable Azure OpenAI at enterprise scale in a secure manner. GPT-RAG core is a Retrieval-Augmented Generation pattern running in Azure, using Azure Cognitive Search for retrieval and Azure OpenAI large language models to power ChatGPT-style and Q&A experiences.

⭐ 1,124 View on GitHub

fed-rag

Python

A framework for fine-tuning retrieval-augmented generation (RAG) systems.

⭐ 139 View on GitHub

ollama_pdf_rag

TypeScript

A full-stack demo showcasing a local RAG (Retrieval Augmented Generation) pipeline to chat with your PDFs.

⭐ 479 View on GitHub

Hyper-RAG

Python

"Hyper-RAG: Combating LLM Hallucinations using Hypergraph-Driven Retrieval-Augmented Generation" by Yifan Feng, Hao Hu, Xingliang Hou, Shiquan Liu, Shihui Ying, Shaoyi Du, Han Hu, and Yue Gao.

⭐ 234 View on GitHub

renumics-rag

Python

Visualization for a Retrieval-Augmented Generation (RAG) Assistant 🤖❤️📚

⭐ 196 View on GitHub

ai-sdk-preview-rag

TypeScript

Retrieval-augmented generation (RAG) template powered by the AI SDK.

⭐ 385 View on GitHub

RAG_Techniques

Jupyter Notebook

This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and contextually rich responses.

⭐ 24,124 View on GitHub

rag-chatbot

Python

RAG (Retrieval-augmented generation) ChatBot that provides answers based on contextual information extracted from a collection of Markdown files.

⭐ 368 View on GitHub

rag-web-ui

TypeScript

RAG Web UI is an intelligent dialogue system based on RAG (Retrieval-Augmented Generation) technology.

⭐ 2,737 View on GitHub

document-chat-system

TypeScript

Open-source document chat platform with semantic search, RAG (Retrieval Augmented Generation), and multi-provider AI support (OpenRouter, OpenAI, ImageRouter).

⭐ 121 View on GitHub