Multi-tenant fine-tuning for local LLMs with Tinker-compatible API
-
Updated
May 29, 2026 - Python
Multi-tenant fine-tuning for local LLMs with Tinker-compatible API
AI agent with multi-agent orchestration, autonomous cognitive systems, and a full management dashboard
Describe images with Ollama
🚀 Unified NLP Pipelines for Language Models
Delta: LLM conversation branching
Fast-ASDLC: 5x TTM with AI-native Agentic SDLC. Local-LLM first, Human-in-the-loop, Spec-driven. Built on DDD, Hexagonal Architecture, C4 Model & MCP. Features Meta-agents for self-improvement, Memory Bank for context persistence, and automated 100% test coverage. Everything-as-Code & Mermaid.js centric to save context window and slash token costs.
A Unity package for building open-source AI voice agents that run fully locally. You can use it to build intelligent non-player characters (NPCs), game interfaces, among many other applications.
Playground for learning by doing
The Operating System for Local Intelligence. ⚙️
J.A.R.V.I.S: An AI-powered Open Source Intelligence (OSINT) system. It orchestrates deep web scraping and local LLMs to autonomously generate comprehensive intelligence dossiers.
A lightweight CLI to orchestrate Gemini and GPT using your local files as a shared blackboard.
Experiments running offline LLMs in Python and Rust locally using Ollama and llama.cpp
A lightweight, self-contained Python project for running local LLM personalities with minimal dependencies. This system uses TinyLlama-1.1B-Chat-v1.0.0 and llama-cpp-python for inference, and Rich for a user-friendly console chat interface. This is a expansion of Tiny-Local-llm which allows you to select from 1 of 3 basic personalities.
Local-first RAG pipeline — ChromaDB, DeepSeek-R1 via Ollama, idempotent ingestion, reactive Marimo UI. Zero cloud APIs. Fully Dockerized.
Auto-benchmark LLLMs (Local Large Language Models), GGUF model files, and llama.cpp configs—with receipts, telemetry, and learning loops to find the best PILOT (Plug-in Inference Layer for Orchestrating Tasks) for your agentic harness.
On device autonomous research and content writing using open-sourced LLMs and Crew AI.
GGUF-Runner - Want to run LLMs locally, use this guide, and run with LLAMA.cpp
RAG Intelligent Question-Answering System Based on LangChain
A terminal-based tool for building flexible AI workflows anywhere. Process documents, create pipelines, and manage context from the command line.
Add a description, image, and links to the local-llms topic page so that developers can more easily learn about it.
To associate your repository with the local-llms topic, visit your repo's landing page and select "manage topics."