Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
-
Updated
Jun 1, 2026 - Python
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
The official GitHub page for the survey paper "A Survey of Large Language Models".
Official release of InternLM series (InternLM, InternLM2, InternLM2.5, InternLM3).
Robust recipes to align language models with human and AI preferences
OpenClaw-RL: Train any agent simply by talking
The open source research environment for AI researchers to seamlessly train, evaluate, and scale models from local hardware to GPU clusters.
Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets
Build, Evaluate, and Optimize AI Systems. Includes evals, RAG, agents, fine-tuning, synthetic data generation, dataset management, MCP, and more.
Align Anything: Training All-modality Model with Feedback
Implement a reasoning LLM in PyTorch from scratch, step by step
A curated list of reinforcement learning with human feedback resources (continually updated)
Fine-tuning ChatGLM-6B with PEFT | 基于 PEFT 的高效 ChatGLM 微调
A Doctor for your data
Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.
🚀 An open-source, hands-on curriculum bridging the gap from basic RL concepts to LLM alignment, RLVR, and advanced Agentic systems.
An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
Add a description, image, and links to the rlhf topic page so that developers can more easily learn about it.
To associate your repository with the rlhf topic, visit your repo's landing page and select "manage topics."