Trending Society

Sign In Sign Up

Open Sources

Curated repos, tools, and frameworks shaping the developer ecosystem.
Live data from GitHub.

oumi | Open Source Review | Trending Society

Back to Open Source

oumi

by oumi-ai

9.3kPythondpoevaluationfine-tuninggpt-ossgpt-oss-120b

Easily fine-tune, evaluate and deploy Gemma 4, Qwen3.5, Qwen3.6, gpt-oss, DeepSeek-R1, or any open source LLM / VLM!

View on GitHub Documentation

LicenseApache-2.0

Stars9,318

Forks778

Contributors53

Last pushJun 17, 2026

About this project

Everything you need to build state-of-the-art foundation models, end-to-end

🔥 News

[2026/06] Added support for the Gemma 4 model family
[2026/05] Oumi v0.8 released with oumi deploy CLI for dedicated inference endpoints, an oumi-mcp MCP server for Claude/Cursor integration, batch API support across Anthropic/Fireworks/Together, and Transformers v5 / TRL / vLLM dependency upgrades
[2026/03] Upgraded to Transformers v5, TRL v0.30, vLLM v0.19, and veRL v0.7 compatibility
[2026/03] MCP Integration Phase 1: package scaffold and dependencies for MCP server support
[2026/03] New: oumi deploy command for deploying oumi models dedicated inference endpoints on fireworks.ai and parasail
[2026/03] Added support for Qwen3.5 model family
[2026/03] Inference engines received multiple improvements: list_models api, improved error reporting
[2026/02] Preview of using the Oumi Platform and Lambda to fine-tune and deploy a 4B model for user intent classification
[2026/02] Lambda and Oumi partner for end-to-end custom model development
[2025/12] Oumi v0.6.0 released with Python 3.13 support, oumi analyze CLI command, TRL 0.26+ support, and more
[2025/12] WeMakeDevs AI Agents Assemble Hackathon: Oumi webinar on Finetuning for Text-to-SQL

Related Projects

hermes-agent

The agent that grows with you

yt-dlp

A feature-rich command-line audio/video downloader

Notebook	Try in Colab	Goal
🎯 Getting Started: A Tour		Quick tour of core features: training, evaluation, inference, and job management
🔧 Model Finetuning Guide		End-to-end guide to LoRA tuning with data prep, training, and evaluation
📚 Model Distillation		Guide to distilling large models into smaller, efficient ones
📋 Model Evaluation		Comprehensive model evaluation using Oumi's evaluation framework
☁️ Remote Training		Launch and monitor training jobs on cloud (AWS, Azure, GCP, Lambda, etc.) platforms
📈 LLM-as-a-Judge		Filter and curate training data with built-in judges

Using pip (Recommended)

# Basic installation
uv pip install oumi

# With GPU support
uv pip install 'oumi[gpu]'

# Latest development version
uv pip install git+https://github.com/oumi-ai/oumi.git

Don't have uv? Install it or use pip instead.

Using Docker

# Pull the latest image
docker pull ghcr.io/oumi-ai/oumi:latest

# Run oumi commands
docker run --gpus all -it ghcr.io/oumi-ai/oumi:latest oumi --help

# Train with a mounted config
docker run --gpus all -v $(pwd):/workspace -it ghcr.io/oumi-ai/oumi:latest \
    oumi train --config /workspace/my_config.yaml

# Training
oumi train -c configs/recipes/smollm/sft/135m/quickstart_train.yaml

# Evaluation
oumi evaluate -c configs/recipes/smollm/evaluation/135m/quickstart_eval.yaml

# Inference
oumi infer -c configs/recipes/smollm/inference/135m_infer.yaml --interactive

# GCP
oumi launch up -c configs/recipes/smollm/sft/135m/quickstart_gcp_job.yaml

# AWS
oumi launch up -c configs/recipes/smollm/sft/135m/quickstart_gcp_job.yaml --resources.cloud aws

# Azure
oumi launch up -c configs/recipes/smollm/sft/135m/quickstart_gcp_job.yaml --resources.cloud azure

# Lambda
oumi launch up -c configs/recipes/smollm/sft/135m/quickstart_gcp_job.yaml --resources.cloud lambda

Model	Example Configurations
Qwen3-Next 80B A3B	LoRA • Inference • Inference (Instruct) • Evaluation
Qwen3 30B A3B	LoRA • Inference • Evaluation
Qwen3 32B	LoRA • Inference • Evaluation
Qwen3 14B	LoRA • Inference • Evaluation
Qwen3 8B	FFT • Inference • Evaluation
Qwen3 4B	FFT • Inference • Evaluation
Qwen3 1.7B	FFT • Inference • Evaluation
Qwen3 0.6B	FFT • Inference • Evaluation
QwQ 32B	FFT • LoRA • QLoRA • Inference • Evaluation
Qwen2.5-VL 3B	SFT • LoRA• Inference (vLLM) • Inference
Qwen2-VL 2B	SFT • LoRA • Inference (vLLM) • Inference (SGLang) • Inference • Evaluation

Model	Example Configurations
DeepSeek R1 671B	Inference (Together AI)
Distilled Llama 8B	FFT • LoRA • QLoRA • Inference • Evaluation
Distilled Llama 70B	FFT • LoRA • QLoRA • Inference • Evaluation
Distilled Qwen 1.5B	FFT • LoRA • Inference • Evaluation
Distilled Qwen 32B	LoRA • Inference • Evaluation

Model	Example Configurations
Llama 4 Scout Instruct 17B	FFT • LoRA • QLoRA • Inference (vLLM) • Inference • Inference (Together.ai)
Llama 4 Scout 17B	FFT
Llama 3.1 8B	FFT • LoRA • QLoRA • Pre-training • Inference (vLLM) • Inference • Evaluation
Llama 3.1 70B	FFT • LoRA • QLoRA • Inference • Evaluation
Llama 3.1 405B	FFT • LoRA • QLoRA
Llama 3.2 1B	FFT • LoRA • QLoRA • Inference (vLLM) • Inference (SGLang) • Inference • Evaluation
Llama 3.2 3B	FFT • LoRA • QLoRA • Inference (vLLM) • Inference (SGLang) • Inference • Evaluation
Llama 3.3 70B	FFT • LoRA • QLoRA • Inference (vLLM) • Inference • Evaluation
Llama 3.2 Vision 11B	SFT • Inference (vLLM) • Inference (SGLang) • Evaluation

Model	Example Configurations
Llama 3.2 Vision 11B	SFT • LoRA • Inference (vLLM) • Inference (SGLang) • Evaluation
LLaVA 7B	SFT • Inference (vLLM) • Inference
Phi3 Vision 4.2B	SFT • LoRA • Inference (vLLM)
Phi4 Vision 5.6B	SFT • LoRA • Inference (vLLM) • Inference
Qwen2-VL 2B	SFT • LoRA • Inference (vLLM) • Inference (SGLang) • Inference • Evaluation
Qwen3-VL 2B	Inference
Qwen3-VL 4B	Inference
Qwen3-VL 8B	Inference
Qwen2.5-VL 3B	SFT • LoRA• Inference (vLLM) • Inference
SmolVLM-Instruct 2B	SFT • LoRA

📋 Click to see more supported models

Instruct Models

Model	Size	Paper	HF Hub	License	Open ¹
✅ SmolLM-Instruct	135M/360M/1.7B	Blog	Hub	Apache 2.0	✅
✅ DeepSeek R1 Family	1.5B/8B/32B/70B/671B	Blog	Hub	MIT	❌
✅ Llama 3.1 Instruct	8B/70B/405B	Paper	Hub	License	❌
✅ Llama 3.2 Instruct	1B/3B	Paper	Hub	License	❌
✅ Llama 3.3 Instruct	70B	Paper	Hub	License	❌
✅ Phi-3.5-Instruct	4B/14B	Paper	Hub	License	❌
✅ Qwen3	0.6B-32B	Paper	Hub	License	❌
Qwen2.5-Instruct	0.5B-70B	Paper	Hub	License	❌
OLMo 2 Instruct	7B	Paper	Hub	Apache 2.0	✅
✅ OLMo 3 Instruct	7B/32B	Paper	Hub	Apache 2.0	✅
MPT-Instruct	7B	Blog	Hub	Apache 2.0	✅
Command R	35B/104B	Blog	Hub	License	❌
Granite-3.1-Instruct	2B/8B	Paper	Hub	Apache 2.0	❌
Gemma 2 Instruct	2B/9B	Blog	Hub	License	❌
✅ Gemma 3 Instruct	4B/12B/27B	Blog	Hub	License	❌
DBRX-Instruct	130B MoE	Blog	Hub	Apache 2.0	❌
Falcon-Instruct	7B/40B	Paper	Hub	Apache 2.0	❌
✅ Llama 4 Scout Instruct	17B (Activated) 109B (Total)	Paper	Hub	License	❌
✅ Llama 4 Maverick Instruct	17B (Activated) 400B (Total)	Paper	Hub	License	❌

Vision-Language Models

Model	Size	Paper	HF Hub	License	Open
✅ Llama 3.2 Vision	11B	Paper	Hub	License	❌
✅ LLaVA-1.5	7B	Paper	Hub	License	❌
✅ Phi-3 Vision	4.2B	Paper	Hub	License	❌
✅ BLIP-2	3.6B	Paper	Hub	MIT	❌
✅ Qwen2-VL	2B	Blog	Hub	License	❌
✅ Qwen3-VL	2B/4B/8B	Blog	Hub	License	❌
✅ SmolVLM-Instruct	2B	Blog	Hub	Apache 2.0	✅

Base Models

Model	Size	Paper	HF Hub	License	Open
✅ SmolLM2	135M/360M/1.7B	Blog	Hub	Apache 2.0	✅
✅ Llama 3.2	1B/3B	Paper	Hub	License	❌
✅ Llama 3.1	8B/70B/405B	Paper	Hub	License	❌
✅ GPT-2	124M-1.5B	Paper	Hub	MIT	✅
DeepSeek V2	7B/13B	Blog	Hub	License	❌
Gemma2	2B/9B	Blog	Hub	License	❌
GPT-J	6B	Blog	Hub	Apache 2.0	✅
GPT-NeoX	20B	Paper	Hub	Apache 2.0	✅
Mistral	7B	Paper	Hub	Apache 2.0	❌
Mixtral	8x7B/8x22B	Blog

Reasoning Models

Model	Size	Paper	HF Hub	License	Open
✅ gpt-oss	20B/120B	Paper	Hub	Apache 2.0	❌
✅ Qwen3	0.6B-32B	Paper	Hub	License	❌
✅ Qwen3-Next	80B-A3B	Blog	Hub	License	❌
Qwen QwQ	32B	Blog	Hub	License	❌

Code Models

Model	Size	Paper	HF Hub	License	Open
✅ Qwen2.5 Coder	0.5B-32B	Blog	Hub	License	❌
DeepSeek Coder	1.3B-33B	Paper	Hub	License	❌
StarCoder 2	3B/7B/15B	Paper	Hub	License	✅

Math Models

Model	Size	Paper	HF Hub	License	Open
DeepSeek Math	7B	Paper	Hub	License	❌

@software{oumi2025,
  author = {Oumi Community},
  title = {Oumi: an Open, End-to-end Platform for Building Large Foundation Models},
  month = {January},
  year = {2025},
  url = {https://github.com/oumi-ai/oumi}
}