Open Sources

Curated repos, tools, and frameworks shaping the developer ecosystem.
Live data from GitHub.

🔥 [2026/06] 🎬 SANA-Streaming: 2B Model for Real-time Streaming Editing is released! Supports 720p, 1-min video editing. A pioneer work for streaming editing. See Project | Doc | Paper | Reactor Demo.
🔥 [2026/05] 🌍 SANA-WM: 2.6B Controllable World Model is released! Supports 720p, 1-min video generation with 6-DoF camera control. A new baseline for World Modeling and Embodied AI. See Project | Doc | Paper | Reactor Demo.
🔥 [2026/04] ⚡ Sol-RL: NVFP4 Rollout, BF16 Training RL is available! All training recipes for SANA, FLUX.1, and SD3.5-L, together with bundled post-training datasets, are released. See Sol-RL doc | Page | Paper.
🔥 [2026/03] 📺 SANA-Video 720p model with LTX-VAE is released. Use it with LTX2 Refiner to upscale the videos to 2K resolution! See Model Zoo, SANA-Video doc and Blog about refiner.
🔥 [2026/03] 💪 Post Training Infra: SANA × Cosmos-RL — We partner with Cosmos-RL to provide a complete RL infrastructure for SANA. You can now post-train (SFT/RL) SANA-Image and SANA-Video with state-of-the-art algorithms (e.g. Diffusion-NFT, Flow-GRPO), preset configs, reward services, and flexible datasets. See SANA on Cosmos-RL and our Cosmos-RL integration doc.
🔥 [2026/02] 🚀 SANA is now supported in SGLang! High-performance serving with OpenAI-compatible API. [Guidance]
🔥 [2026/01/26] SANA-Video is accepted as Oral by ICLR-2026. 🎉🎉🎉
🔥 [2025/12/09] 🎬 LongSANA: 27FPS real-time minute-length video generation model, training and inference code are all released. Thanks to LongLive Team. Refer to: [Train] | [Test] | [Weight]
🔥 [2025/11/24] 🪶 Blog: how Causal Linear Attention unlocks infinite context for LLMs and long video generation.
🔥 [2025/11/9] 🎬 Introduction video shows how Block Causal Linear Attention and Causal Mix-FFN work?
🔥 [2025/11/6] 📺SANA-Video is merged into diffusers. How to use.
🔥 [2025/10/27] 📺SANA-Video is released. [README] | [Weights] support Text-to-Video, TextImage-to-Video.
🔥 [2025/10/13] 📺SANA-Video is coming, 1). a 5s Linear DiT Video model, and 2). real-time minute-length video generation (with LongLive). [paper] | [Page]

Click to show all updates

✅ [2025/8/20] We release a new DC-AE-Lite for faster inference and smaller memory. [How to config] | [diffusers PR] | [Weight]
✅ [2025/6/25] SANA-Sprint was accepted to ICCV'25 🏖️
✅ [2025/6/4] SANA-Sprint ComfyUI Node is released [Example].
✅ [2025/5/8] SANA-Sprint (One-step diffusion) diffusers training code is released [Guidance].
✅ [2025/5/4] SANA-1.5 (Inference-time scaling) is accepted by ICML-2025. 🎉🎉🎉
✅ [2025/3/22] 🔥SANA-Sprint demo is hosted on Huggingface, try it! 🎉 [Demo Link]
✅ [2025/3/22] 🔥SANA-1.5 is supported in ComfyUI! 🎉: ComfyUI Guidance | ComfyUI Work Flow SANA-1.5 4.8B
✅ [2025/3/22] 🔥SANA-Sprint code & weights are released! 🎉 Include: Training & Inference code and Weights / HF are all released. [Guidance]
✅ [2025/3/21] 🚀Sana + Inference Scaling is released. [Guidance]
✅ [2025/3/16] 🔥SANA-1.5 code & weights are released! 🎉 Include: DDP/FSDP | TAR file WebDataset | Multi-Scale Training code and Weights | HF are all released.
✅ [2025/3/14] 🏃SANA-Sprint is coming out! 🎉 A new one/few-step generator of Sana. 0.1s per 1024px image on H100, 0.3s on RTX 4090. Find out more details: [Page] | [Arxiv]. Code is coming very soon along with diffusers
✅ [2025/2/10] 🚀Sana + ControlNet is released. [Guidance] | [Model] | [Demo]
✅ [2025/1/30] Release CAME-8bit optimizer code. Saving more GPU memory during training. [How to config]
✅ [2025/1/29] 🎉 🎉 🎉SANA 1.5 is out! Figure out how to do efficient training & inference scaling! 🚀[Tech Report]
✅ [2025/1/24] 4bit-Sana is released, powered by SVDQuant and Nunchaku inference engine. Now run your Sana within 8GB GPU VRAM [Guidance] [Demo] [Model]
✅ [2025/1/24] DCAE-1.1 is released, better reconstruction quality. [Model] [diffusers]
✅ [2025/1/23] Sana is accepted as Oral by ICLR-2025. 🎉🎉🎉
✅ [2025/1/12] DC-AE tiling makes Sana-4K inferences 4096x4096px images within 22GB GPU memory. With model offload and 8bit/4bit quantize. The 4K Sana run within 8GB GPU VRAM. [Guidance]
✅ [2025/1/11] Sana code-base license changed to Apache 2.0.
✅ [2025/1/10] Inference Sana with 8bit quantization.[Guidance]
✅ [2025/1/8] 4K resolution Sana models is supported in Sana-ComfyUI and work flow is also prepared. [4K guidance]
✅ [2025/1/8] 1.6B 4K resolution Sana models are released: [BF16 pth] or [BF16 diffusers]. 🚀 Get your 4096x4096 resolution images within 20 seconds! Find more samples in Sana page. Thanks SUPIR for their wonderful work and support.
✅ [2025/1/2] Bug in the diffusers pipeline is solved. Solved PR
✅ [2025/1/2] 2K resolution Sana models is supported in Sana-ComfyUI and work flow is also prepared.
✅ [2024/12] 1.6B 2K resolution Sana models are released: [BF16 pth] or [BF16 diffusers]. 🚀 Get your 2K resolution images within 4 seconds! Find more samples in Sana page. Thanks SUPIR for their wonderful work and support.
✅ [2024/12] diffusers supports Sana-LoRA fine-tuning! Sana-LoRA's training and convergence speed is super fast. [Guidance] or [diffusers docs].
✅ [2024/12] diffusers has Sana! All Sana models in diffusers safetensors are released and diffusers pipeline SanaPipeline, SanaPAGPipeline, DPMSolverMultistepScheduler(with FlowMatching) are all supported now. We prepare a Model Card for you to choose.
✅ [2024/12] 1.6B BF16 Sana model is released for stable fine-tuning.
✅ [2024/12] We release the ComfyUI node for Sana. [Guidance]
✅ [2024/11] All multi-linguistic (Emoji & Chinese & English) SFT models are released: 1.6B-512px, 1.6B-1024px, 600M-512px, 600M-1024px. The metric performance is shown here
✅ [2024/11] Sana Replicate API is launching at Sana-API.
✅ [2024/11] 1.6B Sana models are released.
✅ [2024/11] Training & Inference & Metrics code are released.
✅ [2024/11] Working on diffusers.
[2024/10] Demo is released.
[2024/10] DC-AE Code and weights are released!
[2024/10] Paper is on Arxiv!

git clone https://github.com/NVlabs/Sana.git
cd Sana && ./environment_setup.sh sana

import torch
from diffusers import SanaPipeline

pipe = SanaPipeline.from_pretrained(
    "Efficient-Large-Model/SANA1.5_1.6B_1024px_diffusers",
    torch_dtype=torch.bfloat16,
)
pipe.to("cuda")

pipe.vae.to(torch.bfloat16)
pipe.text_encoder.to(torch.bfloat16)

prompt = 'a cyberpunk cat with a neon sign that says "Sana"'
image = pipe(
    prompt=prompt,
    height=1024,
    width=1024,
    guidance_scale=4.5,
    num_inference_steps=20,
    generator=torch.Generator(device="cuda").manual_seed(42),
)[0]

image[0].save("sana.png")

Methods (1024x1024)	Throughput (samples/s)	Latency (s)	Params (B)	Speedup	FID 👇	CLIP 👆	GenEval 👆	DPG 👆
FLUX-dev	0.04	23.0	12.0	1.0×	10.15	27.47	0.67	84.0
Sana-0.6B	1.7	0.9	0.6	39.5×	5.81	28.36	0.64	83.6
Sana-0.6B	1.7	0.9	0.6	39.5×	5.61	28.80	0.68	84.2
Sana-1.6B	1.0	1.2	1.6	23.3×	5.92	28.94	0.69	84.5
Sana-1.5 1.6B	1.0	1.2	1.6	23.3×	5.70	29.12	0.82	84.5
Sana-1.5 4.8B	0.26	4.2	4.8	6.5×	5.99	29.23	0.81	84.7

Models	Latency (s)	Params (B)	VBench Total ↑	Quality ↑	Semantic ↑
Wan-2.1-14B	1897	14	83.73	85.77	75.58
Wan-2.1-1.3B	400	1.3	83.38	85.67	74.22
SANA-Video-2B	36	2	84.05	84.63	81.73

@misc{xie2024sana,
      title={Sana: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer},
      author={Enze Xie and Junsong Chen and Junyu Chen and Han Cai and Haotian Tang and Yujun Lin and Zhekai Zhang and Muyang Li and Ligeng Zhu and Yao Lu and Song Han},
      year={2024},
      eprint={2410.10629},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2410.10629},
    }

Click to expand all BibTeX citations

@misc{xie2025sana,
      title={SANA 1.5: Efficient Scaling of Training-Time and Inference-Time Compute in Linear Diffusion Transformer},
      author={Xie, Enze and Chen, Junsong and Zhao, Yuyang rectangle and Yu, Jincheng and Zhu, Ligeng and Lin, Yujun and Zhang, Zhekai and Li, Muyang and Chen, Junyu and Cai, Han and others},
      year={2025},
      eprint={2501.18427},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2501.18427},
    }

@misc{chen2025sanasprint,
      title={SANA-Sprint: One-Step Diffusion with Continuous-Time Consistency Distillation},
      author={Junsong Chen and Shuchen Xue and Yuyang Zhao and Jincheng Yu graves and Sayak Paul and Junyu Chen and Han Cai and Song Han and Enze Xie},
      year={2025},
      eprint={2503.09641},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2503.09641},
    }

@misc{chen2025sanavideo,
      title={SANA-Video: Efficient Video Generation with Block Linear Diffusion Transformer},
      author={Chen, Junsong and Zhao, Yuyang and Yu, Jincheng and Chu, Ruihang and Chen, Junyu and Yang, Shuai and Wang, Xianbang and Pan, Yicheng and Zhou, Daquan and Ling, Huan and others},
      year={2025},
      eprint={2509.24695},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2509.24695},
    }

@misc{li2026fp4,
      title={FP4 Explore, BF16 Train: Diffusion Reinforcement Learning via Efficient Rollout Scaling},
      author={Li, Yitong and Chen, Junsong and Xue, Shuchen and Zeren, Pengcuo and Fu, Siyuan and Yang, Dinghao and Tang, Yangyang and Bai, Junjie and Luo, Ping and Han, Song and others},
      year={2026}
      eprint={2604.06916},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2604.06916},
}

@misc{zhu2026sanawm,
      title={SANA-WM: Efficient Minute-Scale World Modeling with Hybrid Linear Diffusion Transformer},
      author={Haoyi Zhu and Haozhe Liu and Yuyang Zhao and Tian Ye and Junsong Chen and Jincheng Yu and Tong He and Song Han and Enze Xie},
      year={2026},
      eprint={2605.15178},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2605.15178},
}

@misc{zhao2026sanastreamingrealtimestreamingvideo,
      title={SANA-Streaming: Real-time Streaming Video Editing with Hybrid Diffusion Transformer},
      author={Yuyang Zhao and Yicheng Pan and Qiyuan He and Jincheng Yu and Junsong Chen and Tian Ye and Haozhe Liu and Enze Xie and Song Han},
      year={2026},
      eprint={2605.30409},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2605.30409},
}

Open Sources

Sana

About this project

📚 Docs | SANA | SANA-1.5 | SANA-Sprint | SANA-Video | SANA-WM | SANA-Streaming | Sol-RL
Demo | 🤗 HuggingFace | ComfyUI | SGLang | Cosmos-RL

Related Projects

hermes-agent

yt-dlp

ICLR 2025 Oral | ICML 2025 | ICCV 2025 Highlight | ICLR 2026 Oral

News

💡 Introduction

Quick Start

Inference with 🧨 diffusers

Getting Started

Performance

Image Generation (1024px)

Video Generation (VBench 720p)

💪To-Do List

🤗 Acknowledgements

Contribution

🌟 Star History

📖 BibTeX

stable-diffusion-webui

Open Sources

We read 100+ sources so you don't have to.

Sana

About this project

📚 Docs | SANA | SANA-1.5 | SANA-Sprint | SANA-Video | SANA-WM | SANA-Streaming | Sol-RL Demo | 🤗 HuggingFace | ComfyUI | SGLang | Cosmos-RL

Related Projects

hermes-agent

yt-dlp

ICLR 2025 Oral | ICML 2025 | ICCV 2025 Highlight | ICLR 2026 Oral

News

💡 Introduction

Quick Start

Inference with 🧨 diffusers

Getting Started

Performance

Image Generation (1024px)

Video Generation (VBench 720p)

💪To-Do List

🤗 Acknowledgements

Contribution

🌟 Star History

📖 BibTeX

stable-diffusion-webui

📚 Docs | SANA | SANA-1.5 | SANA-Sprint | SANA-Video | SANA-WM | SANA-Streaming | Sol-RL
Demo | 🤗 HuggingFace | ComfyUI | SGLang | Cosmos-RL