WED, 03 JUN 2026 · 17:42:48 UTC
Department · Open Source17 tools

The open source AI directory.

Permissive licensing, downloadable weights, fork-friendly code. The tools you can run on your own infrastructure — audited, fine-tuned, or both. China is leading several of the categories; we cover everything on merit.

Why open source AI matters in 2026

Two years ago "open source AI" was Llama 2 and a handful of community fine-tunes. In 2026 it's a serious procurement category — frontier-class models with permissive licensing, mature inference stacks, and a Chinese open-weights ecosystem that has redrawn the competitive map. For any team building on AI, the question is no longer whether to consider open weights; it's when.

The case for open source comes down to four things: audit (you can read the weights and the training recipe), portability (no vendor lock-in, you can switch infrastructure overnight), customisation (fine-tuning on your data, no commercial cap), and cost (per-token serving costs are 5–20× lower at the same quality tier). The case against — capability gap, support overhead, and the operational tax of running your own infrastructure — has narrowed every quarter since Llama 3.1 in mid-2024.

The model layer

The open-weights frontier is now genuinely competitive with closed models. Meta's Llama 3.3 70B matches its own 405B sibling at a fifth of the serving cost. DeepSeek's V3 (671B MoE, 37B active per token) was trained for a reported $5.6M and ships with quality competitive with Claude 3.5 Sonnet — released under a permissive licence that allows commercial use without caveats. Its sibling R1 is the first open reasoning model at o1 quality, released under MIT.

Moonshot's Kimi K2 (1T MoE, 32B active) is the strongest open-weights coding agent at the time of writing — 65.8% on SWE-bench Verified, comparable to Claude Sonnet 4. Alibaba's Qwen 2.5 72B is Apache 2.0 and ships alongside specialist Coder, Math, and Audio variants that share the same backbone. Mistral's European stack — Large 2 for chat, Codestral 25 for autocomplete — extends the picture with EU-resident inference. And the small-model regime is dominated by Microsoft Research's Phi-4 at 14B, MIT-licensed and small enough to run on a single consumer GPU.

The Chinese surge — and why the geopolitics is the wrong frame

The story of 2025 was the Chinese labs proving that frontier capability and permissive licensing aren't mutually exclusive. DeepSeek, Qwen, Kimi, and the smaller open-weight efforts from 01.AI and Yi collectively shipped four world-class model families with quality matching or beating their American closed-weight counterparts — all open under MIT or Apache variants.

Procurement teams sometimes hesitate on Chinese-origin models on jurisdictional grounds. The pragmatic view: the model weights are static files you serve on your own infrastructure, in your own region, with your own observability. The training-data provenance question is real but applies to every closed model equally — at least with open weights you can audit them. Where geopolitical concern is genuinely warranted is the hosted API layer (chat.deepseek.com, chat.qwen.ai) where queries leave your perimeter. That's a different decision than the model itself.

The tooling layer

Around the model layer sits an increasingly mature open-source tooling stack. Aider, Cline, and Continue are the open coding agents — three different takes on bringing Claude-Code-style workflows to your editor, all running against whichever model you point them at. Civitai is the hub for open image-generation models. Hugging Face remains the gravitational centre of model distribution.

The infrastructure layer is similarly open: vLLM and Ollama for serving, LangChain and LlamaIndex for orchestration, Weaviate, Chroma, Qdrant, and Milvus for vector search, LangFuse and Helicone for observability. None of these have a closed-source equivalent that meaningfully outperforms them; the closed alternatives compete on support and integration rather than capability.

When to pick open over closed

Open weights win cleanly when any of these apply: you need to fine-tune on proprietary data, you have strict data-residency or audit requirements, you serve enough volume that per-token cost dominates (typically above 50M tokens/day), or you simply want infrastructure independence from the trio of frontier labs.

Closed models still win for the hardest agentic workloads (Claude Opus 4 leads SWE-bench Verified), the latest reasoning research (o1, o3-mini), and the most integrated multimodal experiences (GPT-4o voice, Gemini 2.5 Pro video). For most production workloads in 2026 the answer is hybrid: open for the bulk, closed for the hard edge cases.

Below: every open-source tool we currently track, grouped by category. Tools with the ◯ Open source tag are downloadable / forkable; Chinese-origin tools are surfaced exactly the same way as Western ones — we don't maintain separate listings.

The list

Every open source tool we track.

All tools
Open source

Wan

Alibaba Cloud's open-source AI suite for generating and editing images and videos with precise control over text, color, and characters.

videoPaid
Open source

MiniMax

A full-stack multimodal AI platform offering text, video, voice, music, and agent capabilities for developers and creators.

apiPaid
Open source

Qwen

An open ecosystem of multilingual foundation models spanning chat, image generation, translation, and AI safety guardrails.

chatFreemium
Open source

DeepSeek

Open-weight AI models and a free chatbot for reasoning, coding, and general assistance.

chatFreemium
Open source

Moonshot AI

Research-driven AI lab offering the Kimi assistant for coding, analysis, slides, and multimodal research with API access.

chatFreemium
Open source

ChatGLM

A Chinese-native AI assistant platform powered by the GLM-5.1 agentic model, offering long-running autonomous agents, multimodal tools, and team collaboration via WeChat and Feishu.

agentsFreemium
Open source

Civitai

The largest community hub for discovering, sharing, and creating AI art models, images, and workflows.

imageFree
Open source

Stability AI

Enterprise creative production platform powered by generative AI for marketing, gaming, and entertainment teams.

imageFreemium
Open source

Continue

Source-controlled AI checks that enforce engineering standards on every GitHub pull request.

codingPaid
Open source

Cline

Open-source AI coding assistant for VS Code and CLI with bring-your-own-key inference pricing.

codingFreemium
Open source

Aider

AI pair programming in your terminal for developers who want deep codebase awareness and Git-native workflows.

codingFree
Open source

Browser-Use

Open-source agentic browsing library. 1.0 stable with first-class Playwright support.

agentsFreemium
Open source

Supabase

The open-source Firebase, now with first-class pgvector + Iceberg integration.

infrastructureFreemium
Open source

Hugging Face

The open-source AI hub — models, datasets, and now a zero-shot router.

infrastructureFreemium
Open source

n8n

Open-source workflow automation with first-class LLM nodes.

automationFreemium
Open source

Replicate

Run any open model with a single API call, now with sub-100ms cold start.

infrastructurePaid
Open source

Mistral

European AI lab with the best open-weights MoE models on the market.

chatFreemium