In 2020, frontier AI meant closed APIs from a handful of companies. By 2026, open-weight models matched proprietary ones on most benchmarks, and a single open-source release from a Chinese lab could wipe $593 billion from the US stock market in a day. This arc traces the fork between closed-source and open-weight AI — from the implicit monopoly of compute, through the LLaMA leak that started an open revolution, to DeepSeek’s efficiency shock and the geopolitical twist of export controls that backfired. The question is no longer whether open source can compete, but what happens when both paths converge on commoditization.
01. The Closed-Source Default
Before 2023, the phrase “open-source AI” meant open-source frameworks like TensorFlow and PyTorch — not open models. Frontier models were inherently closed because training them required compute resources that only a handful of companies could afford. OpenAI’s GPT-3 (June 2020), a 175-billion-parameter model trained on 300 billion tokens with an estimated 3.14e23 FLOPS, was accessible only through a paid API. DALL-E 2 (April 2022) was a closed web app. GitHub Copilot (June 2022) charged $19/month. And ChatGPT (November 2022) was free to use but its weights were entirely proprietary. There was no “open vs closed” debate because there was no open option. The debate simply didn’t exist.
Key Insight
Closed source wasn’t a business choice — it was a physical constraint of compute concentration.
The fork began in August 2022, when Stability AI open-sourced Stable Diffusion — a text-to-image model that rivaled closed offerings from OpenAI and Google. Anyone with a consumer GPU could generate high-quality images. But the real shock came six months later. Meta released LLaMA in February 2023 under a non-commercial research license, sharing weights with academic researchers via a request form. Within days, the weights were leaked on 4chan. What followed was a Cambrian explosion: Stanford’s Alpaca fine-tuned LLaMA-7B for $600; LMSYS’s Vicuna reached 90% of ChatGPT quality; dozens of derivative models emerged within weeks. The leaked LLaMA weights became the seed of the entire open-weight LLM ecosystem. Meta responded by releasing Llama 2 under a commercial license in July 2023, and by July 2024, Llama 3.1 405B became the first open-weight model to reach frontier-level performance — the open path was now real.
Key Insight
The open-weight revolution began not with a business decision, but with a leak on 4chan.
On January 20, 2025, Chinese AI lab DeepSeek released R1 — an MIT-licensed open-weight reasoning model matching OpenAI’s o1 on key benchmarks. The training cost: approximately $5.9 million, roughly 1/20th of comparable closed models. Using Mixture-of-Experts architecture and the GRPO (Group Relative Policy Optimization) reinforcement learning technique, DeepSeek proved that frontier reasoning didn’t require billions of dollars in compute. The market reaction one week later was unprecedented: Nvidia lost $593 billion in a single day — the largest single-day value loss in history. The Nasdaq fell 3%. Tech billionaires lost nearly $100 billion combined. The selloff reflected a fundamental revaluation: if frontier AI could be built with fewer GPUs, the entire $500B AI infrastructure buildout needed re-examination. DeepSeek R1 demonstrated that open-source models could not only compete with closed ones on quality, but also expose the economic assumptions behind the closed-source ecosystem.
Key Insight
When open models matched closed ones on quality, the entire capital allocation thesis for AI collapsed and reset.
On June 12, 2026, the US government ordered an export ban on Anthropic’s Fable 5, citing national security concerns. The intended message: closed frontier models from American companies must not flow to adversaries. The actual outcome was immediate and ironic. The very next day — June 13 — Z.ai (formerly Zhipu AI) released GLM-5.2, a 744-billion-parameter MoE model under the MIT license with a 1-million-token context window. On coding benchmarks (FrontierSWE, PostTrainBench), it matched or exceeded GPT-5.5 at roughly one-sixth the inference cost. Z.ai explicitly framed the timing as a direct response to the Fable 5 ban. The pattern was now clear: export controls on closed models were not containing AI capability — they were accelerating open-source alternatives. DeepSeek V4, released in April 2026 with 1.6 trillion parameters trained on Huawei Ascend chips, confirmed that the open path had become the primary channel for AI capability diffusion across borders.
Key Insight
Export controls on closed models became the most effective accelerator of open-source alternatives.
By mid-2026, the benchmark gap between open and closed models had shrunk from a canyon to a crack. Open models matched or exceeded closed ones on MMLU, MATH, HumanEval, and GPQA Diamond. Closed models maintained a lead only on complex agentic coding (SWE-bench) and human preference (Chatbot Arena). The question has shifted from “can open source compete?” to a more uncomfortable one: what happens when both paths commoditize? OpenAI’s restructure as a for-profit PBC (October 2025) signaled that the closed path needs a sustainable business model. Sora’s shutdown (April 2026) showed that even frontier labs kill products that don’t find market fit. Meanwhile, open standards like Anthropic’s MCP Protocol (November 2024) and consumer phenomena like OpenClaw (January 2026) suggested that the real competition is moving from model capability to ecosystem control, data access, and distribution. The fork is no longer a binary choice. It has become a structural feature of the AI industry — two parallel paths that will coexist, compete, and reshape each other for years to come.
Key Insight
The real competition is no longer model quality — it’s ecosystem, data access, and distribution control.
In six years, AI went from having no open-source model ecosystem at all to a world where open-weight models are a permanent feature of the landscape. The fork was driven not by ideology but by events: a weight leak, an efficiency breakthrough, and a geopolitical miscalculation. Closed-source models still lead in certain domains, but the margin is thin and shrinking. The deeper story is that the open vs closed debate was never really about philosophy — it was about who controls access to AI capability. And that question, far from being settled, has only become more consequential as both paths mature.