WED, 03 JUN 2026 · 18:36:08 UTC

AI21 Labs

FlagshipLab

Israel·HQ Tel Aviv·Est. 2017

Mamba-based LLMs out of Tel Aviv — Jamba family.

6.0

our score

Our take

Pioneering Mamba-Transformer hybrids from Tel Aviv, straddling consumer writing tools and enterprise LLM APIs.

At a glance

Best known for
Jamba hybrid LLMs and the Wordtune writing assistant
Biggest strength
Production-grade Mamba-Transformer research outside the US
Biggest risk
Capital and compute asymmetry versus US hyperscaler labs
Stage
Series C
Primary revenue
Enterprise LLM APIs via AI21 Studio and consumer Wordtune subscriptions

What they do

AI21 Labs is a frontier artificial-intelligence research and product company that designs, trains, and deploys large language models. Headquartered in Tel Aviv, it operates at the intersection of foundational research and commercial application, offering models through two primary channels: AI21 Studio, a developer-facing API platform, and Wordtune, a consumer and professional writing assistant. The company’s flagship technical bet is the Jamba family of models, which marry Structured State Space (Mamba) layers with traditional Transformer attention to improve inference efficiency and extend effective context windows. This hybrid architecture is positioned as an alternative to the pure-Transformer stacks that dominate the market, promising lower latency and reduced compute costs for long-document analysis, code generation, and enterprise search.

The firm sells to a mix of software developers, enterprise IT teams, and individual knowledge workers. Through AI21 Studio, customers can integrate Jurassic and Jamba models into production workflows, with availability on major cloud marketplaces including AWS Bedrock and Google Cloud Vertex AI. Meanwhile, Wordtune competes in the crowded consumer productivity space, offering rewriting, summarization, and tone adjustment across browsers and desktop applications. With 200–500 employees, AI21 is among the largest independent AI labs outside the United States and China, and it markets itself as a privacy-conscious, research-rooted alternative to the Silicon Valley giants.

Origin story

AI21 Labs was founded in 2017 in Tel Aviv by Yoav Shoham, a Stanford professor emeritus and serial entrepreneur, together with Ori Goshen, a product veteran previously associated with Taboola and CrowdX, and Amnon Shashua, who is better known as the founder of Mobileye. The trio set out to advance natural-language processing beyond the then-current state of the art, leveraging Israel’s deep talent pool in machine learning and cryptography.

The company gained early traction with Wordtune, launched around 2020, which demonstrated consumer demand for AI-assisted writing. It followed with the Jurassic-1 model family in 2021, marking its entry into the foundation-model race and establishing AI21 Studio as an enterprise API business. After releasing Jurassic-2 in 2023, the lab made a defining pivot in 2024 toward hybrid architectures, unveiling Jamba—the first production-grade LLM to combine Mamba state-space models with Transformer layers. This shift was accompanied by a $208 million Series C that valued the company at roughly $1.4 billion, bringing in strategic investors including Google and NVIDIA, validating its technical direction while underscoring the immense capital requirements of frontier-model development.

Key products

Jamba 1.5 Large

2024

A hybrid Mamba-Transformer LLM optimized for long-context enterprise tasks such as document analysis and code generation, accessible via API and select cloud marketplaces.

Wordtune

2020

An AI-powered writing assistant for consumers and professionals that offers real-time rewriting, summarization, and tone adjustments across web and desktop apps.

AI21 Studio

2021

A developer platform and API suite for integrating Jurassic and Jamba models into production applications, with tooling for fine-tuning and prompt management.

Jurassic series

2021

Earlier-generation Transformer-based LLMs (Jurassic-1 and Jurassic-2) that established AI21’s enterprise API business and remain available to legacy customers.

Leadership

  • YS

    Yoav Shoham

    Co-founder & Co-CEO

    Stanford professor emeritus; previously co-founded TradingDynamics and Timeful (acquired by Google).

  • OG

    Ori Goshen

    Co-founder & Co-CEO

    Former product leader at Taboola and co-founder of CrowdX; oversees product and commercial strategy.

  • AS

    Amnon Shashua

    Co-founder

    Founder and CEO of Mobileye; provides strategic and technical guidance across the AI21 board.

Strengths & risks

Strengths

  • +Hybrid Mamba-Transformer architecture (Jamba) delivering meaningful efficiency and long-context gains.
  • +One of the longest-running independent frontier labs outside the US and China.
  • +Dual revenue streams reducing reliance on a single enterprise API market.
  • +Deep academic founding team with strong credibility in NLP and machine learning.
  • +Strategic cloud distribution via AWS Bedrock and Google Cloud Vertex AI.

Risks

  • Intense competition from US labs with orders-of-magnitude more capital and compute.
  • Mamba-based architectures remain unproven at the largest scales and developer mindshare.
  • Consumer writing market (Wordtune) is being commoditized by free AI in Office and Google Workspace.
  • Geopolitical and operational exposure tied to headquarters in Israel.
  • Narrower enterprise brand recognition versus OpenAI, Anthropic, and Google.

Recent moves

  1. Released Jamba 1.5 model family

    Mid 2024

    Launched Jamba 1.5 Large and Mini, advancing hybrid Mamba-Transformer designs with stronger long-context performance and improved efficiency benchmarks.

  2. Expanded hyperscaler distribution

    2023-2024

    Broadened availability of Jurassic and Jamba models on AWS Bedrock and Google Cloud Vertex AI to ease enterprise procurement and deployment.

Competitive position

AI21 Labs competes in a brutal arena against OpenAI, Anthropic, Google DeepMind, Cohere, and Mistral. Where it wins is on architectural differentiation: the Jamba family is one of the few production alternatives to the standard Transformer stack, offering potential cost and latency advantages for long-context workloads. Its Israeli roots also provide a geopolitically diversified option for enterprises and governments wary of US-only AI supply chains, and its research pedigree lends credibility in technical procurement processes.

Where it loses is in raw scale, brand awareness, and ecosystem lock-in. OpenAI and Anthropic dominate the frontier-model conversation, while Google and Microsoft bundle competing capabilities into ubiquitous productivity suites. AI21’s consumer product, Wordtune, faces an existential threat from embedded AI in Word, Gmail, and Google Docs. In the API layer, AI21 Studio is a viable second-tier option but lacks the developer zeitgeist of OpenAI’s GPT family or the open-source momentum of Mistral and Llama. The company’s best path forward is to own the efficiency-first, long-context niche and convert that into sticky enterprise contracts before larger rivals close the architecture gap.

What to watch

  • 01Jamba model adoption rates on AI21 Studio versus pure-Transformer alternatives.
  • 02Enterprise customer expansion and average contract value in API business.
  • 03Wordtune subscriber retention as platform-native AI writing tools improve.
  • 04Ability to raise a Series D or reach profitability without strategic acquisition.
  • 05Real-world long-context benchmarks against GPT-4 Turbo and Claude 3.5 Sonnet.

Frequently asked questions

What makes Jamba different from standard Transformer models like GPT-4?

Jamba combines Structured State Space (Mamba) layers with Transformer attention, aiming to reduce memory and compute costs while handling very long contexts more efficiently than pure-Transformer designs.

Is AI21 Labs an Israeli company?

Yes. It was founded in 2017 and is headquartered in Tel Aviv, with founders including prominent Israeli technologists.

Who are the founders of AI21 Labs?

The company was co-founded by Yoav Shoham (Stanford professor), Ori Goshen (product executive), and Amnon Shashua (founder of Mobileye).

What is Wordtune?

Wordtune is AI21’s consumer and professional writing assistant that provides real-time rewriting, summarization, and tone adjustments across browsers and desktop applications.

How do developers access AI21 models?

Developers can use AI21 Studio APIs or access Jurassic and Jamba models through cloud marketplaces such as AWS Bedrock and Google Cloud Vertex AI.

Does AI21 offer open-weight or open-source models?

Some Jamba variants have been released with open weights on Hugging Face, while the latest Jamba 1.5 and enterprise-grade models are primarily available through API and partner clouds.

What is AI21’s primary enterprise value proposition?

It offers a research-backed, privacy-conscious alternative to US hyperscaler models, with particular emphasis on efficient long-context processing and hybrid architecture IP.

How does AI21 compete with better-funded US labs?

It differentiates through architectural innovation (Mamba-Transformer hybrids), non-US geopolitical positioning, and a dual revenue model that includes both consumer subscriptions and enterprise APIs.

The bottom line

AI21 Labs occupies a distinct niche as a non-US frontier lab betting that hybrid Mamba-Transformer architectures can outmaneuver pure-Transformer rivals on efficiency and long-context reasoning. Its Jamba family represents one of the few credible production-scale departures from the GPT formula, giving the company technical cachet with researchers and cost-conscious enterprises. However, the gap between architectural promise and commercial traction remains wide. The lab must prove that Jamba can win sustained enterprise commitments against the marketing and distribution muscle of OpenAI, Anthropic, Google, and Microsoft, all of which enjoy deeper cloud integration and vastly larger war chests. Looking ahead, AI21’s trajectory hinges on whether its efficiency advantages translate into defensible market share in enterprise API services, or whether it becomes an acqui-hire target for a cloud giant seeking differentiated model IP. A successful Series D or path to profitability would change the view materially; conversely, stagnating Jamba adoption would raise serious questions about the commercial viability of non-Transformer architectures.

Visit AI21 Labs

Key products

  • Jamba 1.5 Large
  • Wordtune
  • AI21 Studio

Subsidiaries & spin-outs

Latest announcements

13 entries
  1. Explores how caching breaks in agentic LLM workflows, particularly when running experiments with multiple parallel LLM calls, and discusses the resulting reproducibility challenges.

  2. Demonstrates how AI21's Maestro deep research agents can achieve state-of-the-art performance cost-effectively.

  3. Examines how seemingly correct "gold-like" answers in coding benchmarks can conceal underlying functional failures in agent behavior.

  4. Argues that code-generation tools like Claude Code alone are insufficient for building robust production AI systems, outlining broader engineering requirements.

  5. Details the investigation into a 32-bit integer overflow bug in a CUDA kernel that corrupted operations and remained undetected for an extended period.

  6. Analyzes the key differences between prototype AI agents and reliable, scalable production systems deployed in enterprise environments.

  7. Identifies common bottlenecks and practical challenges that cause enterprise AI deployments to stall or fail.

  8. Introduces a modular approach to agent orchestration inspired by human cognition for building more capable AI systems.

  9. Presents a model-agnostic technique to minimize padding during LLM training, significantly reducing computational waste and improving efficiency.

  10. Shares techniques and best practices for scaling vLLM inference without encountering out-of-memory errors.

  11. Chronicles the debugging process of a vLLM issue where a single token caused widespread output corruption.

  12. Proposes a multi-scale retrieval method where RAG chunk sizes dynamically adapt based on query characteristics to improve accuracy.

  13. Introduces dynamic data snoozing, a technique to improve computational efficiency during online reinforcement learning training.

Related companies

All companies →