What are the key features of IBM's new Granite Embedding Multilingual R2 models?

The Granite Embedding Multilingual R2 release includes two Apache 2.0 licensed models: a 311-million parameter model and a compact 97-million parameter model. Both support over 200 languages, with enhanced quality for 52, and include code retrieval for nine programming languages. They feature a 32,000-token context window, a significant increase, making them suitable for long-document understanding and enterprise applications.

How do the new Granite R2 models improve multilingual AI performance?

The Granite R2 models significantly narrow the gap between efficiency and comprehensive language support. The 311M model ranks second among open models under 500M parameters on MTEB, while the 97M model outperforms all other open sub-100M multilingual embedders. Their advanced ModernBERT architecture and 32,000-token context window enhance long-document understanding and overall retrieval quality across over 200 languages.

← Back to front page

Generative AI & ToolsThursday, May 14, 2026

Granite Embedding Multilingual R2: Open Apache 2.0 Multilingual Embeddings with 32K Context — Best Sub-100M Retrieval Quality

Original reporting by Hugging Face

Multilingual embedding models have long presented a dilemma for developers and researchers alike: achieving broad language coverage and high-quality retrieval often demands large, computationally intensive models, while compact alternatives typically compromise on performance or linguistic scope. This fundamental tension forces a difficult choice between speed and efficacy in critical applications like cross-lingual search or retrieval-augmented generation.

The latest Granite Embedding Multilingual R2 release from IBM addresses this challenge head-on, introducing two innovative Apache 2.0 licensed models designed to narrow the performance gap significantly. The flagship granite-embedding-311m-multilingual-r2 is a 311-million parameter model that achieves a score of 65.2 on MTEB Multilingual Retrieval, ranking second among open models under 500 million parameters, and features Matryoshka support for flexible dimensionality. Complementing it is the granite-embedding-97m-multilingual-r2, a compact 97-million parameter model that surprisingly outperforms every other open sub-100M multilingual embedder on the same benchmark, scoring an impressive 60.3.

Both models offer expansive support for over 200 languages, with enhanced retrieval quality for 52 specific languages, and add crucial code retrieval capabilities across nine programming languages. Built on the advanced ModernBERT architecture, they boast an unprecedented 32,000-token context window—a 64x increase over their predecessors—making them exceptionally well-suited for long-document understanding and enterprise-grade applications. This release promises to redefine expectations for accessible, high-performing multilingual AI.

The release of Granite Embedding Multilingual R2 marks a significant advancement in the landscape of natural language processing, effectively narrowing the persistent gap between model efficiency and comprehensive language support. By introducing both a compact 97-million-parameter model that redefines performance for its size and a robust 311-million-parameter model with Matryoshka support, this offering provides developers and enterprises with powerful, flexible tools. These models, embracing a 32,768-token context window and trained across over 200 languages with enhanced support for 52, alongside code retrieval capabilities, promise to transform how AI applications interact with diverse global data.

The broader implications of this development are profound. These Apache 2.0-licensed models democratize access to high-quality multilingual embeddings, making sophisticated retrieval-augmented generation, cross-lingual search, and international code collaboration more accessible and cost-effective. Their enterprise-ready design, built with careful data governance, ensures reliability and responsible deployment in commercial settings, reducing operational overhead and accelerating integration into existing AI frameworks. Looking ahead, this innovation fosters a future where AI systems are inherently more inclusive and globally aware, enabling organizations to build robust solutions that transcend linguistic and technical boundaries. The Granite R2 models pave the way for a new generation of AI applications, empowering broader innovation and a more interconnected digital ecosystem.

Frequently asked questions

What are the key features of IBM's new Granite Embedding Multilingual R2 models?: The Granite Embedding Multilingual R2 release includes two Apache 2.0 licensed models: a 311-million parameter model and a compact 97-million parameter model. Both support over 200 languages, with enhanced quality for 52, and include code retrieval for nine programming languages. They feature a 32,000-token context window, a significant increase, making them suitable for long-document understanding and enterprise applications.
How do the new Granite R2 models improve multilingual AI performance?: The Granite R2 models significantly narrow the gap between efficiency and comprehensive language support. The 311M model ranks second among open models under 500M parameters on MTEB, while the 97M model outperforms all other open sub-100M multilingual embedders. Their advanced ModernBERT architecture and 32,000-token context window enhance long-document understanding and overall retrieval quality across over 200 languages.
What are the main applications and benefits of the Granite Embedding Multilingual R2 models?: These models are designed for critical applications like cross-lingual search, retrieval-augmented generation (RAG), and international code collaboration. Their Apache 2.0 license democratizes access to high-quality multilingual embeddings, making sophisticated AI solutions more accessible and cost-effective. With enterprise-ready design and a large context window, they enable robust, globally aware AI systems that transcend linguistic and technical boundaries.

Intro and outro generated by Printing Press AI from the source article above. Always consult the original reporting for verbatim quotes and primary sources.