AI Models & APIs Jun 3 blog.google

Google launches Gemma 4 12B unified multimodal AI model

Original: Gemma 4 12B: A unified, encoder-free multimodal model

Why This Matters

Simplified multimodal AI architecture could reduce complexity for developers

Google announced Gemma 4 12B, a new unified encoder-free multimodal AI model. The 12 billion parameter model is designed to handle multiple types of data including text and images in a single architecture without separate encoders.

Google has introduced Gemma 4 12B, a new multimodal AI model with 12 billion parameters that can process both text and images within a unified architecture. Unlike traditional multimodal models that require separate encoders for different data types, Gemma 4 12B uses an encoder-free design to handle multiple modalities. The model is part of Google's Gemma family of open-source AI models and is aimed at developers building multimodal applications. The announcement was made on Google's official blog, highlighting the model's ability to understand and generate content across different data formats in a single framework.

Source

blog.google — Read original →

Google launches Gemma 4 12B unified multimodal AI model

Why This Matters

Source

Related articles

Sign in to listen