HomeTechnologyGoogle announces Gemma 4 open AI models, switches to Apache 2.0 license

Google announces Gemma 4 open AI models, switches to Apache 2.0 license

TechnologyApril 2, 2026
2 min read
Google announces Gemma 4 open AI models, switches to Apache 2.0 license
Gemma 4 brings the first major update to Google's open models in a year.
Reading Settings

Google's Gemini AI models have improved by leaps and bounds over the past year, but you can only use Gemini on Google's terms. The company's Gemma open-weight models have provided more freedom, but Gemma 3, which launched over a year ago, is getting a bit long in the tooth. Starting today, developers can start working with Gemma 4, which comes in four sizes optimized for local usage. Google has also acknowledged developer frustrations with AI licensing, so it's dumping the custom Gemma license.

Like past versions of its open-weight models, Google has designed Gemma 4 to be usable on local machines. That can mean plenty of things, of course. The two large Gemma variants, 26B Mixture of Experts and 31B Dense, are designed to run unquantized in bfloat16 format on a single 80GB Nvidia H100 GPU. Granted, that's a $20,000 AI accelerator, but it's still local hardware. If quantized to run at lower precision, these big models will fit on consumer GPUs.

Google also claims it has focused on reducing latency to really take advantage of Gemma's local processing. The 26B Mixture of Experts model activates only 3.8 billion of its 26 billion parameters in inference mode, giving it much higher tokens-per-second than similarly sized models. Meanwhile, 31B Dense is more about quality than speed, but Google expects developers to fine-tune it for specific uses.

Read full article

Comments

Source: Ars Technica

Share this article

Related Articles

Europe’s extreme heat is shutting down power plants
Jun 2411 hours ago

Europe’s extreme heat is shutting down power plants

Europe is in the middle of a record-breaking heat wave, and the grid is being pushed to its limits as people turn to fans and air-conditioning to try to stay cool. Some power plants won’t be online to

technologyreview.com5 min read
Read More
The emergence of the web data infrastructure layer for AI
Jun 2411 hours ago

The emergence of the web data infrastructure layer for AI

AI is booming. New use cases are emerging each day. To capitalize on the technology’s potential, enterprises require data at scale. In many cases, though, the relevant information is blocked or unstru

technologyreview.com7 min read
Read More