6 results
APRIL 18, 2025 / Gemma
The release of int4 quantized versions of Gemma 3 models, optimized with Quantization Aware Training (QAT) brings significantly reduced memory requirements, allowing users to run powerful models like Gemma 3 27B on consumer-grade GPUs such as the NVIDIA RTX 3090.
MARCH 12, 2025 / Gemma
Gemma 3 1B, a new small language model for mobile and web applications via Google AI Edge, is now available, with increased efficiency, improved performance, and offline availability.
SEPT. 4, 2024 / AI Edge
TensorFlow Lite, now named LiteRT, is still the same high-performance runtime for on-device AI, but with an expanded vision to support models authored in PyTorch, JAX, and Keras.
JUNE 26, 2024 / AI Edge
Model Explorer, a new graph visualization tool from Google AI Edge, enables developers to overcome the complexities of optimizing models for edge devices.
MAY 29, 2024 / AI Edge
AI Edge Torch Generative API enables developers to bring powerful new capabilities on-device, such as summarization, content generation, and more.
MAY 14, 2024 / AI Edge
Released today, AI Edge Torch enables support for PyTorch, JAX, Keras, and TensorFlow with TFLite.