Search

28 results

Clear filters
  • MAY 9, 2025 / DeepMind

    Advancing the frontier of video understanding with Gemini 2.5

    Gemini 2.5 marks a major leap in video understanding, achieving state-of-the-art performance on key video understanding benchmarks and being able to seamlessly use audio-visual information with code and other data formats.

    2.5Pro_Metadata_VideoUnderstanding
  • MAY 6, 2025 / Gemini

    Gemini 2.5 Pro Preview: even better coding performance

    An updated I/O edition preview of Gemini 2.5 Pro is being released for developers, featuring best-in-class front-end and UI development performance, ranking #1 on the WebDev Arena leaderboard, and showcasing applications like video to code and easier feature development through starter apps.

    Gemini 2.5 Pro (I/O Edition): even better coding performance
  • APRIL 18, 2025 / Gemma

    Gemma 3 QAT Models: Bringing state-of-the-Art AI to consumer GPUs

    The release of int4 quantized versions of Gemma 3 models, optimized with Quantization Aware Training (QAT) brings significantly reduced memory requirements, allowing users to run powerful models like Gemma 3 27B on consumer-grade GPUs such as the NVIDIA RTX 3090.

    Gemma 3 Quantization Aware - meta
  • APRIL 17, 2025 / Gemini

    Start building with Gemini 2.5 Flash

    Gemini 2.5 Flash is in preview, offering improved reasoning capabilities through a "thinking" process that developers can control for cost and latency tradeoffs. This updated version aims to provide a cost-effective solution for complex tasks, balancing performance and price.

    Gemini 2.5 Flash ai.dev
  • APRIL 15, 2025 / Gemini

    Bring your ideas to life: Veo 2 video generation available for developers

    Generate high-quality videos from text and image prompts with Veo 2, a video generation model, now generally available in the Gemini API and Google AI Studio to enhance your content creation and marketing efforts.

    Veo 2 now generally available in the Gemini API and Google AI Studio
  • APRIL 9, 2025 / Cloud

    Announcing the Agent2Agent Protocol (A2A)

    Agent2Agent (A2A) protocol is an open standard designed to enable AI agents from different vendors and frameworks to collaborate and exchange information across enterprise platforms aiming to foster a future of seamless AI agent interoperability and enhanced automation.

    Agent2Agent Interoperability
  • FEB. 25, 2025 / Gemini

    Start building with Gemini 2.0 Flash and Flash-Lite

    Gemini 2.0 Flash-Lite is now generally available in the Gemini API for production use in Google AI Studio and for enterprise customers on Vertex AI. 2.0 Flash-Lite offers improved performance over 1.5 Flash across reasoning, multimodal, math and factuality benchmarks. For projects that require long context windows, 2.0 Flash-Lite is an even more cost-effective solution, with simplified pricing for prompts more than 128K tokens.

    Flash Family
  • FEB. 19, 2025 / Gemma

    Introducing PaliGemma 2 mix: A vision-language model for multiple tasks

    PaliGemma 2 mix, an upgraded vision-language model, is now available, offering capabilities like image captioning, OCR, and object detection in various sizes.

    Paligemma 2 Mix
  • DEC. 20, 2024 / Gemma

    Beyond English: How Gemma open models are bridging the language gap

    AI Singapore and INSAIT teams have leveraged Gemma, a family of open-source language models, to create LLMs tailored to the unique needs of their communities, in a show of innovation and inclusivity in AI.

    Gemma-SEALION
  • DEC. 11, 2024 / Gemini

    The next chapter of the Gemini era for developers

    Gemini 2.0 Flash has enhanced capabilities like multimodal outputs and native tool use, and introduces new coding agents to improve developer productivity, now available for testing in Google AI Studio.

    Gemini 2.0
OSZAR »