RSS GitHub
The Ledger A sourced historical record of AI

Google DeepMind Launches Gemini

A ledger entry in the models archive, dated 2023-12-06.

Summary

Google DeepMind launched Gemini, its most capable AI model family, claiming it outperformed GPT-4 on most benchmarks and was the first model built from the ground up to be natively multimodal. The launch was marred by controversy over a misleading demo video but nonetheless represented Google's most significant challenge to OpenAI's frontier model leadership.

What Happened

On December 6, 2023, Google DeepMind CEO Demis Hassabis announced Gemini, a new family of multimodal AI models in three sizes: Ultra (largest), Pro (mid-tier), and Nano (smallest, designed for on-device use). Google described Gemini as "natively multimodal" — trained from the ground up on text, images, audio, and video, rather than combining separate models for each modality.

Gemini Ultra was positioned as Google's GPT-4 competitor, and the accompanying technical report claimed it was the first model to exceed human-expert performance on the MMLU benchmark (scoring 90.0% vs. GPT-4's 86.4%). The model showed strong performance across text, code, image understanding, and reasoning benchmarks.

However, the launch was quickly overshadowed by controversy. A demo video titled "Hands-on with Gemini" appeared to show the model responding in real-time to voice and visual input, but was later revealed to have been produced using still images and text prompts rather than live interaction. Google acknowledged that the video was "illustrative" but faced significant criticism for misleading marketing.

Gemini Pro was immediately made available in Bard (Google's ChatGPT competitor) and through API. Gemini Ultra was delayed until February 2024, when it launched as part of the rebranded "Gemini Advanced" product.

Why It Matters

Gemini was the clearest signal that the frontier model race was not a one-company affair. Google, with its vast compute resources, research talent (including the combined DeepMind and Google Brain teams), and distribution through its consumer products, represented the most formidable long-term competitor to OpenAI.

The demo controversy, however, revealed a recurring pattern in AI product launches: the temptation to oversell capabilities outpacing the reality of model performance. This dynamic would repeat throughout 2024 and became a significant trust issue for the industry.

Gemini's multi-size strategy — Ultra for frontier competition, Pro for everyday use, Nano for devices — established a template that other companies would follow. The recognition that different use cases required different model sizes, rather than a single monolithic model, represented a maturing of commercial AI strategy.

§ How to read the metadata
Landmark
Fundamentally alters the trajectory; 2–5 per year.
Major
Meaningfully shifts the landscape; 2–4 per month.
Notable
Worth documenting; significance can be upgraded later.
Confidence
High = primary sources corroborate. Medium = credible secondary only. Low = provisional. Disputed = credible sources disagree.
Contestation
Uncontested = no formal challenge. Contested = at least one challenge open. Superseded = replaced by a later entry. Unresolved = dispute still open.

References

  1. Introducing Gemini: our largest and most capable AI model primary
  2. Gemini: A Family of Highly Capable Multimodal Models primary
  3. Google Announces Gemini, Its Most Powerful AI Model secondary

See also