Google AI – Our AI Journey

Learn how Google has worked over the past 20 years to make AI helpful for everyone.

Our AI Journey

ML for Spell Check

Google begins using machine learning to help with spell check at scale in Search.

Google Translate

Google launches Google Translate using machine learning to automatically translate languages, starting with Arabic-English and English-Arabic.

Deep learning applied to speech recognition

A new era of AI begins when Google researchers improve speech recognition with Deep Neural Networks, which is a new machine learning architecture loosely modeled after the neural structures in the human brain.

Google trains AI at scale

In the famous “cat paper,” Google Research begins using large sets of “unlabeled data," like videos and photos from the internet, to significantly improve AI image classification. Roughly analogous to human learning, the neural network recognizes images (including cats!) from exposure instead of direct instruction.

Word2Vec

Introduced in the research paper “Distributed Representations of Words and Phrases and their Compositionality,” Word2Vec catalyzed fundamental progress in natural language processing -- going on to be cited more than 40,000 times in the decade following, and winning the NeurIPS 2023 “Test of Time” Award.

AtariDQN

AtariDQN is the first deep learning model to successfully learn control policies directly from high-dimensional sensory input using reinforcement learning. It played Atari games from just the raw pixel input at a level that superpassed a human expert.

Sequence to Sequence Learning with Neural Networks

Google presents Sequence To Sequence Learning With Neural Networks, a powerful machine learning technique that can learn to translate languages and summarize text by reading words one at a time and remembering what it has read before.

DeepMind Acquisition

Google acquires DeepMind, one of the leading AI research labs in the world.

RankBrain in Search and Ads

Google deploys RankBrain in Search and Ads providing a better understanding of how words relate to concepts.

Distillation

Distillation allows complex models to run in production by reducing their size and latency, while keeping most of the performance of larger, more computationally expensive models. It has been used to improve Google Search and Smart Summary for Gmail, Chat, Docs, and more.

Google Photos

At its annual I/O developers conference, Google introduces Google Photos, a new app that uses AI with search capability to search for and access your memories by the people, places, and things that matter.

TensorFlow

Google introduces TensorFlow, a new, scalable open source machine learning framework used in speech recognition.

Federated Learning

Google Research proposes a new, decentralized approach to training AI called Federated Learning that promises improved security and scalability.

AlphaGo

AlphaGo, a computer program developed by DeepMind, plays the legendary Lee Sedol, winner of 18 world titles, famed for his creativity and widely considered to be one of the greatest players of the past decade. During the games, AlphaGo played several inventive winning moves. In game two, it played Move 37 — a creative move helped AlphaGo win the game and upended centuries of traditional wisdom.

Tensor Processing Units

Google publicly announces the Tensor Processing Unit (TPU), custom data center silicon built specifically for machine learning. After that announcement, the TPU continues to gain momentum:

• TPU v2 is announced in 2017
• TPU v3 is announced at I/O 2018
• TPU v4 is announced at I/O 2021
• At I/O 2022, Sundar announces the world’s largest, publicly-available machine learning hub, powered by TPU v4 pods and based at our data center in Mayes County, Oklahoma, which runs on 90% carbon-free energy.

WaveNet

Developed by researchers at DeepMind, WaveNet is a new deep neural network for generating raw audio waveforms allowing it to model natural sounding speech. WaveNet was used to model many of the voices of the Google Assistant and other Google services.

Neural Machine Translation

Google announces the Google Neural Machine Translation system (GNMT), which uses state-of-the-art training techniques to achieve the largest improvements to date for machine translation quality.

Diabetic Retinopathy

In a paper published in the Journal of the American Medical Association, Google demonstrates that a machine-learning driven system for diagnosing diabetic retinopathy from a retinal image could perform on-par with board-certified ophthalmologists.

Transformer

Google releases "Attention Is All You Need,” a research paper that introduces the Transformer, a novel neural network architecture particularly well suited for language understanding, among many other things.

Genomics

Introduced DeepVariant, an open-source genomic variant caller that significantly improves the accuracy of identifying variant locations. This innovation in Genomics has contributed to the fastest ever human genome sequencing, and helped create the world's first human pangenome reference.

JAX Framework

Google Research releases JAX – a Python library designed for high-performance numerical computing, especially machine learning research.

Smart Compose

Google announces Smart Compose, a new feature in Gmail that uses AI to help users more quickly reply to their email. Smart Compose builds on Smart Reply, another AI feature.

AI Principles

Google publishes its AI Principles – a set of guidelines that the company follows when developing and using artificial intelligence. The principles are designed to ensure that AI is used in a way that is beneficial to society and respects human rights.

BERT

Google introduces a new technique for natural language processing pre-training called Bidirectional Encoder Representations from Transformers (BERT), helping Search better understand users’ queries.

AlphaZero

AlphaZero, a general reinforcement learning algorithm, masters chess, shogi, and Go through self-play.

Quantum Milestone 1

Google’s Quantum AI demonstrates for the first time a computational task that can be executed exponentially faster on a quantum processor than on the world’s fastest classical computer -- just 200 seconds on a quantum processor compared to the 10,000 years it would take on a classical device.

Reinforcement Learning for Chip Design

Google Research proposes using machine learning itself to assist in creating computer chip hardware to accelerate the design process.

AlphaFold 2

DeepMind’s AlphaFold is recognized as a solution to the 50-year “protein-folding problem.” AlphaFold can accurately predict 3D models of protein structures and is accelerating research in biology. This work went on to receive a Nobel Prize in Chemistry in 2024.

MUM

At I/O 2021, Google announces MUM, multimodal models that are 1,000 times more powerful than BERT and allow people to naturally ask questions across different types of information.

LaMDA

At I/O 2021, Google announces LaMDA, a new conversational technology short for “Language Model for Dialogue Applications.”

Tensor

Google announces Tensor, a custom-built System on a Chip (SoC) designed to bring advanced AI experiences to Pixel users.

PaLM

At I/O 2022, Sundar announces PaLM – or Pathways Language Model – Google’s largest language model to date, trained on 540 billion parameters.

LaMDA 2

Sundar announces LaMDA 2, Google’s most advanced conversational AI model.

Imagen, Parti

Google announces Imagen and Parti, two models that use different techniques to generate photorealistic images from a text description.

AlphaFold 200M

The AlphaFold Database--which included over 200 million proteins structures and nearly all cataloged proteins known to science--is released.

Phenaki

Google announces Phenaki, a model that can generate realistic videos from text prompts.

Med-PaLM

Google developed Med-PaLM, a medically fine-tuned LLM, which was the first model to attain a passing score on a medical licensing exam-style question benchmark, demonstrating its ability to accurately answer medical questions.

MusicLM

Google introduces MusicLM, an AI model that can generate music from text.

Quantum Milestone 2

Google’s Quantum AI achieves the world’s first demonstration of reducing errors in a quantum processor by increasing the number of qubits.

Bard

Google releases Bard, an early experiment that lets people collaborate with generative AI, first in the US and UK — followed by other countries.

Google DeepMind

DeepMind and Google's Brain team merge to form Google DeepMind.

PaLM 2

Google launches PaLM 2, our next generation large language model, that builds on Google’s legacy of breakthrough research in machine learning and responsible AI.

GraphCast

GraphCast, an AI model for faster and more accurate global weather forecasting, is introduced.

GNoME

GNoME - a deep learning tool - is used to discover 2.2 million new crystals, including 380,000 stable materials that could power future technologies.

Gemini

Google introduces Gemini, our most capable and general model, built from the ground up to be multimodal. Gemini is able to generalize and seamlessly understand, operate across, and combine different types of information including text, code, audio, image and video.

Gemini ecosystem

Google expands the Gemini ecosystem to introduce a new generation: Gemini 1.5, and brings Gemini to more products like Gmail and Docs. Gemini Advanced launched, giving people access to Google’s most capable AI models.

Gemma

Gemma is a family of lightweight state-of-the art open models built from the same research and technology used to create the Gemini models.

AlphaFold 3

Introduced AlphaFold 3, a new AI model developed by Google DeepMind and Isomorphic Labs that predicts the structure of proteins, DNA, RNA, ligands and more. Scientists can access the majority of its capabilities, for free, through AlphaFold Server.

Connectomics

Google Research and Harvard published the first synaptic-resolution reconstruction of the human brain. This achievement, made possible by the fusion of scientific imaging and Google's AI algorithms, paves the way for discoveries about brain function.

NeuralGCM

NeuralGCM, a new machine learning-based approach to simulating Earth's atmosphere, is introduced. Developed in partnership with the European Centre for Medium-Range Weather Forecasts (ECMWF), NeuralGCM combines traditional physics-based modeling with ML for improved simulation accuracy and efficiency.

IMO Silver Medal Equivalent

Our combined AlphaProof and AlphaGeometry 2 systems solved four out of six problems from the 2024 International Mathematical Olympiad (IMO), achieving the same level as a silver medalist in the competition for the first time. The IMO is the oldest, largest and most prestigious competition for young mathematicians, and has also become widely recognized as a grand challenge in machine learning.

Willow

Willow, our state-of-the-art quantum chip, demonstrates error correction and performance that paves the way to a useful, large-scale quantum computer.

Gemini 2.0

Google introduces Gemini 2.0, our most capable AI model yet. Gemini 2.0 has new capabilities, like multimodal output with native image generation and audio output, and native use of tools including Google Search and Maps.

Gemini 2.5

Introduced Gemini 2.5, our most intelligent AI model. It's state-of-the-art on a wide range of benchmarks and debuts at #1 on LMArena by a significant margin. Gemini 2.5 models are thinking models, capable of reasoning through their thoughts before responding, resulting in enhanced performance and improved accuracy.

Veo 3

Google announces its latest state of the art video generation model at I/O 2025. Built by Google DeepMind, Veo 3 creates high quality videos, now with native audio generation--meaning it can produce videos complete with dialogue, sound effects, and background sounds.

Nano Banana

Google launches Nano Banana, the new state-of-the-art image generation and editing capability from Gemini 2.5 Flash. Debuting as the #1 image generation and editing model in the world, Nano Banana empowers users to effortlessly create, edit and refine high-quality, imaginative visuals.