Alexander Gruenstein

Alex Gruenstein works on mobile speech interfaces at Google. He holds a Ph.D. in Computer Science from MIT, as well as B.S and M.S. degrees from Stanford University in Symbolic Systems.

Research Areas

Speech Processing

Authored Publications

Google Publications

Other Publications

An Efficient Streaming Non-Recurrent On-Device End-to-End Model with Improvements to Rare-Word Modeling

Tara N Sainath

Yanzhang (Ryan) He

Arun Narayanan

Rami Botros

Ruoming Pang

David Johannes Rybach

Cyril Allauzen

Ehsan Variani

James Qin

Quoc-Nam Le-The

Alex Gruenstein

Anmol Gulati

Bo Li

Cal Peyser

Chung-Cheng Chiu

Diamantino A. Caseiro

Emmanuel Guzman

Ian Carmichael McGraw

Jiahui Yu

Michael D. Riley

Pat Rondon

Qiao Liang

Sepand Mavandadi

Shuo-yiin Chang

Trevor Deatrick Strohman

W. Ronny Huang

Wei Li

Yonghui Wu

Yu Zhang

Interspeech (2021) (to appear)

A Streaming On-Device End-to-End Model Surpassing Server-Side Conventional Model Quality and Latency

Tara Sainath

Yanzhang (Ryan) He

Bo Li

Arun Narayanan

Ruoming Pang

Antoine Bruguier

Shuo-yiin Chang

Wei Li

Raziel Alvarez

Zhifeng Chen

Chung-Cheng Chiu

David Garcia

Alex Gruenstein

Kevin Hu

Minho Jin

Anjuli Kannan

Qiao Liang

Ian McGraw

Cal Peyser

Rohit Prabhavalkar

Golan Pundak

David Rybach

(June) Yuan Shangguan

Yash Sheth

Trevor Strohman

Mirkó Visontai

Yonghui Wu

Yu Zhang

Ding Zhao

ICASSP (2020)

VoiceFilter-Lite: Streaming Targeted Voice Separation for On-Device Speech Recognition

Quan Wang

Ignacio Lopez Moreno

Mert Saglam

Kevin William Wilson

Alan Chiao

Renjie Liu

Yanzhang (Ryan) He

Wei Li

Jason Pelecanos

Marily Nika

Alex Gruenstein

Interspeech 2020 (2020) (to appear)

Hotword Cleaner: Dual-Microphone Adaptive Noise Cancellation With Deferred Filter Coefficients for Robust Keyword Spotting

Yiteng Huang

Turaj Zakizadeh Shabestary

Alex Gruenstein

Proc. ICASSP, IEEE (2019), pp. 6346-6350

Multi-Microphone Adaptive Noise Cancellation for Robust Hotword Detection

Yiteng Huang

Turaj Zakizadeh Shabestary

Alex Gruenstein

Li Wan

Proc. InterSpeech 2019, pp. 1233-1237

STREAMING END-TO-END SPEECH RECOGNITION FOR MOBILE DEVICES

Yanzhang He

Tara Sainath

Rohit Prabhavalkar

Ian McGraw

Raziel Alvarez

Ding Zhao

David Rybach

Anjuli Kannan

Yonghui Wu

Ruoming Pang

Qiao Liang

Deepti Bhatia

Yuan Shangguan

Bo Li

Golan Pundak

Khe Chai Sim

Tom Bagby

Shuo-yiin Chang

Kanishka Rao

Alex Gruenstein

ICASSP (2019)

A Cascade Architecture for Keyword Spotting on Mobile Devices

Alexander Gruenstein

Raziel Alvarez

Chris Thornton

Mohammadali Ghodrat

31st Conference on Neural Information Processing Systems (NIPS 2017), Long Beach, CA, USA (2017)

Personalized Speech Recognition On Mobile Devices

Ian McGraw

Rohit Prabhavalkar

Raziel Alvarez

Montse Gonzalez Arenas

Kanishka Rao

David Rybach

Ouais Alsharif

Hasim Sak

Alexander Gruenstein

Françoise Beaufays

Carolina Parada

Proceedings of International Conference on Acoustics, Speech and Signal Processing (ICASSP), IEEE (2016)

Accurate and Compact Large Vocabulary Speech Recognition on Mobile Devices

Xin Lei

Andrew Senior

Alexander Gruenstein

Jeffrey Sorensen

Interspeech (2013)

Preview

Estimating Word-Stability During Incremental Speech Recognition

Ian McGraw

Alexander Gruenstein

Interspeech (2012)

Preview

Unsupervised Testing Strategies for ASR

Brian Strope

Doug Beeferman

Alexander Gruenstein

Xin Lei

Interspeech 2011, pp. 1685-1688

Preview

On-Demand Language Model Interpolation for Mobile Speech Input

Brandon Ballinger

Cyril Allauzen

Alexander Gruenstein

Johan Schalkwyk

Interspeech (2010), pp. 1812-1815

A Self-Labeling Speech Corpus: Collecting Spoken Words with an Online Educational Game

Ian McGraw

Alexander Gruenstein

Andrew Sutherland

Interspeech (2009)

City Browser: Developing a Conversational Automotive HMI

Alexander Gruenstein

Jarrod Orszulak

Sean Liu

Shannon Roberts

Jeff Zabel

Bryan Reimer

Bruce Mehler

Stephanie Seneff

James Glass

and Joseph Coughlin.

Proc. of CHI (2009)

A Self-Transcribing Speech Corpus: Collecting Continuous Speech with an Online Educational Game

Alexander Gruenstein

Ian McGraw

Andrew Sutherland

SLaTE (2009)

The WAMI Toolkit for Developing, Deploying, and Evaluating Web-Accessible Multimodal Interfaces

Alexander Gruenstein

Ian McGraw

Ibrahim Badr

Proc. of 10th International Conference on Multimodal Interfaces (2008)

Meeting Structure Annotation

Alexander Gruenstein

John Niekrasz

Matthew Purver

Recent Trends in Discourse and Dialogue, Springer (2008)

A Multimodal Home Entertainment Interface via a Mobile Device

Alexander Gruenstein

Bo-June (Paul) Hsu

James Glass

Stephanie Seneff

Lee Hetherington

Scott Cyphers

Ibrahim Badr

Chao Wang

Sean Liu

Proc. of the ACL Workshop on Mobile Language Processing (2008)

Response-Based Confidence Annotation for Spoken Dialogue Systems

Alexander Gruenstein

Proc. of the 9th SIGdial Workshop on Discourse and Dialogue (2008)

Releasing a Multimodal Dialogue System into the Wild: User Support Mechanisms

Alexander Gruenstein

Stephanie Seneff

Proc. of the 8th SIGdial Workshop on Discourse and Dialogue (2007)

Context Sensitive Language Modeling for Large Sets of Proper Nouns in Multimodal Dialogue Systems

Alexander Gruenstein

Stephanie Seneff

Proc. of IEEE/ACL Workshop on Spoken Language Technology (2006)

Scalable and Portable Web-based Multimodal Dialogue Interaction with Geographical Databases

Alexander Gruenstein

Stephanie Seneff

Chao Wang

Interspeech (2006)

NOMOS: A Semantic Web Software Framework for Annotation of Multimodal Corpora

John Niekrasz

Alexander Gruenstein

Proc. of the 5th Conference on Language Resources and Evaluation (LREC 2006)

Context-Sensitive Statistical Language Modeling

Alexander Gruenstein

Chao Wang

Stephanie Seneff

Interspeech (2005), pp. 17-20

A General Purpose Architecture for Intelligent Tutoring Systems

Brady Clark

Oliver Lemon

Alexander Gruenstein

Elizabeth Owen Bratt

John Fry

Stanley Peters

Heather Pon-Barry

Karl Schultz

Zack Thomsen-Gray

Pucktada Treeratpituk

Natural, Intelligent and Effective Interaction in Multimodal Dialogue Systems, Kluwer (2005)

Meeting Structure Annotation: Data and Tools

Alexander Gruenstein

John Niekraz

Matthew Purver

Proc. of the 6th SIGdial Workshop on Disource and Dialogue (2005)

Multithreaded context for robust conversational interfaces: context-sensitive speech recognition and interpretation of corrective fragments

Oliver Lemon

Alexander Gruenstein

ACM Transactions on Computer-Human Interaction, vol. 11(3) (2004), pp. 241-267

Demo: A Multimodal Learning Interface for Sketch, Speak and Point creation of a Schedule Chart

Ed Kaiser

David Demirdjian

Alexander Gruenstein

Xiaoguang Li

John Niekrasz

Matt Wesson

Sanjeev Kumar

Proceedings of the Sixth International Conference on Multimodal Interfaces (ICMI 2004)

Using an Activity Model to Address Issues in Task-Oriented Dialogue Interaction Over Extended Periods

Alexander Gruenstein

Lawrence Cavedon

Proceedings of AAAI Spring Symposium on Interaction Between Humans and Autonomous Systems over Extended Periods (2004)

Emotional Information Available from Videotapes vs Transcripts

Anna Liess

Wendy Ellis

Janine Giese-Davis

Alexander Gruenstein

Mitch Golant

David Spiegel

Proceedings of the 25th Annual Meeting of the Society of Behavioral Medicine (2004)

Multi-Human Dialogue Understanding for Assisting Artifact-Producing Meetings

John Niekrasz

Alexander Gruenstein

Lawrence Cavedon

Proceedings of the 20th International Conference on Computational Linguistics (COLING) (2004)

Managing uncertainty in dialogue information state for real time understanding of multi-human meeting dialogues

Alexander Gruenstein

Lawrence Cavedon

John Niekrasz

Dominic Widdows

Stanley Peters

Proceedings of the 8th Workshop on Formal Semantics and Pragmatics of Dialogue (Catalog) (2004)

Generation of collaborative spoken dialogue contributions in dynamic task environment

Oliver Lemon

Alexander Gruenstein

Randolph Gullett

Alexis Battle

Laura Hiatt

Stanley Peters

Working Papers of the 2003 AAAI Spring Symposium on Natural Language Generation in Spoken and Written Dialogue, {AAAI} Press, pp. 85-90

An information state approach in a multi-modal dialogue system for human-robot conversation

Oliver Lemon

Anne Bracy

Alexander Gruenstein

Stanley Peters

Perspectives on Dialogue in the new Millenium, John Benjamins (2003), pp. 229-242

Targeted Help for Spoken Dialogue Systems: Intelligent Feedback Improves Naive User's Performance

Beth Ann Hockey

Oliver Lemon

Ellen Campana

Laura Hiatt

Gregory Aist

James Hieronymus

Alexander Gruenstein

John Dowding

Proceedings of the 10th Conference of the European Chapter of the Association for Computational Linguistics (EACL) (2003)

Collaborative Dialogue for Controlling Autonomous Systems

Oliver Lemon

Alexander Gruenstein

Lawrence Cavedon

Stanley Peters

Proccedings of the AAAI Fall Symposium (2002)

Collaborative Activities and Multi-tasking in Dialogue Systems

Oliver Lemon

Alexander Gruenstein

Stanley Peters

Traitment automatique des langues, vol. 43(2) (2002), pp. 131-154

Multi-tasking and Collaborative Activities in Dialogue Systems

Oliver Lemon

Alexander Gruenstein

Alexis Battle

Stanley Peters

Proceedings of the 3rd SIGdial Workshop on Discourse and Dialogue (2002), pp. 113-124

Information States in a Multi-modal Dialogue System for Human-Robot Conversation

Oliver Lemon

Anne Bracy

Alexander Gruenstein

Stanley Peters

Proceedings of the 5th Workshop on Formal Semantics and Pragmatics of Dialogue (Bi-Dialog 2001), pp. 57 - 67

The WITAS Multi-Modal Dialogue System I

Oliver Lemon

Anne Bracy

Alexander Gruenstein

Stanley Peters

7th European Conference on Speech Communication and Technology (EuroSpeech) (2001)

A Multi-Modal Dialogue System for Human-Robot Conversation

Oliver Lemon

Anne Bracy

Alexander Gruenstein

Stanley Peters

Proceedings of the Scond Meeting of the North American Chapter of the Association for Computational Linguistics NAACL (2001)

Search on Google Scholar

Defining the technology of today and tomorrow.

Philosophy

People

Teams

AI/ML Foundations  & Capabilities

Algorithms & Optimization

Computing Paradigms

Responsible Human-Centric Technology

Science & Societal Impact

Projects

Publications

Resources

Shaping the future, together.

Student programs

Faculty programs

Conferences & events

Alexander Gruenstein

Research Areas

Join us

Defining the technology of today and tomorrow.

Philosophy

People

Teams

AI/ML Foundations & Capabilities

Algorithms & Optimization

Computing Paradigms

Responsible Human-Centric Technology

Science & Societal Impact

Projects

Publications

Resources

Shaping the future, together.

Student programs

Faculty programs

Conferences & events

Alexander Gruenstein

Research Areas

Filter by:

Year

Research Area

Team

Join us

AI/ML Foundations  & Capabilities