Michael Riley

Michael Riley has a B.S., M.S., and Ph.D. from MIT, all in computer science. He began his career at Bell Labs and AT&T Labs where he, together with Mehryar Mohri and Fernando Pereira, introduced and developed the theory and use of weighted finite-state transducers (WFSTs) in speech and language. He is currently distinguished research scientist at Google, Inc. His interests include speech and natural language processing, machine learning, and information retrieval. He is a principal author of the OpenFst library He manages a group with expertise that includes speech recognition and synthesis, NLP, information retrieval, image processing, algorithms, machine learning and privacy. He is an IEEE and ISCA Fellow.

Research Areas

Authored Publications

Google Publications

Other Publications

On Weight Interpolation of the Hybrid Autoregressive Transducer Model

Bhuvana Ramabhadran

Cyril Allauzen

David Rybach

Ehsan Variani

Michael D. Riley

Tongzhou Chen

Interspeech 2022, Interspeech 2022 (2022) (to appear)

Spatial model personalization in Gboard

Gary Sivek

Michael D. Riley

An Efficient Streaming Non-Recurrent On-Device End-to-End Model with Improvements to Rare-Word Modeling

Tara N Sainath

Yanzhang (Ryan) He

Arun Narayanan

Rami Botros

Ruoming Pang

David Johannes Rybach

Cyril Allauzen

Ehsan Variani

James Qin

Quoc-Nam Le-The

Alex Gruenstein

Anmol Gulati

Bo Li

Cal Peyser

Chung-Cheng Chiu

Diamantino A. Caseiro

Emmanuel Guzman

Ian Carmichael McGraw

Jiahui Yu

Michael D. Riley

Pat Rondon

Qiao Liang

Sepand Mavandadi

Shuo-yiin Chang

Trevor Deatrick Strohman

W. Ronny Huang

Wei Li

Yonghui Wu

Yu Zhang

Interspeech (2021) (to appear)

Approximating probabilistic models as weighted finite automata

Ananda Theertha Suresh

Brian Edward Roark

Michael D. Riley

Vlad Schogol

Computational Linguistics, vol. 47 (2021), pp. 221-254

Hybrid Autoregressive Transducer (HAT)

Ehsan Variani

David Rybach

Cyril Allauzen

Michael Riley

ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing, Barcelona, Spain, pp. 6139-6143

Distilling weighted finite automata from arbitrary probabilistic models

Ananda Theertha Suresh

Brian Roark

Michael Riley

Vlad Schogol

Proceedings of FSMNLP (2019), pp. 87-97

Federated Learning of N-gram Language Models

Adeline Wong

Ananda Theertha Suresh

Cyril Allauzen

Francoise Beaufays

Michael Riley

Mingqing Chen

Rajiv Mathews

The SIGNLL Conference on Computational Natural Language Learning (2019)

Latin script keyboards for South Asian languages with finite-state normalization

Lawrence Wolf-Sonkin

Vlad Schogol

Brian Roark

Michael Riley

Proceedings of FSMNLP (2019), pp. 108-117

Algorithms for Weighted Finite Automata with Failure Transitions

Cyril Allauzen

Michael Riley

International Conference of Implementation and Applications of Automata (CIAA) (2018), pp. 46-58

Semantic Lattice Processing in Contextual Automatic Speech Recognition for Google Assistant

Leonid Velikovich

Ian Williams

Justin Scheiner

Petar Aleksic

Pedro Moreno

Michael Riley

Interspeech 2018, ISCA (2018), pp. 2222-2226

On Lattice Generation for Large Vocabulary Speech Recognition

David Rybach

Johan Schalkwyk

Michael Riley

IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), Okinawa, Japan (2017)

Transliterated mobile keyboard input via weighted finite-state transducers

Lars Hellsten

Brian Roark

Prasoon Goyal

Cyril Allauzen

Francoise Beaufays

Tom Ouyang

Michael Riley

David Rybach

Proceedings of the 13th International Conference on Finite State Methods and Natural Language Processing (FSMNLP) (2017)

Contextual prediction models for speech recognition

Yoni Halpern

Keith Hall

Vlad Schogol

Michael Riley

Brian Roark

Gleb Skobeltsyn

Martin Baeuml

Proceedings of Interspeech 2016

Learning N-gram Language Models from Uncertain Data

Vitaly Kuznetsov

Hank Liao

Mehryar Mohri

Michael Riley

Brian Roark

Interspeech (2016)

Distributed representation and estimation of WFST-based n-gram models

Cyril Allauzen

Michael Riley

Brian Roark

Proceedings of the ACL Workshop on Statistical NLP and Weighted Automata (StatFSM) (2016), pp. 32-41

Composition-based on-the-fly rescoring for salient n-gram biasing

Keith Hall

Eunjoon Cho

Cyril Allauzen

Francoise Beaufays

Noah Coccaro

Kaisuke Nakajima

Michael Riley

Brian Roark

David Rybach

Linda Zhang

Interspeech 2015, International Speech Communications Association

Preview

Rapid Vocabulary Addition to Context-Dependent Decoder Graphs

Cyril Allauzen

Michael Riley

Interspeech 2015

Preview

Direct construction of compact context-dependency transducers from data

David Rybach

Michael Riley

Chris Alberti

Computer Speech & Language, vol. 28 (2014), pp. 177-191

Pushdown automata in statistical machine translation

Cyril Allauzen

Bill Byrne

Adrià de Gispert

Gonzalo Iglesias

Michael Riley

Computational Linguistics, vol. 40 (2014), pp. 687-723

Preview

Encoding Linear Models As Weighted Finite-State Transducers

Ke Wu

Cyril Allauzen

Keith Hall

Michael Riley

Brian Roark

Interspeech 2014, ISCA, pp. 1258-1262

Preview

Smoothed marginal distribution constraints for language modeling

Brian Roark

Cyril Allauzen

Michael Riley

Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (ACL) (2013), pp. 43-52

Pre-Initialized Composition for Large-Vocabulary Speech Recognition

Cyril Allauzen

Michael Riley

Interspeech 2013, 666 – 670

Preview

The OpenGrm Open-Source Finite-State Grammar Software Libraries

Brian Roark

Richard Sproat

Cyril Allauzen

Michael Riley

Jeffrey Sorensen

Terry Tai

ACL (System Demonstrations) (2012), pp. 61-66

Preview

A Pushdown Transducer Extension for the OpenFst Library

Cyril Allauzen

Michael Riley

CIAA, Springer (2012), pp. 66-77

Preview

Voice Query Refinement

Cyril Allauzen

Edward Benson

Ciprian Chelba

Michael Riley

Johan Schalkwyk

Interspeech (2012)

Preview

Language Modeling for Automatic Speech Recognition Meets the Web: Google Search by Voice

Ciprian Chelba

Johan Schalkwyk

Boulos Harb

Carolina Parada

Cyril Allauzen

Leif Johnson

Michael Riley

Peng Xu

Preethi Jyothi

Thorsten Brants

Vida Ha

Will Neveitt

University of Toronto (2012)

Mobile Music Modeling, Analysis and Recognition

Pavel Golik

Boulos Harb

Ananya Misra

Michael Riley

Alex Rudnick

Eugene Weinstein

International Conference on Acoustics, Speech, and Signal Processing (ICASSP) (2012)

Bayesian Language Model Interpolation for Mobile Speech Input

Cyril Allauzen

Michael Riley

Interspeech 2011, pp. 1429-1432

Hierarchical Phrase-Based Translation Representations

Gonzalo Iglesias

Cyril Allauzen

William Byrne

Adrià de Gispert

Michael Riley

Proceedings of EMNLP 2011

Preview

A Filter-based Algorithm for Efficient Composition of Finite-State Transducers

Cyril Allauzen

Michael Riley

Johan Schalkwyk

International Journal of Foundations of Computer Science, vol. 22 (2011), pp. 1781-1795

Preview

Language Modeling for Automatic Speech Recognition Meets the Web: Google Search by Voice

Ciprian Chelba

Johan Schalkwyk

Boulos Harb

Carolina Parada

Cyril Allauzen

Michael Riley

Peng Xu

Thorsten Brants

Vida Ha

Will Neveitt

OGI/OHSU Seminar Series, Portland, Oregon, USA (2011)

Filters for Efficient Composition of Weighted Finite-State Transducers

Cyril Allauzen

Michael Riley

Johan Schalkwyk

CIAA (2010), pp. 28-38

Preview

Expected Sequence Similarity Maximization

Cyril Allauzen

Shankar Kumar

Wolfgang Macherey

Mehryar Mohri

Michael Riley

NAACL HLT (2010)

Preview

Direct Construction of Compact Context-Dependency Transducers From Data

David Rybach

Michael Riley

Interspeech 2010, ISCA

A Generalized Composition Algorithm for Weighted Finite-State Transducers

Cyril Allauzen

Michael Riley

Johan Schalkwyk

Interspeech 2009

Web-derived Pronunciations

Arnab Ghoshal

Martin Jansche

Sanjeev Khudanpur

Michael Riley

Morgan Ulinski

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) (2009), pp. 4289-4292

Web Derived Pronunciations for Spoken Term Detection

Doğan Can

Erica Cooper

Arnab Ghoshal

Martin Jansche

Sanjeev Khudanpur

Bhuvana Ramabhadran

Michael Riley

Murat Saraçlar

Abhinav Sethy

Morgan Ulinski

Christopher White

32nd Annual International ACM SIGIR Conference (2009), pp. 83-90

OpenFst: An Open-Source, Weighted Finite-State Transducer Library and its Applications to Speech and Language

Michael Riley

Cyril Allauzen

Martin Jansche

Proceedings of the North American Chapter of the Association for Computational Linguistics -- Human Language Technologies (NAACL HLT) 2009 conference, Tutorials

Sample Selection Bias Correction Theory

Corinna Cortes

Mehryar Mohri

Michael Riley

Afshin Rostamizadeh

Proceedings of The 19th International Conference on Algorithmic Learning Theory (ALT 2008), Springer, Heidelberg, Germany, Budapest, Hungary

Preview

Speech Recognition with Weighted Finite-State Transducers

Mehryar Mohri

Fernando C. N. Pereira

Michael Riley

Handbook on Speech Processing and Speech Communication, Part E: Speech recognition, Springer-Verlag, Heidelberg, Germany (2008)

Preview

On the Computation of the Relative Entropy of Probabilistic Automata

Corinna Cortes

Mehryar Mohri

Ashish Rastogi

Michael Riley

International Journal of Foundations of Computer Science, vol. 19 (2008), pp. 219-242

Preview

Speech Recognition with Weighted Finite-State Transducers

Mehryar Mohri

Fernando C. N. Pereira

Michael Riley

Handbook on Speech Processing and Speech Communication, Part E: Speech recognition, Springer-Verlag, Heidelberg, Germany (2007)

Preview

OpenFst: a General and Efficient Weighted Finite-State Transducer Library

Cyril Allauzen

Michael Riley

Johan Schalkwyk

Wojciech Skut

Mehryar Mohri

Proceedings of the 12th International Conference on Implementation and Application of Automata (CIAA 2007), Springer-Verlag, Heidelberg, Germany, Prague, Czech Republic

Preview

On the Computation of the Relative Entropy of Probabilistic Automata

Corinna Cortes

Mehryar Mohri

Ashish Rastogi

Michael Riley

International Journal of Foundations of Computer Science, vol. to appear (2007)

Preview

Efficient Computation of the Relative Entropy of Probabilistic Automata

Corinna Cortes

Mehryar Mohri

Ashish Rastogi

Michael Riley

Proceedings of the 7th Latin American Symposium (LATIN 2006), Springer-Verlag, Heidelberg, Germany, Valdivia, Chile

Preview

Automata and Graph Compression

Mehryar Mohri

Michael Riley

Ananda Theertha Suresh

CoRR, vol. abs/1502.07288 (2015)

Automata and graph compression

Mehryar Mohri

Michael Riley

Ananda Theertha Suresh

ISIT (2015), pp. 2989-2993

MAP adaptation of stochastic grammars

M. Bacchiani

M. Riley

B. Roark

R. Sproat

Computer Speech and Language, vol. 20 (2006), pp. 41-68

Efficient Computation of the Relative Entropy of Probabilistic Automata

Corinna Cortes

Mehryar Mohri

Ashish Rastogi

Michael Riley

LATIN (2006), pp. 323-336

Weighted Automata in Text and Speech Processing

Mehryar Mohri

Fernando Pereira

Michael Riley

arXiv, vol. abs/cs/0503077 (2005)

Statistical Modeling for Unit Selection in Speech Synthesis

Cyril Allauzen

Mehryar Mohri

Michael Riley

42nd Meeting of the Association for Computational Linguistics (ACL 2004), Proceedings of the Conference, Barcelona, Spain

A Generalized Construction of Integrated Speech Recognition Transducers

Cyril Allauzen

Mehryar Mohri

Brian Roark

Michael Riley

Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2004), Montreal, Canada

Statistical Modeling for Unit Selection in Speech Synthesis

Cyril Allauzen

Mehryar Mohri

Michael Riley

$42$nd Meeting of the Association for Computational Linguistics (ACL 2004), Proceedings of the Conference, Barcelona, Spain

Voice Signatures

Izhak Shafran

Michael Riley

Mehryar Mohri

Proceedings of The 8th IEEE Automatic Speech Recognition and Understanding Workshop (ASRU 2003), St. Thomas, U.S. Virgin Islands

A comparison of two LVR search optimization techniques

Stephan Kanthak

Hermann Ney

Michael Riley

Mehryar Mohri

INTERSPEECH (2002)

A Comparison of Two LVR Search Optimization Techniques

Stephan Kanthak

Hermann Ney

Michael Riley

Mehryar Mohri

Proceedings of the International Conference on Spoken Language Processing 2002 (ICSLP '02), Denver, Colorado

Weighted Finite-State Transducers in Speech Recognition (Tutorial)

Mehryar Mohri

Michael Riley

Proceedings of the International Conference on Spoken Language Processing 2002 (ICSLP '02), Denver, Colorado

An Efficient Algorithm for the N-Best-Strings Problem

Mehryar Mohri

Michael Riley

Proceedings of the International Conference on Spoken Language Processing 2002 (ICSLP '02), Denver, Colorado

An Efficient Algorithm for the $N$-Best-Strings Problem

Mehryar Mohri

Michael Riley

Proceedings of the International Conference on Spoken Language Processing 2002 (ICSLP '02), Denver, Colorado

Weighted Finite-State Transducers in Speech Recognition

Mehryar Mohri

Fernando C. N. Pereira

Michael Riley

Computer Speech and Language, vol. 16 (2002), pp. 69-88

Weighted finite-state transducers in speech recognition

Mehryar Mohri

Fernando Pereira

Michael Riley

Computer Speech & Language, vol. 16 (2002), pp. 69-88

A Weight Pushing Algorithm for Large Vocabulary Speech Recognition

Mehryar Mohri

Michael Riley

Proceedings of the 7th European Conference on Speech Communication and Technology (Eurospeech '01), Aalborg, Denmark (2001)

A weight pushing algorithm for large vocabulary speech recognition

Mehryar Mohri

Michael Riley

INTERSPEECH (2001), pp. 1603-1606

Weighted Finite-State Transducers in Speech Recognition

Mehryar Mohri

Fernando C. N. Pereira

Michael Riley

Proceedings of the ISCA Tutorial and Research Workshop, Automatic Speech Recognition: Challenges for the new Millenium (ASR2000), Paris, France

The Design Principles of a Weighted Finite-State Transducer Library

Mehryar Mohri

Fernando C. N. Pereira

Michael Riley

Theoretical Computer Science, vol. 231 (2000), pp. 17-32

The Design Principles of a Weighted Finite-State Transducer Library

Mehryar Mohri

Fernando C. N. Pereira

Michael Riley

Theor. Comput. Sci., vol. 231 (2000), pp. 17-32

Integrated Context-Dependent Networks in Very Large Vocabulary Speech Recognition

Mehryar Mohri

Michael Riley

Proceedings of the 6th European Conference on Speech Communication and Technology (Eurospeech '99), Budapest, Hungary (1999)

Network Optimizations for Large Vocabulary Speech Recognition

Mehryar Mohri

Michael Riley

Speech Communication, vol. 28 (1999), pp. 1-12

Rapid unit selection from a large speech corpus for concatenative speech synthesis

Mark Beutnagel

Mehryar Mohri

Michael Riley

EUROSPEECH (1999)

Efficient General Lattice Generation and Rescoring

Andrej Ljolje

Fernando Pereira

Michael Riley

EUROSPEECH 99 (1999), pp. 1251-1254

Integrated context-dependent networks in very large vocabulary speech recognition

Mehryar Mohri

Michael Riley

EUROSPEECH (1999)

Network optimizations for large-vocabulary speech recognition

Mehryar Mohri

Michael Riley

Speech Communication, vol. 28 (1999), pp. 1-12

Rapid Unit Selection from a Large Speech Corpus for Concatenative Speech Synthesis

Mark Beutnagel

Mehryar Mohri

Michael Riley

Proceedings of the 6th European Conference on Speech Communication and Technology (Eurospeech '99), Budapest, Hungary (1999)

A Rational Design for a Weighted Finite-State Transducer Library

Mehryar Mohri

Fernando C. N. Pereira

Michael Riley

Proceedings of the Second International Workshop on Implementing Automata (WIA '97), Springer-Verlag, Berlin-NY (1998), pp. 144-158

Full Expansion of Context-Dependent Networks in Large Vocabulary Speech Recognition

Mehryar Mohri

Michael Riley

Don Hindle

Andrej Ljolje

Fernando C. N. Pereira

Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP '98), Seattle, Washington (1998)

Full expansion of context-dependent networks in large vocabulary speech recognition

Mehryar Mohri

Michael Riley

Donald Hindle

Andrej Ljolje

Fernando C. N. Pereira

ICASSP (1998), pp. 665-668

A Rational Design for a Weighted Finite-State Transducer Library

Mehryar Mohri

Fernando C. N. Pereira

Michael Riley

Proceedings of the Workshop on Implementing Automata (WIA '97), London, Ontario, Canada, University of Western Ontario, London, Ontario, Canada (1997)

Weighted determinization and minimization for large vocabulary speech recognition

Mehryar Mohri

Michael Riley

EUROSPEECH (1997)

Transducer Composition for Context-Dependent Network Expansion

Michael Riley

Fernando Pereira

Mehryar Mohri

EuroSpeech'97, European Speech Communication Association, Genova, Italy (1997), pp. 1427-1430

Transducer Composition for Context-Dependent Network Expansion

Michael Riley

Fernando C. N. Pereira

Mehryar Mohri

Proceedings of the 5th European Conference on Speech Communication and Technology (Eurospeech '97), Rhodes, Greece (1997)

Speech Recognition by Composition of Weighted Finite Automata

Fernando Pereira

Michael Riley

Finite-State Language Processing, MIT Press, Cambridge, Massachusetts (1997), pp. 431-453

A Rational Design for a Weighted Finite-State Transducer Library

Mehryar Mohri

Fernando Pereira

Michael Riley

WIA'97: Proceedings of the Workshop on Implementing Automata, Springer-Verlag (1997)

Weighted Determinization and Minimization for Large Vocabulary Speech Recognition

Mehryar Mohri

Michael Riley

Proceedings of the 5th European Conference on Speech Communication and Technology (Eurospeech '97), Rhodes, Greece (1997)

Compilation of Weighted Finite-State Transducers from Decision Trees

Richard Sproat

Michael Riley

CoRR, vol. cmp-lg/9606018 (1996)

Speech Recognition by Composition of Weighted Finite Automata

Fernando C. N. Pereira

Michael Riley

arXiv (1996)

Algorithms for Speech Recognition and Language Processing

Mehryar Mohri

Michael Riley

Richard Sproat

CoRR, vol. cmp-lg/9608018 (1996)

Rational Power Series in Text and Speech Processing

Mehryar Mohri

Fernando C. N. Pereira

Michael Riley

Graduate course, University of Pennsylvania, Department of Computer Science, Philadelphia, PA (1996)

Finite-State Transducers in Language and Speech Processing

Mehryar Mohri

Michael Riley

Richard Sproat

Tutorial at the 16th International Conference on Computational Linguistics (COLING-96), COLING, Copenhagen, Denmark (1996)

Compilation of Weighted Finite-State Transducers from Decision Trees

Richard Sproat

Michael Riley

ACL (1996), pp. 215-222

Weighted Automata in Text and Speech Processing

Mehryar Mohri

Fernando C. N. Pereira

Michael Riley

Proceedings of the 12th biennial European Conference on Artificial Intelligence (ECAI-96), Workshop on Extended finite state models of language, John Wiley and Sons, Chichester, Budapest, Hungary (1996)

The AT&T 60,000 Word Speech-to-Text System

Michael Riley

Andrej Ljolje

Don Hindle

Fernando Pereira

Eurospeech'95: ESCA 4th European Conference on Speech Communication and Technology, Madrid, Spain (1995), pp. 207-210

Weighted Rational Transductions and their Application to Human Language Processing

Fernando Pereira

Michael Riley

Richard W. Sproat

Human Language Technology Workshop, Morgan Kaufmann, San Francisco, California (1994), pp. 262-267

A spoken language translator for restricted-domain context-free languages

David B. Roe

Pedro J. Moreno

Richard W. Sproat

Fernando Pereira

Michael Riley

Alejandro Macarrón

Speech Communication, vol. 11 (1992), pp. 311-319

Efficient Grammar Processing for a Spoken Language Translation System

David B. Roe

Pedro J. Moreno

Richard W. Sproat

Fernando Pereira

Michael Riley

Alejandro Macarrón

Proceedings of ICASSP, IEEE, San Francisco, California (1992), pp. 213-216

Toward a Spoken Language Translator for Restricted-Domain Context-Free Languages

David B. Roe

Fernando Pereira

Richard W. Sproat

Michael Riley

Pedro J. Moreno

Alejandro Macarrón

EUROSPEECH 91 -- 2nd European Conference on Speech Communication and Technology, Genova, Italy (1991), pp. 1063-1066

Search on Google Scholar

Defining the technology of today and tomorrow.

Philosophy

People

Teams

AI/ML Foundations  & Capabilities

Algorithms & Optimization

Computing Paradigms

Responsible Human-Centric Technology

Science & Societal Impact

Projects

Publications

Resources

Shaping the future, together.

Student programs

Faculty programs

Conferences & events

Michael Riley

Research Areas

Join us

Defining the technology of today and tomorrow.

Philosophy

People

Teams

AI/ML Foundations & Capabilities

Algorithms & Optimization

Computing Paradigms

Responsible Human-Centric Technology

Science & Societal Impact

Projects

Publications

Resources

Shaping the future, together.

Student programs

Faculty programs

Conferences & events

Michael Riley

Research Areas

Filter by:

Year

Research Area

Team

Join us

AI/ML Foundations  & Capabilities