Fernando Pereira

Fernando Pereira is VP and Engineering Fellow at Google, where he leads research and development in natural language understanding and machine learning. His previous positions include chair of the Computer and Information Science department of the University of Pennsylvania, head of the Machine Learning and Information Retrieval department at AT&T Labs, and research and management positions at SRI International. He received a Ph.D. in Artificial Intelligence from the University of Edinburgh in 1982, and has over 120 research publications on computational linguistics, machine learning, bioinformatics, speech recognition, and logic programming, as well as several patents. He was elected AAAI Fellow in 1991 for contributions to computational linguistics and logic programming, ACM Fellow in 2010 for contributions to machine learning models of natural language and biological sequences, and ACL Fellow for contributions to sequence modeling, finite-state methods, and dependency and deductive parsing. He was president of the Association for Computational Linguistics in 1993.

Research Areas

Authored Publications

Google Publications

Other Publications

Conversational Music Retrieval with Synthetic Data

Megan Eileen Leszczynski

Ravi Ganti

Shu Zhang

Krisztian Balog

Filip Radlinski

Fernando Pereira

Arun Tejasvi Chaganty

Second Workshop on Interactive Learning for Natural Language Processing at NeurIPS 2022

Points, Paths, and Playscapes: Large-scale Spatial Language Understanding Tasks Set in the Real World

Jason Baldridge

Tania Bedrax-Weiss

Daphne Luong

Srini Narayanan

Bo Pang

Fernando Pereira

Radu Soricut

Michael Tseng

Yuan Zhang

Proceedings of the First International Workshop on Spatial Language Understanding, Association for Computational Linguistics, New Orleans, Louisiana, USA (2018), pp. 46-52

SLING: A framework for frame semantic parsing

Michael Ringgaard

Rahul Gupta

Fernando C. N. Pereira

arXiv (2017), pp. 9

Collective Entity Resolution with Multi-Focal Attention

Amir Globerson

Nevena Lazic

Soumen Chakrabarti

Amarnag Subramanya

Michael Ringaard

Fernando Pereira

ACL (2016)

Multinomial Loss on Held-out Data for the Sparse Non-negative Matrix Language Model

Ciprian Chelba

Fernando Pereira

ArXiv, Google (2015)

Plato: A Selective Context Model for Entity Resolution

Nevena Lazic

Amarnag Subramanya

Michael Ringgaard

Fernando Pereira

Transactions of the Association for Computational Linguistics, vol. 3 (2015), pp. 503-515

Yedalog: Exploring Knowledge at Scale

Brian Chin

Daniel von Dincklage

Vuk Ercegovac

Peter Hawkins

Mark S. Miller

Franz Och

Chris Olston

Fernando Pereira

1st Summit on Advances in Programming Languages (SNAPL 2015), Schloss Dagstuhl--Leibniz-Zentrum fuer Informatik, Dagstuhl, Germany, pp. 63-78

Large Scale Distributed Acoustic Modeling With Back-off N-grams

Ciprian Chelba

Peng Xu

Fernando Pereira

Thomas Richardson

IEEE Transactions on Audio, Speech and Language Processing, vol. 21 (2013), pp. 1158-1169

Large Scale Distributed Acoustic Modeling With Back-off N-grams

Ciprian Chelba

Peng Xu

Fernando Pereira

Thomas Richardson

ICSI, Berkeley, California (2013)

Distributed Acoustic Modeling with Back-off N-grams

Ciprian Chelba

Peng Xu

Fernando Pereira

Thomas Richardson

Proceedings of ICASSP 2012, IEEE, pp. 4129-4132

Large-Scale Cross-Document Coreference Using Distributed Inference and Hierarchical Models

Sameer Singh

Amarnag Subramanya

Fernando Pereira

Andrew McCallum

Association for Computational Linguistics (ACL) (2011)

Posterior Sparsity in Dependency Grammar Induction

Jennifer Gillenwater

Kuzman Ganchev

Joao Graca

Fernando Pereira

Ben Taskar

Journal of Machine Learning Research, vol. 12 (2011), pp. 455-490

Controlling Complexity in Part-of-Speech Induction

Joao Graca

Kuzman Ganchev

Luisa Coheur

Fernando Pereira

Ben Taskar

Journal of Artificial Intelligence Research (JAIR), vol. 41 (2011), pp. 527-551

A theory of learning from different domains

Shai Ben-David

John Blitzer

Koby Crammer

Alex Kulesza

Fernando Pereira

Jennifer Vaughan

Machine Learning, vol. 79 (2010), pp. 151-175

Distributed MAP Inference for Undirected Graphical Models

Sameer Singh

Amarnag Subramanya

Fernando Pereira

Andrew McCallum

Workshop on Learning on Cores, Clusters and Clouds (LCCC), Neural Information Processing Society (NIPS) (2010)

Preview

Exploiting Feature Covariance in High-Dimensional Online Learning

Justin Ma

Alex Kulesza

Mark Dredze

Koby Crammer

Lawrence Saul

Fernando Pereira

Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics, JMLR (2010), pp. 493-500

Preview

Automatically incorporating new sources in keyword search-based data integration

Partha Pratim Talukdar

Zachary G. Ives

Fernando Pereira

SIGMOD Conference, ACM Press (2010), pp. 387-398

Preview

Experiments in Graph-based Semi-Supervised Learning Methods for Class-Instance Acquisition

Partha Pratim Talukdar

Fernando Pereira

48th Annual Meeting of the Association for Computational Linguistics (ACL 2010)

Preview

Efficient Graph-Based Semi-Supervised Learning of Structured Tagging Models

Amarnag Subramanya

Slav Petrov

Fernando Pereira

Proceedings of the 2010 Conference on Empirical Methods on Natural Language Processing (EMNLP '10)

Preview

Sparsity in Dependency Grammar Induction

Jennifer Gillenwater

Kuzman Ganchev

João Graça

Fernando Pereira

Ben Taskar

48th Annual Meeting of the Association for Computational Linguistics (ACL 2010)

Preview

The Unreasonable Effectiveness of Data

Alon Halevy

Peter Norvig

Fernando Pereira

IEEE Intelligent Systems, vol. 24 (2009), pp. 8-12

Preview

Group Sparse Coding

Samy Bengio

Fernando Pereira

Yoram Singer

Dennis Strelow

Advances in Neural Information Processing Systems (2009)

Preview

Gaussian Margin Machines

Koby Crammer

Mehryar Mohri

Fernando Pereira

Twelfth International Conference on Artificial Intelligence and Statistics (AISTATS 2009), Clearwater Beach, Florida, pp. 105-112

Preview

A transcription factor affinity-based code for mammalian transcription initiation

M Megraw

F Pereira

ST Jensen

U Ohler

AG Hatzigeorgiou

Genome Research, vol. 19 (2009), pp. 644-56

Posterior vs. Parameter Sparsity in Latent Variable Models

Joao Graca

Kuzman Ganchev

Ben Taskar

Fernando Pereira

Advances in Neural Information Processing Systems 22 (2009), pp. 664-672

Intelligent Email: Reply and Attachment Prediction

Mark Dredze

Tova Brooks

Josh Carroll

Joshua Magarick

John Blitzer

Fernando Pereira

Proceedings of the 2008 International Conference on Intelligent User Interfaces

Preview

Confidence-Weighted Linear Classification

Mark Dredze

Koby Crammer

Fernando Pereira

International Conference on Machine Learning (ICML) (2008)

Generating Summary Keywords for Emails Using Topics

Mark Dredze

Hanna Wallach

Danny Puller

Fernando Pereira

Proceedings of the 2008 International Conference on Intelligent User Interfaces

Preview

Weakly-Supervised Acquisition of Labeled Class Instances using Graph Random Walks

Partha Pratim Talukdar

Joseph Reisinger

Marius Pasca

Deepak Ravichandran

Rahul Bhagat

Fernando Pereira

Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP-2008), Association for Computational Linguistics, Honolulu, Hawaii, pp. 582-590

Preview

Learning Bounds for Domain Adaptation

John Blitzer

Koby Crammer

Alex Kulesza

Fernando Pereira

Jennifer Wortman

Advances in Neural Information Processing Systems 20, {MIT} Press, Cambridge, MA (2008)

Preview

Structured Learning with Approximate Inference

Alex Kulesza

Fernando Pereira

Advances in Neural Information Processing Systems 20, {MIT} Press, Cambridge, MA (2008)

Preview

Speech Recognition with Weighted Finite-State Transducers

Mehryar Mohri

Fernando C. N. Pereira

Michael Riley

Handbook on Speech Processing and Speech Communication, Part E: Speech recognition, Springer-Verlag, Heidelberg, Germany (2008)

Preview

Reading the Markets: Forecasting Public Opinion of Political Candidates by News Analysis

Kevin Lerman

Ari Gilder

Mark Dredze

Fernando Pereira

Conference on Computational Linguistics (Coling) (2008)

The Need for Open Source Software in Machine Learning

Soren Sonnenburg

Mikio L. Braun

Cheng Soon Ong

Samy Bengio

Leon Bottou

Geoff Holmes

Yann LeCun

Klaus-Robert Mueller

Fernando Pereira

Carl-Edward Rasmussen

Gunnar Raetsch

Bernhard Schoelkopf

Alexander Smola

Pascal Vincent

Jason Weston

Robert C. Williamson

Journal of Machine Learning Research, vol. 8 (2007), pp. 2443-2466

Euclidean Embedding of Co-occurrence Data

Amir Globerson

Gal Chechik

Fernando Pereira

Naftali Tishby

Journal of Machine Learning Research, vol. 8 (2007), pp. 2265-2295

Preview

Frustratingly Hard Domain Adaptation for Dependency Parsing

Mark Dredze

John Blitzer

Partha Pratim Talukdar

Kuzman Ganchev

João V. Graça

Fernando Pereira

Proceedings of the CoNLL Shared Task Session of EMNLP-CoNLL 2007, pp. 1051-1055

Preview

Speech Recognition with Weighted Finite-State Transducers

Mehryar Mohri

Fernando C. N. Pereira

Michael Riley

Handbook on Speech Processing and Speech Communication, Part E: Speech recognition, Springer-Verlag, Heidelberg, Germany (2007)

Preview

A Context Pattern Induction Method for Named Entity Extraction

Partha Pratim Talukdar

Thorsten Brants

Mark Liberman

Fernando Pereira

Proceedings of CoNLL-X (2006), pp. 141-148

Preview

A rate-distortion one-class model and its applications to clustering

K. Crammer

P. Talukdar

F. Pereira

Proceedings of the 25th Annual International Conference on Machine Learning (ICML 2008), Omnipress, pp. 184-191

Intelligent Email: Aiding Users with AI

Mark Dredze

Hanna Wallach

Danny Puller

Tova Brooks

Josh Carroll

Joshua Magarick

John Blitzer

Fernando Pereira

American National Conference on Artificial Intelligence (AAAI) (2008)

Confidence-weighted linear classification

M. Dredze

K. Crammer

F. Pereira

Proceedings of the 25th Annual International Conference on Machine Learning (ICML 2008), Omnipress, pp. 264-271

Evigan: a hidden variable model for integrating gene evidence for eukaryotic gene prediction

Qian Liu

Aaron J Mackey

David S Roos

Fernando C N Pereira

Bioinformatics, vol. 24 (2008), pp. 597-605

Learning to Create Data-Integrating Queries

Partha Pratim Talukdar

Marie Jacob

M. Salman Mehmood

Koby Crammer

Zachary Ives

Fernando Pereira

Sudipto Guha

VLDB (2008)

Reranking candidate gene models with cross-species comparison for improved gene prediction

Qian Liu

Koby Crammer

Fernando C. Pereira

David S. Roos

BMC Bioinformatics, vol. 9 (2008), pp. 433

Learning to join everything

Fernando Pereira

CIKM (2007), pp. 9-10

Analysis of Representations for Domain Adaptation

Shai Ben-David

John Blitzer

Koby Crammer

Fernando Pereira

Advances in Neural Information Processing Systems 20, MIT Press, Cambridge, MA (2007)

Semi-Automated Named Entity Annotation

Kuzman Ganchev

Fernando Pereira

Mark Mandel

Steven Carroll

Peter White

Proceedings of the Linguistic Annotation Workshop, Association for Computational Linguistics (2007), pp. 53-56

Biographies, Bollywood, Boom-boxes and Blenders: Domain Adaptation for Sentiment Classification

John Blitzer

Mark Dredze

Fernando Pereira

Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics, Association for Computational Linguistics, Prague, Czech Republic (2007), pp. 440-447

Penn/UMass/CHOP Biocreative II systems

Kuzman Ganchev

Koby Crammer

Fernando Pereira

Gideon Mann

Kedar Bellare

Andrew McCallum

Steven Carroll

Yang Jin

Peter White

Proceedings of the Second BioCreative Challenge Evaluation Workshop (2007), pp. 119-124

Global Discriminative Learning for Higher-Accuracy Computational Gene Prediction

Axel Bernal

Koby Crammer

Artemis Hatzigeorgiou

Fernando Pereira

PLoS Computational Biology, vol. 3 (2007)

Transductive structured classification through constrained min-cuts

Kuzman Ganchev

Fernando Pereira

Proceedings of the Second Workshop on TextGraphs: Graph-Based Algorithms for Natural Language Processing, Association for Computational Linguistics (2007), pp. 37-44

Automated recognition of malignancy mentions in biomedical literature

Yang Jin

Ryan T. McDonald

Kevin Lerman

Mark A. Mandel

Steven Carroll

Mark Y. Liberman

Fernando C. Pereira

Raymond S. Winters

Peter S. White

BMC Bioinformatics, vol. 7 (2006), pp. 492

"Sorry I forgot the attachment": Email Attachment Prediction

Mark Dredze

John Blitzer

Fernando Pereira

3rd Conference on Email and Anti-Spam, Stanford, CA (2006)

Multilingual Dependency Parsing with a Two-Stage Discriminative Parser

Ryan McDonald

Kevin Lerman

Fernando Pereira

Tenth Conference on Computational Natural Language Learning (CoNLL-X) (2006)

Online Learning of Approximate Dependency Parsing Algorithms

Ryan McDonald

Fernando Pereira

11th Conference of the European Chapter of the Association for Computational Linguistics: EACL 2006, pp. 81-88

Domain Adaptation with Structural Correspondence Learning

John Blitzer

Ryan McDonald

Fernando Pereira

EMNLP 2006: 2006 Conference on Empirical Methods in Natural Language Processing, pp. 120-128

An automated procedure to identify biomedical articles that contain cancer-associated gene variants

Ryan McDonald

R Scott Winters

Claire K Ankuda

Joan A Murphy

Amy E Rogers

Fernando Pereira

Marc S Greenblatt

Peter S White

Human Mutation, vol. 27 (2006), pp. 957-64

Online Learning of Approximate Dependency Parsing Algorithms

Ryan McDonald

Fernando Pereira

Proceedings of EACL (2006)

Embedding Heterogeneous Data Using Statistical Models

Amir Globerson

Gal Chechik

Fernando Pereira

Naftali Tishby

AAAI (2006)

Distributed Latent Variable Models of Lexical Co-occurrences

John Blitzer

Amir Globerson

Fernando Pereira

Tenth International Workshop on Artificial Intelligence and Statistics (2005)

Automatically annotating documents with normalized gene lists

Jeremiah Crim

Ryan McDonald

Fernando Pereira

BMC Bioinformatics (2005)

Online Large-Margin Training of Dependency Parsers

Ryan McDonald

Koby Crammer

Fernando Pereira

43rd Annual Meeting of the Association for Computational Linguistics (ACL 2005)

Reply Expectation Prediction for Email Management

Mark Dredze

John Blitzer

Fernando Pereira

2nd Conference on Email and Anti-Spam, Stanford, CA (2005)

Simple Algorithms for Complex Relation Extraction with Applications to Biomedical IE

Ryan McDonald

Seth Kulick

Fernando Pereira

Scott Winters

Yang Jin

Pete White

Proceedings of ACL (2005)

Non-Projective Dependency Parsing using Spanning Tree Algorithms

Ryan McDonald

Fernando Pereira

Kiril Ribarov

Jan Hajic

Proceedings of HLT-EMNLP (2005)

A Conditional Random Field for Discriminatively-trained Finite-state String Edit Distance

Andrew McCallum

Kedar Bellare

Fernando Pereira

Proceedings of the 21st Conference on Uncertainty in Artificial Intelligence (UAI 2005)

Reply Expectation Prediction for Email Management

Mark Dredze

John Blitzer

Fernando Pereira

CEAS (2005)

Identifying gene and protein mentions in text using conditional random fields

Ryan McDonald

Fernando Pereira

BMC Bioinformatics (2005)

Non-Projective Dependency Parsing using Spanning Tree Algorithms

Ryan T. McDonald

Fernando Pereira

Kiril Ribarov

Jan Hajic

HLT/EMNLP (2005)

Weighted Automata in Text and Speech Processing

Mehryar Mohri

Fernando Pereira

Michael Riley

arXiv, vol. abs/cs/0503077 (2005)

Flexible Text Segmentation with Structured Multilabel Classification

Ryan McDonald

Koby Crammer

Fernando Pereira

Proceedings of HLT-EMNLP (2005)

Online Large-Margin Training of Dependency Parsers

Ryan McDonald

Koby Crammer

Fernando Pereira

Proceedings of ACL (2005)

Simple Algorithms for Complex Relation Extraction with Applications to Biomedical IE

Ryan McDonald

Fernando Pereira

Seth Kulick

Scott Winters

Yang Jin

Pete White

43rd Annual Meeting of the Association for Computational Linguistics (ACL 2005)

An entity tagger for recognizing acquired genomic variations in cancer literature

Ryan McDonald

Scott Winters

Mark Mandel

Yang Jin

Pete White

Fernando Pereira

Bioinformatics (2004)

Case-Factor Diagrams for Structured Probabilistic Modeling

David McAllester

Michael Collins

Fernando Pereira

Proceedings of the 20th Conference on Uncertainty in Artificial Intelligence (2004)

Hierarchical Distributed Representations for Statistical Language Modeling

John Blitzer

Kilian Weinberger

Lawrence Saul

Fernando Pereira

Advances in Neural Information Processing Systems 17, MIT Press, Cambridge, MA (2004)

ATDD: An Algorithmic Tool for Domain Discovery in Protein Sequences

Stanislav Angelov

Sanjeev Khanna

Li Li

Fernando Pereira

Algorithms in Bioinformatics, 4th International Workshop (WABI 2004), Springer, pp. 206-217

Hierarchical Distributed Representations for Statistical Language Modeling

John Blitzer

Kilian Q. Weinberger

Lawrence K. Saul

Fernando Pereira

NIPS (2004)

Case-Factor Diagrams for Structured Probabilistic Modeling

David A. McAllester

Michael Collins

Fernando Pereira

UAI (2004), pp. 382-391

Euclidean Embedding of Co-Occurrence Data

Amir Globerson

Gal Chechik

Fernando C. Pereira

Naftali Tishby

Advances in Neural Information Processing Systems (NIPS), MIT press, Cambridge, MA (2004), pp. 497-504

Shallow Parsing with Conditional Random Fields

Fei Sha

Fernando C. N. Pereira

HLT-NAACL (2003)

Weighted finite-state transducers in speech recognition

Mehryar Mohri

Fernando Pereira

Michael Riley

Computer Speech & Language, vol. 16 (2002), pp. 69-88

Weighted Finite-State Transducers in Speech Recognition

Mehryar Mohri

Fernando C. N. Pereira

Michael Riley

Computer Speech and Language, vol. 16 (2002), pp. 69-88

Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data

John Lafferty

Andrew McCallum

Fernando Pereira

Proceedings of ICML-01 (2001), pp. 282-289

Formal Grammar and Information Theory: Together Again?

Fernando Pereira

Philosophical Transactions of the Royal Society, vol. 358 (2000), pp. 1239-1253

Maximum Entropy Markov Models for Information Extraction and Segmentation

Andrew McCallum

Dayne Freitag

Fernando Pereira

Machine Learning: Proceedings of the Seventeenth International Conference (ICML 2000), Stanford, California, pp. 591-598

The Design Principles of a Weighted Finite-State Transducer Library

Mehryar Mohri

Fernando C. N. Pereira

Michael Riley

Theoretical Computer Science, vol. 231 (2000), pp. 17-32

The information bottleneck method

Naftali Tishby

Fernando C. Pereira

William Bialek

arXiv, vol. physics/0004057 (2000)

Machine Learning for Efficient Natural-Language Processing

Fernando C. N. Pereira

CPM (2000), pp. 11

Weighted Finite-State Transducers in Speech Recognition

Mehryar Mohri

Fernando C. N. Pereira

Michael Riley

Proceedings of the ISCA Tutorial and Research Workshop, Automatic Speech Recognition: Challenges for the new Millenium (ASR2000), Paris, France

Multimedia Standards: Present and Future

Fernando C. N. Pereira

ICMCS, Vol. 1 (1999), pp. 145-146

AT&T at TREC-8

Amit Singhal

Steven P. Abney

Michiel Bacchiani

Michael Collins

Donald Hindle

Fernando C. N. Pereira

TREC (1999)

Declarative Programming for a Messy World

Fernando C. N. Pereira

ICLP (1999), pp. 3-5

Document Expansion for Speech Retrieval

Amit Singhal

Fernando C. N. Pereira

SIGIR (1999), pp. 34-41

Quantifiers, Anaphora, and Intensionality

Mary Dalrymple

John Lamping

Fernando Pereira

Vijay Saraswat

Semantics and Syntax in Lexical Functional Grammar, MIT Press, Cambridge, Massachusetts (1999), pp. 39-89

SCAN: Designing and Evaluating User Interfaces to Support Retrieval From Speech Archives

Steve Whittaker

Julia Hirschberg

John Choi

Donald Hindle

Fernando C. N. Pereira

Amit Singhal

SIGIR (1999), pp. 26-33

Relating Probabilistic Grammars and Automata

Steven P. Abney

David A. McAllester

Fernando Pereira

ACL (1999)

An Efficient Extension to Mixture Techniques for Prediction and Decision Trees

Fernando C. N. Pereira

Yoram Singer

Machine Learning, vol. 36 (1999), pp. 183-199

Finding Information in Audio: A New Paradigm for Audio Browsing and Retrieval

Julia Hirschberg

Steve Whittaker

Don Hindle

Fernando Pereira

Amit Singhal

Accessing Information in Spoken Audio: Proceedings of the ESCA ETRW Workshop, Cambridge, England (1999), pp. 117-122

Efficient General Lattice Generation and Rescoring

Andrej Ljolje

Fernando Pereira

Michael Riley

EUROSPEECH 99 (1999), pp. 1251-1254

The Information Bottleneck Method

Naftali Z. Tishby

Fernando Pereira

William Bialek

Proceedings of the 37th Allerton Conference on Communication, Control and Computing, Urbana, Illinois (1999)

Relating Probabilistic Grammars and Automata

Steven Abney

David McAllester

Fernando Pereira

37th Annual Meeting of the Association for Computational Linguistics, Morgan Kaufmann, San Francisco, California (1999), pp. 542-549

Distributional Similarity Models: Clustering vs.~Nearest Neighbors

Lillian Lee

Fernando Pereira

37th Annual Meeting of the Association for Computational Linguistics, Morgan Kaufmann, San Francisco, California (1999), pp. 33-40

Similarity-Based Models of Word Cooccurrence Probabilities

Ido Dagan

Lillian Lee

Fernando C. N. Pereira

Machine Learning, vol. 34 (1999), pp. 43-69

Dynamic Compilation of Weighted Context-Free Grammars

Mehryar Mohri

Fernando Pereira

Proceedings of COLING-ACL '98, Montreal, Canada (1998), pp. 891-897

A Rational Design for a Weighted Finite-State Transducer Library

Mehryar Mohri

Fernando C. N. Pereira

Michael Riley

Proceedings of the Second International Workshop on Implementing Automata (WIA '97), Springer-Verlag, Berlin-NY (1998), pp. 144-158

Modelling Divergent Production: A multi-domain approach

F. Pereira

ECAI (1998), pp. 131-132

SCAN - Speech Content Based Audio Navigator: A Systems Overview

John Choi

Don Hindle

Julia Hirschberg

Ivan Magrin-Chagnolleau

Christine Nakatani

Fernando Pereira

Amit Singhal

Steve Whittaker

Proceedings of the Fifth International Conference on Spoken Language Processing, Sydney (1998)

Dynamic Compilation of Weighted Context-Free Grammars

Mehryar Mohri

Fernando C. N. Pereira

36th Meeting of the Association for Computational Linguistics (ACL '98), Proceedings of the Conference, Montréal, Québec, Canada (1998), pp. 891-897

AT&T at TREC-7

Amit Singhal

John Choi

Donald Hindle

David D. Lewis

Fernando C. N. Pereira

TREC (1998), pp. 186-198

Full Expansion of Context-Dependent Networks in Large Vocabulary Speech Recognition

Mehryar Mohri

Michael Riley

Don Hindle

Andrej Ljolje

Fernando C. N. Pereira

Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP '98), Seattle, Washington (1998)

Similarity-Based Methods For Word Sense Disambiguation

Ido Dagan

Lillian Lee

Fernando C. N. Pereira

arXiv (1997)

Transducer Composition for Context-Dependent Network Expansion

Michael Riley

Fernando C. N. Pereira

Mehryar Mohri

Proceedings of the 5th European Conference on Speech Communication and Technology (Eurospeech '97), Rhodes, Greece (1997)

Transducer Composition for Context-Dependent Network Expansion

Michael Riley

Fernando Pereira

Mehryar Mohri

EuroSpeech'97, European Speech Communication Association, Genova, Italy (1997), pp. 1427-1430

A Rational Design for a Weighted Finite-State Transducer Library

Mehryar Mohri

Fernando Pereira

Michael Riley

WIA'97: Proceedings of the Workshop on Implementing Automata, Springer-Verlag (1997)

Quantifiers, Anaphora, and Intensionality

Mary Dalrymple

John Lamping

Fernando C. N. Pereira

Vijay A. Saraswat

Journal of Logic, Language, and Information, vol. 6, no. 3 (1997), pp. 219-273

Similarity-Based Methods For Word Sense Disambiguation

Ido Dagan

Lillian Lee

Fernando Pereira

35th Annual Meeting of the Association for Computational Linguistics, Morgan Kaufmann, San Francisco, California (1997), pp. 56-63

Finite-State Approximation of Phrase-Structure Grammars

Fernando Pereira

Rebecca N. Wright

Finite-State Language Processing, MIT Press, Cambridge, Massachusetts (1997), pp. 149-173

A Rational Design for a Weighted Finite-State Transducer Library

Mehryar Mohri

Fernando C. N. Pereira

Michael Riley

Proceedings of the Workshop on Implementing Automata (WIA '97), London, Ontario, Canada, University of Western Ontario, London, Ontario, Canada (1997)

Speech Recognition by Composition of Weighted Finite Automata

Fernando Pereira

Michael Riley

Finite-State Language Processing, MIT Press, Cambridge, Massachusetts (1997), pp. 431-453

Aggregate and Mixed-Order Markov Models for Statistical Language Processing

Lawrence Saul

Fernando Pereira

Proceedings of the Second Conference on Empirical Methods in Natural Language Processing, Association for Computational Linguistics, Somerset, NJ. Distributed by Morgan Kaufmann, San Francisco, CA (1997), pp. 81-89

AT&T at TREC-6: SDR Track

Amit Singhal

John Choi

Donald Hindle

Fernando C. N. Pereira

TREC (1997), pp. 227-232

Speech Recognition by Composition of Weighted Finite Automata

Fernando C. N. Pereira

Michael Riley

arXiv (1996)

Rational Power Series in Text and Speech Processing

Mehryar Mohri

Fernando C. N. Pereira

Michael Riley

Graduate course, University of Pennsylvania, Department of Computer Science, Philadelphia, PA (1996)

Interactions of Scope and Ellipsis

Stuart M. Shieber

Fernando Pereira

Mary Dalrymple

Linguistics and Philosophy, vol. 19 (1996), pp. 527-552

Intensional Verbs Without Type-Raising or Lexical Ambiguity

Mary Dalrymple

John Lamping

Fernando Pereira

Vijay Saraswat

Logic, Language and Computation (Volume 1), {CSLI} Publications, Stanford, California (1996), pp. 167-182

A Deductive Account of Quantification in LFG

Mary Dalrymple

John Lamping

Fernando Pereira

Vijay Saraswat

Quantifiers, Deduction, and Context, {CSLI} Publications, Stanford, California (1996), pp. 33-57

Weighted Automata in Text and Speech Processing

Mehryar Mohri

Fernando C. N. Pereira

Michael Riley

Proceedings of the 12th biennial European Conference on Artificial Intelligence (ECAI-96), Workshop on Extended finite state models of language, John Wiley and Sons, Chichester, Budapest, Hungary (1996)

Language, Computation and Artificial Intelligence

Fernando C. N. Pereira

ACM Computing Surveys, vol. 28 (1996), pp. 9

Principles and Implementation of Deductive Parsing

Stuart M. Shieber

Yves Schabes

Fernando Pereira

Journal of Logic Programming, vol. 24 (1995), pp. 3-36

Design of a Linguistic Postprocessor using Variable Memory Length Markov Models

Isabelle Guyon

Fernando Pereira

Proceedings of the Third International Conference on Document Analysis and Recognition, IEEE Computer Society Press, Los Alamitos, California (1995), pp. 454-457

The AT&T 60,000 Word Speech-to-Text System

Michael Riley

Andrej Ljolje

Don Hindle

Fernando Pereira

Eurospeech'95: ESCA 4th European Conference on Speech Communication and Technology, Madrid, Spain (1995), pp. 207-210

Ellipsis and Higher-Order Unification

Mary Dalrymple

Stuart M. Shieber

Fernando C. N. Pereira

arXiv (1995)

Linear Logic for Meaning Assembly

Mary Dalrymple

John Lamping

Fernando C. N. Pereira

Vijay A. Saraswat

arXiv (1995)

Beyond Word N-Grams

Fernando Pereira

Yoram Singer

Naftali Z. Tishby

Proceedings of the Third Workshop on Very Large Corpora, Association for Computational Linguistics, Columbus, Ohio (1995), pp. 95-106

Frequencies vs. Biases: Machine Learning Problems in Natural Language Processing - Abstract

Fernando C. N. Pereira

ICML (1994), pp. 380

Similarity-Based Estimation of Word Cooccurrence Probabilities

Ido Dagan

Fernando Pereira

Lillian Lee

32nd Annual Meeting of the Association for Computational Linguistics, Morgan Kaufmann, San Francisco, California (1994), pp. 272-278

Frequencies vs Biases: Machine Learning Problems in Natural Language Processing (Extended Abstract)

Fernando C. N. Pereira

COLT (1994), pp. 12

Weighted Rational Transductions and their Application to Human Language Processing

Fernando Pereira

Michael Riley

Richard W. Sproat

Human Language Technology Workshop, Morgan Kaufmann, San Francisco, California (1994), pp. 262-267

Distributional Clustering of English Words

Fernando Pereira

Naftali Z. Tishby

Lillian Lee

30th Annual Meeting of the Association for Computational Linguistics, Association for Computational Linguistics, Columbus, Ohio (1993), pp. 183-190

Introduction to Special Issue on Natural Language Processing

Fernando Pereira

Barbara J. Grosz

Artificial Intelligence, vol. 63 (1993), pp. 1-15

Quantifier Scoping

Douglas B. Moran

Fernando Pereira

The Core Language Engine, MIT Press, Cambridge, Massachusetts (1992), pp. 149-172

A spoken language translator for restricted-domain context-free languages

David B. Roe

Pedro J. Moreno

Richard W. Sproat

Fernando Pereira

Michael Riley

Alejandro Macarrón

Speech Communication, vol. 11 (1992), pp. 311-319

Efficient Grammar Processing for a Spoken Language Translation System

David B. Roe

Pedro J. Moreno

Richard W. Sproat

Fernando Pereira

Michael Riley

Alejandro Macarrón

Proceedings of ICASSP, IEEE, San Francisco, California (1992), pp. 213-216

Empirical Properties of Finite State Approximations for Phrase Structure Grammars

Fernando Pereira

David B. Roe

Proceedings of the International Conference on Spoken Language Processing, Banff, Alberta (1992), pp. 261-264

Inside-Outside Reestimation from Partially Bracketed Corpora

Fernando Pereira

Yves Schabes

30th Annual Meeting of the Association for Computational Linguistics, Association for Computational Linguistics, Newark, Delaware (1992), pp. 128-135

Toward a Spoken Language Translator for Restricted-Domain Context-Free Languages

David B. Roe

Fernando Pereira

Richard W. Sproat

Michael Riley

Pedro J. Moreno

Alejandro Macarrón

EUROSPEECH 91 -- 2nd European Conference on Speech Communication and Technology, Genova, Italy (1991), pp. 1063-1066

Ellipsis and Higher-Order Unification

Mary Dalrymple

Stuart M. Shieber

Fernando Pereira

Linguistics and Philosophy, vol. 14 (1991), pp. 399-452

Deductive Interpretation

Fernando Pereira

Natural Language and Speech, Springer-Verlag (1991), pp. 116-133

Incremental Interpretation

Fernando Pereira

Martha E. Pollack

Artificial Intelligence, vol. 50 (1991), pp. 37-82

Finite-State Approximation of Phrase-Structure Grammars

Fernando Pereira

Rebecca N. Wright

29th Annual Meeting of the Association for Computational Linguistics, Association for Computational Linguistics, Berkeley, California (1991), pp. 246-255

Semantic Interpretation as Higher-Order Deduction

Fernando Pereira

Logics in AI: European Workshop JELIA'90, Springer-Verlag, Berlin, Germany, Amsterdam, Holland (1991), pp. 78-96

Finite-State Approximations of Grammars

Fernando Pereira

Proceedings of the Second Speech and Natural Language Workshop (1990), pp. 20-25

Categorial Semantics and Scoping

Fernando Pereira

Computational Linguistics, vol. 16 (1990), pp. 1-10

Prolog and Natural-Language Analysis: into the Third Decade

Fernando Pereira

Logic Programming: Proceedings of the 1990 North American Conference, MIT Press, Cambridge, Massachusetts, Austin, Texas, pp. 813-832

Semantic-Head-Driven Generation

Stuart M. Shieber

Gertjan van Noord

Fernando Pereira

Robert C. Moore

Computational Linguistics, vol. 16 (1990), pp. 30-42

A Semantic-Head-Driven Generation Algorithm for Unification-Based Formalisms

Stuart M. Shieber

Gertjan van Noord

Robert C. Moore

Fernando Pereira

27th Annual Meeting of the Association for Computational Linguistics, Association for Computational Linguistics, University of British Columbia, Vancouver, Canada (1989), pp. 7-17

A Semantic-Head-Driven Generation Algorithm for Unification-Based Formalisms

Stuart M. Shieber

Gertjan van Noord

Robert C. Moore

Fernando C. N. Pereira

ACL (1989), pp. 7-17

A Calculus for Semantic Composition and Scoping

Fernando Pereira

27th Annual Meeting of the Association for Computational Linguistics, Association for Computational Linguistics, University of British Columbia, Vancouver, Canada (1989), pp. 152-160

Synergistic Use of Direct Manipulation and Natural Language

Phil R. Cohen

Mary Dalrymple

Douglas B. Moran

Fernando Pereira

J. W. Sullivan

R. A. Gargan, Jr.

J. L. Schlossberg

S. W. Tyler

Proceedings of CHI'89, Austin, Texas (1989)

Integrating Speech and Natural Language Processing

Robert C. Moore

Fernando Pereira

Hy Murveit

First Speech and Natural Language Workshop (1989), pp. 243-247

An Integrated Framework for Semantic and Pragmatic Interpretation

Martha E. Pollack

Fernando Pereira

26th Annual Meeting of the Association for Computational Linguistics, Association for Computational Linguistics, Buffalo, New York (1988), pp. 75-86

Grammars and Logics of Partial Information

Fernando Pereira

Logic Programming: Proceedings of the Fourth International Conference, MIT Press, Cambridge Massachusetts, Melbourne, Australia (1987), pp. 989-1013

Prolog and Natural-Language Analysis

Fernando Pereira

Stuart M. Shieber

Center for the Study of Language and Information, Stanford, California (1987)

TEAM: An Experiment in the Design of Transportable Natural Language Interfaces

Barbara J. Grosz

Douglas E. Appelt

Paul A. Martin

Fernando Pereira

Artificial Intelligence, vol. 32 (1987), pp. 173-243

Can Drawing Be Liberated from the von Neumann Style

Fernando Pereira

Logic Programming and Its Applications, Ablex, Norwood, New Jersey (1986), pp. 175-187

TEAM: An Experimental Transportable Natural-Language Interface

Paul A. Martin

Douglas E. Appelt

Barbara J. Grosz

Fernando C. N. Pereira

FJCC (1986), pp. 260-267

A Sheaf-Theoretic Model of Concurrency

Luis F. Monteiro

Fernando Pereira

Symposium on Logic and Computer Science, IEEE Computer Society Press, Cambridge, Massachusetts (1986), pp. 66-76

A New Characterization of Attachment Preferences

Fernando Pereira

Natural Language Parsing---Psychological, Computational and Theoretical perspectives, Cambridge University Press, Cambridge, England (1985), pp. 307-319

A Structure-Sharing Representation for Unification-Based Grammar Formalisms

Fernando Pereira

23rd Annual Meeting of the Association for Computational Linguistics, Association for Computational Linguistics, Chicago, Illinois (1985), pp. 137-144

An Overview of Automated Reasoning and Related Fields

L. Wos

Fernando Pereira

Robert Hong

Robert S. Boyer

J Strother Moore

W. W. Bledsoe

L. J. Henschen

Bruce G. Buchanan

Graham Wrightson

Cordell Green

Journal of Automated Reasoning, vol. 1 (1985), pp. 5-48

The Semantics of Grammar Formalisms Seen as Computer Languages

Fernando Pereira

Stuart M. Shieber

Proceedings of COLING 84, Association for Computational Linguistics, Stanford, California (1984), pp. 123-129

Can Drawing Be Liberated From the Von Neumann Style?

Fernando C. N. Pereira

Databases for Business and Office Applications (1983), pp. 184-190

A Fact Dependency System for the Logic Programmer

Peter S. G. Swinson

Fernando Pereira

Aart Bijl

Computer-Aided Design, vol. 14 (1983), pp. 235-243

Parsing as Deduction

Fernando Pereira

David H. D. Warren

21st Annual Meeting of the Association for Computational Linguistics, Association for Computational Linguistics, Cambridge, Massachusetts (1983), pp. 137-144

Transportability and Generality in a Natural-Language Interface System

Paul A. Martin

Douglas E. Appelt

Fernando Pereira

Proceedings of the Eight International Joint Conference on Artificial Intelligence (1983), pp. 573-581

An Efficient Easily Adaptable System for Interpreting Natural Language Queries

David H. D. Warren

Fernando Pereira

Computational Linguistics, vol. 8 (1982), pp. 110-122

Extraposition Grammars

Fernando Pereira

Computational Linguistics, vol. 7 (1981), pp. 243-256

Definite Clause Grammars for Language Analysis---a Survey of the Formalism and a Comparison with Augmented Transition Networks

Fernando Pereira

David H. D. Warren

Artificial Intelligence, vol. 13 (1980), pp. 231-278

Prolog -- The Language and its Implementation Compared with Lisp

David H. D. Warren

Luis M. Pereira

Fernando Pereira

Proceedings of the Symposium on Artificial Intelligence and Programming Languages, Rochester, New York (1977), pp. 109-115

Search on Google Scholar

Defining the technology of today and tomorrow.

Philosophy

People

Teams

AI/ML Foundations  & Capabilities

Algorithms & Optimization

Computing Paradigms

Responsible Human-Centric Technology

Science & Societal Impact

Projects

Publications

Resources

Shaping the future, together.

Student programs

Faculty programs

Conferences & events

Fernando Pereira

Research Areas

Join us

Defining the technology of today and tomorrow.

Philosophy

People

Teams

AI/ML Foundations & Capabilities

Algorithms & Optimization

Computing Paradigms

Responsible Human-Centric Technology

Science & Societal Impact

Projects

Publications

Resources

Shaping the future, together.

Student programs

Faculty programs

Conferences & events

Fernando Pereira

Research Areas

Filter by:

Year

Team

Research Area

Join us

AI/ML Foundations  & Capabilities