Kevin Wilson

Sequential Multi-Frame Neural Beamforming for Speech Separation and Enhancement

Zhong-Qiu Wang

Hakan Erdogan

Scott Wisdom

Kevin Wilson

Desh Raj

Shinji Watanabe

Zhuo Chen

John Hershey

IEEE SLT 2021

Unsupervised Speech Separation Using Mixtures of Mixtures

Scott Wisdom

Efthymios Tzinis

Hakan Erdogan

Ron J. Weiss

Kevin Wilson

John R. Hershey

ICML 2020 Workshop on Self-Supervision for Audio and Speech

Unsupervised Sound Separation Using Mixture Invariant Training

Scott Wisdom

Efthymios Tzinis

Hakan Erdogan

Ron J. Weiss

Kevin Wilson

John R. Hershey

NeurIPS (2020)

VoiceFilter-Lite: Streaming Targeted Voice Separation for On-Device Speech Recognition

Quan Wang

Ignacio Lopez Moreno

Mert Saglam

Kevin William Wilson

Alan Chiao

Renjie Liu

Yanzhang (Ryan) He

Wei Li

Jason Pelecanos

Marily Nika

Alex Gruenstein

Interspeech 2020 (2020) (to appear)

Differentiable Consistency Constraints for Improved Deep Speech Enhancement

Scott Wisdom

John R. Hershey

Kevin Wilson

Jeremy Thorpe

Michael Chinen

Brian Patton

Rif A. Saurous

IEEE International Conference on Acoustics, Speech, and Signal Processing (2019)

Universal Sound Separation

Ilya Kavalerov

Scott Wisdom

Hakan Erdogan

Brian Patton

Kevin Wilson

Jonathan Le Roux

John R. Hershey

IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) (2019)

VoiceFilter: Targeted Voice Separation by Speaker-Conditioned Spectrogram Masking

Hannah Raphaelle Muckenhirn

Ignacio Lopez Moreno

John Hershey

Kevin Wilson

Prashant Sridhar

Quan Wang

Rif A. Saurous

Ron Weiss

Ye Jia

Zelin Wu

ICASSP 2019 (2018)

Looking to Listen at the Cocktail Party: Audio-visual Speech Separation

Ariel Ephrat

Inbar Mosseri

Oran Lang

Tali Dekel

Kevin Wilson

Bill Freeman

Miki Rubinstein

IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2018)

EXPLORING TRADEOFFS IN MODELS FOR LOW-LATENCY SPEECH ENHANCEMENT

Brian Patton

Jan Skoglund

Jeremy Thorpe

John Hershey

Kevin Wilson

Michael Chinen

Richard F. Lyon

Rif A. Saurous

Proceedings of the 16th International Workshop on Acoustic Signal Enhancement (2018)

AVA-Speech: A Densely Labeled Dataset of Speech Activity in Movies

Sourish Chaudhuri

Joseph Roth

Dan Ellis

Andrew C. Gallagher

Liat Kaver

Radhika Marvin

Caroline Pantofaru

Nathan Christopher Reale

Loretta Guarino Reid

Kevin Wilson

Zhonghua Xi

Proceedings of Interspeech, 2018

Looking to Listen at the Cocktail Party: A Speaker-Independent Audio-Visual Model for Speech Separation

Ariel Ephrat

Inbar Mosseri

Oran Lang

Tali Dekel

Kevin Wilson

Avinatan Hassidim

William T. Freeman

Michael Rubinstein

ACM Transactions on Graphics (Proc. SIGGRAPH), vol. 37 (2018)

CNN Architectures for Large-Scale Audio Classification

Shawn Hershey

Sourish Chaudhuri

Daniel P. W. Ellis

Jort F. Gemmeke

Aren Jansen

Channing Moore

Manoj Plakal

Devin Platt

Rif A. Saurous

Bryan Seybold

Malcolm Slaney

Ron Weiss

Kevin Wilson

International Conference on Acoustics, Speech and Signal Processing (ICASSP), IEEE (2017)

Acoustic Modeling for Google Home

Bo Li

Tara Sainath

Arun Narayanan

Joe Caroselli

Michiel Bacchiani

Ananya Misra

Izhak Shafran

Hasim Sak

Golan Pundak

Kean Chin

Khe Chai Sim

Ron J. Weiss

Kevin Wilson

Ehsan Variani

Chanwoo Kim

Olivier Siohan

Mitchel Weintraub

Erik McDermott

Rick Rose

Matt Shannon

INTERSPEECH 2017 (2017)

Multichannel Signal Processing with Deep Neural Networks for Automatic Speech Recognition

Tara Sainath

Ron J. Weiss

Kevin Wilson

Bo Li

Arun Narayanan

Ehsan Variani

Michiel Bacchiani

Izhak Shafran

Andrew Senior

Kean Chin

Ananya Misra

Chanwoo Kim

IEEE /ACM Transactions on Audio, Speech, and Language Processing, vol. 25 (2017), pp. 965 - 979

Raw Multichannel Processing Using Deep Neural Networks

Tara N. Sainath

Ron J. Weiss

Kevin W. Wilson

Arun Narayanan

Michiel Bacchiani

Bo Li

Ehsan Variani

Izhak Shafran

Andrew Senior

Kean Chin

Ananya Misra

Chanwoo Kim

New Era for Robust Speech Recognition: Exploiting Deep Learning, Springer (2017)

Factored Spatial and Spectral Multichannel Raw Waveform CLDNNs

Tara N. Sainath

Ron J. Weiss

Kevin W. Wilson

Arun Narayanan

Michiel Bacchiani

International Conference on Acoustics, Speech and Signal Processing (ICASSP), IEEE (2016)

Preview

Neural Network Adaptive Beamforming for Robust Multichannel Speech Recognition

Bo Li

Tara N. Sainath

Ron J. Weiss

Kevin W. Wilson

Michiel Bacchiani

Proc. Interspeech, ISCA (2016)

Reducing the Computational Complexity of Multimicrophone Acoustic Models with Integrated Feature Extraction

Tara N. Sainath

Arun Narayanan

Ron J. Weiss

Ehsan Variani

Kevin W. Wilson

Michiel Bacchiani

Izhak Shafran

Proc. Interspeech, ISCA (2016)

Preview

AutoMOS: Learning a non-intrusive assessor of naturalness-of-speech

Brian Patton

Yannis Agiomyrgiannakis

Michael Terry

Kevin Wilson

Rif A. Saurous

D. Sculley

NIPS 2016 End-to-end Learning for Speech and Audio Processing Workshop (to appear)

Speaker Location and Microphone Spacing Invariant Acoustic Modeling from Raw Multichannel Waveforms

Tara N. Sainath

Ron J. Weiss

Kevin Wilson

Arun Narayanan

Michiel Bacchiani

Andrew Senior

ASRU (2015)

Preview

Learning the Speech Front-end with Raw Waveform CLDNNs

Tara Sainath

Ron J. Weiss

Kevin Wilson

Andrew W. Senior

Oriol Vinyals

Interspeech (2015)

Preview

Speech Acoustic Modeling from Raw Multichannel Waveforms

Yedid Hoshen

Ron Weiss

Kevin W Wilson

International Conference on Acoustics, Speech, and Signal Processing, IEEE (2015)

No Results Found

Defining the technology of today and tomorrow.

Philosophy

People

Teams

AI/ML Foundations  & Capabilities

Algorithms & Optimization

Computing Paradigms

Responsible Human-Centric Technology

Science & Societal Impact

Projects

Publications

Resources

Shaping the future, together.

Student programs

Faculty programs

Conferences & events

Kevin Wilson

Research Areas

Join us

Defining the technology of today and tomorrow.

Philosophy

People

Teams

AI/ML Foundations & Capabilities

Algorithms & Optimization

Computing Paradigms

Responsible Human-Centric Technology

Science & Societal Impact

Projects

Publications

Resources

Shaping the future, together.

Student programs

Faculty programs

Conferences & events

Kevin Wilson

Research Areas

Filter by:

Year

Team

Research Area

Join us

AI/ML Foundations  & Capabilities