RJ Skerry-Ryan

Research Areas

Authored Publications

Google Publications

Other Publications

Speaker Generation

Daisy Stanton

David Teh-Hwa Kao

Eric Battenberg

Matt Shannon

RJ Skerry-Ryan

Soroosh Mariooryad

Tom Bagby

arXiv.org (2021)

Wave-Tacotron: Spectrogram-free end-to-end text-to-speech synthesis

Ron J. Weiss

RJ Skerry-Ryan

Eric Battenberg

Soroosh Mariooryad

Diederik P. Kingma

ICASSP (2021)

Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Alignments

Isaac Elias

Heiga Zen

Jonathan Shen

Yu Zhang

Ye Jia

RJ Skerry-Ryan

Yonghui Wu

(2021)

Location-Relative Attention Mechanisms For Robust Long-Form Speech Synthesis

Eric Battenberg

RJ Skerry-Ryan

Soroosh Mariooryad

Daisy Stanton

David Kao

Matt Shannon

Tom Bagby

arXiv (2019)

Effective Use of Variational Embedding Capacity in Expressive End-to-End Speech Synthesis

Eric Battenberg

Soroosh Mariooryad

Daisy Stanton

RJ Skerry-Ryan

Matt Shannon

David Kao

Tom Bagby

arXiv (2019)

Learning to speak fluently in a foreign language: Multilingual speech synthesis and cross-language voice cloning

Yu Zhang

Ron J. Weiss

Heiga Zen

Yonghui Wu

Zhifeng Chen

RJ Skerry-Ryan

Ye Jia

Andrew Rosenberg

Bhuvana Ramabhadran

Interspeech (2019)

Semi-Supervised Training for Improving Data Efficiency in End-to-End Speech Synthesis

Yu-An Chung

Yuxuan Wang

Wei-Ning Hsu

Yu Zhang

RJ Skerry-Ryan

ICASSP 2019

Semi-Supervised Generative Modeling for Controllable Speech Synthesis

Raza Habib

Soroosh Mariooryad

Matt Shannon

Eric Battenberg

RJ Skerry-Ryan

Daisy Stanton

David Kao

Tom Bagby

arXiv (2019)

Towards End-to-End Prosody Transfer for Expressive Speech Synthesis with Tacotron

RJ Skerry-Ryan

Eric Battenberg

Ying Xiao

Yuxuan Wang

Daisy Stanton

Joel Shor

Ron J. Weiss

Rob Clark

Rif A. Saurous

International Conference on Machine Learning (2018)

Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis

Yuxuan Wang

Daisy Stanton

Yu Zhang

RJ Skerry-Ryan

Eric Battenberg

Joel Shor

Ying Xiao

Fei Ren

Ye Jia

Rif A. Saurous

ICML (2018)

Natural TTS Synthesis By Conditioning WaveNet On Mel Spectrogram Predictions

Jonathan Shen

Ruoming Pang

Ron J. Weiss

Mike Schuster

Navdeep Jaitly

Zongheng Yang

Zhifeng Chen

Yu Zhang

Yuxuan Wang

RJ Skerry-Ryan

Rif A. Saurous

Yannis Agiomyrgiannakis

Yonghui Wu

ICASSP (2018)

Complex Evolution Recurrent Neural Networks (ceRNNs)

Izhak Shafran

RJ Skerry-Ryan

Tom Bagby

IEEE ICASSP 2018

Tacotron: Towards End-to-End Speech Synthesis

Yuxuan Wang

RJ Skerry-Ryan

Daisy Stanton

Yonghui Wu

Ron J. Weiss

Navdeep Jaitly

Zongheng Yang

Ying Xiao

Zhifeng Chen

Samy Bengio

Quoc Le

Yannis Agiomyrgiannakis

Rob Clark

Rif A. Saurous

Interspeech (2017)

Uncovering Latent Style Factors for Expressive Speech Synthesis

Yuxuan Wang

RJ Skerry-Ryan

Ying Xiao

Daisy Stanton

Joel Shor

Eric Battenberg

Rob Clark

Rif A. Saurous

NIPS Workshop on Machine Learning for Audio Signal Processing (ML4Audio) (2017) (to appear)

No Results Found

Search on Google Scholar

Defining the technology of today and tomorrow.

Philosophy

People

Teams

AI/ML Foundations  & Capabilities

Algorithms & Optimization

Computing Paradigms

Responsible Human-Centric Technology

Science & Societal Impact

Projects

Publications

Resources

Shaping the future, together.

Student programs

Faculty programs

Conferences & events

RJ Skerry-Ryan

Research Areas

Join us

Defining the technology of today and tomorrow.

Philosophy

People

Teams

AI/ML Foundations & Capabilities

Algorithms & Optimization

Computing Paradigms

Responsible Human-Centric Technology

Science & Societal Impact

Projects

Publications

Resources

Shaping the future, together.

Student programs

Faculty programs

Conferences & events

RJ Skerry-Ryan

Research Areas

Filter by:

Year

Research Area

Team

Join us

AI/ML Foundations  & Capabilities