Reading Recommendation
Pages: 1 2
Threads
- A Few Useful Things (2 Replies)
- Understanding the Capabilities, Limitations, and Societal Impact of Large Language (1 Reply)
- Managing Bias in AI (1 Reply)
- SC-GlowTTS: An Efficient Zero-Shot Multi-Speaker Text-To-Speech Model (1 Reply)
- Training Your Own Voice Font Using Flowtron (1 Reply)
- HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech (1 Reply)
- WaveGlow: A Flow-Based Generative Network for Speech Synthesis (1 Reply)
- FastPitch: Parallel Text-to-Speech with Pitch Prediction (1 Reply)
- Flowtron: An Autoregressive Flow-Based Generative Network for Text-to-Speech Synthe (1 Reply)
- Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions (1 Reply)
- Fast and Easy Crowdsourced Perceptual Audio Evaluation (1 Reply)
- The AI Index 2021 Annual Report (1 Reply)
- Conformer: Convolution-Augmented Transformer for Speech Recognition (1 Reply)
- Jasper: An End-to-End Convolutional Neural Acoustic Model (1 Reply)
- Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism (1 Reply)
- WER We Are and WER We Think We Are (1 Reply)
- GPT-NeoX: Large Scale Autoregressive Language Modeling in PyTorch, version 0.0.1 (1 Reply)
- Google’s Speech Recognition Technology Now Has a 4.9% Word Error Rate (1 Reply)
- QuartzNet: Deep Automatic Speech Recognition with 1D Time-Channel Separable Conv (1 Reply)
- Mesh Transformer JAX (1 Reply)
Pages: 1 2