[논문리뷰] NaturalSpeech 2: Latent Diffusion Models are Natural and Zero-Shot Speech and Singing Synthesizers 2023년 08월 12일 NaturalSpeech 2 논문 리뷰 Tags: Audio and Speech Processing, Diffusion, Microsoft, Text-to-Speech, Voice Conversion
[논문리뷰] NaturalSpeech: End-to-End Text to Speech Synthesis with Human-Level Quality 2023년 08월 11일 NaturalSpeech 논문 리뷰 Tags: Audio and Speech Processing, Microsoft, Text-to-Speech, Transformer
[논문리뷰] YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone 2023년 08월 10일 YourTTS 논문 리뷰 Tags: Audio and Speech Processing, Text-to-Speech, Transformer, Voice Conversion
[논문리뷰] HiddenSinger: High-Quality Singing Voice Synthesis via Neural Audio Codec and Latent Diffusion Models 2023년 08월 09일 HiddenSinger 논문 리뷰 Tags: Audio and Speech Processing, Contrastive Learning, Diffusion, Singing Voice Synthesis
[논문리뷰] Diff-TTSG: Denoising probabilistic integrated speech and gesture synthesis 2023년 08월 08일 Diff-TTSG 논문 리뷰 (ISCA SSW 2023) Tags: Audio and Speech Processing, Diffusion, Text-to-Speech-and-Gesture