[논문리뷰] Masked Diffusion Transformer is a Strong Image Synthesizer (MDT) 2023년 08월 06일 MDT 논문 리뷰 (ICCV 2023) Tags: Computer Vision, Diffusion, DiT, ICCV, Image Generation
[논문리뷰] FastDiff: A Fast Conditional Diffusion Model for High-Quality Speech Synthesis 2023년 08월 05일 FastDiff 논문 리뷰 (IJCAI 2022) Tags: Audio and Speech Processing, Diffusion, Text-to-Speech
[논문리뷰] Mega-TTS: Zero-Shot Text-to-Speech at Scale with Intrinsic Inductive Bias 2023년 08월 04일 Mega-TTS 논문 리뷰 (INTERSPEECH 2023) Tags: Audio and Speech Processing, INTERSPEECH, Text-to-Speech, Transformer
[논문리뷰] U-DiT TTS: U-Diffusion Vision Transformer for Text-to-Speech 2023년 08월 03일 U-DiT TTS 논문 리뷰 (INTERSPEECH 2023) Tags: Audio and Speech Processing, Diffusion, INTERSPEECH, Text-to-Speech, ViT
[논문리뷰] FastSpeech 2: Fast and High-Quality End-to-End Text to Speech 2023년 08월 02일 FastSpeech 2 논문 리뷰 (ICLR 2021) Tags: Audio and Speech Processing, Distillation, ICLR, Microsoft, Text-to-Speech, Transformer