[논문리뷰] AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head 2023년 05월 04일 AudioGPT 논문 리뷰 Tags: AI, Audio and Speech Processing, GPT, Talking Head
[논문리뷰] DINOv2: Learning Robust Visual Features without Supervision 2023년 05월 03일 Video LDM 논문 리뷰 Tags: AI, Computer Vision, Distillation, Meta, Self-Supervised Learning, ViT
[논문리뷰] Align your Latents: High-Resolution Video Synthesis with Latent Diffusion Models (Video LDM) 2023년 05월 02일 Video LDM 논문 리뷰 (CVPR 2023) Tags: AI, Computer Vision, CVPR, Diffusion, NVIDIA, Video Generation
[논문리뷰] Text2LIVE: Text-Driven Layered Image and Video Editing 2023년 05월 01일 Text2LIVE 논문 리뷰 (ECCV 2022) Tags: AI, Computer Vision, ECCV, Image Editing, NVIDIA
[논문리뷰] Neural Codec Language Models are Zero-Shot Text to Speech Synthesizers (VALL-E) 2023년 04월 30일 VALL-E 논문 리뷰 Tags: AI, Audio and Speech Processing, Microsoft, Text-to-Speech, Transformer