[논문리뷰] Visual Instruction Tuning 2024년 06월 04일 LLaVA 논문 리뷰 (NeurIPS 2023 Oral) Tags: Computer Vision, Large Multimodal Model, Microsoft, NeurIPS, NLP
[논문리뷰] ConsiStory: Training-Free Consistent Text-to-Image Generation 2024년 06월 02일 ConsiStory 논문 리뷰 (SIGGRAPH 2024) Tags: Computer Vision, Diffusion, SIGGRAPH, Text-to-Image
[논문리뷰] VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models 2024년 05월 31일 VideoCrafter2 논문 리뷰 (CVPR 2024) Tags: Computer Vision, CVPR, Diffusion, Text-to-Video
[논문리뷰] Drag Your Noise: Interactive Point-based Editing via Diffusion Semantic Propagation 2024년 05월 29일 DragNoise 논문 리뷰 (CVPR 2024) Tags: Computer Vision, CVPR, Diffusion, Image Editing
[논문리뷰] Language-Image Models with 3D Understanding 2024년 05월 27일 Cube-LLM 논문 리뷰 (ICLR 2025) Tags: Computer Vision, ICLR, Large Multimodal Model, LLM, NLP, NVIDIA