[논문리뷰] LISA: Reasoning Segmentation via Large Language Model 2024년 06월 14일 LISA 논문 리뷰 (CVPR 2024) Tags: Computer Vision, CVPR, Image Segmentation, Large Multimodal Model, NLP
[논문리뷰] DemoFusion: Democratising High-Resolution Image Generation With No $$$ 2024년 06월 12일 DemoFusion 논문 리뷰 (CVPR 2024) Tags: Computer Vision, CVPR, Diffusion, Image Generation
[논문리뷰] ViP-LLaVA: Making Large Multimodal Models Understand Arbitrary Visual Prompts 2024년 06월 10일 ViP-LLaVA 논문 리뷰 (CVPR 2024) Tags: Computer Vision, CVPR, Large Multimodal Model
[논문리뷰] Improved Baselines with Visual Instruction Tuning 2024년 06월 08일 LLaVA-1.5 논문 리뷰 (CVPR 2024) Tags: Computer Vision, CVPR, Large Multimodal Model, Microsoft, NLP
[논문리뷰] Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection 2024년 06월 06일 Grounding DINO 논문 리뷰 (ECCV 2024) Tags: Computer Vision, ECCV, Microsoft, Object Detection, Transformer