[논문리뷰] ViP-LLaVA: Making Large Multimodal Models Understand Arbitrary Visual Prompts
ViP-LLaVA 논문 리뷰 (CVPR 2024)
ViP-LLaVA 논문 리뷰 (CVPR 2024)
LLaVA-1.5 논문 리뷰 (CVPR 2024)
Grounding DINO 논문 리뷰 (ECCV 2024)
LLaVA 논문 리뷰 (NeurIPS 2023 Oral)
ConsiStory 논문 리뷰 (SIGGRAPH 2024)