[논문리뷰] Vid2Seq: Large-Scale Pretraining of a Visual Language Model for Dense Video Captioning
Vid2Seq 논문 리뷰 (CVPR 2023)
Vid2Seq 논문 리뷰 (CVPR 2023)
Painter 논문 리뷰 (CVPR 2023)
Promptbreeder 논문 리뷰
Focal-Stable-DINO 논문 리뷰
FocalNet 논문 리뷰 (NeurIPS 2022)