1
Reverse-focus fine-grained multimodal semantic alignment for video captioning
反向聚焦细粒度多模态语义对齐的视频字幕模型
Accepted Paper
No. 7, 2025 :
doi:10.19734/j.issn.1001-3695.2024.11.0492