本店会员98全部书籍免费看!!!
主页
/
多模态大模型论文(300份)(1)
/
5个多模态大模型研究方向
/
视觉理解
/
Cream Visually-Situated Natural Language Understanding with Contrastive Reading Model and Frozen Large Language Models.pdf
DocFormerv2 Local Features for Document Understanding.pdf
LLaVAR Enhanced Visual Instruction Tuning for Text-Rich Image Understanding.pdf
M3IT A Large-Scale Dataset towards Multi-Modal Multilingual Instruction Tuning.pdf
mPLUG-DocOwl Modularized Multimodal Large Language Model for Document Understanding.pdf
Multimodal Transformer for Multimodal Machine Translation.pdf
On the Performance of Multimodal Language Models.pdf
PDFVQA A New Dataset for Real-World VQA on PDF Documents.pdf
TouchStone Evaluating Vision-Language Models by Language Models.pdf
UReader Universal OCR-free Visually-situated Language Understanding with Multimodal Large Language Model.pdf
Copyright © All rights reserved.
信息加载中,请等待...