-电子书大全-/多模态大模型论文（300份）(1)/5个多模态大模型研究方向/视觉理解

本店会员98全部书籍免费看！！！

主页/多模态大模型论文（300份）(1)/5个多模态大模型研究方向/视觉理解/

Cream Visually-Situated Natural Language Understanding with Contrastive Reading Model and Frozen Large Language Models.pdf
DocFormerv2 Local Features for Document Understanding.pdf
LLaVAR Enhanced Visual Instruction Tuning for Text-Rich Image Understanding.pdf
M3IT A Large-Scale Dataset towards Multi-Modal Multilingual Instruction Tuning.pdf
mPLUG-DocOwl Modularized Multimodal Large Language Model for Document Understanding.pdf
Multimodal Transformer for Multimodal Machine Translation.pdf
On the Performance of Multimodal Language Models.pdf
PDFVQA A New Dataset for Real-World VQA on PDF Documents.pdf
TouchStone Evaluating Vision-Language Models by Language Models.pdf
UReader Universal OCR-free Visually-situated Language Understanding with Multimodal Large Language Model.pdf

Copyright © All rights reserved.

信息加载中,请等待...