10 12 13

Wenqi Zhang

zwq2018

zwq2018

AI & ML interests

LLM, Multimodal, Robotics

Recent Activity

updated a dataset 1 day ago

DAMO-NLP-SG/multimodal_textbook

upvoted a collection 2 days ago

Jan 10 Releases 🌨️

authored a paper 6 days ago

2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining

View all activity

Organizations

zwq2018's activity

commented 4 papers 10 days ago

commented 2 papers 3 months ago

Distill Visual Chart Reasoning Ability from LLMs to MLLMs

Paper • 2410.18798 • Published Oct 24, 2024 • 20 •

Distill Visual Chart Reasoning Ability from LLMs to MLLMs

Paper • 2410.18798 • Published Oct 24, 2024 • 20 •

commented a paper 4 months ago

Training Language Models to Self-Correct via Reinforcement Learning

Paper • 2409.12917 • Published Sep 19, 2024 • 136 •

New activity in HuggingFaceM4/idefics2-8b-base 5 months ago

Some issues regarding training

#9 opened 5 months ago by

zwq2018

I would like to ask what the specific design of the few-shot test of the base model is

#6 opened 6 months ago by

zwq2018

New activity in zwq2018/Multi-modal-Self-instruct 6 months ago

License?

#2 opened 6 months ago by

soymono

commented 3 papers 6 months ago

Multimodal Self-Instruct: Synthetic Abstract Image and Visual Reasoning Instruction Using Language Model

Paper • 2407.07053 • Published Jul 9, 2024 • 43 •

Multimodal Self-Instruct: Synthetic Abstract Image and Visual Reasoning Instruction Using Language Model

Paper • 2407.07053 • Published Jul 9, 2024 • 43 •

Vision language models are blind

Paper • 2407.06581 • Published Jul 9, 2024 • 83 •

New activity in huggingface/HuggingDiscussions 6 months ago

[FEEDBACK] Daily Papers

106

#32 opened 7 months ago by

kramp

New activity in zwq2018/Data-Copilot over 1 year ago

cannot unpack non-iterable APIError object

#2 opened over 1 year ago by

ljfanson