Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
8
1
zi
menglan
Follow
0 followers
·
1 following
AI & ML interests
None yet
Recent Activity
new
activity
10 days ago
AIDC-AI/Ovis2-34B:
what about the image/video captioning ability compared to other methods, like internvl-2.5, sharegpt4v and so on?
new
activity
10 days ago
AIDC-AI/Ovis2-34B:
How to only use the text and visual embedding?
new
activity
14 days ago
google/siglip2-base-patch16-224:
Error while loading processor: TypeError: expected str, bytes or os.PathLike object, not NoneType
View all activity
Organizations
None yet
menglan
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
AIDC-AI/Ovis2-34B
10 days ago
what about the image/video captioning ability compared to other methods, like internvl-2.5, sharegpt4v and so on?
#3 opened 10 days ago by
menglan
How to only use the text and visual embedding?
1
#2 opened 13 days ago by
Labmem009
New activity in
google/siglip2-base-patch16-224
14 days ago
Error while loading processor: TypeError: expected str, bytes or os.PathLike object, not NoneType
8
#2 opened 17 days ago by
armamut
New activity in
OpenGVLab/InternVideo2_5_Chat_8B
28 days ago
TypeError: chat() got an unexpected keyword argument 'video_path'
10
#2 opened about 2 months ago by
skyernest
New activity in
DAMO-NLP-SG/VideoLLaMA3-7B-Image
28 days ago
something wrong in caling ffmpeg to extract frames
#4 opened 28 days ago by
menglan
New activity in
OpenGVLab/InternVideo2_5_Chat_8B
about 1 month ago
Error on get_vision_tower
2
#3 opened about 1 month ago by
Clip-AI
New activity in
DAMO-NLP-SG/VideoLLaMA3-7B-Image
about 1 month ago
find a bug in load_images func
#3 opened about 1 month ago by
menglan
New activity in
mPLUG/mPLUG-Owl3-7B-241101
about 2 months ago
does it support multi-images and chinese prompt as input
#2 opened about 2 months ago by
menglan
liked
a dataset
7 months ago
tomg-group-umd/pixelprose
Viewer
•
Updated
Jun 23, 2024
•
15.6M
•
522
•
144