Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
microsoft
/
Phi-4-multimodal-instruct
like
1.05k
Follow
Microsoft
10.1k
Automatic Speech Recognition
Transformers
Safetensors
24 languages
phi4mm
text-generation
nlp
code
audio
speech-summarization
speech-translation
visual-question-answering
phi-4-multimodal
phi
phi-4-mini
custom_code
arxiv:
2407.13833
License:
mit
Model card
Files
Files and versions
Community
37
Train
Use this model
Update README.md
#2
by
fasdfgaer
- opened
11 days ago
base:
refs/heads/main
←
from:
refs/pr/2
Discussion
Files changed
+1
-1
fasdfgaer
11 days ago
•
edited 11 days ago
Corrected the typo "Audio Uniderstanding" to "Audio Understanding".
See translation
❤️
1
1
+
Update README.md
faf353bc
nguyenbh
changed pull request status to
merged
10 days ago
Edit
Preview
Upload images, audio, and videos by dragging in the text input, pasting, or
clicking here
.
Tap or paste here to upload images
Comment
·
Sign up
or
log in
to comment