Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
microsoft
/
Phi-4-multimodal-instruct
like
1.06k
Follow
Microsoft
10.1k
Automatic Speech Recognition
Transformers
Safetensors
24 languages
phi4mm
text-generation
nlp
code
audio
speech-summarization
speech-translation
visual-question-answering
phi-4-multimodal
phi
phi-4-mini
custom_code
arxiv:
2407.13833
License:
mit
Model card
Files
Files and versions
Community
37
Train
Use this model
17df1f5
Phi-4-multimodal-instruct
8 contributors
History:
4 commits
nguyenbh
Update examples
17df1f5
13 days ago
examples
Add examples
13 days ago
figures
Added model files
14 days ago
speech-lora
Added model files
14 days ago
vision-lora
Added model files
14 days ago
.gitattributes
Safe
1.57 kB
Added model files
14 days ago
CODE_OF_CONDUCT.md
Safe
444 Bytes
Added model files
14 days ago
LICENSE
Safe
1.14 kB
Added model files
14 days ago
README.md
52.7 kB
Update examples
13 days ago
SECURITY.md
Safe
2.66 kB
Added model files
14 days ago
SUPPORT.md
Safe
1.24 kB
Added model files
14 days ago
added_tokens.json
Safe
249 Bytes
Added model files
14 days ago
config.json
Safe
4.63 kB
Added model files
14 days ago
configuration_phi4mm.py
11 kB
Added model files
14 days ago
generation_config.json
Safe
190 Bytes
Added model files
14 days ago
merges.txt
Safe
2.42 MB
Added model files
14 days ago
model-00001-of-00003.safetensors
Safe
5 GB
LFS
Added model files
14 days ago
model-00002-of-00003.safetensors
Safe
4.95 GB
LFS
Added model files
14 days ago
model-00003-of-00003.safetensors
Safe
1.2 GB
LFS
Added model files
14 days ago
model.safetensors.index.json
Safe
240 kB
Added model files
14 days ago
modeling_phi4mm.py
116 kB
Added model files
14 days ago
preprocessor_config.json
Safe
482 Bytes
Added model files
14 days ago
processing_phi4mm.py
32.8 kB
Added model files
14 days ago
processor_config.json
Safe
121 Bytes
Added model files
14 days ago
sample_finetune_speech.py
16.7 kB
Added model files
14 days ago
sample_finetune_vision.py
19.6 kB
Added model files
14 days ago
sample_inference_phi4mm.py
10.5 kB
Added model files
14 days ago
special_tokens_map.json
Safe
473 Bytes
Added model files
14 days ago
speech_conformer_encoder.py
111 kB
Added model files
14 days ago
tokenizer.json
Safe
15.5 MB
LFS
Added model files
14 days ago
tokenizer_config.json
Safe
3.25 kB
Added model files
14 days ago
vision_siglip_navit.py
78.2 kB
Added model files
14 days ago
vocab.json
Safe
3.91 MB
Added model files
14 days ago