Automatic Speech Recognition
Transformers
Safetensors
phi4mm
text-generation
nlp
code
audio
speech-summarization
speech-translation
visual-question-answering
phi-4-multimodal
phi
phi-4-mini
custom_code
Eval Results
Instructions to use microsoft/Phi-4-multimodal-instruct with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use microsoft/Phi-4-multimodal-instruct with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("automatic-speech-recognition", model="microsoft/Phi-4-multimodal-instruct", trust_remote_code=True)# Load model directly from transformers import AutoModelForCausalLM model = AutoModelForCausalLM.from_pretrained("microsoft/Phi-4-multimodal-instruct", trust_remote_code=True, dtype="auto") - Notebooks
- Google Colab
- Kaggle
Ctrl+K
- examples
- figures
- speech-lora
- vision-lora
- 1.61 kB
- 444 Bytes
- 1.14 kB
- 65.4 kB
- 2.66 kB
- 1.24 kB
- 249 Bytes
- 4.63 kB
- 11 kB
- 4.62 kB
- 190 Bytes
- 2.42 MB
- 5 GB xet
- 4.95 GB xet
- 1.2 GB xet
- 240 kB
- 116 kB
- 5.3 MB xet
- 482 Bytes
- 32.8 kB
- 121 Bytes
- 16.7 kB
- 19.6 kB
- 10.5 kB
- 473 Bytes
- 111 kB
- 15.5 MB xet
- 3.25 kB
- 78.2 kB
- 3.91 MB