Ollama v0.7
Run leading vision models locally with the new engine
About
Ollama is a platform that enables users to run large language models locally. This blog post announces a significant update: Ollama's new engine now supports multimodal models, starting with vision capabilities. It introduces support for models like Meta Llama 4, Google Gemma 3, and Qwen 2.5 VL, demonstrating their use for general multimodal understanding, reasoning, and document scanning. The new engine improves model modularity, accuracy in processing large images, and memory management through features like image caching and KV cache optimizations, laying the foundation for future modalities like speech and video generation.
Categories & Tags
Color Palette
Primary Blue
#0070f3
Dark Grey (Text)
#1a1a1a
White (Background)
#ffffff
Light Grey (Code Block Background)
#f6f8fa
Typography
Sans-serif
Headings, Body Text, Navigation
Monospace
Code Blocks
Design Review
Similar Products
Clear for Slack
Clear messages get answered quicker
Griply 2026
Achieve your goals with a goal-oriented task manager
HappyMail
We made email simple again
Blober.io
The easiest way to transfer files between cloud providers.
Supaguard
Scan, Detect & Protect Your Supabase Data
Timelines Time Tracking 4
Track your time to achieve your New Year resolutions.
SoftReveal — Reveal less. Engage more.
Hide Content, Reveal on Click
CalPal
The notebook calculator that thinks for you (now with AI).
Reword
Rewrite messages without leaving your workflow
Radial
Your shortcuts, one gesture away
MoovAI
Launch viral AI ads & pro social content in minutes
Resell AI
Reselling workflow with market-based price suggestions