Ollama 3 is out, and it’s the biggest update since the project started. You can now serve multiple models concurrently from a single process, with smart memory management.
Ollama 3 is out, and it’s the biggest update since the project started. You can now serve multiple models concurrently from a single process, with smart memory management.