A much improved model scheduling system is now on Ollama!
- 🫶 Significantly reduced crashes due to out of memory issues
- 📍 Maximizing GPU utilization
- 🚵 Multi-GPU performance
- 🌎 Accurate reporting of memory usage
Learn more & try the latest Ollama! 👇👇👇