nekomeowww.OllamaOperatorYet another operator for running large language models on Kubernetes with ease. Powered by Ollama! ๐ซ
$ winget install --id nekomeowww.OllamaOperator --exact --version 0.10.10Run in Command Prompt, PowerShell, or Windows Terminal. Prompts for any agreements.
For Intune admins
Automated application patching for Microsoft Intune. Pckgr keeps a curated library of 1,000+ apps continuously up-to-date in your tenant via Microsoft Graph โ no manual repackaging, no chasing vendor sites.
See Pckgr's app libraryWhile Ollama is a powerful tool for running large language models locally, and the user experience of CLI is just the same as using Docker CLI, it's not possible yet to replicate the same user experience on Kubernetes, especially when it comes to running multiple models on the same cluster with loads of resources and configurations.
That's where the Ollama Operator kicks in:
- Install the operator on your Kubernetes cluster
- Apply the needed CRDs
- Create your models
- Wait for the models to be fetched and loaded, that's it!
Thanks to the great works of lama.cpp, no more worries about Python environment, CUDA drivers.
The journey to large language models, AIGC, localized agents, ๐ฆ๐ Langchain and more is just a few steps away!
Copy a command tailored to that specific architecture, type, and scope - useful when winget would otherwise pick a different default.