Faster Ollama alternative

RandomlyRight@sh.itjust.works · 1 day ago

Faster Ollama alternative

RandomlyRight@sh.itjust.works · 8 hours ago

I’ve read about this method in the GitHub issues, but to me it seemed impractical to have different models just to change the context size, and that was the point I started looking for alternatives

theunknownmuncher@lemmy.world · 8 hours ago

You can overwrite the model by using the same name instead of creating one with a new name if it bothers you. Either way there is no duplication of the llm model file