I’ve already started playing with this inside my sandbox, and even the smaller set of weights is more adroit in responses than the previous version.
As usual I’m going to have to tediously test and re-do all my tooling prompts, but what I really want to test is the expanded context length and see how far I can push it with ollama
.