I set up llama.cop last week on my M3. Was fairly simple via homebrew. However, I get tags like <|imstart|> in the output constantly. Is there a way to filter them out with llama-server? Seems like a major usability issue if you want to use llama.cpp by itself (with the web interface).
ollama didn’t have the issue, but it’s less configurable.
ollama didn’t have the issue, but it’s less configurable.