Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I've been trying the Q4_K_M version, and sometimes it gets stuck in a loop. Gemma 4 doesn’t have this issue.


This has happened before with quantizations and other backends (ones not used by the research lab). Give it a week, download latest versions of everything, and try again.


I'm having the same issues, the more I use it. The repetition penalty doesn't seem to help.

I get some really amusing 'reflective' responses, but I think it needs a bit more cooking. Maybe I'll try another variant.


perhaps increasing repitition_penalty might be helpful




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: