Very anecdotal but for me this model has very weak prompt adherence. I compared it a tiny bit to gemini flash 3.0 and simple things like "don't use markdown tables in output" was very hard to get with m2.1
Took me like 5 prompt iterations until it finally listened.
But it's very good, better than flash 3.0 in terms of code output and reasoning while being cheaper.
Took me like 5 prompt iterations until it finally listened.
But it's very good, better than flash 3.0 in terms of code output and reasoning while being cheaper.