one thing to keep in mind is that you have to use GPT-5.4 differently from codex. they "work" in different ways. i was aghast when i noticed how terrible Codex was against Claude Code only to conclude it was me who wasn't using it right a couple days later
Opus 4.6 and 4.7 are better than GPT-5.4 xhigh, but only marginally. I can't give proper pointers on what to change because it's incredibly hard to quantify.
In essence, though, GPT-5.4 needs explicit instructions not to take liberties - this is included in the default system prompt of Claude Code which leads me to think Opus is just as overzealous as GPT-5.4 unless explicitly told off.
And it takes EVERYTHING you say at face value. Questions like "don't you see why this is bad?" will be answered with "yeah, i do." which is also kind of cool...
because with Opus in Claude Code i constantly have to reassure the model i'm not insinuating anything, lest it takes my question and runs with it into a frenzy of "oh shit my bad let me fix it im so sorry" type changes.
Opus 4.6 and 4.7 are better than GPT-5.4 xhigh, but only marginally. I can't give proper pointers on what to change because it's incredibly hard to quantify.
In essence, though, GPT-5.4 needs explicit instructions not to take liberties - this is included in the default system prompt of Claude Code which leads me to think Opus is just as overzealous as GPT-5.4 unless explicitly told off.
And it takes EVERYTHING you say at face value. Questions like "don't you see why this is bad?" will be answered with "yeah, i do." which is also kind of cool...
because with Opus in Claude Code i constantly have to reassure the model i'm not insinuating anything, lest it takes my question and runs with it into a frenzy of "oh shit my bad let me fix it im so sorry" type changes.