Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
|
oof-baroomf's comments
login
oof-baroomf
6 months ago
|
parent
|
context
[–]
| on:
GPT-5
74.9 SWEBench. This increases the SOTA by a whole .4%. Although the pricing is great, it doesn't seem like OpenAI found a giant breakthrough yet like o1 or Claude 3.5 Sonnet
Workaccount2
6 months ago
|
parent
|
next
[–]
I'm pretty sure 3.5 sonnet always benchmarked poorly, despite it being the clear programming winner of it's time.
iLoveOncall
6 months ago
|
parent
|
prev
[–]
That would assume there is a giant breakthrough to be found.
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: