Hacker Newsnew | past | comments | ask | show | jobs | submitlogin
Show HN: A/B test Gemini, ChatGPT and Claude (news.ycombinator.com)
2 points by eclair99 on April 16, 2024 | hide | past | favorite | 2 comments
I built this app for macOS to A/B test LLMs side-by-side. Also, I feel that having your prompt answered by two LLMs is a quick and easy way to confirm you are not falling for hallucinated information.


Any insights into which is better from the using the app?


Thanks for checking out the tool!

I personally feel Claude3 outperforms Gemini and ChatGPT. But lack of web-browsing is a slight dis-adv of claude.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: