Show HN: A/B test Gemini, ChatGPT and Claude | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		Show HN: A/B test Gemini, ChatGPT and Claude (news.ycombinator.com)
		2 points by eclair99 on April 16, 2024 \| hide \| past \| favorite \| 2 comments
		I built this app for macOS to A/B test LLMs side-by-side. Also, I feel that having your prompt answered by two LLMs is a quick and easy way to confirm you are not falling for hallucinated information.

shadowfax92 on April 16, 2024 | [–]

Any insights into which is better from the using the app?

eclair99 on April 16, 2024 | | [–]

Thanks for checking out the tool!

I personally feel Claude3 outperforms Gemini and ChatGPT. But lack of web-browsing is a slight dis-adv of claude.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact