Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

A simple test - take one of your own photos, something interesting, and put in into a LLM, let it describe it in words. Then use a image generator to create the image back. It works like back-translation image->text->image. It proves how much the models really understand images and text.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: