Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

So if it's not using attention and it processes the entire input into an embedding to process in one go, I guess this is neither a Transformer nor a RNN but just a MLP?




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: