Hacker Newsnew | past | comments | ask | show | jobs | submit | fromlogin
DeepSeek's Dualpath Paper explained with animations (mesuvash.github.io)
2 points by mesuvash 38 days ago | past
Reinforcement Learning for LLMs (mesuvash.github.io)
2 points by gmays 40 days ago | past
Intuitive Intro to Reinforcement Learning for LLMs (mesuvash.github.io)
3 points by mesuvash 45 days ago | past
An Intuitive Introduction to PPO and GRPO (mesuvash.github.io)
5 points by mesuvash 47 days ago | past | 2 comments
Hashing for large-scale similarity (mesuvash.github.io)
57 points by suphyr on Feb 11, 2019 | past | 5 comments

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: