Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
|
from
login
DeepSeek's Dualpath Paper explained with animations
(
mesuvash.github.io
)
2 points
by
mesuvash
38 days ago
|
past
Reinforcement Learning for LLMs
(
mesuvash.github.io
)
2 points
by
gmays
40 days ago
|
past
Intuitive Intro to Reinforcement Learning for LLMs
(
mesuvash.github.io
)
3 points
by
mesuvash
45 days ago
|
past
An Intuitive Introduction to PPO and GRPO
(
mesuvash.github.io
)
5 points
by
mesuvash
47 days ago
|
past
|
2 comments
Hashing for large-scale similarity
(
mesuvash.github.io
)
57 points
by
suphyr
on Feb 11, 2019
|
past
|
5 comments
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: