> Alphazero also doesn't need training data as input--it's generated by game-pla...

GolDDranks · 2025-08-08T10:33:08 1754649188

Are you talking about Monte Carlo tree search? I consider it part of the algorithm in AlphaZero's case. But agreed that RL is a lot harder in real-life setting than in a board game setting.

Davidzheng · 2025-08-08T10:30:42 1754649042

the harness is obtained from the game rules? the "harness" is part of the algorithm of alphzero

Jensson · 2025-08-08T13:14:14 1754658854

> the "harness" is part of the algorithm of alphzero

Then that is not a general algorithm and results from it doesn't apply to other problems.