HN Mail
Subscribe
REINFORCEMENT LEARNING
Deepseek R1 Zero learns to reason using reinforcement learning on base model [pdf]
4 points
|
0 comments
Show HN: Play tag against an opponent trained with reinforcement learning
2 points
|
0 comments
Show HN: Panopticon AI – Open-source platform for military AI research
7 points
|
5 comments