HN Mail
Subscribe
REINFORCEMENT LEARNING
Improve reinforcement learning algorithms by pre-training as if they were human
3 points
|
0 comments
Designing Arithmetic Circuits with Deep Reinforcement Learning
3 points
|
0 comments
Why is ChatGPT so good? RLHF
1 points
|
0 comments