• HN Mail
  • Subscribe

REINFORCEMENT LEARNING

Improve reinforcement learning algorithms by pre-training as if they were human
3 points | 0 comments

Designing Arithmetic Circuits with Deep Reinforcement Learning
3 points | 0 comments

Why is ChatGPT so good? RLHF
1 points | 0 comments