入门强化学习 Easy Reinforcement Learning | Wyaaaattwho’s Blog

type

Post

status

Published

date

Nov 22, 2024

slug

RL

summary

tags

study

tech

category

Academy

icon

password

🔥

强化学习（Reinforcement Learning, RL）是一种模仿人类学习方式的机器学习方法，它通过“试错”的交互过程，从环境中获得反馈信号（奖励或惩罚），逐步优化决策策略。无论是围棋中击败人类顶尖棋手的 AlphaGo，还是帮助机器人实现灵活运动的智能控制系统，强化学习都展示出了极大的潜力和应用价值。

本博客旨在为强化学习初学者提供一条清晰、易懂的入门路径，持续更新中。

Imitation: Supervised Learning of Behaviors

Reintroduction to Reinforcement Learning

Policy Gradients

Actor-Critic Algorithms

Value Function Methods

Deep RL with Q-Functions

Advanced Policy Gradient