View on GitHub

Introduction to Reinforcement Learning

Material for an introduction course to reinforcement learning for compute scientists

Chapters 6: Temporal-Difference Learning

Examples

Sarsa (and GLIE) for the Cliff Walking Environment (using gym-classics environments).
Q-Learning for the Cliff Walking Environment (using gym-classics environments).
Learning to Play Tic-Tac-Toe with Q-Learning implements a simple table-based Q-learning algorithm to play the game from scratch.

Connection to Chapter 17.4: Reward Engineering

Examples

License

© 2025 Michael Hahsler. All code and documents in this repository are provided under Creative Commons Attribution-ShareAlike 4.0 International (CC BY-SA 4.0) License.

CC BY-SA 4.0