Welcome to this project. It is a tiny project where we don't do too much coding (yet) but we cooperate together to finish some tricky exercises from famous RL book Reinforcement Learning, An Introduction by Sutton. You may know that this book, especially the second version which was published last year, has no official solution manual. If you send your answer to the email address that the author leaved, you will be returned a fake answer sheet that is incomplete and old. So, why don't we write our own? Most of problems are mathematical proof in which one can learn the therotical backbone nicely but some of them are quite challenging coding problems. Both of them will be updated gradually but math will go first.
Main author would be me and current main cooperater is Zhiqi Pan.
ABOUT MISTAKES: Don't even expect the solutions be perfect, there are always mistakes. And, sometimes the problems are just open. Show your ideas and question them in 'issues' at any time!
Let's roll'n out!
[UPDATE Nov 5 2019] Due to multiple projects due simultaneously in Dec, this project may have to wait a little bit for being updated. Collaborators are always welcomed!
Finished without programming. Plan on creating additional exercises to this Chapter because many materials are lack of practice.
Finished without programming. Thanks for help from Zhiqi Pan.
Finished without programming
Partially finished.
Finished. Ex4.7 Partially finished. Dat DP question will burn my mind and macbook but I encourage any one who cares nothing about that trying to do yourself. Running through it forces you remember everything behind ordinary DP.:)