Section 4.9.0 - Introductory Writeups
Section 4.9.0.1 - Basic Writeup - Wikipedia - Reinforcement Learning
Section 4.9.0.2 - Detailed Writeup - Textbook - Sutton,Barto - Reinforcement Learning: An Introduction
Section 4.9.1 - Temporal difference learning
Section 4.9.1.0 - Introductory Writeups
Section 4.9.1.0.1 - Basic Writeup - Wikipedia - Temporal Difference Learning
Section 4.9.1.0.2 - Detailed Writeup - Kunz - An Introduction to Temporal Difference Learning
Section 4.9.2 - Q-learning
Section 4.9.2.0 - Introductory Writeups
Section 4.9.2.0.1 - Basic Writeup - Wikipedia - Q-Learning
Section 4.9.2.0.2 - Detailed Writeup - Watkins,Dayan - Q-Learning
Section 4.9.3 - State Action Reward State Action (SARSA)
Section 4.9.3.0 - Introductory Writeups
Section 4.9.3.0.1 - Basic Writeup - Wikipedia - State Action Reward State Action
Section 4.9.3.0.2 - Detailed Writeups
Section 4.9.3.0.2.1 - Part 1 - Rummery,Niranjan - On-Line Q-Learning Using Connectionist Systems
Section 4.9.3.0.2.2 - Part 2 - Seijen,Hasselt,Whiteson,Wiering - A Theoretical and Empiral Analysis of Expected Sarsa
Section 4.9.4 - Fictitious play
Section 4.9.4.0 - Introductory Writeups
Section 4.9.4.0.1 - Basic Writeup - Wikipedia - Fictitious Play
Section 4.9.4.0.2 - Detailed Writeup - Daskalakis - Topics in Algorithmic Game Theory
Section 4.9.5 - Learning classifier system
Section 4.9.5.0 - Introductory Writeups
Section 4.9.5.0.1 - Basic Writeup - Wikipedia - Learning Classifier System
Section 4.9.5.0.2 - Detailed Writeup - Urbanowicz,Moore - Learning Classifier Systems: A Complete Introduction, Review, and Roadmap
Section 4.9.6 - Optimal control
Section 4.9.6.0 - Introductory Writeups
Section 4.9.6.0.1 - Basic Writeup - Wikipedia - Optimal Control
Section 4.9.6.0.2 - Detailed Writeup - Bertsekas - Reinforcement Learning and Optimal Control
Section 4.9.7 - Error-driven learning
Section 4.9.8 - Multi-agent system
Section 4.9.8.0 - Introductory Writeups
Section 4.9.8.0.1 - Basic Writeup - Wikipedia - Multi-Agent System
Section 4.9.8.0.2 - Detailed Writeups
Section 4.9.8.0.2.1 - Part 1 - Balaji,Srinivasan - An Introduction to Multi-Agent Systems
Section 4.9.8.0.2.2 - Part 2 - Hoek,Wooldridge - Multi-Agent Systems
Section 4.9.9 - Distributed artificial intelligence
Section 4.9.9.0 - Introductory Writeups
Section 4.9.9.0.1 - Basic Writeup - Wikipedia - Distributed Artificial Intelligence
Section 4.9.9.0.2 - Detailed Writeups
Section 4.9.9.0.2.1 - Part 1 - Durfee - Distributed Artificial Intelligence
Section 4.9.9.0.2.2 - Part 2 - Bond,Gasser - A Survey of Distributed Artificial Intelligence
Section 4.9.9.1 - Trends in Distributed AI
Section 4.9.10 - Learning Automata
Section 4.9.10.0 - Introductory Writeups
Section 4.9.10.0.1 - Basic Writeup - Wikipedia - Learning Automata
Section 4.9.10.0.2 - Detailed Writeups
Section 4.9.10.0.2.1 - Part 1 - Narendra,Thathachar - Learning Automata: A Survey
Section 4.9.10.0.2.2 - Part 2 - Narendra,Thathachar - Learning Automata: An Introduction
Section 4.9.11 - Deep Reinforcement Learning
Section 4.9.11.0 - Basic Writeup - Lavet,Islam,Henderson,Bellemare,Pineau - An Introduction to Deep Refinforcement Learning
Section 4.9.11.1 - Deep Q-Network
Section 4.9.11.2 - Deep Deterministic Policy Gradient