This course aims to cover the fundamental topics relevant to Reinforcement Learning (RL), a computational learning approach where an agent tries to maximize the total amount of reward it receives while interacting with the complex and uncertain environments. The course content includes the basics of Markov Decision Processes, model-based and model-free RL techniques, policy optimization, RL distributed system design, as well as the case studies of RL for game playing such as AlphaGo, traffic simulation, and other robotics applications.
Advisory: Students are expected to have solid foundation on signal processing.