CS6700 - Reinforcement learning
Course Data :
The Reinforcement Learning problem : evaluative feedback, non-associative learning, Rewards and returns, Markov Decision Processes, Value functions, optimality and approximation. Dynamic programming : value iteration, policy iteration, asynchronous DP, generalized policy iteration. Monta-Carlo methods : policy evaluation, roll outs, on policy and off policy learning, importance sampling. Temporal Difference learning : TD prediction, Optimality of TD(0), SARSA, Q-learning, R-learning, Games and after states. Eligibility traces : n-step TD prediction, TD (lambda), forward and backward views, Q (lambda), SARSA (lambda), replacing traces and accumulating traces. Function Approximation : Value prediction, gradient descent methods, linear function approximation, ANN based function approximation, lazy learning, instability issues Policy Gradient methods : non-associative learning – REINFORCE algorithm, exact gradient methods, estimating gradients, approximate policy gradient algorithms, actor-critic methods
Note : The pre-req was updated to MA2040 from Jul 2018 offering onwards.
Pre-Requisites |
Parameters
Credits |
Type |
Date of Introduction |
3-1-0-4 |
Elective |
Aug 2007 |
|
Previous Instances of the Course
- Jan 2024 - May 2024
Instructor(s) : Balaraman Ravindran.
- Jan 2023 - May 2023
Instructor(s) : Balaraman Ravindran.
- Jan 2022 - Apr 2022
Instructor(s) : Balaraman Ravindran.
- Aug 2021 - Dec 2021
Instructor(s) : L A Prashanth.
Teaching Assistants : V Rebin Silva, Nithia V, Sandip Saha, Nency Bansal.
- Feb 2021 - May 2021
Instructor(s) : L A Prashanth.
Teaching Assistants : Nithia V, Yadav Mahesh Lorik, Phule Tushar Jaywant.
- Jan 2020 - May 2020
Instructor(s) : Balaraman Ravindran.
Teaching Assistants : Harshavardhan P K, Arjun Manoharan, Kakadiya Ashutosh Dilipbhai, Chigullapally Sriharsha.
- Jan 2019 - May 2019
Instructor(s) : Balaraman Ravindran.
Teaching Assistants : Malla Kavya Mrudula, Rahul Ramesh, Arjun Manoharan, Pavan Ravishankar, Neha Sah.
- Jul 2018 - Nov 2018
Instructor(s) : L A Prashanth.
Teaching Assistants : Nithia V, Ajay Kumar Pandey, Bhavsar Nirav Narharibhai.
- Jan 2018 - May 2018
Instructor(s) : Balaraman Ravindran.
Teaching Assistants : Abhishek Naik, Nikita Moghe.
- Jan 2017 - May 2017
Instructor(s) : Balaraman Ravindran.
Teaching Assistants : Sahil Sharma, J P Sagar, Subhojyoti Mukherjee.