Event |
Date |
Lecture |
Readings |
Logistics |
Lecture |
Feb 4 |
Questions in Intelligence |
None |
|
Lecture |
Feb 6 |
Evaluating Intelligence |
- Imitation Game
- Chapters 1 and 2 of this thesis
- Optional: Chapter 1 of this
thesis
|
HW0: Familiarization with Infrastructure |
Lecture |
Feb 11 |
Overview of RL (submit feedback) |
-
Comparing Policy-Gradient Algorithms
- Optional:
A Tour of Reinforcement Learning:
The View from Continuous Control
|
Release HW1 |
Discussion |
Feb 13 |
On-Policy RL Algorithms (submit feedback) |
- PPO
- IMPALA
- A3C
- Optional: GAE
|
|
Holiday |
Feb 18 |
Monday Calendar |
|
|
Discussion |
Feb 20 |
Off-Policy RL Algorithms (submit feedback) |
- DQN
- DDPG
- Rainbow
- Optional: SAC
- Optional: TD3
|
Submit Project Abstract |
Discussion |
Feb 25 |
RL Applications (submit feedback) |
- Learning Dexterity
- Alpha Go
- Alpha Zero
- Optional: Mu Zero
- Optional: Playing DOTA
|
|
Discussion |
Feb 27 |
Algorithms for Exploration (submit feedback) |
-
Curiosity-driven Exploration by self-supervised Prediction
-
Unifying Count-Based Exploration and Intrinsic Motivation
-
Diversity is All You Need
- Optional:
Empowerment
- Optional:
What is Intrinsic Motivation?
|
|
Discussion |
Mar 3 |
Transfer Learning in Context of Decision Making (submit feedback) |
- RL2
- MAML
- Gotta Learn Fast
- Optional: Domain Randomization
- Optional: Policy Sketches
- Optional: Procgen
|
|
Discussion |
Mar 5 |
Curriculum Learning (submit feedback) |
-
Curriculum Learning
- POET
- Asymmetric Self-Play
- Optional:
PowerPlay
- Optional:
Goal GAN
- Optional:
Teacher-Student Curriculum Learning
|
|
Lecture |
Mar 10 |
Learning Models (submit feedback) |
-
MOSAIC
-
Supervised Learning with Distal Teacher
-
DYNA
|
|
Discussion |
Mar 12 |
Papers on Learning Models (submit feedback) |
-
Hindsight Experience Replay
-
Visual Foresight
- Optional:
Learning to Poke by Poking
- Optional:
Embed to Control
- Optional:
World Models
|
|
Holiday |
Mar 17 |
COVID-19 |
|
Release HW2; HW1 due |
Holiday |
Mar 19 |
COVID-19 |
|
|
Holiday |
Mar 24 |
Spring Break |
|
|
Holiday |
Mar 26 |
Spring Break |
|
|
Presentation |
Mar 31 |
Midterm Project Presentations |
|
|
Discussion |
Apr 2 |
Neural Network Architectures (submit feedback) |
-
Relational Deep Reinforcement Learning
-
Attention is All You Need (Transformer)
- Optional:
Memory Networks
- Optional:
PathNet
- Optional:
Parameter Superposition
- Optional:
Randomly Wired Networks
|
|
Discussion |
Apr 7 |
Representation Learning (submit feedback) |
-
Intelligence Without Representations
-
Survey of Self-Supervised Learning
- Optional:
Simple Framework for Contrastive Learning
- Optional:
Tutorial on VAEs
- Optional:
Unsupervised Learning of Object Keypoints for Perception and Control
- Optional:
Navigation using Mid-Level Priors
|
|
Lecture |
Apr 9 |
Imitation Learning (submit feedback) |
-
Is Imitation Learning the Route to Humanoid Robots?
-
DAGGER
- Optional:
Mirror Neurons
|
|
Discussion |
Apr 14 |
Papers on Imitation Learning (submit feedback) |
-
Zero-Shot Visual Imitation
-
Divergence Minimization Perspective
- Optional:
GAIL
- Optional:
One Shot Visual Imitation Learning
- Optional:
Deep Mimic
- Optional:
One Shot Imitation Learning
|
Release HW3; HW2 due |
Discussion |
Apr 16 |
Papers on Inverse RL (submit feedback) |
-
Inverse Reinfocement Learning
-
Maximum Entropy IRL
-
Time Contrastive Networks
- Optional:
Guided Cost Learning
|
|
Discussion |
April 21 |
Hierarchial Learning (RL + Imitation) (submit feedback) |
-
Option-Critic Architecture
-
FeUdal Networks for HRL
- Optional:
Feudal Reinforcement Learning.
|
|
Discussion |
Apr 23 |
Learning from Touch, Vision and Sound (submit feedback) |
-
GelSight
- Learning to Grasp and Regrasp using Vision and Touch
|
|
Discussion |
April 28 |
Multi Agent Systems (submit feedback) |
-
Survey of MARL
-
Learning with Opponent-Learning Awareness (LOLA)
- Useful
Link for MARL papers (not a reading)
|
|
Discussion |
April 30 |
Role of Language (submit feedback) |
-
A Survey of RL Informed by Natural Language
-
Emergence of Grounded Compositional Language in Multi-Agent Populations
|
|
Discussion |
May 5 |
Miscellaneous (submit feedback) |
- Successor Representations
- Deep RL That Matters
-
Benchmarking Model Based RL
- Optional
Learning Complex Dexterous Manipulation with DRL and Demonstrations
|
|
Presentation |
May 12 |
Final Project Presentations |
|
|