MIT 6.884 - Computational Sensorimotor Learning

Schedule: Spring 2020

Lectures are Tuesday and Thursday 2:30-4:00, in 2-105

Here is a tentative schedule of lectures, readings, assignments, and final project. Readings and assignments will be added as they become available. Before every lecture, you need to give feedback on the papers listed in the Readings section.

Event	Date	Lecture	Readings	Logistics
Lecture	Feb 4	Questions in Intelligence	None
Lecture	Feb 6	Evaluating Intelligence	Imitation Game Chapters 1 and 2 of this thesis Optional: Chapter 1 of this thesis	HW0: Familiarization with Infrastructure
Lecture	Feb 11	Overview of RL (submit feedback)	Comparing Policy-Gradient Algorithms Optional: A Tour of Reinforcement Learning: The View from Continuous Control	Release HW1
Discussion	Feb 13	On-Policy RL Algorithms (submit feedback)	PPO IMPALA A3C Optional: GAE
Holiday	Feb 18	Monday Calendar
Discussion	Feb 20	Off-Policy RL Algorithms (submit feedback)	DQN DDPG Rainbow Optional: SAC Optional: TD3	Submit Project Abstract
Discussion	Feb 25	RL Applications (submit feedback)	Learning Dexterity Alpha Go Alpha Zero Optional: Mu Zero Optional: Playing DOTA
Discussion	Feb 27	Algorithms for Exploration (submit feedback)	Curiosity-driven Exploration by self-supervised Prediction Unifying Count-Based Exploration and Intrinsic Motivation Diversity is All You Need Optional: Empowerment Optional: What is Intrinsic Motivation?
Discussion	Mar 3	Transfer Learning in Context of Decision Making (submit feedback)	RL2 MAML Gotta Learn Fast Optional: Domain Randomization Optional: Policy Sketches Optional: Procgen
Discussion	Mar 5	Curriculum Learning (submit feedback)	Curriculum Learning POET Asymmetric Self-Play Optional: PowerPlay Optional: Goal GAN Optional: Teacher-Student Curriculum Learning
Lecture	Mar 10	Learning Models (submit feedback)	MOSAIC Supervised Learning with Distal Teacher DYNA
Discussion	Mar 12	Papers on Learning Models (submit feedback)	Hindsight Experience Replay Visual Foresight Optional: Learning to Poke by Poking Optional: Embed to Control Optional: World Models
Holiday	Mar 17	COVID-19		Release HW2; HW1 due
Holiday	Mar 19	COVID-19
Holiday	Mar 24	Spring Break
Holiday	Mar 26	Spring Break
Presentation	Mar 31	Midterm Project Presentations
Discussion	Apr 2	Neural Network Architectures (submit feedback)	Relational Deep Reinforcement Learning Attention is All You Need (Transformer) Optional: Memory Networks Optional: PathNet Optional: Parameter Superposition Optional: Randomly Wired Networks
Discussion	Apr 7	Representation Learning (submit feedback)	Intelligence Without Representations Survey of Self-Supervised Learning Optional: Simple Framework for Contrastive Learning Optional: Tutorial on VAEs Optional: Unsupervised Learning of Object Keypoints for Perception and Control Optional: Navigation using Mid-Level Priors
Lecture	Apr 9	Imitation Learning (submit feedback)	Is Imitation Learning the Route to Humanoid Robots? DAGGER Optional: Mirror Neurons
Discussion	Apr 14	Papers on Imitation Learning (submit feedback)	Zero-Shot Visual Imitation Divergence Minimization Perspective Optional: GAIL Optional: One Shot Visual Imitation Learning Optional: Deep Mimic Optional: One Shot Imitation Learning	Release HW3; HW2 due
Discussion	Apr 16	Papers on Inverse RL (submit feedback)	Inverse Reinfocement Learning Maximum Entropy IRL Time Contrastive Networks Optional: Guided Cost Learning
Discussion	April 21	Hierarchial Learning (RL + Imitation) (submit feedback)	Option-Critic Architecture FeUdal Networks for HRL Optional: Feudal Reinforcement Learning.
Discussion	Apr 23	Learning from Touch, Vision and Sound (submit feedback)	GelSight Learning to Grasp and Regrasp using Vision and Touch
Discussion	April 28	Multi Agent Systems (submit feedback)	Survey of MARL Learning with Opponent-Learning Awareness (LOLA) Useful Link for MARL papers (not a reading)
Discussion	April 30	Role of Language (submit feedback)	A Survey of RL Informed by Natural Language Emergence of Grounded Compositional Language in Multi-Agent Populations
Discussion	May 5	Miscellaneous (submit feedback)	Successor Representations Deep RL That Matters Benchmarking Model Based RL Optional Learning Complex Dexterous Manipulation with DRL and Demonstrations
Presentation	May 12	Final Project Presentations

Event

Date

Lecture

Readings

Logistics

Lecture

Feb 4

Questions in Intelligence

None

Lecture

Feb 6

Evaluating Intelligence

Imitation Game
Chapters 1 and 2 of this thesis
Optional: Chapter 1 of this thesis

HW0: Familiarization with Infrastructure

Lecture

Feb 11

Overview of RL (submit feedback)

Release HW1

Discussion

Feb 13

On-Policy RL Algorithms (submit feedback)

PPO
IMPALA
A3C
Optional: GAE

Holiday

Feb 18

Monday Calendar

Discussion

Feb 20

Off-Policy RL Algorithms (submit feedback)

DQN
DDPG
Rainbow
Optional: SAC
Optional: TD3

Submit Project Abstract

Discussion

Feb 25

RL Applications (submit feedback)

Discussion

Feb 27

Algorithms for Exploration (submit feedback)

Curiosity-driven Exploration by self-supervised Prediction
Unifying Count-Based Exploration and Intrinsic Motivation
Diversity is All You Need
Optional: Empowerment
Optional: What is Intrinsic Motivation?

Discussion

Mar 3

Transfer Learning in Context of Decision Making (submit feedback)

RL2
MAML
Gotta Learn Fast
Optional: Domain Randomization
Optional: Policy Sketches
Optional: Procgen

Discussion

Mar 5

Curriculum Learning (submit feedback)

Curriculum Learning
POET
Asymmetric Self-Play
Optional: PowerPlay
Optional: Goal GAN
Optional: Teacher-Student Curriculum Learning

Lecture

Mar 10

Learning Models (submit feedback)

Discussion

Mar 12

Papers on Learning Models (submit feedback)

Holiday

Mar 17

COVID-19

Release HW2; HW1 due

Holiday

Mar 19

COVID-19

Holiday

Mar 24

Spring Break

Holiday

Mar 26

Spring Break

Presentation

Mar 31

Midterm Project Presentations

Discussion

Apr 2

Neural Network Architectures (submit feedback)

Discussion

Apr 7

Representation Learning (submit feedback)

Intelligence Without Representations
Survey of Self-Supervised Learning
Optional: Simple Framework for Contrastive Learning
Optional: Tutorial on VAEs
Optional: Unsupervised Learning of Object Keypoints for Perception and Control
Optional: Navigation using Mid-Level Priors

Lecture

Apr 9

Imitation Learning (submit feedback)

Is Imitation Learning the Route to Humanoid Robots?
DAGGER
Optional: Mirror Neurons

Discussion

Apr 14

Papers on Imitation Learning (submit feedback)

Zero-Shot Visual Imitation
Divergence Minimization Perspective
Optional: GAIL
Optional: One Shot Visual Imitation Learning
Optional: Deep Mimic
Optional: One Shot Imitation Learning

Release HW3; HW2 due

Discussion

Apr 16

Papers on Inverse RL (submit feedback)

Discussion

April 21

Hierarchial Learning (RL + Imitation) (submit feedback)

Option-Critic Architecture
FeUdal Networks for HRL
Optional: Feudal Reinforcement Learning.

Discussion

Apr 23

Learning from Touch, Vision and Sound (submit feedback)

Discussion

April 28

Multi Agent Systems (submit feedback)

Survey of MARL
Learning with Opponent-Learning Awareness (LOLA)
Useful Link for MARL papers (not a reading)

Discussion

April 30

Role of Language (submit feedback)

A Survey of RL Informed by Natural Language

Emergence of Grounded Compositional Language in Multi-Agent Populations

Discussion

May 5

Miscellaneous (submit feedback)

Presentation

May 12

Final Project Presentations

For questions or comments, email csl-staff [AT] mit [DOT] edu.

Back to 6.884 home.