This week’s session by Bruno covered the relationship between two interesting areas, RL and causal inference. We went over the two following papers:

Counterfactual Multi-Agent Policy Gradients
https://arxiv.org/abs/1705.08926

and

Counterfactual Off-Policy Evaluation with Gumbel-Max Structural Causal Models
https://arxiv.org/abs/1905.05824

Our paper readings take place weekly at the Department of Engineering on Tuesdays 2pm. Hope to see you next week!

Like our Facebook page to keep informed of the other events that we’ll be doing. Check out our paper reading page for information on other regular paper reading groups that are going on around Cambridge.

LEAVE A REPLY

Please enter your comment!
Please enter your name here