This week’s session by Bruno covered the relationship between two interesting areas, RL and causal inference. We went over the two following papers:
Counterfactual Multi-Agent Policy Gradients
https://arxiv.org/abs/1705.08926
and
Counterfactual Off-Policy Evaluation with Gumbel-Max Structural Causal Models
https://arxiv.org/abs/1905.05824
Our paper readings take place weekly at the Department of Engineering on Tuesdays 2pm. Hope to see you next week!
Like our Facebook page to keep informed of the other events that we’ll be doing. Check out our paper reading page for information on other regular paper reading groups that are going on around Cambridge.