Why is it called "reinforcement learning"? Because the agents learn like Pavlov's dog, do something good and get a treat, do something bad and get punished.
But learning in humans and other intelligent species are far more complex. The explain (why didn't that work?), imagine (what would happen if I tried...?), and reflect (how might have things have differently?).
Sequential decision-making systems are common in practice. Reinforcement learning - algorithms that optimize sequential decision-making agents, promises to extend these systems in ways that will bend society. But we'll need causal reasoning to do it.
Remember how deep learning sky-rocketed the careers of those who started early? This course will get you in on the ground floor of the next major milestone in machine learning.
Workshop DetailsLength: 4 weeks
Current Cohort: March 2022
Next Cohort: June 2022 (tentative)
- One-on-one scheduling and weekly retros
- Continuous office hours via chat, instructor feedback on coding assignments
- Optional portfolio project in causal ML that you can showcase publicly. Get instructor guidance and iterative feedback on portfolio project.
- Gain ability to build causal reasoning algorithms into decision-making systems in data science and machine learning teams at top-tier technology organizations.
- Master modern programming libraries, including a deep learning framework, in the implementation of causal reasoning algorithms.
Robert is currently a research scientist at Microsoft Research and faculty at the Northeastern University department of computer science. He has worked in industry as a research engineer and data scientist building production quality systems for Bayesian decision-making under uncertainty.
Robert didn't start in machine learning. He started his career by becoming fluent in Mandarin Chinese and moving to Tibet to do developmental economics fieldwork. He later obtained a graduate degree from Johns Hopkins School of Advanced International Studies.
Robert attained his Ph. D. in mathematical statistics from Purdue University. He has published in top tier journals and venues on topics including causal inference, probabilistic modeling, sequential decision processes, and dynamic models of complex systems.