Michael Gimelfarb - Personal Website
AI Researcher, Department of Computer Science, University of Toronto
Summary
I am a researcher in reinforcement learning, a branch of artificial intelligence that studies how an algorithm (agent) should interact with a dynamic environment (task) to achieve a specific goal. My current research tackles the following questions:
- Transfer Learning: How (and what) information learned by one agent on one task could help other agents trying to solve other related tasks? (i.e.)
- Automated Planning: How can the optimal actions be found efficiently when the observations and interactions are high-dimensional and complex? (i.e.)
- Offline RL: How can agents be evaluated/optimized better without costly interactions, by using data of prior interactions from another source? (i.e.)
Biography
I am currently working as a postdoc AI researcher in the RVL Lab led by Florian Shkurti at the University of Toronto, working on offline RL. In 2023, I was a researcher in the D3M Lab led by Scott Sanner, where I worked on automated planning and the development of the Python planning toolkit pyRDDLGym (which has now been included in scikit-decide!). In 2022, I interned at Google DeepMind, UK. I completed my PhD in Industrial Engineering at the University of Toronto advised by Scott Sanner and Chi-Guhn Lee in 2022, and my master’s degree in the same department under supervision of Michael Jong Kim in 2017. I received my Bachelor’s degree in Business Administration (BBA) from the Schulich School of Business.
I enjoy photography, outdoor sports, and play classical piano.
news
Mar 25, 2024 | The results of the 2023 International Planning Competition were published in AI Magazine. |
---|---|
Feb 19, 2024 | Version 2.0 of pyRDDLGym was published. |
Feb 12, 2024 | Our paper on JaxPlan and GurobiPlan was published in ICAPS. |
Jan 01, 2024 | I joined the Robot Vision and Learning Lab as a full-time postdoc in January. |