Modularity and transfer in reinforcement learning (Talk)
- Herke van Hoof
- University of Amsterdam
Learning new control strategies for (possibly unknown) dynamical systems is a challenging task. Reinforcement learning algorithms typically require 'fresh' data regularly, but obtaining data safely and in sufficient quantities is a challenge on real systems. Thus, it is no surprise that most recent successes have been in domains where massive amounts of data can easily be generated in simulation (e.g., games such as Atari and Go).
Biography: Herke van Hoof is assistant professor at the University of Amsterdam. His research focuses on reinforcement learning, with a focus on developing techniques that could be applied on physical systems. Before that, he did research on robots learning from data they gather by themselves as a postdoc at McGill University in Montreal and as PhD student at TU Darmstadt. His obtained his bachelor and master degrees from the University of Groningen.