Evolved Policy Gradients
Weβre releasing an experimental metalearning approach called Evolved Policy Gradients, a method that evolves the loss function of learning agents, which can enable fast training on novel tasks. Agents trained with EPG can succeed at basic tasks at test time that were outside their training regime, l...
Log in to bookmark articles and create collections
Isabella News