Pawel Ladosz, Lilian Weng, Minwoo Kim, Hyondong Oh
This paper reviews exploration techniques in deep reinforcement learning. Exploration techniques are of primary importance when solving sparse reward problems. In sparse reward problems, the reward is rare, which means …
5 commentsAndré Barreto, Diana Borsa, John Quan, Tom Schaul, David Silver, Matteo Hessel, Daniel Mankowitz, Augustin ŽÃdek, Rémi Munos
The ability to transfer skills across tasks has the potential to scale up reinforcement learning (RL) agents to environments currently out of reach. Recently, a framework based on two ideas, …
7 commentsDanilo Jimenez Rezende, Karol Gregor, Daan Wierstra
In this paper we introduce a new unsupervised reinforcement learning method for discovering the set of intrinsic options available to an agent. This set is learned by maximizing the number …
10 commentsSergey Levine, Kelvin Xu, Siddharth Verma, Chelsea Finn
Reinforcement learning has the potential to automate the acquisition of behavior in complex settings, but in order for it to be successfully deployed, a number of practical challenges must be …
8 commentsTejas D. Kulkarni, Karthik R. Narasimhan, Ardavan Saeedi, Joshua B. Tenenbaum
Learning goal-directed behavior in environments with sparse feedback is a major challenge for reinforcement learning algorithms. The primary difficulty arises due to insufficient exploration, resulting in an agent being unable …
4 commentsDanilo Jimenez Rezende, Shakir Mohamed
The choice of approximate posterior distribution is one of the core problems in variational inference. Most applications of variational inference employ simple families of posterior approximations in order to allow …
9 commentsPieter Abbeel, Sergey Levine, Bradly C. Stadie
Achieving efficient and scalable exploration in complex domains poses a major challenge in reinforcement learning. While Bayesian and PAC-MDP approaches to the exploration problem offer strong formal guarantees, they are …
9 commentsWilliam F. Whitney, Michael Bloesch, Jost Tobias Springenberg, Abbas Abdolmaleki, Kyunghyun Cho, Martin Riedmiller
Despite the close connection between exploration and sample efficiency, most state of the art reinforcement learning algorithms include no considerations for exploration beyond maximizing the entropy of the policy. In …
8 commentsPieter Abbeel, Danijar Hafner, Ramanan Sekar, Oleh Rybkin, Kostas Daniilidis, Deepak Pathak
Reinforcement learning allows solving complex tasks, however, the learning tends to be task-specific and the sample efficiency remains a challenge. We present Plan2Explore, a self-supervised reinforcement learning agent that tackles …
6 commentsPaul-Antoine Le Tolguenec, Emmanuel Rachelson, Yann Besse, Dennis G. Wilson
When searching for policies, reward-sparse environments often lack sufficient information about which behaviors to improve upon or avoid. In such environments, the policy search process is bound to blindly search …
9 commentsCédric Colas, Olivier Sigaud, Pierre-Yves Oudeyer
In continuous action domains, standard deep reinforcement learning algorithms like DDPG suffer from inefficient exploration when facing sparse or deceptive reward problems. Conversely, evolutionary and developmental methods focusing on exploration …
1 commentPage 1 of 1
Open Journal Club © 2024