reinforcement learning - 学术搜索

搜索结果

约 753,132 条结果

[PDF] sci-hub.bond
Reinforcement Learning: An Introduction
Richard S. Sutton, Andrew G. Barto - IEEE Transactions on Neural Networks - 2005
An account of key ideas and algorithms in reinforcement learning. The discussion ranges from the history of the field's intellectual foundations to recent developments and applications. Areas studied include reinforcement learning problems in terms of Markov decision problems and solution methods.
被引用次数：25,772
[PDF] sci-hub.store
Human-level control through deep reinforcement learning
Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Andrei A. Rusu - Nature - 2015
该记录暂无摘要，您可以通过来源链接查看详细信息。
被引用次数：29,321
[PDF] sci-hub.email
Reinforcement Learning: An Introduction
Richard S. Sutton, Andy Barto - IEEE Transactions on Neural Networks - 1998
该记录暂无摘要，您可以通过来源链接查看详细信息。
被引用次数：26,892
[PDF] jair.org
Reinforcement Learning: A Survey
Leslie Pack Kaelbling, Michael L. Littman, Andrew Moore - Journal of Artificial Intelligence Research - 1996
This paper surveys the field of reinforcement learning from a computer-science perspective. It is written to be accessible to researchers familiar with machine learning. Both the historical basis of the field and a broad selection of current work are summarized. Reinforcement learning is the problem faced by an agent that learns behavior through trial-and-error interactions with a dynamic environment. The work descri…
被引用次数：8,754
[HTML] enterscholar.com
Introduction to Reinforcement Learning
Richard S. Sutton, Andrew G. Barto - MIT Press eBooks - 1998
From the Publisher: In Reinforcement Learning, Richard Sutton and Andrew Barto provide a clear and simple account of the key ideas and algorithms of reinforcement learning. Their discussion ranges from the history of the field's intellectual foundations to the most recent developments and applications. The only necessary mathematical background is familiarity with elementary concepts of probability.
被引用次数：6,915
[PDF] sci-hub.store
Reinforcement Learning: An Introduction
Jeffrey D. Johnson, Jinghong Li, Zengshi Chen - Neurocomputing - 2000
该记录暂无摘要，您可以通过来源链接查看详细信息。
被引用次数：8,685
[PDF] arxiv.org
Diagnosing Non-Intermittent Anomalies in Reinforcement Learning Policy Executions (Short Paper)
Natan, Avraham, Stern, Roni, Kalech, Meir - arXiv (Cornell University) - 2017
Due to the safety risks and training sample inefficiency, it is often preferred to develop controllers in simulation. However, minor differences between the simulation and the real world can cause a significant sim-to-real gap. This gap can reduce the effectiveness of the developed controller. In this paper, we examine a case study of transferring an octorotor reinforcement learning controller from simulation to the…
被引用次数：11,269
[PDF] link.springer.com
Simple statistical gradient-following algorithms for connectionist reinforcement learning
Ronald J. Williams - Machine Learning - 1992
该记录暂无摘要，您可以通过来源链接查看详细信息。
被引用次数：7,429
[PDF] arxiv.org
Continuous control with deep reinforcement learning
Timothy Lillicrap, Jonathan J. Hunt, Alexander Pritzel, Nicolas Heess - arXiv (Cornell University) - 2016
Abstract: We adapt the ideas underlying the success of Deep Q-Learning to the continuous action domain. We present an actor-critic, model-free algorithm based on the deterministic policy gradient that can operate over continuous action spaces. Using the same learning algorithm, network architecture and hyper-parameters, our algorithm robustly solves more than 20 simulated physics tasks, including classic problems suc…
被引用次数：6,777
[PDF] arxiv.org
Playing Atari with Deep Reinforcement Learning
Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Alex Graves - arXiv (Cornell University) - 2013
We present the first deep learning model to successfully learn control policies directly from high-dimensional sensory input using reinforcement learning. The model is a convolutional neural network, trained with a variant of Q-learning, whose input is raw pixels and whose output is a value function estimating future rewards. We apply our method to seven Atari 2600 games from the Arcade Learning Environment, with no…
被引用次数：5,121

Reinforcement Learning: An Introduction

Human-level control through deep reinforcement learning

Reinforcement Learning: An Introduction

Reinforcement Learning: A Survey

Introduction to Reinforcement Learning

Reinforcement Learning: An Introduction

Diagnosing Non-Intermittent Anomalies in Reinforcement Learning Policy Executions (Short Paper)

Simple statistical gradient-following algorithms for connectionist reinforcement learning

Continuous control with deep reinforcement learning

Playing Atari with Deep Reinforcement Learning

学科分类

热门搜索