E-Book, Englisch, Band 173, 172 Seiten, eBook
Reihe: The Springer International Series in Engineering and Computer Science
Sutton Reinforcement Learning
1992
ISBN: 978-1-4615-3618-5
Verlag: Springer US
Format: PDF
Kopierschutz: 1 - PDF Watermark
E-Book, Englisch, Band 173, 172 Seiten, eBook
Reihe: The Springer International Series in Engineering and Computer Science
ISBN: 978-1-4615-3618-5
Verlag: Springer US
Format: PDF
Kopierschutz: 1 - PDF Watermark
Reinforcement learning is both a new and a very old topic in AI. The term appears to have been coined by Minsk (1961), and independently in control theory by Walz and Fu (1965). The earliest machine learning research now viewed as directly relevant was Samuel's (1959) checker player, which used temporal-difference learning to manage delayed reward much as it is used today. Of course learning and reinforcement have been studied in psychology for almost a century, and that work has had a very strong impact on the AI/engineering work. One could in fact consider all of reinforcement learning to be simply the reverse engineering of certain psychological learning processes (e.g. operant conditioning and secondary reinforcement).
is an edited volume of original research, comprising seven invited contributions by leading researchers.
Zielgruppe
Research
Autoren/Hrsg.
Weitere Infos & Material
Introduction; R.Sutton. Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning; R.J. Wiiliams. Practical Issues in Temporal Difference Learning; G. Teasauro. Technical Note: Q-Learning; C.J.C.H. Watkins, P. Dayan. Self Improving Reactive Agents Based on Reinforcement Learning, Planning and Teaching; L.-J. Lin. Transfer of Learning by Composing Solutions of Elemental Sequential Tasks; S.P. Singh. The Convergence of TD (lambda) for general lambda; P. Dayan. A Reinforcement Connctionist Approach to Robot Path Finding in Non-Maze-Like Environments; J. del R. Millán, C. Torras.




