Efficient Reinforcement Learning using Gaussian Processes

This book examines Gaussian processes in both model-based reinforcement learning (RL) and inference in nonlinear dynamic systems.First, we introduce PILCO, a fully Bayesian approach for efficient RL in continuous-valued state and action spaces when no expert knowledge is available. PILCO takes model...

Full description

Saved in:
Bibliographic Details
Main Author: Deisenroth, Marc Peter (auth)
Format: Book Chapter
Published: KIT Scientific Publishing 2010
Subjects:
Online Access:Get Fullteks
DOAB: description of the publication
Tags: Add Tag
No Tags, Be the first to tag this record!
LEADER 01561naaaa2200289uu 4500
001 doab_20_500_12854_45907
005 20210211
020 |a KSP/1000019799 
020 |a 9783866445697 
024 7 |a 10.5445/KSP/1000019799  |c doi 
041 0 |a English 
042 |a dc 
100 1 |a Deisenroth, Marc Peter  |4 auth 
245 1 0 |a Efficient Reinforcement Learning using Gaussian Processes 
260 |b KIT Scientific Publishing  |c 2010 
300 |a 1 electronic resource (IX, 205 p. p.) 
506 0 |a Open Access  |2 star  |f Unrestricted online access 
520 |a This book examines Gaussian processes in both model-based reinforcement learning (RL) and inference in nonlinear dynamic systems.First, we introduce PILCO, a fully Bayesian approach for efficient RL in continuous-valued state and action spaces when no expert knowledge is available. PILCO takes model uncertainties consistently into account during long-term planning to reduce model bias. Second, we propose principled algorithms for robust filtering and smoothing in GP dynamic systems. 
540 |a Creative Commons  |f https://creativecommons.org/licenses/by-nc-nd/4.0/  |2 cc  |4 https://creativecommons.org/licenses/by-nc-nd/4.0/ 
546 |a English 
653 |a autonomous learning 
653 |a Gaussian processes 
653 |a control 
653 |a machine learning 
653 |a Bayesian inference 
856 4 0 |a www.oapen.org  |u https://www.ksp.kit.edu/9783866445697  |7 0  |z Get Fullteks 
856 4 0 |a www.oapen.org  |u https://directory.doabooks.org/handle/20.500.12854/45907  |7 0  |z DOAB: description of the publication