| Volume 9,Issue 3,2011 Table of Contents
Editorial Special issue on approximate dynamic programming and reinforcement learning | | | Editorial: Special issue on approximate dynamic programming and reinforcement learning | | Silvia Ferrari,Jagannathan Sarangapani and Frank L. Lewis | | 2011,9(3):309 [Abstract(1835)] [View PDF 32.96 K (411)] | | | | Approximate policy iteration: a survey and some new methods | | Dimitri P. BERTSEKAS | | 2011,9(3):310-335 [Abstract(3821)] [View PDF 460.78 K (355)] | | | | A review of stochastic algorithms with continuous value function approximation and some new approximate policy iteration algorithms for multidimensional continuous applications | | Warren B. POWELL and Jun MA | | 2011,9(3):336-352 [Abstract(3057)] [View PDF 263.48 K (364)] | | | | Adaptive dynamic programming for online solution of a zero-sum differential game | | Draguna VRABIE and Frank LEWIS | | 2011,9(3):353-360 [Abstract(3580)] [View PDF 222.31 K (619)] | | | | Approximate dynamic programming solutions with a single network adaptive critic for a class of nonlinear systems | | Jie DING and S. N. BALAKRISHNAN | | 2011,9(3):370-380 [Abstract(2294)] [View PDF 493.05 K (510)] | | | | Finite horizon optimal control of discrete-time nonlinear systems with unfixed initial state using adaptive dynamic programming | | Qinglai WEI and Derong LIU | | 2011,9(3):381-390 [Abstract(2057)] [View PDF 379.43 K (548)] | | | | A model-based approximate λ-policy iteration approach to online evasive path planning and the video game Ms. Pac-Man | | Greg FODERARO,Vikram RAJU and Silvia FERRARI | | 2011,9(3):391-399 [Abstract(3467)] [View PDF 477.32 K (558)] | | | | Asymptotic tracking by a reinforcement learning-based adaptive critic controller | | Shubhendu BHASIN,Nitin SHARMA,Parag PATRE and Warren DIXON | | 2011,9(3):400-409 [Abstract(4392)] [View PDF 458.25 K (482)] | | | | Stable reinforcement learning with recurrent neural networks | | James Nate KNIGHT and Charles ANDERSON | | 2011,9(3):410-420 [Abstract(4215)] [View PDF 367.62 K (707)] | | | | Semi-Markov adaptive critic heuristics with application to airline revenue management | | Ketaki KULKARNI,Abhijit GOSAVI,Susan MURRAY and Katie GRANTHAM | | 2011,9(3):421-430 [Abstract(2409)] [View PDF 207.68 K (474)] | | | | Multiresolution state-space discretization for Q-Learning with pseudorandomized discretization | | Amanda LAMPTON,John VALASEK and Mrinal KUMAR | | 2011,9(3):431-439 [Abstract(2114)] [View PDF 475.94 K (361)] | | | | Hierarchical state-abstracted and socially augmented Q-Learning for reducing complexity in agent-based learning | | Xueqing SUN,Tao MAO,Laura RAY,Dongqing SHI and Jerald KRALIK | | 2011,9(3):440-450 [Abstract(2048)] [View PDF 551.24 K (494)] | | | | Moving least-squares approximations for linearly-solvable stochastic optimal control problems | | Mingyuan ZHONG and Emanuel TODOROV | | 2011,9(3):451-463 [Abstract(3257)] [View PDF 584.51 K (341)] | | |
|