Volume 9,Issue 3,2011 Table of Contents

   
Other Issues:  
  

Editorial

Online optimal control of nonlinear discrete-time systems using approximate dynamic programming
  Travis DIERKS and Sarangapani JAGANNATHAN
  2011,9(3):361-369 [Abstract(2460)]  [View PDF 268.41 K (839)]  [HTML]
  

Special issue on approximate dynamic programming and reinforcement learning

Editorial: Special issue on approximate dynamic programming and reinforcement learning
  Silvia Ferrari,Jagannathan Sarangapani and Frank L. Lewis
  2011,9(3):309 [Abstract(1887)]  [View PDF 32.96 K (411)]  [HTML]
  
Approximate policy iteration: a survey and some new methods
  Dimitri P. BERTSEKAS
  2011,9(3):310-335 [Abstract(4313)]  [View PDF 460.78 K (355)]  [HTML]
  
A review of stochastic algorithms with continuous value function approximation and some new approximate policy iteration algorithms for multidimensional continuous applications
  Warren B. POWELL and Jun MA
  2011,9(3):336-352 [Abstract(3161)]  [View PDF 263.48 K (364)]  [HTML]
  
Adaptive dynamic programming for online solution of a zero-sum differential game
  Draguna VRABIE and Frank LEWIS
  2011,9(3):353-360 [Abstract(4092)]  [View PDF 222.31 K (619)]  [HTML]
  
Approximate dynamic programming solutions with a single network adaptive critic for a class of nonlinear systems
  Jie DING and S. N. BALAKRISHNAN
  2011,9(3):370-380 [Abstract(2392)]  [View PDF 493.05 K (510)]  [HTML]
  
Finite horizon optimal control of discrete-time nonlinear systems with unfixed initial state using adaptive dynamic programming
  Qinglai WEI and Derong LIU
  2011,9(3):381-390 [Abstract(2149)]  [View PDF 379.43 K (548)]  [HTML]
  
A model-based approximate λ-policy iteration approach to online evasive path planning and the video game Ms. Pac-Man
  Greg FODERARO,Vikram RAJU and Silvia FERRARI
  2011,9(3):391-399 [Abstract(3965)]  [View PDF 477.32 K (558)]  [HTML]
  
Asymptotic tracking by a reinforcement learning-based adaptive critic controller
  Shubhendu BHASIN,Nitin SHARMA,Parag PATRE and Warren DIXON
  2011,9(3):400-409 [Abstract(5054)]  [View PDF 458.25 K (482)]  [HTML]
  
Stable reinforcement learning with recurrent neural networks
  James Nate KNIGHT and Charles ANDERSON
  2011,9(3):410-420 [Abstract(4827)]  [View PDF 367.62 K (707)]  [HTML]
  
Semi-Markov adaptive critic heuristics with application to airline revenue management
  Ketaki KULKARNI,Abhijit GOSAVI,Susan MURRAY and Katie GRANTHAM
  2011,9(3):421-430 [Abstract(2499)]  [View PDF 207.68 K (474)]  [HTML]
  
Multiresolution state-space discretization for Q-Learning with pseudorandomized discretization
  Amanda LAMPTON,John VALASEK and Mrinal KUMAR
  2011,9(3):431-439 [Abstract(2210)]  [View PDF 475.94 K (361)]  [HTML]
  
Hierarchical state-abstracted and socially augmented Q-Learning for reducing complexity in agent-based learning
  Xueqing SUN,Tao MAO,Laura RAY,Dongqing SHI and Jerald KRALIK
  2011,9(3):440-450 [Abstract(2130)]  [View PDF 551.24 K (494)]  [HTML]
  
Moving least-squares approximations for linearly-solvable stochastic optimal control problems
  Mingyuan ZHONG and Emanuel TODOROV
  2011,9(3):451-463 [Abstract(3391)]  [View PDF 584.51 K (341)]  [HTML]