Full Content is available to subscribers

Subscribe/Learn More  >
Proceedings Article

Approximate dynamic programming recurrence relations for a hybrid optimal control problem

[+] Author Affiliations
W. Lu, S. Ferrari

Duke Univ. (United States)

R. Fierro

The Univ. of New Mexico (United States)

T. A. Wettergren

Naval Undersea Warfare Ctr. (United States)

Proc. SPIE 8387, Unmanned Systems Technology XIV, 83870C (May 1, 2012); doi:10.1117/12.919286
Text Size: A A A
From Conference Volume 8387

  • Unmanned Systems Technology XIV
  • Baltimore, Maryland, USA | April 23, 2012

abstract

This paper presents a hybrid approximate dynamic programming (ADP) method for a hybrid dynamic system (HDS) optimal control problem, that occurs in many complex unmanned systems which are implemented via a hybrid architecture, regarding robot modes or the complex environment. The HDS considered in this paper is characterized by a well-known three-layer hybrid framework, which includes a discrete event controller layer, a discrete-continuous interface layer, and a continuous state layer. The hybrid optimal control problem (HOCP) is to nd the optimal discrete event decisions and the optimal continuous controls subject to a deterministic minimization of a scalar function regarding the system state and control over time. Due to the uncertainty of environment and complexity of the HOCP, the cost-to-go cannot be evaluated before the HDS explores the entire system state space; as a result, the optimal control, neither continuous nor discrete, is not available ahead of time. Therefore, ADP is adopted to learn the optimal control while the HDS is exploring the environment, because of the online advantage of ADP method. Furthermore, ADP can break the curses of dimensionality which other optimizing methods, such as dynamic programming (DP) and Markov decision process (MDP), are facing due to the high dimensions of HOCP.© (2012) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Citation

W. Lu ; S. Ferrari ; R. Fierro and T. A. Wettergren
"Approximate dynamic programming recurrence relations for a hybrid optimal control problem", Proc. SPIE 8387, Unmanned Systems Technology XIV, 83870C (May 1, 2012); doi:10.1117/12.919286; http://dx.doi.org/10.1117/12.919286


Access This Proceeding
Sign in or Create a personal account to Buy this proceeding ($15 for members, $18 for non-members).

Figures

Tables

NOTE:
Citing articles are presented as examples only. In non-demo SCM6 implementation, integration with CrossRef’s "Cited By" API will populate this tab (http://www.crossref.org/citedby.html).

Some tools below are only available to our subscribers or users with an online account.

Related Content

Customize your page view by dragging & repositioning the boxes below.

Related Book Chapters

Advertisement
  • Don't have an account?
  • Subscribe to the SPIE Digital Library
  • Create a FREE account to sign up for Digital Library content alerts and gain access to institutional subscriptions remotely.
Access This Proceeding
Sign in or Create a personal account to Buy this proceeding ($15 for members, $18 for non-members).
Access This Proceeding
Sign in or Create a personal account to Buy this article ($15 for members, $18 for non-members).
Access This Chapter

Access to SPIE eBooks is limited to subscribing institutions and is not available as part of a personal subscription. Print or electronic versions of individual SPIE books may be purchased via SPIE.org.