On the Performance of Planning Through Backpropagation

Jan 1, 2020·

Renato Scaroni

Thiago P. Bueno

Leliane N. De Barros

Denis D. Mauá

· 0 min read

Abstract

Planning problems with continuous state and action spaces are difficult to solve with existing planning techniques, specially when the state transition is defined by a high-dimension non-linear dynamics. Recently, a technique called Planning through Backpropagation (PtB) was introduced as an efficient and scalable alternative to traditional optimization-based methods for continuous planning problems. PtB leverages modern gradient descent algorithms and highly optimized automatic differentiation libraries to obtain approximate solutions. However, to date there have been no empirical evaluations comparing PtB with Linear-Quadratic (LQ) control problems. In this work, we compare PtB with an optimal algorithm from control theory called LQR, and its iterative version iLQR, when solving linear and non-linear continuous deterministic planning problems. The empirical results suggest that PtB can be an efficient alternative to optimizing non-linear continuous deterministic planning, being much easier to be implemented and stabilized than classical model-predictive control methods.

Type

Conference paper

Publication

Proceedings of the 9th Brazilian Conference on Intelligent Systems

Last updated on Jan 1, 2020

Authors

Denis D. Mauá (he/him)

Associate Professor

← Learning Probabilistic Sentential Decision Diagrams by Sampling Jan 1, 2020

Prediction of Environmental Conditions for Maritime Navigation using a Network of Sensors: A Practical Application of Graph Neural Networks Jan 1, 2020 →