Jump to content

Mountain Car: Difference between revisions

From mintOC
No edit summary
Line 7: Line 7:
The '''Mountain Car problem''' proposes a vehicle stuck in a “well.” It lacks the power to directly climb out of the well, but instead must accelerate repeatedly forwards and backwards until it has achieved the energy necessary to exit the well.
The '''Mountain Car problem''' proposes a vehicle stuck in a “well.” It lacks the power to directly climb out of the well, but instead must accelerate repeatedly forwards and backwards until it has achieved the energy necessary to exit the well.


The problem is a popular machine learning test case. It first appeared in the PhD thesis of Andrew Moore in 1990. [Moo90]. The implementation here is taken from [[]] and based on that given by Melnikov, Makmal, and Briegel [MMB14].
The problem is a popular machine learning test case. It first appeared in the PhD thesis of Andrew Moore in 1990. [[#Moo90 | [2]]]. The implementation here is taken from [[#openmdao | [1]]] and based on that given by Melnikov, Makmal, and Briegel [[#MMB14 | [3]]].
Its dynamics are given by a two-dimensional [[:Category:ODE model|ODE model]].
Its dynamics are given by a two-dimensional [[:Category:ODE model|ODE model]].



Revision as of 14:58, 20 August 2025

Mountain Car
State dimension: 1
Differential states: 2
Discrete control functions: 1


The Mountain Car problem proposes a vehicle stuck in a “well.” It lacks the power to directly climb out of the well, but instead must accelerate repeatedly forwards and backwards until it has achieved the energy necessary to exit the well.

The problem is a popular machine learning test case. It first appeared in the PhD thesis of Andrew Moore in 1990. [2]. The implementation here is taken from [1] and based on that given by Melnikov, Makmal, and Briegel [3]. Its dynamics are given by a two-dimensional ODE model.

The optimal integer control functions exhibits a bang bang structure.

Mathematical formulation

minutfsubject tox˙(t)=v(t),v˙(t)=0.001u(t)0.0025cos(3x(t)),x(0)=0.5,v(0)=0,x(tf)=0.5,v(tf)0,u(t)[1,1] t[0,tf]

Reference Solutions

Here is one local solution to the above control problem.

Miscellaneous and Further Reading

This formulation detailed description can be found in [1].

References

[1] Multidisciplinary Optimal Control Library: https://openmdao.org/dymos/docs/latest/examples/mountain_car/mountain_car.html
[2] Andrew William Moore. Efficient memory-based learning for robot control. Technical Report UCAM-CL-TR-209, University of Cambridge, Computer Laboratory, November 1990. URL: https://www.cl.cam.ac.uk/techreports/UCAM-CL-TR-209.pdf, doi:10.48456/tr-209.
[3] Alexey A Melnikov, Adi Makmal, and Hans J Briegel. Projective simulation applied to the grid-world and the mountain-car problem. arXiv preprint arXiv:1405.5459, 2014.