publikationen_reinforcement-learning

Perkins, Engelbrecht, Barto, Reinforcement learning with stability guarantees