Our Paper, Time-Efficient Reinforcement Learning with Stochastic Stateful Policies, was accepted at the International Conference on Learning Representations (ICLR) 2024! We introduce a novel training approach for stateful policies, decomposing them into a stochastic internal state kernel and a stateless policy jointly optimized using our stochastic stateful policy gradient. This method overcomes the drawbacks of Backpropagation Through Time (BPTT), providing a faster and simpler alternative, as demonstrated in evaluations on complex continuous control tasks such as humanoid locomotion.
Robot Learning Workshop
I am excited to announce that I will be co-organizing the Next-Gen Robot Learning Symposium at the Technical University of Darmstadt on 4th November 2024!