<aside> 🔥 ICLR 2024 (Spotlight)

</aside>

<aside> 📝 **Wei Wang¹, Dongqi Han², Xufang Luo², Dongsheng Li²

¹ Western University, Canada, ² Microsoft Research Asia

</aside>

🔥 News

🔥 Table of Content

🔥 TLDR


This paper formalizes and addresses signal delay in deep reinforcement learning, introducing effective strategies that maintain high performance in robotic control tasks despite substantial delays.

🔥 Overview


Untitled.png

Despite the notable advancements in deep reinforcement learning (DRL) in recent years, a prevalent issue that is often overlooked is the impact of signal delay. Signal delay occurs when there is a lag between an agent's perception of the environment and its corresponding actions. In this paper, we first formalize delayed-observation Markov decision processes (DOMDP) by extending the standard MDP framework to incorporate signal delays. Next, we elucidate the challenges posed by the presence of signal delay in DRL, showing that trivial DRL algorithms and generic methods for partially observable tasks suffer greatly from delays. Lastly, we propose effective strategies to overcome these challenges. Our methods achieve remarkable performance in continuous robotic control tasks with large delays, yielding results comparable to those in non-delayed cases. Overall, our work contributes to a deeper understanding of DRL in the presence of signal delays and introduces novel approaches to address the associated challenges.

🔥 Video - 5min Short Intro


https://www.youtube.com/watch?v=bJIMSvwUYhc

🔥 Resources - Code, Slides, Video, Paper


🔥 Code

https://github.com/microsoft/Addressing-signal-delay-in-deep-RL

🔥 Slides

https://docs.google.com/presentation/d/1s55OP8DXf8XSeyXOKlKVmaNy1QI9hjBgq4SnYWdPE5U/edit?usp=sharing

https://docs.google.com/presentation/d/1xqDNW_HndHOkyiJc2XjWlquemYrUndt6IShBjTfaWao/edit?usp=sharing

🔥 Video

Addressing Signal Delay in Deep Reinforcement Learning

🔥 Poster

poster.pdf

🔥 Paper

Addressing Signal Delay in Deep Reinforcement Learning

🔥 Q&A