Mastane Achab, Technology Innovation Institute, Abu Dhabi
Title: ``Robustness via distributional reinforcement learning’’. Abstract: In dynamic programming (DP) and reinforcement learning (RL), an agent learns to act optimally in terms of expected long-term return by sequentially interacting with its environment modeled by a Markov decision process (MDP). More…