Ruoqi Zhang

Stay Huuuungry Stay Foooolish

prof_pic.jpg

ÅNG 103175 hus 10

Lägerhyddsvägen 1

Box 337 751 05 UPPSALA

I am a PhD Candidate supervised by Per Mattsson and co-supervised by Torbjörn Wigren at Uppsala University. My research focuses on Reinforcement Learning (RL) and Automatic Control, particularly addressing how control could assist RL and uncertainty in RL.

I earned my master from Uppsala Univerity in 2020 and did my thesis in Uppsala System and Control Division. Beyond my research, I organize SysCon RL Reading Group, engaging weekly in recent RL papers to foster understanding and innovation within the field. Explore past discussions here.

news

Oct 21, 2024 🎉 I will visit Professor Dominik Bauman who is leading Cyber-physical Systems Group from Oct 22nd to Dec 5th at Aalto University. This visit is partially funded by Liljewalch travel scholarships.
Sep 25, 2024 🎉 Our work Entropy-regularized Diffusion Policy with Q-Ensembles for Offline Reinforcement Learning is accepted by Neurips 2024, Vancouver.
Jul 24, 2024 🎉 Our paper Safe Output Feedback Improvement with Baselines was accpted by IEEE CDC 2024 Milano.
May 14, 2024 🎉 I will attend ETPL-ETHZ Multi-Agent Reinforcement Learning from July 29th to July 31st in Lausanne.
May 11, 2024 🎉 We presented our work Entropy-regularized Diffusion Policy with Q-Ensembles for Offline Reinforcement Learning on Generative Models for Decision Making Workshop on ICLR 2024, Vienna.

selected publications

  1. Entropy-regularized Diffusion Policy with Q-Ensembles for Offline Reinforcement Learning
    Zhang, Ruoqi, Luo, Ziwei, Sjölund, Jens, Schön, Thomas B., and Mattsson, Per
    2024
  2. Observer-Feedback-Feedforward Controller Structures in Reinforcement Learning
    Zhang, Ruoqi, Mattsson, Per, and Wigren, Torbjörn
    IFAC-PapersOnLine 2023 22nd IFAC World Congress, Yokohama, Japan
  3. Robust nonlinear set-point control with reinforcement learning
    Zhang, Ruoqi, Mattsson, Per, and Wigren, Torbjörn
    In American Control Conference, ACC 2023, San Diego, CA, USA, May 31 - June 2, 2023 2023
  4. Risk-sensitive Actor-free Policy via Convex Optimisation
    Zhang, Ruoqi, and Sjölund, Jens
    In Proceedings of the IJCAI-23 Joint Workshop on Artificial Intelligence Safety and Safe Reinforcement Learning , Macau, China, August 21-22, 2023 2023