Skip to main content
Research

Publications: DR David Mguni

Yan X, Song Y, Cui X, Christianos F, Zhang H, Wang J, Mguni D ( 2025 ) . A Bilevel Reinforcement Learning Framework with Language Prior Knowledge . Machine Learning and Knowledge Discovery in Databases. Research Track , vol. 16018 , Springer Nature
Schäfer L, Slumbers O, McAleer S, Du Y, Albrecht SV, Mguni D ( 2025 ) . Ensemble Value Functions for Efficient Exploration in Multi-Agent Reinforcement Learning . Autonomous agents and multiagent systems . Conference: Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems - Volume 11849 - 1857 .
Jafferjee T, Ziomek J, Yang T, Dai Z, Wang J, Taylor ME, Shao K, Wang J et al. ( 2025 ) . Taming Multi-Agent Reinforcement Learning with Estimator Variance Reduction . Autonomous agents and multiagent systems . Conference: Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems - Volume 11042 - 1050 .
Dinh LC, Mguni D, Tran-Thanh L, Yang Y ( 2024 ) . A Summary of Online Markov Decision Processes with Non-oblivious Strategic Adversary . Conference: International Conference on Autonomous Agents and Multiagent Systems
Li H, Huang W, Mguni D, Shao K ( 2023 ) . A survey on algorithms for Nash equilibria in finite normal-form games . Computer Science Review
Feng X, Luo Y, Wang Z, Yang M, Du Y ( 2023 ) . ChessGPT: Bridging Policy Learning and Language Modeling . Conference: Conference on Neural Information Processing Systems
Mguni D, Jafferjee T, Wang J, Perez-Nieves N, Song W, Taylor M, Yang T, Zhu J ( 2023 ) . Learning to Shape Rewards using a Game of Two Partners . Conference: Association for the Advancement of Artificial Intelligence
Slumbers O, Mguni D, Blumberg S, McAleer S, Wang J ( 2023 ) . A game-theoretic framework for managing risk in multi-agent systems . Conference: International Conference on Machine Learning
Mguni D, Chen H, Jafferjee T, Wang J, Yue L, McAleer S ( 2023 ) . MANSA: Learning Fast and Slow in Multi-Agent Systems . Conference: International Conference on Machine Learning
Mguni D, Sootla A, Ziomek J, Slumbers O, Dai Z, Shao K ( 2023 ) . Timing is Everything: Learning to Act Selectively with Costly Actions and Budgetary Constraints . Conference: International Conference on Learning Representations
Dinh LC, Mguni D, Tran-Thanh L ( 2023 ) . Online Markov Decision Processes with Non-oblivious Strategic Adversary . Autonomous Agents and Multi-Agent Systems
Mguni D, Deng X, Li N, Mguni D ( 2022 ) . On the complexity of computing Markov perfect equilibrium in general-sum stochastic games . National Science Review
Dinh LC, McAleer S, Tian Z, Slumbers O, Mguni D, Wang J ( 2022 ) . Online double oracle . Transactions on Machine Learning Research
Dai Z, Zhou T, Shao K, Mguni D, Wang B ( 2022 ) . Socially-Attentive Policy Optimization in Multi-Agent Self-Driving System . Conference: Conference on Robot Learning
Mguni D, Chen Y, Deng X, Wang J ( 2022 ) . On the Convergence of Fictitious Play: A Decomposition Approach . Conference: International Joint Conference on Artificial Intelligence
Mguni D, Sootla A, Cowen-Rivers A, Jafferjee T, Wang Z ( 2022 ) . SAUTE RL: Almost Surely Safe Reinforcement Learning Using State Augmentation . Conference: International Conference on Machine Learning
Mguni D, Jafferjee T, Wang J, Perez-Nieves N, Tong F, Li Y, Zhu J ( 2022 ) . LIGS: Learnable Intrinsic-Reward Generation Selection for Multi-Agent Learning . Conference: International Conference on Learning Representations
Mguni D, Perez Nieves N, Wang J . Apparatus and method for automated reward shaping . no. 18365818 ,