Publications: DR David Mguni
Yan X, Song Y, Cui X, Christianos F, Zhang H, Wang J, Mguni D
(
2025
)
.
A Bilevel Reinforcement Learning Framework with Language Prior Knowledge
.
Machine Learning and Knowledge Discovery in Databases. Research Track
,
vol.
16018
,
Springer Nature
Schäfer L, Slumbers O, McAleer S, Du Y, Albrecht SV, Mguni D
(
2025
)
.
Ensemble Value Functions for Efficient Exploration in Multi-Agent Reinforcement Learning
.
Autonomous agents and multiagent systems
.
Conference:
Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems - Volume 11849
-
1857
.
Jafferjee T, Ziomek J, Yang T, Dai Z, Wang J, Taylor ME, Shao K, Wang J et al.
(
2025
)
.
Taming Multi-Agent Reinforcement Learning with Estimator Variance Reduction
.
Autonomous agents and multiagent systems
.
Conference:
Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems - Volume 11042
-
1050
.
Dinh LC, Mguni D, Tran-Thanh L, Yang Y
(
2024
)
.
A Summary of Online Markov Decision Processes with Non-oblivious Strategic Adversary
.
Conference:
International Conference on Autonomous Agents and Multiagent Systems
Li H, Huang W, Mguni D, Shao K
(
2023
)
.
A survey on algorithms for Nash equilibria in finite normal-form games
.
Computer Science Review
Feng X, Luo Y, Wang Z, Yang M, Du Y
(
2023
)
.
ChessGPT: Bridging Policy Learning and Language Modeling
.
Conference:
Conference on Neural Information Processing Systems
Mguni D, Jafferjee T, Wang J, Perez-Nieves N, Song W, Taylor M, Yang T, Zhu J
(
2023
)
.
Learning to Shape Rewards using a Game of Two Partners
.
Conference:
Association for the Advancement of Artificial Intelligence
Slumbers O, Mguni D, Blumberg S, McAleer S, Wang J
(
2023
)
.
A game-theoretic framework for managing risk in multi-agent systems
.
Conference:
International Conference on Machine Learning
Mguni D, Chen H, Jafferjee T, Wang J, Yue L, McAleer S
(
2023
)
.
MANSA: Learning Fast and Slow in Multi-Agent Systems
.
Conference:
International Conference on Machine Learning
Mguni D, Sootla A, Ziomek J, Slumbers O, Dai Z, Shao K
(
2023
)
.
Timing is Everything: Learning to Act Selectively with Costly Actions and Budgetary Constraints
.
Conference:
International Conference on Learning Representations
Dinh LC, Mguni D, Tran-Thanh L
(
2023
)
.
Online Markov Decision Processes with Non-oblivious Strategic Adversary
.
Autonomous Agents and Multi-Agent Systems
Mguni D, Deng X, Li N, Mguni D
(
2022
)
.
On the complexity of computing Markov perfect equilibrium in general-sum stochastic games
.
National Science Review
Dinh LC, McAleer S, Tian Z, Slumbers O, Mguni D, Wang J
(
2022
)
.
Online double oracle
.
Transactions on Machine Learning Research
Dai Z, Zhou T, Shao K, Mguni D, Wang B
(
2022
)
.
Socially-Attentive Policy Optimization in Multi-Agent Self-Driving System
.
Conference:
Conference on Robot Learning
Mguni D, Chen Y, Deng X, Wang J
(
2022
)
.
On the Convergence of Fictitious Play: A Decomposition Approach
.
Conference:
International Joint Conference on Artificial Intelligence
Mguni D, Sootla A, Cowen-Rivers A, Jafferjee T, Wang Z
(
2022
)
.
SAUTE RL: Almost Surely Safe Reinforcement Learning Using State Augmentation
.
Conference:
International Conference on Machine Learning
Mguni D, Jafferjee T, Wang J, Perez-Nieves N, Tong F, Li Y, Zhu J
(
2022
)
.
LIGS: Learnable Intrinsic-Reward Generation Selection for Multi-Agent Learning
.
Conference:
International Conference on Learning Representations
Mguni D, Perez Nieves N, Wang J
.
Apparatus and method for automated reward shaping
.
no.
18365818
,