Publications

For news about publications, follow us on X:

Click on any author names or tags to filter publications.

All topic tags:
survey deep-rl multi-agent-rl agent-modelling ad-hoc-teamwork autonomous-driving goal-recognition explainable-ai causal generalisation security emergent-communication iterated-learning intrinsic-reward simulator state-estimation deep-learning transfer-learning

Selected tags (click to remove):
deep-rl

2025

Zidu Yin, Zhen Zhang, Dong Gong, Stefano V. Albrecht, Javen Q. Shi
Highway Graph to Accelerate Reinforcement Learning
Transactions on Machine Learning Research, 2025
Abstract | BibTex | arXiv | Code
TMLR deep-rl

Abstract: Reinforcement Learning (RL) algorithms often suffer from low training efficiency. A strategy to mitigate this issue is to incorporate a model-based planning algorithm, such as Monte Carlo Tree Search (MCTS) or Value Iteration (VI), into the environmental model. The major limitation of VI is the need to iterate over a large tensor with the shape |S| × |A| × |S|, where S/A denotes the state/action space. This process iteratively updates the value of the preceding state st−1 based on the state st in one step via value propagation. These still lead to intensive computations. We focus on improving the training efficiency of RL algorithms by improving the efficiency of the value learning process. For the deterministic environments with discrete state and action spaces, on the sampled empirical state-transition graph, a non-branching sequence of transitions can directly bring the agent from s0 to sT without deviating from intermediate states, which we call a highway. On such non-branching highways, the value-updating process can be merged as a one-step process instead of iterating the value step-by-step. Based on this observation, we propose a novel graph structure, named highway graph, to model the state transition. Our highway graph compresses the transition model into a concise graph, where edges can represent multiple state transitions to support value propagation across multiple time steps in each iteration. We thus can obtain a more efficient value learning approach by facilitating the VI algorithm on highway graphs. By integrating the highway graph into RL (as a model-based off-policy RL method), the RL training can be remarkably accelerated in the early stages (within 1 million frames). Comparison against various baselines on four categories of environments reveals that our method outperforms both representative and novel model-free and model-based RL algorithms, demonstrating 10 to more than 150 times more efficiency while maintaining an equal or superior expected return, as confirmed by carefully conducted analyses. Moreover, a deep neural network-based agent is trained using the highway graph, resulting in better generalization and lower storage costs.

@article{yin2025highway,
   title = {Highway Graph to Accelerate Reinforcement Learning},
   author = {Zidu Yin and Zhen Zhang and Dong Gong and Stefano V. Albrecht and Javen Q. Shi},
   journal = {Transactions on Machine Learning Research (TMLR)},
   year = {2025}
}

Alain Andres, Lukas Schäfer, Esther Villar-Rodriguez, Stefano V. Albrecht, Javier Del Ser
Using Offline Data to Speed-up Reinforcement Learning in Procedurally Generated Environments
Neurocomputing, 2025
Abstract | BibTex | Publisher | Code
Neurocomputing deep-rl

@article{andres2025offline,
   title = {Using Offline Data to Speed-up Reinforcement Learning in Procedurally Generated Environments},
   author = {Andres, Alain and Sch\"afer, Lukas and Villar-Rodriguez, Esther and Albrecht, Stefano V. and Del Ser, Javier},
   journal = {Neurocomputing},
   volume = {618}
   year = {2025}
}

Samuel Garcin, Trevor McInroe, Pablo Samuel Castro, Christopher G. Lucas, David Abel, Prakash Panangaden, Stefano V. Albrecht
Studying the Interplay Between the Actor and Critic Representations in Reinforcement Learning
International Conference on Learning Representations, 2025
Abstract | BibTex | Paper | Code
ICLR deep-rl generalisation

@inproceedings{garcin2025acrep,
   title={Studying the Interplay Between the Actor and Critic Representations in Reinforcement Learning},
   author={Samuel Garcin and Trevor McInroe and Pablo Samuel Castro and Christopher G. Lucas and David Abel and Prakash Panangaden and Stefano V. Albrecht},
   booktitle={13th International Conference on Learning Representations},
   year={2025}
}

2024

Stefano V. Albrecht, Filippos Christianos, Lukas Schäfer
Multi-Agent Reinforcement Learning: Foundations and Modern Approaches
MIT Press (print version scheduled for December 2024), 2024
Abstract | BibTex | Book website | Book codebase
MITP multi-agent-rl deep-rl deep-learning survey

@book{ marl-book,
   author = {Stefano V. Albrecht and Filippos Christianos and Lukas Sch\"afer},
   title = {Multi-Agent Reinforcement Learning: Foundations and Modern Approaches},
   publisher = {MIT Press},
   year = {2024},
   url = {https://www.marl-book.com}
}

Trevor McInroe, Lukas Schäfer, Stefano V. Albrecht
Multi-Horizon Representations with Hierarchical Forward Models for Reinforcement Learning
Transactions on Machine Learning Research, 2024
Abstract | BibTex | arXiv | Code
TMLR deep-rl

@article{mcinroe2024hksl,
   title = {Multi-Horizon Representations with Hierarchical Forward Models for Reinforcement Learning},
   author = {Trevor McInroe and Lukas Schäfer and Stefano V. Albrecht},
   journal = {Transactions on Machine Learning Research (TMLR)},
   year = {2024}
}

Xuehui Yu, Mhairi Dunion, Xin Li, Stefano V. Albrecht
Skill-aware Mutual Information Optimisation for Generalisation in Reinforcement Learning
Conference on Neural Information Processing Systems, 2024
Abstract | BibTex | arXiv | Code
NeurIPS deep-rl generalisation

@inproceedings{yu2024skillaware,
   title={Skill-aware Mutual Information Optimisation for Generalisation in Reinforcement Learning},
   author={Xuehui Yu and Mhairi Dunion and Xin Li and Stefano V. Albrecht},
   booktitle={Conference on Neural Information Processing Systems},
   year={2024}
}

Mhairi Dunion, Stefano V. Albrecht
Multi-view Disentanglement for Reinforcement Learning with Multiple Cameras
Reinforcement Learning Conference, 2024
Abstract | BibTex | arXiv | Code
RLC deep-rl generalisation

@inproceedings{dunion2024mvd,
   title={Multi-view Disentanglement for Reinforcement Learning with Multiple Cameras},
   author={Mhairi Dunion and Stefano V. Albrecht},
   booktitle={1st Reinforcement Learning Conference},
   year={2024}
}

Trevor McInroe, Adam Jelley, Stefano V. Albrecht, Amos Storkey
Planning to Go Out-of-Distribution in Offline-to-Online Reinforcement Learning
Reinforcement Learning Conference, 2024
Abstract | BibTex | arXiv
RLC deep-rl

@inproceedings{mcinroe2024planning,
   title={Planning to Go Out-of-Distribution in Offline-to-Online Reinforcement Learning},
   author={Trevor McInroe and Adam Jelley and Stefano V. Albrecht and Amos Storkey},
   booktitle={1st Reinforcement Learning Conference},
   year={2024}
}

Aditya Kapoor, Sushant Swamy, Kale-ab Tessera, Mayank Baranwal, Mingfei Sun, Harshad Khadilkar, Stefano V. Albrecht
Agent-Temporal Credit Assignment for Optimal Policy Preservation in Sparse Multi-Agent Reinforcement Learning
RLC Workshop on Coordination and Cooperation for Multi-Agent Reinforcement Learning Methods, 2024
Abstract | BibTex | Paper
RLC deep-rl multi-agent-rl

@inproceedings{kapoor2024agenttemporal,
   title={Agent-Temporal Credit Assignment for Optimal Policy Preservation in Sparse Multi-Agent Reinforcement Learning},
   author={Aditya Kapoor and Sushant Swamy and Kale-ab Tessera and Mayank Baranwal and Mingfei Sun and Harshad Khadilkar and Stefano V Albrecht},
   booktitle={Coordination and Cooperation for Multi-Agent Reinforcement Learning Methods Workshop},
   year={2024},
   url={https://openreview.net/forum?id=dGS1e3FXUH}
}

Samuel Garcin, James Doran, Shangmin Guo, Christopher G. Lucas, Stefano V. Albrecht
DRED: Zero-Shot Transfer in Reinforcement Learning via Data-Regularised Environment Design
International Conference on Machine Learning, 2024
Abstract | BibTex | arXiv
ICML deep-rl

@inproceedings{garcin2024dred,
   title={{DRED}: Zero-Shot Transfer in Reinforcement Learning via Data-Regularised Environment Design},
   author={Samuel Garcin and James Doran and Shangmin Guo and Christopher G. Lucas and Stefano V. Albrecht},
   year={2024},
   booktitle={International Conference on Machine Learning (ICML)}
}

Guy Azran, Mohamad H. Danesh, Stefano V. Albrecht, Sarah Keren
Contextual Pre-planning on Reward Machine Abstractions for Enhanced Transfer in Deep Reinforcement Learning
AAAI Conference on Artificial Intelligence, 2024
Abstract | BibTex | arXiv | Code | Video
AAAI deep-rl causal

@inproceedings{azran2024contextual,
   title={Contextual Pre-planning on Reward Machine Abstractions for Enhanced Transfer in Deep Reinforcement Learning},
   author={Guy Azran and Mohamad H. Danesh and Stefano V. Albrecht and Sarah Keren},
   booktitle={Proceedings of the 38th AAAI Conference on Artificial Intelligence},
   year={2024}
}

Guy Azran, Mohamad H. Danesh, Stefano V. Albrecht, Sarah Keren
Contextual Pre-planning on Reward Machine Abstractions for Enhanced Transfer in Deep Reinforcement Learning
ICAPS Workshop on Planning and Reinforcement Learning, 2024
Abstract | BibTex | arXiv | Code | Video
ICAPS deep-rl causal

@inproceedings{Azran2022enhancing,
   title={Contextual Pre-planning on Reward Machine Abstractions for Enhanced Transfer in Deep Reinforcement Learning},
   author={Azran, Guy and Danesh, Mohamad H. and Albrecht, Stefano V. and Keren, Sarah},
   booktitle={ICAPS Workshop on Planning and Reinforcement Learning (https://prl-theworkshop.github.io/prl2024-icaps/},
   year={2024}
}

2023

Arrasy Rahman, Ignacio Carlucho, Niklas Höpner, Stefano V. Albrecht
A General Learning Framework for Open Ad Hoc Teamwork Using Graph-based Policy Learning
Journal of Machine Learning Research, 2023
Abstract | BibTex | arXiv | Publisher | Code
JMLR ad-hoc-teamwork deep-rl agent-modelling multi-agent-rl

@article{JRahman2022POGPL,
   author  = {Arrasy Rahman and Ignacio Carlucho and Niklas H\"opner and Stefano V. Albrecht},
   title   = {A General Learning Framework for Open Ad Hoc Teamwork Using Graph-based Policy Learning},
   journal = {Journal of Machine Learning Research},
   year    = {2023},
   volume  = {24},
   number  = {298},
   pages   = {1--74},
   url     = {http://jmlr.org/papers/v24/22-099.html}
}

Filippos Christianos, Georgios Papoudakis, Stefano V. Albrecht
Pareto Actor-Critic for Equilibrium Selection in Multi-Agent Reinforcement Learning
Transactions on Machine Learning Research, 2023
Abstract | BibTex | arXiv | Code
TMLR deep-rl multi-agent-rl

@article{christianos2023pareto,
   title={Pareto Actor-Critic for Equilibrium Selection in Multi-Agent Reinforcement Learning},
   author={Filippos Christianos and Georgios Papoudakis and Stefano V. Albrecht},
   journal={Transactions on Machine Learning Research (TMLR)},
   year={2023}
}

Arrasy Rahman, Elliot Fosong, Ignacio Carlucho, Stefano V. Albrecht
Generating Teammates for Training Robust Ad Hoc Teamwork Agents via Best-Response Diversity
Transactions on Machine Learning Research, 2023
Abstract | BibTex | arXiv | Code
TMLR ad-hoc-teamwork multi-agent-rl deep-rl

@article{rahman2023BRDiv,
   title={Generating Teammates for Training Robust Ad Hoc Teamwork Agents via Best-Response Diversity},
   author={Arrasy Rahman and Elliot Fosong and Ignacio Carlucho and Stefano V. Albrecht},
   journal={Transactions on Machine Learning Research (TMLR)},
   year={2023}
}

Mhairi Dunion, Trevor McInroe, Kevin Sebastian Luck, Josiah Hanna, Stefano V. Albrecht
Conditional Mutual Information for Disentangled Representations in Reinforcement Learning
Conference on Neural Information Processing Systems, 2023
Abstract | BibTex | arXiv | Code
NeurIPS deep-rl causal generalisation

@inproceedings{dunion2023cmid,
   title={Conditional Mutual Information for Disentangled Representations in Reinforcement Learning},
   author={Mhairi Dunion and Trevor McInroe and Kevin Sebastian Luck and Josiah Hanna and Stefano V. Albrecht},
   booktitle={Conference on Neural Information Processing Systems},
   year={2023}
}

Lukas Schäfer, Filippos Christianos, Amos Storkey, Stefano V. Albrecht
Learning Task Embeddings for Teamwork Adaptation in Multi-Agent Reinforcement Learning
NeurIPS Workshop on Generalization in Planning, 2023
Abstract | BibTex | arXiv | Code
NeurIPS multi-agent-rl deep-rl

@inproceedings{schaefer2023mate,
   title={Learning Task Embeddings for Teamwork Adaptation in Multi-Agent Reinforcement Learning},
   author={Lukas Schäfer and Filippos Christianos and Amos Storkey and Stefano V. Albrecht},
   booktitle={NeurIPS Workshop on Generalization in Planning},
   year={2023}
}

Guy Azran, Mohamad H Danesh, Stefano V. Albrecht, Sarah Keren
Contextual Pre-Planning on Reward Machine Abstractions for Enhanced Transfer in Deep Reinforcement Learning
NeurIPS Workshop on Generalization in Planning, 2023
Abstract | BibTex | arXiv
NeurIPS deep-rl causal

@inproceedings{azran2023contextual,
   title={Contextual Pre-Planning on Reward Machine Abstractions for Enhanced Transfer in Deep Reinforcement Learning},
   author={Guy Azran and Mohamad H. Danesh and Stefano V. Albrecht and Sarah Keren},
   booktitle={NeurIPS Workshop on Generalization in Planning},
   year={2023}
}

Samuel Garcin, James Doran, Shangmin Guo, Christopher G. Lucas, Stefano V. Albrecht
How the level sampling process impacts zero-shot generalisation in deep reinforcement learning
NeurIPS Workshop on Agent Learning in Open-Endedness, 2023
Abstract | BibTex | arXiv
NeurIPS deep-rl

@inproceedings{garcin2023level,
   title={How the level sampling process impacts zero-shot generalisation in deep reinforcement learning},
   author={Samuel Garcin and James Doran and Shangmin Guo and Christopher G. Lucas and Stefano V. Albrecht},
   booktitle={NeurIPS Workshop on Agent Learning in Open-Endedness},
   year={2023}
}

Sabrina McCallum, Max Taylor-Davies, Stefano V. Albrecht, Alessandro Suglia
Is Feedback All You Need? Leveraging Natural Language Feedback in Goal-Conditioned Reinforcement Learning
NeurIPS Workshop on Goal-Conditioned Reinforcement Learning, 2023
Abstract | BibTex | arXiv | Code
NeurIPS deep-rl

@inproceedings{mccallum2023feedback,
   title={Is Feedback All You Need? Leveraging Natural Language Feedback in Goal-Conditioned Reinforcement Learning},
   author={Sabrina McCallum and Max Taylor-Davies and Stefano V. Albrecht and Alessandro Suglia},
   booktitle={NeurIPS Workshop on Goal-Conditioned Reinforcement Learning (GCRL)},
   year={2023}
}

Mhairi Dunion, Trevor McInroe, Kevin Sebastian Luck, Josiah Hanna, Stefano V. Albrecht
Temporal Disentanglement of Representations for Improved Generalisation in Reinforcement Learning
International Conference on Learning Representations, 2023
Abstract | BibTex | arXiv | Code
ICLR deep-rl generalisation causal

@inproceedings{dunion2023ted,
   title={Temporal Disentanglement of Representations for Improved Generalisation in Reinforcement Learning},
   author={Mhairi Dunion and Trevor McInroe and Kevin Sebastian Luck and Josiah Hanna and Stefano V. Albrecht},
   booktitle={International Conference on Learning Representations (ICLR)},
   year={2023}
}

Filippos Christianos, Peter Karkus, Boris Ivanovic, Stefano V. Albrecht, Marco Pavone
Planning with Occluded Traffic Agents using Bi-Level Variational Occlusion Models
IEEE International Conference on Robotics and Automation, 2023
Abstract | BibTex | arXiv
ICRA deep-rl autonomous-driving

@inproceedings{christianos2023planning,
   title={Planning with Occluded Traffic Agents using Bi-Level Variational Occlusion Models},
   author={Filippos Christianos and Peter Karkus and Boris Ivanovic and Stefano V. Albrecht and Marco Pavone},
   booktitle={International Conference on Robotics and Automation (ICRA)},
   year={2023}
}

Giuseppe Vecchio, Simone Palazzo, Dario C Guastella, Riccardo E. Sarpietro, Ignacio Carlucho, Stefano V. Albrecht, Giovanni Muscato, Concetto Spampinato
MIDGARD: A Simulation Platform for Autonomous Navigation in Unstructured Environments
RSS Workshop on Multi-Agent Planning and Navigation in Challenging Environments, 2023
Abstract | BibTex | arXiv
RSS simulator deep-rl

@inproceedings{vecchio2022midgard,
   title={MIDGARD: A Simulation Platform for Autonomous Navigation in Unstructured Environments},
   author={Vecchio, Giuseppe and Palazzo, Simone and Guastella, Dario C and Sarpietro, Riccardo E. and Carlucho, Ignacio and Albrecht, Stefano V. and Muscato, Giovanni and Spampinato, Concetto},
   booktitle={RSS 2023 Workshop on Multi-Agent Planning and Navigation in Challenging Environments},
   year={2023}
}

Filippos Christianos, Georgios Papoudakis, Stefano V. Albrecht
Pareto Actor-Critic for Equilibrium Selection in Multi-Agent Reinforcement Learning
AAMAS Workshop on Optimization and Learning in Multiagent Systems, 2023
Abstract | BibTex | arXiv
AAMAS deep-rl multi-agent-rl

@inproceedings{christianos2023pareto,
   title={Pareto Actor-Critic for Equilibrium Selection in Multi-Agent Reinforcement Learning},
   author={Filippos Christianos and Georgios Papoudakis and Stefano V. Albrecht},
   booktitle={AAMAS Workshop on Optimization and Learning in Multiagent Systems},
   year={2023}
}

Adam Michalski, Filippos Christianos, Stefano V. Albrecht
SMAClite: A Lightweight Environment for Multi-Agent Reinforcement Learning
AAMAS Workshop on Multiagent Sequential Decision Making Under Uncertainty, 2023
Abstract | BibTex | arXiv | Code
AAMAS deep-rl multi-agent-rl

@inproceedings{michalski2023smaclite,
   title={SMAClite: A Lightweight Environment for Multi-Agent Reinforcement Learning},
   author={Adam Michalski and Filippos Christianos and Stefano V. Albrecht},
   booktitle={AAMAS workshop on Multiagent Sequential Decision Making Under Uncertainty (MSDM)},
   year={2023}
}

Lukas Schäfer, Oliver Slumbers, Stephen McAleer, Yali Du, Stefano V. Albrecht, David Mguni
Ensemble Value Functions for Efficient Exploration in Multi-Agent Reinforcement Learning
AAMAS Workshop on Adaptive and Learning Agents, 2023
Abstract | BibTex | arXiv
AAMAS multi-agent-rl deep-rl

@inproceedings{schaefer2023emax,
   title={Ensemble Value Functions for Efficient Exploration in Multi-Agent Reinforcement Learning},
   author={Lukas Schäfer and Oliver Slumbers and Stephen McAleer and Yali Du and Stefano V. Albrecht and David Mguni},
   year={2023},
   booktitle={AAMAS Workshop on Adaptive and Learning Agents (ALA)},
}

Callum Tilbury, Filippos Christianos, Stefano V. Albrecht
Revisiting the Gumbel-Softmax in MADDPG
AAMAS Workshop on Adaptive and Learning Agents, 2023
Abstract | BibTex | arXiv | Code
AAMAS multi-agent-rl deep-rl

@inproceedings{tilbury2023revisitingmaddpg,
   title={Revisiting the Gumbel-Softmax in MADDPG},
   author={Callum Tilbury and Filippos Christianos and Stefano V. Albrecht},
   year={2023},
   booktitle={AAMAS Workshop on Adaptive and Learning Agents (ALA)},
}

Alain Andres, Lukas Schäfer, Esther Villar-Rodriguez, Stefano V. Albrecht, Javier Del Ser
Using Offline Data to Speed-up Reinforcement Learning in Procedurally Generated Environments
AAMAS Workshop on Adaptive and Learning Agents, 2023
Abstract | BibTex | arXiv
AAMAS deep-rl

@inproceedings{andres2023using,
   title={Using Offline Data to Speed-up Reinforcement Learning in Procedurally Generated Environments},
   author={Andres, Alain and Schäfer, Lukas and Villar-Rodriguez, Esther and Albrecht, Stefano V. and Del Ser, Javier},
   booktitle={AAMAS Workshop on Adaptive and Learning Agents (ALA)},
   year={2023}
}

Guy Azran, Mohamad H. Danesh, Stefano V. Albrecht, Sarah Keren
Contextual Pre-Planning on Reward Machine Abstractions for Enhanced Transfer in Deep Reinforcement Learning
IJCAI Workshop on Planning and Reinforcement Learning, 2023
Abstract | BibTex | arXiv
IJCAI deep-rl causal

@inproceedings{azran2023contextual,
   title={Contextual Pre-Planning on Reward Machine Abstractions for Enhanced Transfer in Deep Reinforcement Learning},
   author={Guy Azran and Mohamad H. Danesh and Stefano V. Albrecht and Sarah Keren},
   booktitle={IJCAI Workshop on Planning and Reinforcement Learning (https://prl-theworkshop.github.io/)},
   year={2023}
}

Mhairi Dunion, Trevor McInroe, Kevin Sebastian Luck, Josiah Hanna, Stefano V. Albrecht
Conditional Mutual Information for Disentangled Representations in Reinforcement Learning
European Workshop on Reinforcement Learning, 2023
Abstract | BibTex | arXiv | Code
EWRL deep-rl causal generalisation

@inproceedings{dunion2023cmid,
   title={Conditional Mutual Information for Disentangled Representations in Reinforcement Learning},
   author={Mhairi Dunion and Trevor McInroe and Kevin Sebastian Luck and Josiah Hanna and Stefano V. Albrecht},
   booktitle={European Workshop on Reinforcement Learning},
   year={2023}
}

Samuel Garcin, James Doran, Shangmin Guo, Christopher G. Lucas, Stefano V. Albrecht
How the level sampling process impacts zero-shot generalisation in deep reinforcement learning
arXiv:2310.03494, 2023
Abstract | BibTex | arXiv
deep-rl

@misc{garcin2023level,
   title={How the level sampling process impacts zero-shot generalisation in deep reinforcement learning},
   author={Samuel Garcin and James Doran and Shangmin Guo and Christopher G. Lucas and Stefano V. Albrecht},
   year={2023},
   eprint={2310.03494},
   archivePrefix={arXiv},
   primaryClass={cs.LG}
}

Trevor McInroe, Stefano V. Albrecht, Amos Storkey
Planning to Go Out-of-Distribution in Offline-to-Online Reinforcement Learning
arXiv:2310.05723, 2023
Abstract | BibTex | arXiv
deep-rl

@misc{mcinroe2023planning,
   title={Planning to Go Out-of-Distribution in Offline-to-Online Reinforcement Learning},
   author={Trevor McInroe and Stefano V. Albrecht and Amos Storkey},
   year={2023},
   eprint={2310.05723},
   archivePrefix={arXiv},
   primaryClass={cs.LG}
}

2022

Stefano V. Albrecht, Michael Wooldridge
Special Issue on Multi-Agent Systems Research in the United Kingdom: Guest Editorial
AI Communications, 2022
Abstract | BibTex | Publisher | Special Issue
AIC survey deep-rl multi-agent-rl agent-modelling

@article{albrecht2020special,
   title = {Special Issue on Multi-Agent Systems Research in the United Kingdom: Guest Editorial},
   author = {Stefano V. Albrecht and Michael Wooldridge},
   journal = {AI Communications},
   volume = {35},
   number = {4},
   year = {2022},
   publisher = {IOS Press},
   url = {https://content.iospress.com/articles/ai-communications/aic229003}
}

Ibrahim H. Ahmed, Cillian Brewitt, Ignacio Carlucho, Filippos Christianos, Mhairi Dunion, Elliot Fosong, Samuel Garcin, Shangmin Guo, Balint Gyevnar, Trevor McInroe, Georgios Papoudakis, Arrasy Rahman, Lukas Schäfer, Massimiliano Tamborski, Giuseppe Vecchio, Cheng Wang, Stefano V. Albrecht
Deep Reinforcement Learning for Multi-Agent Interaction
AI Communications, 2022
Abstract | BibTex | arXiv | Publisher
AIC survey deep-rl multi-agent-rl ad-hoc-teamwork agent-modelling goal-recognition security explainable-ai autonomous-driving

@article{albrecht2022aic,
   author = {Ahmed, Ibrahim H. and Brewitt, Cillian and Carlucho, Ignacio and Christianos, Filippos and Dunion, Mhairi and Fosong, Elliot and Garcin, Samuel and Guo, Shangmin and Gyevnar, Balint and McInroe, Trevor and Papoudakis, Georgios and Rahman, Arrasy and Schäfer, Lukas and Tamborski, Massimiliano and Vecchio, Giuseppe and Wang, Cheng and Albrecht, Stefano V.},
   title = {Deep Reinforcement Learning for Multi-Agent Interaction},
   journal = {AI Communications, Special Issue on Multi-Agent Systems Research in the UK},
   year = {2022}
}

Rujie Zhong, Duohan Zhang, Lukas Schäfer, Stefano V. Albrecht, Josiah P. Hanna
Robust On-Policy Sampling for Data-Efficient Policy Evaluation in Reinforcement Learning
Conference on Neural Information Processing Systems, 2022
Abstract | BibTex | arXiv | Code
NeurIPS deep-rl

@inproceedings{zhong2022datacollection,
   title={Robust On-Policy Data Collection for Data Efficient Policy Evaluation},
   author={Rujie Zhong and Duohan Zhang and Lukas Sch\"afer and Stefano V. Albrecht and Josiah P. Hanna},
   booktitle={Conference on Neural Information Processing Systems},
   year={2022}
}

Trevor McInroe, Lukas Schäfer, Stefano V. Albrecht
Learning Representations for Reinforcement Learning with Hierarchical Forward Models
NeurIPS Workshop on Deep Reinforcement Learning, 2022
Abstract | BibTex | arXiv
NeurIPS deep-rl generalisation

@inproceedings{mcinroe2022hksl,
   title={Learning Representations for Reinforcement Learning with Hierarchical Forward Models},
   author={Trevor McInroe and Lukas Schäfer and Stefano V. Albrecht},
   booktitle={NeurIPS Workshop on Deep RL},
   year={2022}
}

Mhairi Dunion, Trevor McInroe, Kevin Sebastian Luck, Josiah Hanna, Stefano V. Albrecht
Temporal Disentanglement of Representations for Improved Generalisation in Reinforcement Learning
NeurIPS Workshop on Deep Reinforcement Learning, 2022
Abstract | BibTex | arXiv | Code
NeurIPS deep-rl generalisation causal

@inproceedings{dunion2022ted,
   title={Temporal Disentanglement of Representations for Improved Generalisation in Reinforcement Learning},
   author={Mhairi Dunion and Trevor McInroe and Kevin Sebastian Luck and Josiah Hanna and Stefano V. Albrecht},
   booktitle={NeurIPS Workshop on Deep Reinforcement Learning},
   year={2022}
}

Guy Azran, Mohamad Hosein Danesh, Stefano V. Albrecht, Sarah Keren
Enhancing Transfer of Reinforcement Learning Agents with Abstract Contextual Embeddings
NeurIPS Workshop on Neuro Causal and Symbolic AI, 2022
Abstract | BibTex
NeurIPS deep-rl causal

@inproceedings{Azran2022enhancing,
   title={Enhancing Transfer of Reinforcement Learning Agents with Abstract Contextual Embeddings},
   author={Guy Azran and Mohamad Hosein Danesh and Stefano V. Albrecht and Sarah Keren},
   booktitle={NeurIPS Workshop on Neuro Causal and Symbolic AI (https://ncsi.cause-lab.net)},
   year={2022}
}

Lukas Schäfer, Filippos Christianos, Josiah P. Hanna, Stefano V. Albrecht
Decoupled Reinforcement Learning to Stabilise Intrinsically-Motivated Exploration
International Conference on Autonomous Agents and Multi-Agent Systems, 2022
Abstract | BibTex | arXiv | Code
AAMAS deep-rl intrinsic-reward

@inproceedings{schaefer2022derl,
   title={Decoupled Reinforcement Learning to Stabilise Intrinsically-Motivated Exploration},
   author={Lukas Schäfer and Filippos Christianos and Josiah P. Hanna and Stefano V. Albrecht},
   booktitle={International Conference on Autonomous Agents and Multiagent Systems (AAMAS)},
   year={2022}
}

Giuseppe Vecchio, Simone Palazzo, Dario C Guastella, Ignacio Carlucho, Stefano V. Albrecht, Giovanni Muscato, Concetto Spampinato
MIDGARD: A Simulation Platform for Autonomous Navigation in Unstructured Environments
ICRA Workshop on Releasing Robots into the Wild: Simulations, Benchmarks, and Deployment, 2022
Abstract | BibTex | arXiv
ICRA deep-rl simulator

@misc{Vecchio2022MIDGARD,
   title={MIDGARD: A Simulation Platform for Autonomous Navigation in Unstructured Environments},
   author={Giuseppe Vecchio, Simone Palazzo, Dario C Guastella, Ignacio Carlucho, Stefano V. Albrecht, Giovanni Muscato, Concetto Spampinato},
   year={2022},
   eprint={2205.08389},
   archivePrefix={arXiv},
   primaryClass={cs.MA}
}

Arrasy Rahman, Ignacio Carlucho, Niklas Höpner, Stefano V. Albrecht
A General Learning Framework for Open Ad Hoc Teamwork Using Graph-based Policy Learning
arXiv:2210.05448, 2022
Abstract | BibTex | arXiv
ad-hoc-teamwork deep-rl agent-modelling

@misc{Rahman2022POGPL,
   title={A General Learning Framework for Open Ad Hoc Teamwork Using Graph-based Policy Learning},
   author={Arrasy Rahman and Ignacio Carlucho and Niklas H\"opner and Stefano V. Albrecht},
   year={2022},
   eprint={2210.05448},
   archivePrefix={arXiv}
}

Aleksandar Krnjaic, Jonathan D. Thomas, Georgios Papoudakis, Lukas Schäfer, Peter Börsting, Stefano V. Albrecht
Scalable Multi-Agent Reinforcement Learning for Warehouse Logistics with Robotic and Human Co-Workers
arXiv:2212.11498, 2022
Abstract | BibTex | arXiv
deep-rl multi-agent-rl

@misc{Krnjaic2022HSNAC,
   title={Scalable Multi-Agent Reinforcement Learning for Warehouse Logistics with Robotic and Human Co-Workers},
   author={Aleksandar Krnjaic and Jonathan D. Thomas and Georgios Papoudakis and Lukas Sch\"afer and Peter B\"orsting and Stefano V. Albrecht,
   year={2022},
   eprint={2212.11498},
   archivePrefix={arXiv}
}

Lukas Schäfer, Filippos Christianos, Amos Storkey, Stefano V. Albrecht
Learning Task Embeddings for Teamwork Adaptation in Multi-Agent Reinforcement Learning
arxiv:2207.02249, 2022
Abstract | BibTex | arXiv
deep-rl multi-agent-rl

@misc{schaefer2022mate,
   title={Learning Task Embeddings for Teamwork Adaptation in Multi-Agent Reinforcement Learning},
   author={Lukas Schäfer and Filippos Christianos and Amos Storkey and Stefano V. Albrecht},
   year={2022},
   eprint={2207.02249},
   archivePrefix={arXiv},
   primaryClass={cs.MA}
}

Filippos Christianos, Georgios Papoudakis, Stefano V. Albrecht
Pareto Actor-Critic for Equilibrium Selection in Multi-Agent Reinforcement Learning
arXiv:2209.14344, 2022
Abstract | BibTex | arXiv
deep-rl multi-agent-rl

@misc{christianos2022pareto,
   title={Pareto Actor-Critic for Equilibrium Selection in Multi-Agent Reinforcement Learning},
   author={Filippos Christianos and Georgios Papoudakis and Stefano V. Albrecht},
   year={2022},
   eprint={2209.14344},
   archivePrefix={arXiv},
   primaryClass={cs.LG}
}

2021

Georgios Papoudakis, Filippos Christianos, Lukas Schäfer, Stefano V. Albrecht
Benchmarking Multi-Agent Deep Reinforcement Learning Algorithms in Cooperative Tasks
Conference on Neural Information Processing Systems, Datasets and Benchmarks Track, 2021
Abstract | BibTex | arXiv | Code
NeurIPS deep-rl multi-agent-rl

@inproceedings{papoudakis2021benchmarking,
   title={Benchmarking Multi-Agent Deep Reinforcement Learning Algorithms in Cooperative Tasks},
   author={Georgios Papoudakis and Filippos Christianos and Lukas Sch\"afer and Stefano V. Albrecht},
   booktitle = {Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks (NeurIPS)},
   year={2021},
   url = {http://arxiv.org/abs/2006.07869},
   openreview = {https://openreview.net/forum?id=cIrPX-Sn5n},
   code = {https://github.com/uoe-agents/epymarl}
}

Georgios Papoudakis, Filippos Christianos, Stefano V. Albrecht
Agent Modelling under Partial Observability for Deep Reinforcement Learning
Conference on Neural Information Processing Systems, 2021
Abstract | BibTex | arXiv | Code
NeurIPS deep-rl agent-modelling

@inproceedings{papoudakis2021local,
   title={Agent Modelling under Partial Observability for Deep Reinforcement Learning},
   author={Georgios Papoudakis and Filippos Christianos and Stefano V. Albrecht},
   booktitle = {Proceedings of the Neural Information Processing Systems (NeurIPS)},
   year = {2021}
}

Rujie Zhong, Josiah P. Hanna, Lukas Schäfer, Stefano V. Albrecht
Robust On-Policy Data Collection for Data-Efficient Policy Evaluation
NeurIPS Workshop on Offline Reinforcement Learning, 2021
Abstract | BibTex | arXiv | Code
NeurIPS deep-rl

@inproceedings{zhong2021robust,
   title={Robust On-Policy Data Collection for Data-Efficient Policy Evaluation},
   author={Rujie Zhong and Josiah P. Hanna and Lukas Sch\"afer and Stefano V. Albrecht},
   booktitle={NeurIPS Workshop on Offline Reinforcement Learning (OfflineRL)},
   year={2021}
}

Arrasy Rahman, Niklas Höpner, Filippos Christianos, Stefano V. Albrecht
Towards Open Ad Hoc Teamwork Using Graph-based Policy Learning
International Conference on Machine Learning, 2021
Abstract | BibTex | arXiv | Video | Code
ICML deep-rl agent-modelling ad-hoc-teamwork

@inproceedings{rahman2021open,
   title={Towards Open Ad Hoc Teamwork Using Graph-based Policy Learning},
   author={Arrasy Rahman and Niklas H\"opner and Filippos Christianos and Stefano V. Albrecht},
   booktitle={International Conference on Machine Learning (ICML)},
   year={2021}
}

Filippos Christianos, Georgios Papoudakis, Arrasy Rahman, Stefano V. Albrecht
Scaling Multi-Agent Reinforcement Learning with Selective Parameter Sharing
International Conference on Machine Learning, 2021
Abstract | BibTex | arXiv | Video | Code
ICML deep-rl multi-agent-rl

@inproceedings{christianos2021scaling,
   title={Scaling Multi-Agent Reinforcement Learning with Selective Parameter Sharing},
   author={Filippos Christianos and Georgios Papoudakis and Arrasy Rahman and Stefano V. Albrecht},
   booktitle={International Conference on Machine Learning (ICML)},
   year={2021}
}

Lukas Schäfer, Filippos Christianos, Josiah Hanna, Stefano V. Albrecht
Decoupling Exploration and Exploitation in Reinforcement Learning
ICML Workshop on Unsupervised Reinforcement Learning, 2021
Abstract | BibTex | arXiv | Code
ICML deep-rl intrinsic-reward

@inproceedings{schaefer2021decoupling,
   title={Decoupling Exploration and Exploitation in Reinforcement Learning},
   author={Lukas Schäfer and Filippos Christianos and Josiah Hanna and Stefano V. Albrecht},
   booktitle={ICML Workshop on Unsupervised Reinforcement Learning (URL)},
   year={2021}
}

Trevor McInroe, Lukas Schäfer, Stefano V. Albrecht
Learning Temporally-Consistent Representations for Data-Efficient Reinforcement Learning
arXiv:2110.04935, 2021
Abstract | BibTex | arXiv | Code
deep-rl

@misc{mcinroe2021learning,
   title={Learning Temporally-Consistent Representations for Data-Efficient Reinforcement Learning},
   author={Trevor McInroe and Lukas Schäfer and Stefano V. Albrecht},
   year={2021},
   eprint={2110.04935},
   archivePrefix={arXiv},
   primaryClass={cs.LG}
}

2020

Filippos Christianos, Lukas Schäfer, Stefano V. Albrecht
Shared Experience Actor-Critic for Multi-Agent Reinforcement Learning
Conference on Neural Information Processing Systems, 2020
Abstract | BibTex | arXiv
NeurIPS deep-rl multi-agent-rl

@inproceedings{christianos2020shared,
   title={Shared Experience Actor-Critic for Multi-Agent Reinforcement Learning},
   author={Filippos Christianos and Lukas Sch\"afer and Stefano V. Albrecht},
   booktitle={34th Conference on Neural Information Processing Systems},
   year={2020}
}

Georgios Papoudakis, Stefano V. Albrecht
Variational Autoencoders for Opponent Modeling in Multi-Agent Systems
AAAI Workshop on Reinforcement Learning in Games, 2020
Abstract | BibTex | arXiv
AAAI deep-rl agent-modelling

@inproceedings{papoudakis2020variational,
   title={Variational Autoencoders for Opponent Modeling in Multi-Agent Systems},
   author={Georgios Papoudakis and Stefano V. Albrecht},
   booktitle={AAAI Workshop on Reinforcement Learning in Games},
   year={2020}
}

Arrasy Rahman, Niklas Höpner, Filippos Christianos, Stefano V. Albrecht
Open Ad Hoc Teamwork using Graph-based Policy Learning
arXiv:2006.10412, 2020
Abstract | BibTex | arXiv
deep-rl agent-modelling ad-hoc-teamwork

@misc{rahman2020open,
   title={Open Ad Hoc Teamwork using Graph-based Policy Learning},
   author={Arrasy Rahman and Niklas H\"opner and Filippos Christianos and Stefano V. Albrecht},
   year={2020},
   eprint={2006.10412},
   archivePrefix={arXiv},
   primaryClass={cs.LG}
}

Georgios Papoudakis, Filippos Christianos , Lukas Schäfer, Stefano V. Albrecht
Comparative Evaluation of Multi-Agent Deep Reinforcement Learning Algorithms
arXiv:2006.07869, 2020
Abstract | BibTex | arXiv
deep-rl multi-agent-rl

@misc{papoudakis2020comparative,
   title={Comparative Evaluation of Multi-Agent Deep Reinforcement Learning Algorithms},
   author={Georgios Papoudakis and Filippos Christianos and Lukas Sch\"afer and Stefano V. Albrecht},
   year={2020},
   eprint={2006.07869},
   archivePrefix={arXiv},
   primaryClass={cs.LG}
}

Georgios Papoudakis, Filippos Christianos, Stefano V. Albrecht
Local Information Opponent Modelling Using Variational Autoencoders
arXiv:2006.09447, 2020
Abstract | BibTex | arXiv
deep-rl agent-modelling

@misc{papoudakis2020opponent,
   title={Local Information Opponent Modelling Using Variational Autoencoders},
   author={Georgios Papoudakis and Filippos Christianos and Stefano V. Albrecht},
   year={2020},
   eprint={2006.09447},
   archivePrefix={arXiv},
   primaryClass={cs.LG}
}

2019

Georgios Papoudakis, Filippos Christianos, Arrasy Rahman, Stefano V. Albrecht
Dealing with Non-Stationarity in Multi-Agent Deep Reinforcement Learning
arXiv:1906.04737, 2019
Abstract | BibTex | arXiv
survey deep-rl multi-agent-rl

@misc{papoudakis2019dealing,
   title={Dealing with Non-Stationarity in Multi-Agent Deep Reinforcement Learning},
   author={Georgios Papoudakis and Filippos Christianos and Arrasy Rahman and Stefano V. Albrecht},
   year={2019},
   eprint={1906.04737},
   archivePrefix={arXiv},
   primaryClass={cs.LG}
}