Publications

For news about publications, follow us on X:

Click on any author names or tags to filter publications.

All topic tags:
survey deep-rl multi-agent-rl agent-modelling ad-hoc-teamwork autonomous-driving goal-recognition explainable-ai causal generalisation security emergent-communication iterated-learning intrinsic-reward simulator state-estimation deep-learning transfer-learning LLM

2025

Zidu Yin, Zhen Zhang, Dong Gong, Stefano V. Albrecht, Javen Q. Shi
Highway Graph to Accelerate Reinforcement Learning
Transactions on Machine Learning Research, 2025
Abstract | BibTex | arXiv | Code
TMLR deep-rl

Abstract: Reinforcement Learning (RL) algorithms often suffer from low training efficiency. A strategy to mitigate this issue is to incorporate a model-based planning algorithm, such as Monte Carlo Tree Search (MCTS) or Value Iteration (VI), into the environmental model. The major limitation of VI is the need to iterate over a large tensor with the shape |S| × |A| × |S|, where S/A denotes the state/action space. This process iteratively updates the value of the preceding state st−1 based on the state st in one step via value propagation. These still lead to intensive computations. We focus on improving the training efficiency of RL algorithms by improving the efficiency of the value learning process. For the deterministic environments with discrete state and action spaces, on the sampled empirical state-transition graph, a non-branching sequence of transitions can directly bring the agent from s0 to sT without deviating from intermediate states, which we call a highway. On such non-branching highways, the value-updating process can be merged as a one-step process instead of iterating the value step-by-step. Based on this observation, we propose a novel graph structure, named highway graph, to model the state transition. Our highway graph compresses the transition model into a concise graph, where edges can represent multiple state transitions to support value propagation across multiple time steps in each iteration. We thus can obtain a more efficient value learning approach by facilitating the VI algorithm on highway graphs. By integrating the highway graph into RL (as a model-based off-policy RL method), the RL training can be remarkably accelerated in the early stages (within 1 million frames). Comparison against various baselines on four categories of environments reveals that our method outperforms both representative and novel model-free and model-based RL algorithms, demonstrating 10 to more than 150 times more efficiency while maintaining an equal or superior expected return, as confirmed by carefully conducted analyses. Moreover, a deep neural network-based agent is trained using the highway graph, resulting in better generalization and lower storage costs.

@article{yin2025highway,
   title = {Highway Graph to Accelerate Reinforcement Learning},
   author = {Zidu Yin and Zhen Zhang and Dong Gong and Stefano V. Albrecht and Javen Q. Shi},
   journal = {Transactions on Machine Learning Research (TMLR)},
   year = {2025}
}

Alain Andres, Lukas Schäfer, Esther Villar-Rodriguez, Stefano V. Albrecht, Javier Del Ser
Using Offline Data to Speed-up Reinforcement Learning in Procedurally Generated Environments
Neurocomputing, 2025
Abstract | BibTex | Publisher | Code
Neurocomputing deep-rl

@article{andres2025offline,
   title = {Using Offline Data to Speed-up Reinforcement Learning in Procedurally Generated Environments},
   author = {Andres, Alain and Sch\"afer, Lukas and Villar-Rodriguez, Esther and Albrecht, Stefano V. and Del Ser, Javier},
   journal = {Neurocomputing},
   volume = {618}
   year = {2025}
}

Charlie Masters, Advaith Vellanki, Jiangbo Shangguan, Bart Kultys, Alastair Moore, Stefano V. Albrecht
Orchestrating Human-AI Teams: The Manager Agent as a Unifying Research Challenge
International Conference on Distributed Artificial Intelligence, 2025
Abstract | BibTex | arXiv | Code
DAI multi-agent-rl LLM

@inproceedings{manager_agent_gym_2025,
   title     = {Orchestrating Human-AI Teams: The Manager Agent as a Unifying Research Challenge},
   author    = {Masters, Charlie and Vellanki, Advaith and Shangguan, Jiangbo and Kultys, Bart and Moore, Alastair and Albrecht, Stefano V.},
   booktitle = {Proceedings of the International Conference on Distributed Artificial Intelligence (DAI 2025)},
   year      = {2025}
}

Kale-ab Tessera, Arrasy Rahman, Amos Storkey, Stefano V. Albrecht
HyperMARL: Adaptive Hypernetworks for Multi-Agent RL
Conference on Neural Information Processing Systems, 2025
Abstract | BibTex | arXiv | Code
NeurIPS multi-agent-rl

@inproceedings{tessera2025hypermarl,
   title={{HyperMARL}: Adaptive Hypernetworks for Multi-Agent RL},
   author={Kale-ab Tessera and Arrasy Rahman and Amos Storkey and Stefano V. Albrecht},
   booktitle={Conference on Neural Information Processing Systems},
   year={2025}
}

Samuel Garcin, Trevor McInroe, Pablo Samuel Castro, Christopher G. Lucas, David Abel, Prakash Panangaden, Stefano V. Albrecht
Studying the Interplay Between the Actor and Critic Representations in Reinforcement Learning
International Conference on Learning Representations, 2025
Abstract | BibTex | Paper | Code
ICLR deep-rl generalisation

@inproceedings{garcin2025acrep,
   title={Studying the Interplay Between the Actor and Critic Representations in Reinforcement Learning},
   author={Samuel Garcin and Trevor McInroe and Pablo Samuel Castro and Christopher G. Lucas and David Abel and Prakash Panangaden and Stefano V. Albrecht},
   booktitle={13th International Conference on Learning Representations},
   year={2025}
}

Xu Liu, Haobo Fu, Stefano V. Albrecht, Qiang Fu, Shuai Li
Online-to-Offline RL for Agent Alignment
International Conference on Learning Representations, 2025
Abstract | BibTex | Paper
ICLR deep-learning

@inproceedings{liu2025aligngap,
   title={Online-to-Offline RL for Agent Alignment},
   author={Xu Liu and Haobo Fu and Stefano V. Albrecht and Qiang Fu and Shuai Li},
   booktitle={13th International Conference on Learning Representations},
   year={2025}
}

Lukas Schäfer, Oliver Slumbers, Stephen Mcaleer, Yali Du, Stefano V. Albrecht, David Mguni
Ensemble Value Functions for Efficient Exploration in Multi-Agent Reinforcement Learning
International Conference on Autonomous Agents and Multi-Agent Systems, 2025
Abstract | BibTex | arXiv
AAMAS multi-agent-rl

@inproceedings{schafer2025emax,
   title = {Ensemble Value Functions for Efficient Exploration in Multi-Agent Reinforcement Learning},
   author = {Lukas Sch\"afer and Oliver Slumbers and Stephen Mcaleer and Yali Du and Stefano V. Albrecht and David Mguni},
   booktitle = {International Conference on Autonomous Agents and Multiagent Systems},
   year = {2025}
}

Balint Gyevnar, Stephanie Droop, Tadeg Quillien, Shay B. Cohen, Neil R. Bramley, Christopher G. Lucas, Stefano V. Albrecht
People Attribute Purpose to Autonomous Vehicles When Explaining Their Behavior: Insights from Cognitive Science for Explainable AI
Conference on Human Factors in Computing Systems, 2025
Abstract | BibTex | arXiv | Code
CHI autonomous-driving causal explainable-ai

@inproceedings{gyevnar2025attribute,
   title = {People Attribute Purpose to Autonomous Vehicles When Explaining Their Behavior: Insights from Cognitive Science for Explainable AI},
   author = {Gyevnar, Balint and Droop, Stephanie and Quillien, Tadeg and Cohen, Shay B. and Bramley, Neil R. and Lucas, Christopher G. and Albrecht, Stefano V.},
   year = {2025},
   publisher = {Association for Computing Machinery},
   booktitle = {Proceedings of the 2025 CHI Conference on Human Factors in Computing Systems},
   url = {https://arxiv.org/abs/2403.08828},
   doi={10.1145/3706598.3713509}
}

Cheng Wang, Lingxin Kong, Massimiliano Tamborski, Stefano V. Albrecht
HAD-Gen: Human-like and Diverse Driving Behavior Modeling for Controllable Scenario Generation
Accident Analysis & Prevention (Journal), 2025
Abstract | BibTex | arXiv | Code
AAP autonomous-driving agent-modelling

@article{tessera2025hyper,
   title={{HAD-Gen}: Human-like and Diverse Driving Behavior Modeling for Controllable Scenario Generation},
   author={Cheng Wang and Lingxin Kong and Massimiliano Tamborski and Stefano V. Albrecht},
   journal={Accident Analysis \& Prevention},
   year={2025}
}

Leonard Hinckeldey, Elliot Fosong, Elle Miller, Rimvydas Rubavicius, Trevor McInroe, Patricia Wollstadt, Christiane B. Wiebel-Herboth, Subramanian Ramamoorthy, Stefano V. Albrecht
Assistax: A Hardware-Accelerated Reinforcement Learning Benchmark for Assistive Robotics
RLC 2025 Workshop on Coordination and Cooperation in Multi-Agent Reinforcement Learning, 2025
Abstract | BibTex | arXiv | Code
RLC multi-agent-rl

@inproceedings{hinck2025assistax,
   title={{Assistax}: A Hardware-Accelerated Reinforcement Learning Benchmark for Assistive Robotics},
   author={Leonard Hinckeldey and Elliot Fosong and Elle Miller and Rimvydas Rubavicius and Trevor McInroe and Patricia Wollstadt and Christiane B. Wiebel-Herboth and Subramanian Ramamoorthy and Stefano V. Albrecht},
   booktitle={RLC 2025 Workshop on Coordination and Cooperation in Multi-Agent Reinforcement Learning},
   year={2025}
}

2024

Stefano V. Albrecht, Filippos Christianos, Lukas Schäfer
Multi-Agent Reinforcement Learning: Foundations and Modern Approaches
MIT Press (print version scheduled for December 2024), 2024
Abstract | BibTex | Book website | Book codebase
MITP multi-agent-rl deep-rl deep-learning survey

@book{ marl-book,
   author = {Stefano V. Albrecht and Filippos Christianos and Lukas Sch\"afer},
   title = {Multi-Agent Reinforcement Learning: Foundations and Modern Approaches},
   publisher = {MIT Press},
   year = {2024},
   url = {https://www.marl-book.com}
}

Anton Kuznietsov, Balint Gyevnar, Cheng Wang, Steven Peters, Stefano V. Albrecht
Explainable AI for Safe and Trustworthy Autonomous Driving: A Systematic Review
IEEE Transactions on Intelligent Transportation Systems, 2024
Abstract | BibTex | arXiv
T-ITS autonomous-driving explainable-ai survey

@article{kuznietsov2024avreview,
   title={Explainable AI for Safe and Trustworthy Autonomous Driving: A Systematic Review},
   author={Anton Kuznietsov and Balint Gyevnar and Cheng Wang and Steven Peters and Stefano V. Albrecht},
   journal={IEEE Transactions on Intelligent Transportation Systems (T-ITS)},
   year={2024}
}

Trevor McInroe, Lukas Schäfer, Stefano V. Albrecht
Multi-Horizon Representations with Hierarchical Forward Models for Reinforcement Learning
Transactions on Machine Learning Research, 2024
Abstract | BibTex | arXiv | Code
TMLR deep-rl

@article{mcinroe2024hksl,
   title = {Multi-Horizon Representations with Hierarchical Forward Models for Reinforcement Learning},
   author = {Trevor McInroe and Lukas Schäfer and Stefano V. Albrecht},
   journal = {Transactions on Machine Learning Research (TMLR)},
   year = {2024}
}

Xuehui Yu, Mhairi Dunion, Xin Li, Stefano V. Albrecht
Skill-aware Mutual Information Optimisation for Generalisation in Reinforcement Learning
Conference on Neural Information Processing Systems, 2024
Abstract | BibTex | arXiv | Code
NeurIPS deep-rl generalisation

@inproceedings{yu2024skillaware,
   title={Skill-aware Mutual Information Optimisation for Generalisation in Reinforcement Learning},
   author={Xuehui Yu and Mhairi Dunion and Xin Li and Stefano V. Albrecht},
   booktitle={Conference on Neural Information Processing Systems},
   year={2024}
}

Mhairi Dunion, Stefano V. Albrecht
Multi-view Disentanglement for Reinforcement Learning with Multiple Cameras
Reinforcement Learning Conference, 2024
Abstract | BibTex | arXiv | Code
RLC deep-rl generalisation

@inproceedings{dunion2024mvd,
   title={Multi-view Disentanglement for Reinforcement Learning with Multiple Cameras},
   author={Mhairi Dunion and Stefano V. Albrecht},
   booktitle={1st Reinforcement Learning Conference},
   year={2024}
}

Trevor McInroe, Adam Jelley, Stefano V. Albrecht, Amos Storkey
Planning to Go Out-of-Distribution in Offline-to-Online Reinforcement Learning
Reinforcement Learning Conference, 2024
Abstract | BibTex | arXiv
RLC deep-rl

@inproceedings{mcinroe2024planning,
   title={Planning to Go Out-of-Distribution in Offline-to-Online Reinforcement Learning},
   author={Trevor McInroe and Adam Jelley and Stefano V. Albrecht and Amos Storkey},
   booktitle={1st Reinforcement Learning Conference},
   year={2024}
}

Aditya Kapoor, Sushant Swamy, Kale-ab Tessera, Mayank Baranwal, Mingfei Sun, Harshad Khadilkar, Stefano V. Albrecht
Agent-Temporal Credit Assignment for Optimal Policy Preservation in Sparse Multi-Agent Reinforcement Learning
RLC Workshop on Coordination and Cooperation for Multi-Agent Reinforcement Learning Methods, 2024
Abstract | BibTex | Paper
RLC deep-rl multi-agent-rl

@inproceedings{kapoor2024agenttemporal,
   title={Agent-Temporal Credit Assignment for Optimal Policy Preservation in Sparse Multi-Agent Reinforcement Learning},
   author={Aditya Kapoor and Sushant Swamy and Kale-ab Tessera and Mayank Baranwal and Mingfei Sun and Harshad Khadilkar and Stefano V Albrecht},
   booktitle={Coordination and Cooperation for Multi-Agent Reinforcement Learning Methods Workshop},
   year={2024},
   url={https://openreview.net/forum?id=dGS1e3FXUH}
}

Samuel Garcin, James Doran, Shangmin Guo, Christopher G. Lucas, Stefano V. Albrecht
DRED: Zero-Shot Transfer in Reinforcement Learning via Data-Regularised Environment Design
International Conference on Machine Learning, 2024
Abstract | BibTex | arXiv
ICML deep-rl

@inproceedings{garcin2024dred,
   title={{DRED}: Zero-Shot Transfer in Reinforcement Learning via Data-Regularised Environment Design},
   author={Samuel Garcin and James Doran and Shangmin Guo and Christopher G. Lucas and Stefano V. Albrecht},
   year={2024},
   booktitle={International Conference on Machine Learning (ICML)}
}

Elliot Fosong, Arrasy Rahman, Ignacio Carlucho, Stefano V. Albrecht
Learning Complex Teamwork Tasks Using a Given Sub-task Decomposition
International Conference on Autonomous Agents and Multi-Agent Systems, 2024
Abstract | BibTex | arXiv | Code
AAMAS multi-agent-rl

@inproceedings{fosongLearningComplexTeamwork2024,
   title = {Learning Complex Teamwork Tasks Using a Given Sub-task Decomposition},
   author = {Fosong, Elliot and Rahman, Arrasy and Carlucho, Ignacio and Albrecht, Stefano V.},
   booktitle = {Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems},
   year = {2024}
}

Balint Gyevnar, Cheng Wang, Christopher G. Lucas, Shay B. Cohen, Stefano V. Albrecht
Causal Explanations for Sequential Decision-Making in Multi-Agent Systems
International Conference on Autonomous Agents and Multi-Agent Systems, 2024
Abstract | BibTex | arXiv | Code | Dataset
AAMAS explainable-ai autonomous-driving causal

@inproceedings{gyevnar2024cema,
   title={Causal Explanations for Sequential Decision-Making in Multi-Agent Systems},
   author={Balint Gyevnar and Cheng Wang and Christopher G. Lucas and Shay B. Cohen and Stefano V. Albrecht},
   booktitle = {Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems},
   year={2024}
}

Guy Azran, Mohamad H. Danesh, Stefano V. Albrecht, Sarah Keren
Contextual Pre-planning on Reward Machine Abstractions for Enhanced Transfer in Deep Reinforcement Learning
AAAI Conference on Artificial Intelligence, 2024
Abstract | BibTex | arXiv | Code | Video
AAAI deep-rl causal

@inproceedings{azran2024contextual,
   title={Contextual Pre-planning on Reward Machine Abstractions for Enhanced Transfer in Deep Reinforcement Learning},
   author={Guy Azran and Mohamad H. Danesh and Stefano V. Albrecht and Sarah Keren},
   booktitle={Proceedings of the 38th AAAI Conference on Artificial Intelligence},
   year={2024}
}

Shangmin Guo, Yi Ren, Stefano V. Albrecht, Kenny Smith
lpNTK: Better Generalisation with Less Data via Sample Interaction During Learning
International Conference on Learning Representations, 2024
Abstract | BibTex | arXiv | Code
ICLR deep-learning

@inproceedings{guo2024lpntk,
   title={Sample Relationship from Learning Dynamics Matters for Generalisation},
   author={Shangmin Guo and Yi Ren and Stefano V. Albrecht and Kenny Smith},
   booktitle={12th International Conference on Learning Representations},
   year={2024},
   url={https://openreview.net/forum?id=8Ju0VmvMCW}
}

Aleksandar Krnjaic, Raul D. Steleac, Jonathan D. Thomas, Georgios Papoudakis, Lukas Schäfer, Andrew Wing Keung To, Kuan-Ho Lao, Murat Cubuktepe, Matthew Haley, Peter Börsting, Stefano V. Albrecht
Scalable Multi-Agent Reinforcement Learning for Warehouse Logistics with Robotic and Human Co-Workers
IEEE/RSJ International Conference on Intelligent Robots and Systems, 2024
Abstract | BibTex | arXiv | Website
IROS multi-agent-rl simulator

@inproceedings{krnjaic2024scalable,
   title={Scalable Multi-Agent Reinforcement Learning for Warehouse Logistics with Robotic and Human Co-Workers},
   author={Aleksandar Krnjaic and Raul D. Steleac and Jonathan D. Thomas and Georgios Papoudakis and Lukas Sch\"afer and Andrew Wing Keung To and Kuan-Ho Lao and Murat Cubuktepe and Matthew Haley and Peter B\"orsting and Stefano V. Albrecht},
   booktitle={IEEE/RSJ International Conference on Intelligent Robots and Systems},
   year={2024}
}

Anthony Knittel, Majd Hawasly, Stefano V. Albrecht, John Redford, Subramanian Ramamoorthy
DiPA: Probabilistic Multi-Modal Interactive Prediction for Autonomous Driving
IEEE International Conference on Robotics and Automation, 2024
Abstract | BibTex | arXiv | Publisher
ICRA autonomous-driving state-estimation

@article{Knittel2023dipa,
   title={{DiPA:} Probabilistic Multi-Modal Interactive Prediction for Autonomous Driving},
   author={Anthony Knittel and Majd Hawasly and Stefano V. Albrecht and John Redford and Subramanian Ramamoorthy},
   journal={IEEE Robotics and Automation Letters},
   volume={8},
   number={8},
   pages={4887--4894},
   year={2023}
}

Dongge Han, Trevor McInroe, Adam Jelley, Stefano V. Albrecht, Peter Bell, Amos Storkey
LLM-Personalize: Aligning LLM Planners with Human Preferences via Reinforced Self-Training for Housekeeping Robots
International Conference on Computational Linguistics, 2024
Abstract | BibTex | arXiv | Code | Website
COLING generalisation state-estimation

@inproceedings{han2024llmpersonalize,
   title={LLM-Personalize: Aligning LLM Planners with Human Preferences via Reinforced Self-Training for Housekeeping Robots},
   author={Dongge Han and Trevor McInroe and Adam Jelley and Stefano V. Albrecht and Peter Bell and Amos Storkey},
   booktitle={International Conference on Computational Linguistics},
   year={2024}
}

Guy Azran, Mohamad H. Danesh, Stefano V. Albrecht, Sarah Keren
Contextual Pre-planning on Reward Machine Abstractions for Enhanced Transfer in Deep Reinforcement Learning
ICAPS Workshop on Planning and Reinforcement Learning, 2024
Abstract | BibTex | arXiv | Code | Video
ICAPS deep-rl causal

@inproceedings{Azran2022enhancing,
   title={Contextual Pre-planning on Reward Machine Abstractions for Enhanced Transfer in Deep Reinforcement Learning},
   author={Azran, Guy and Danesh, Mohamad H. and Albrecht, Stefano V. and Keren, Sarah},
   booktitle={ICAPS Workshop on Planning and Reinforcement Learning (https://prl-theworkshop.github.io/prl2024-icaps/},
   year={2024}
}

Sarah Keren, Chaimaa Essayeh, Stefano V. Albrecht, Thomas Mortsyn
Multi-Agent Reinforcement Learning for Energy Networks: Computational Challenges, Progress and Open Problems
arXiv:2404.15583, 2024
Abstract | BibTex | arXiv
multi-agent-rl survey

@misc{keren2024multiagent,
   title={Multi-Agent Reinforcement Learning for Energy Networks: Computational Challenges, Progress and Open Problems},
   author={Sarah Keren and Chaimaa Essayeh and Stefano V. Albrecht and Thomas Mortsyn},
   year={2024},
   eprint={2404.15583},
   archivePrefix={arXiv},
   primaryClass={cs.AI}
}

Kale-ab Tessera, Arrasy Rahman, Stefano V. Albrecht
HyperMARL: Adaptive Hypernetworks for Multi-Agent RL
arXiv:2412.04233, 2024
Abstract | BibTex | arXiv
multi-agent-rl

@misc{tessera2024hyper,
   title={{HyperMARL}: Adaptive Hypernetworks for Multi-Agent RL},
   author={Kale-ab Tessera and Arrasy Rahman and Stefano V. Albrecht},
   year={2024},
   eprint={2412.04233},
   archivePrefix={arXiv}
}

2023

Arrasy Rahman, Ignacio Carlucho, Niklas Höpner, Stefano V. Albrecht
A General Learning Framework for Open Ad Hoc Teamwork Using Graph-based Policy Learning
Journal of Machine Learning Research, 2023
Abstract | BibTex | arXiv | Publisher | Code
JMLR ad-hoc-teamwork deep-rl agent-modelling multi-agent-rl

@article{JRahman2022POGPL,
   author  = {Arrasy Rahman and Ignacio Carlucho and Niklas H\"opner and Stefano V. Albrecht},
   title   = {A General Learning Framework for Open Ad Hoc Teamwork Using Graph-based Policy Learning},
   journal = {Journal of Machine Learning Research},
   year    = {2023},
   volume  = {24},
   number  = {298},
   pages   = {1--74},
   url     = {http://jmlr.org/papers/v24/22-099.html}
}

Filippos Christianos, Georgios Papoudakis, Stefano V. Albrecht
Pareto Actor-Critic for Equilibrium Selection in Multi-Agent Reinforcement Learning
Transactions on Machine Learning Research, 2023
Abstract | BibTex | arXiv | Code
TMLR deep-rl multi-agent-rl

@article{christianos2023pareto,
   title={Pareto Actor-Critic for Equilibrium Selection in Multi-Agent Reinforcement Learning},
   author={Filippos Christianos and Georgios Papoudakis and Stefano V. Albrecht},
   journal={Transactions on Machine Learning Research (TMLR)},
   year={2023}
}

Arrasy Rahman, Elliot Fosong, Ignacio Carlucho, Stefano V. Albrecht
Generating Teammates for Training Robust Ad Hoc Teamwork Agents via Best-Response Diversity
Transactions on Machine Learning Research, 2023
Abstract | BibTex | arXiv | Code
TMLR ad-hoc-teamwork multi-agent-rl deep-rl

@article{rahman2023BRDiv,
   title={Generating Teammates for Training Robust Ad Hoc Teamwork Agents via Best-Response Diversity},
   author={Arrasy Rahman and Elliot Fosong and Ignacio Carlucho and Stefano V. Albrecht},
   journal={Transactions on Machine Learning Research (TMLR)},
   year={2023}
}

Mhairi Dunion, Trevor McInroe, Kevin Sebastian Luck, Josiah Hanna, Stefano V. Albrecht
Conditional Mutual Information for Disentangled Representations in Reinforcement Learning
Conference on Neural Information Processing Systems, 2023
Abstract | BibTex | arXiv | Code
NeurIPS deep-rl causal generalisation

@inproceedings{dunion2023cmid,
   title={Conditional Mutual Information for Disentangled Representations in Reinforcement Learning},
   author={Mhairi Dunion and Trevor McInroe and Kevin Sebastian Luck and Josiah Hanna and Stefano V. Albrecht},
   booktitle={Conference on Neural Information Processing Systems},
   year={2023}
}

Lukas Schäfer, Filippos Christianos, Amos Storkey, Stefano V. Albrecht
Learning Task Embeddings for Teamwork Adaptation in Multi-Agent Reinforcement Learning
NeurIPS Workshop on Generalization in Planning, 2023
Abstract | BibTex | arXiv | Code
NeurIPS multi-agent-rl deep-rl

@inproceedings{schaefer2023mate,
   title={Learning Task Embeddings for Teamwork Adaptation in Multi-Agent Reinforcement Learning},
   author={Lukas Schäfer and Filippos Christianos and Amos Storkey and Stefano V. Albrecht},
   booktitle={NeurIPS Workshop on Generalization in Planning},
   year={2023}
}

Guy Azran, Mohamad H Danesh, Stefano V. Albrecht, Sarah Keren
Contextual Pre-Planning on Reward Machine Abstractions for Enhanced Transfer in Deep Reinforcement Learning
NeurIPS Workshop on Generalization in Planning, 2023
Abstract | BibTex | arXiv
NeurIPS deep-rl causal

@inproceedings{azran2023contextual,
   title={Contextual Pre-Planning on Reward Machine Abstractions for Enhanced Transfer in Deep Reinforcement Learning},
   author={Guy Azran and Mohamad H. Danesh and Stefano V. Albrecht and Sarah Keren},
   booktitle={NeurIPS Workshop on Generalization in Planning},
   year={2023}
}

Samuel Garcin, James Doran, Shangmin Guo, Christopher G. Lucas, Stefano V. Albrecht
How the level sampling process impacts zero-shot generalisation in deep reinforcement learning
NeurIPS Workshop on Agent Learning in Open-Endedness, 2023
Abstract | BibTex | arXiv
NeurIPS deep-rl

@inproceedings{garcin2023level,
   title={How the level sampling process impacts zero-shot generalisation in deep reinforcement learning},
   author={Samuel Garcin and James Doran and Shangmin Guo and Christopher G. Lucas and Stefano V. Albrecht},
   booktitle={NeurIPS Workshop on Agent Learning in Open-Endedness},
   year={2023}
}

Sabrina McCallum, Max Taylor-Davies, Stefano V. Albrecht, Alessandro Suglia
Is Feedback All You Need? Leveraging Natural Language Feedback in Goal-Conditioned Reinforcement Learning
NeurIPS Workshop on Goal-Conditioned Reinforcement Learning, 2023
Abstract | BibTex | arXiv | Code
NeurIPS deep-rl

@inproceedings{mccallum2023feedback,
   title={Is Feedback All You Need? Leveraging Natural Language Feedback in Goal-Conditioned Reinforcement Learning},
   author={Sabrina McCallum and Max Taylor-Davies and Stefano V. Albrecht and Alessandro Suglia},
   booktitle={NeurIPS Workshop on Goal-Conditioned Reinforcement Learning (GCRL)},
   year={2023}
}

Mhairi Dunion, Trevor McInroe, Kevin Sebastian Luck, Josiah Hanna, Stefano V. Albrecht
Temporal Disentanglement of Representations for Improved Generalisation in Reinforcement Learning
International Conference on Learning Representations, 2023
Abstract | BibTex | arXiv | Code
ICLR deep-rl generalisation causal

@inproceedings{dunion2023ted,
   title={Temporal Disentanglement of Representations for Improved Generalisation in Reinforcement Learning},
   author={Mhairi Dunion and Trevor McInroe and Kevin Sebastian Luck and Josiah Hanna and Stefano V. Albrecht},
   booktitle={International Conference on Learning Representations (ICLR)},
   year={2023}
}

Yi Ren, Shangmin Guo, Wonho Bae, Danica J. Sutherland
How to Prepare Your Task Head for Finetuning
International Conference on Learning Representations, 2023
Abstract | BibTex | arXiv
ICLR deep-learning transfer-learning

@inproceedings{ ren2023how,
   title={How to Prepare Your Task Head for Finetuning},
   author={Yi Ren and Shangmin Guo and Wonho Bae and Danica J. Sutherland},
   booktitle={International Conference on Learning Representations (ICLR)},
   year={2023},
   url={https://openreview.net/forum?id=gVOXZproe-e}
}

Anthony Knittel, Majd Hawasly, Stefano V. Albrecht, John Redford, Subramanian Ramamoorthy
DiPA: Probabilistic Multi-Modal Interactive Prediction for Autonomous Driving
IEEE Robotics and Automation Letters, 2023
Abstract | BibTex | arXiv | Publisher
RA-L autonomous-driving state-estimation

@article{Knittel2023dipa,
   title={{DiPA:} Probabilistic Multi-Modal Interactive Prediction for Autonomous Driving},
   author={Anthony Knittel and Majd Hawasly and Stefano V. Albrecht and John Redford and Subramanian Ramamoorthy},
   journal={IEEE Robotics and Automation Letters},
   volume={8},
   number={8},
   pages={4887--4894},
   year={2023}
}

Cillian Brewitt, Massimiliano Tamborski, Cheng Wang, Stefano V. Albrecht
Verifiable Goal Recognition for Autonomous Driving with Occlusions
IEEE/RSJ International Conference on Intelligent Robots and Systems, 2023
Abstract | BibTex | arXiv
IROS autonomous-driving goal-recognition explainable-ai

@inproceedings{brewitt2023ogrit,
   title={Verifiable Goal Recognition for Autonomous Driving with Occlusions},
   author={Cillian Brewitt and Massimiliano Tamborski and Cheng Wang and Stefano V. Albrecht},
   booktitle={IEEE/RSJ International Conference on Intelligent Robots and Systems},
   year={2023}
}

Filippos Christianos, Peter Karkus, Boris Ivanovic, Stefano V. Albrecht, Marco Pavone
Planning with Occluded Traffic Agents using Bi-Level Variational Occlusion Models
IEEE International Conference on Robotics and Automation, 2023
Abstract | BibTex | arXiv
ICRA deep-rl autonomous-driving

@inproceedings{christianos2023planning,
   title={Planning with Occluded Traffic Agents using Bi-Level Variational Occlusion Models},
   author={Filippos Christianos and Peter Karkus and Boris Ivanovic and Stefano V. Albrecht and Marco Pavone},
   booktitle={International Conference on Robotics and Automation (ICRA)},
   year={2023}
}

Cillian Brewitt, Massimiliano Tamborski, Cheng Wang, Stefano V. Albrecht
Verifiable Goal Recognition for Autonomous Driving with Occlusions
ICRA Workshop on Scalable Autonomous Driving, 2023
Abstract | BibTex | arXiv
ICRA autonomous-driving goal-recognition explainable-ai

@misc{brewitt2023verifiable,
   title={Verifiable Goal Recognition for Autonomous Driving with Occlusions},
   author={Cillian Brewitt and Massimiliano Tamborski and Cheng Wang and Stefano V. Albrecht},
   booktitle={ICRA 2023 Workshop on Scalable Autonomous Driving},
   year={2023}
}

Giuseppe Vecchio, Simone Palazzo, Dario C Guastella, Riccardo E. Sarpietro, Ignacio Carlucho, Stefano V. Albrecht, Giovanni Muscato, Concetto Spampinato
MIDGARD: A Simulation Platform for Autonomous Navigation in Unstructured Environments
RSS Workshop on Multi-Agent Planning and Navigation in Challenging Environments, 2023
Abstract | BibTex | arXiv
RSS simulator deep-rl

@inproceedings{vecchio2022midgard,
   title={MIDGARD: A Simulation Platform for Autonomous Navigation in Unstructured Environments},
   author={Vecchio, Giuseppe and Palazzo, Simone and Guastella, Dario C and Sarpietro, Riccardo E. and Carlucho, Ignacio and Albrecht, Stefano V. and Muscato, Giovanni and Spampinato, Concetto},
   booktitle={RSS 2023 Workshop on Multi-Agent Planning and Navigation in Challenging Environments},
   year={2023}
}

Balint Gyevnar, Cheng Wang, Christopher G. Lucas, Shay B. Cohen, Stefano V. Albrecht
Causal Social Explanations for Stochastic Sequential Multi-Agent Decision-Making
AAMAS Workshop on Explainable and Transparent AI and Multi-Agent Systems, 2023
Abstract | BibTex | arXiv | Code
AAMAS autonomous-driving explainable-ai causal

@inproceedings{gyevnar2023causal,
   title={Causal Social Explanations for Stochastic Sequential Multi-Agent Decision-Making},
   author={Balint Gyevnar and Cheng Wang and Christopher G. Lucas and Shay B. Cohen and Stefano V. Albrecht},
   booktitle={5th International Workshop on EXplainable and TRAnsparent AI and Multi-Agent Systems},
   year={2023}
}

Filippos Christianos, Georgios Papoudakis, Stefano V. Albrecht
Pareto Actor-Critic for Equilibrium Selection in Multi-Agent Reinforcement Learning
AAMAS Workshop on Optimization and Learning in Multiagent Systems, 2023
Abstract | BibTex | arXiv
AAMAS deep-rl multi-agent-rl

@inproceedings{christianos2023pareto,
   title={Pareto Actor-Critic for Equilibrium Selection in Multi-Agent Reinforcement Learning},
   author={Filippos Christianos and Georgios Papoudakis and Stefano V. Albrecht},
   booktitle={AAMAS Workshop on Optimization and Learning in Multiagent Systems},
   year={2023}
}

Elliot Fosong, Arrasy Rahman, Ignacio Carlucho, Stefano V. Albrecht
Learning Complex Teamwork Tasks Using a Sub-task Curriculum
AAMAS Workshop on Multiagent Sequential Decision Making Under Uncertainty, 2023
Abstract | BibTex | arXiv | Code
AAMAS multi-agent-rl ad-hoc-teamwork transfer-learning

@inproceedings{fosong2023learning,
   title={Learning complex teamwork tasks using a sub-task curriculum},
   author={Elliot Fosong, Arrasy Rahman, Ignacio Carlucho and Stefano V. Albrecht},
   booktitle={AAMAS Workshop on Multiagent Sequential Decision Making under Uncertainty},
   year={2023},
}

Adam Michalski, Filippos Christianos, Stefano V. Albrecht
SMAClite: A Lightweight Environment for Multi-Agent Reinforcement Learning
AAMAS Workshop on Multiagent Sequential Decision Making Under Uncertainty, 2023
Abstract | BibTex | arXiv | Code
AAMAS deep-rl multi-agent-rl

@inproceedings{michalski2023smaclite,
   title={SMAClite: A Lightweight Environment for Multi-Agent Reinforcement Learning},
   author={Adam Michalski and Filippos Christianos and Stefano V. Albrecht},
   booktitle={AAMAS workshop on Multiagent Sequential Decision Making Under Uncertainty (MSDM)},
   year={2023}
}

Lukas Schäfer, Oliver Slumbers, Stephen McAleer, Yali Du, Stefano V. Albrecht, David Mguni
Ensemble Value Functions for Efficient Exploration in Multi-Agent Reinforcement Learning
AAMAS Workshop on Adaptive and Learning Agents, 2023
Abstract | BibTex | arXiv
AAMAS multi-agent-rl deep-rl

@inproceedings{schaefer2023emax,
   title={Ensemble Value Functions for Efficient Exploration in Multi-Agent Reinforcement Learning},
   author={Lukas Schäfer and Oliver Slumbers and Stephen McAleer and Yali Du and Stefano V. Albrecht and David Mguni},
   year={2023},
   booktitle={AAMAS Workshop on Adaptive and Learning Agents (ALA)},
}

Callum Tilbury, Filippos Christianos, Stefano V. Albrecht
Revisiting the Gumbel-Softmax in MADDPG
AAMAS Workshop on Adaptive and Learning Agents, 2023
Abstract | BibTex | arXiv | Code
AAMAS multi-agent-rl deep-rl

@inproceedings{tilbury2023revisitingmaddpg,
   title={Revisiting the Gumbel-Softmax in MADDPG},
   author={Callum Tilbury and Filippos Christianos and Stefano V. Albrecht},
   year={2023},
   booktitle={AAMAS Workshop on Adaptive and Learning Agents (ALA)},
}

Alain Andres, Lukas Schäfer, Esther Villar-Rodriguez, Stefano V. Albrecht, Javier Del Ser
Using Offline Data to Speed-up Reinforcement Learning in Procedurally Generated Environments
AAMAS Workshop on Adaptive and Learning Agents, 2023
Abstract | BibTex | arXiv
AAMAS deep-rl

@inproceedings{andres2023using,
   title={Using Offline Data to Speed-up Reinforcement Learning in Procedurally Generated Environments},
   author={Andres, Alain and Schäfer, Lukas and Villar-Rodriguez, Esther and Albrecht, Stefano V. and Del Ser, Javier},
   booktitle={AAMAS Workshop on Adaptive and Learning Agents (ALA)},
   year={2023}
}

Guy Azran, Mohamad H. Danesh, Stefano V. Albrecht, Sarah Keren
Contextual Pre-Planning on Reward Machine Abstractions for Enhanced Transfer in Deep Reinforcement Learning
IJCAI Workshop on Planning and Reinforcement Learning, 2023
Abstract | BibTex | arXiv
IJCAI deep-rl causal

@inproceedings{azran2023contextual,
   title={Contextual Pre-Planning on Reward Machine Abstractions for Enhanced Transfer in Deep Reinforcement Learning},
   author={Guy Azran and Mohamad H. Danesh and Stefano V. Albrecht and Sarah Keren},
   booktitle={IJCAI Workshop on Planning and Reinforcement Learning (https://prl-theworkshop.github.io/)},
   year={2023}
}

Mhairi Dunion, Trevor McInroe, Kevin Sebastian Luck, Josiah Hanna, Stefano V. Albrecht
Conditional Mutual Information for Disentangled Representations in Reinforcement Learning
European Workshop on Reinforcement Learning, 2023
Abstract | BibTex | arXiv | Code
EWRL deep-rl causal generalisation

@inproceedings{dunion2023cmid,
   title={Conditional Mutual Information for Disentangled Representations in Reinforcement Learning},
   author={Mhairi Dunion and Trevor McInroe and Kevin Sebastian Luck and Josiah Hanna and Stefano V. Albrecht},
   booktitle={European Workshop on Reinforcement Learning},
   year={2023}
}

Aleksandar Krnjaic, Raul D. Steleac, Jonathan D. Thomas, Georgios Papoudakis, Lukas Schäfer, Andrew Wing Keung To, Kuan-Ho Lao, Murat Cubuktepe, Matthew Haley, Peter Börsting, Stefano V. Albrecht
Scalable Multi-Agent Reinforcement Learning for Warehouse Logistics with Robotic and Human Co-Workers
arXiv:2212.11498, 2023
Abstract | BibTex | arXiv | Website
multi-agent-rl simulator

@misc{krnjaic2023scalable,
   title={Scalable Multi-Agent Reinforcement Learning for Warehouse Logistics with Robotic and Human Co-Workers},
   author={Aleksandar Krnjaic and Raul D. Steleac and Jonathan D. Thomas and Georgios Papoudakis and Lukas Sch\"afer and Andrew Wing Keung To and Kuan-Ho Lao and Murat Cubuktepe and Matthew Haley and Peter B\"orsting and Stefano V. Albrecht},
   year={2023},
   eprint={2212.11498},
   archivePrefix={arXiv},
   primaryClass={cs.LG}
}

Samuel Garcin, James Doran, Shangmin Guo, Christopher G. Lucas, Stefano V. Albrecht
How the level sampling process impacts zero-shot generalisation in deep reinforcement learning
arXiv:2310.03494, 2023
Abstract | BibTex | arXiv
deep-rl

@misc{garcin2023level,
   title={How the level sampling process impacts zero-shot generalisation in deep reinforcement learning},
   author={Samuel Garcin and James Doran and Shangmin Guo and Christopher G. Lucas and Stefano V. Albrecht},
   year={2023},
   eprint={2310.03494},
   archivePrefix={arXiv},
   primaryClass={cs.LG}
}

Trevor McInroe, Stefano V. Albrecht, Amos Storkey
Planning to Go Out-of-Distribution in Offline-to-Online Reinforcement Learning
arXiv:2310.05723, 2023
Abstract | BibTex | arXiv
deep-rl

@misc{mcinroe2023planning,
   title={Planning to Go Out-of-Distribution in Offline-to-Online Reinforcement Learning},
   author={Trevor McInroe and Stefano V. Albrecht and Amos Storkey},
   year={2023},
   eprint={2310.05723},
   archivePrefix={arXiv},
   primaryClass={cs.LG}
}

2022

Stefano V. Albrecht, Michael Wooldridge
Special Issue on Multi-Agent Systems Research in the United Kingdom: Guest Editorial
AI Communications, 2022
Abstract | BibTex | Publisher | Special Issue
AIC survey deep-rl multi-agent-rl agent-modelling

@article{albrecht2020special,
   title = {Special Issue on Multi-Agent Systems Research in the United Kingdom: Guest Editorial},
   author = {Stefano V. Albrecht and Michael Wooldridge},
   journal = {AI Communications},
   volume = {35},
   number = {4},
   year = {2022},
   publisher = {IOS Press},
   url = {https://content.iospress.com/articles/ai-communications/aic229003}
}

Ibrahim H. Ahmed, Cillian Brewitt, Ignacio Carlucho, Filippos Christianos, Mhairi Dunion, Elliot Fosong, Samuel Garcin, Shangmin Guo, Balint Gyevnar, Trevor McInroe, Georgios Papoudakis, Arrasy Rahman, Lukas Schäfer, Massimiliano Tamborski, Giuseppe Vecchio, Cheng Wang, Stefano V. Albrecht
Deep Reinforcement Learning for Multi-Agent Interaction
AI Communications, 2022
Abstract | BibTex | arXiv | Publisher
AIC survey deep-rl multi-agent-rl ad-hoc-teamwork agent-modelling goal-recognition security explainable-ai autonomous-driving

@article{albrecht2022aic,
   author = {Ahmed, Ibrahim H. and Brewitt, Cillian and Carlucho, Ignacio and Christianos, Filippos and Dunion, Mhairi and Fosong, Elliot and Garcin, Samuel and Guo, Shangmin and Gyevnar, Balint and McInroe, Trevor and Papoudakis, Georgios and Rahman, Arrasy and Schäfer, Lukas and Tamborski, Massimiliano and Vecchio, Giuseppe and Wang, Cheng and Albrecht, Stefano V.},
   title = {Deep Reinforcement Learning for Multi-Agent Interaction},
   journal = {AI Communications, Special Issue on Multi-Agent Systems Research in the UK},
   year = {2022}
}

Majd Hawasly, Jonathan Sadeghi, Morris Antonello, Stefano V. Albrecht, John Redford, Subramanian Ramamoorthy
Perspectives on the System-level Design of a Safe Autonomous Driving Stack
AI Communications, 2022
Abstract | BibTex | arXiv | Publisher
AIC survey autonomous-driving goal-recognition explainable-ai

@article{albrecht2022aic,
   author = {Majd Hawasly and Jonathan Sadeghi and Morris Antonello and Stefano V. Albrecht and John Redford and Subramanian Ramamoorthy},
   title = {Perspectives on the System-level Design of a Safe Autonomous Driving Stack},
   journal = {AI Communications, Special Issue on Multi-Agent Systems Research in the UK},
   year = {2022}
}

Rujie Zhong, Duohan Zhang, Lukas Schäfer, Stefano V. Albrecht, Josiah P. Hanna
Robust On-Policy Sampling for Data-Efficient Policy Evaluation in Reinforcement Learning
Conference on Neural Information Processing Systems, 2022
Abstract | BibTex | arXiv | Code
NeurIPS deep-rl

@inproceedings{zhong2022datacollection,
   title={Robust On-Policy Data Collection for Data Efficient Policy Evaluation},
   author={Rujie Zhong and Duohan Zhang and Lukas Sch\"afer and Stefano V. Albrecht and Josiah P. Hanna},
   booktitle={Conference on Neural Information Processing Systems},
   year={2022}
}

Trevor McInroe, Lukas Schäfer, Stefano V. Albrecht
Learning Representations for Reinforcement Learning with Hierarchical Forward Models
NeurIPS Workshop on Deep Reinforcement Learning, 2022
Abstract | BibTex | arXiv
NeurIPS deep-rl generalisation

@inproceedings{mcinroe2022hksl,
   title={Learning Representations for Reinforcement Learning with Hierarchical Forward Models},
   author={Trevor McInroe and Lukas Schäfer and Stefano V. Albrecht},
   booktitle={NeurIPS Workshop on Deep RL},
   year={2022}
}

Mhairi Dunion, Trevor McInroe, Kevin Sebastian Luck, Josiah Hanna, Stefano V. Albrecht
Temporal Disentanglement of Representations for Improved Generalisation in Reinforcement Learning
NeurIPS Workshop on Deep Reinforcement Learning, 2022
Abstract | BibTex | arXiv | Code
NeurIPS deep-rl generalisation causal

@inproceedings{dunion2022ted,
   title={Temporal Disentanglement of Representations for Improved Generalisation in Reinforcement Learning},
   author={Mhairi Dunion and Trevor McInroe and Kevin Sebastian Luck and Josiah Hanna and Stefano V. Albrecht},
   booktitle={NeurIPS Workshop on Deep Reinforcement Learning},
   year={2022}
}

Cillian Brewitt, Massimiliano Tamborski, Stefano V. Albrecht
Verifiable Goal Recognition for Autonomous Driving with Occlusions
NeurIPS Workshop on Machine Learning for Autonomous Driving, 2022
Abstract | BibTex | arXiv | Code
NeurIPS autonomous-driving goal-recognition explainable-ai

@inproceedings{brewitt2022,
   title={Verifiable Goal Recognition for Autonomous Driving with Occlusions},
   author={Cillian Brewitt and Massimiliano Tamborski and Stefano V. Albrecht},
   booktitle={NeurIPS Workshop on Machine Learning for Autonomous Driving},
   year={2022}
}

Shangmin Guo, Yi Ren, Stefano V. Albrecht, Kenny Smith
Sample Relationships through the Lens of Learning Dynamics with Label Information
NeurIPS Workshop on Interpolation and Beyond, 2022
Abstract | BibTex | arXiv
NeurIPS iterated-learning deep-learning transfer-learning

@inproceedings{guo2022relationship,
   title={Sample Relationships through the Lens of Learning Dynamics with Label Information},
   author={Shangmin Guo and Yi Ren and Stefano V. Albrecht and Kenny Smith},
   booktitle={NeurIPS 2022 Workshop on Interpolation and Beyond},
   year={2022}
}

Guy Azran, Mohamad Hosein Danesh, Stefano V. Albrecht, Sarah Keren
Enhancing Transfer of Reinforcement Learning Agents with Abstract Contextual Embeddings
NeurIPS Workshop on Neuro Causal and Symbolic AI, 2022
Abstract | BibTex
NeurIPS deep-rl causal

@inproceedings{Azran2022enhancing,
   title={Enhancing Transfer of Reinforcement Learning Agents with Abstract Contextual Embeddings},
   author={Guy Azran and Mohamad Hosein Danesh and Stefano V. Albrecht and Sarah Keren},
   booktitle={NeurIPS Workshop on Neuro Causal and Symbolic AI (https://ncsi.cause-lab.net)},
   year={2022}
}

Shangmin Guo, Yi Ren, Kory Mathewson, Simon Kirby, Stefano V. Albrecht, Kenny Smith
Expressivity of Emergent Languages is a Trade-off between Contextual Complexity and Unpredictability
International Conference on Learning Representations, 2022
Abstract | BibTex | arXiv | Code
ICLR multi-agent-rl emergent-communication

@inproceedings{guo2022expressivity,
   title={Expressivity of Emergent Languages is a Trade-off between Contextual Complexity and Unpredictability},
   author={Shangmin Guo and Yi Ren and Kory Mathewson and Simon Kirby and Stefano V. Albrecht and Kenny Smith},
   booktitle={International Conference on Learning Representations (ICLR)},
   year={2022}
}

Lukas Schäfer, Filippos Christianos, Josiah P. Hanna, Stefano V. Albrecht
Decoupled Reinforcement Learning to Stabilise Intrinsically-Motivated Exploration
International Conference on Autonomous Agents and Multi-Agent Systems, 2022
Abstract | BibTex | arXiv | Code
AAMAS deep-rl intrinsic-reward

@inproceedings{schaefer2022derl,
   title={Decoupled Reinforcement Learning to Stabilise Intrinsically-Motivated Exploration},
   author={Lukas Schäfer and Filippos Christianos and Josiah P. Hanna and Stefano V. Albrecht},
   booktitle={International Conference on Autonomous Agents and Multiagent Systems (AAMAS)},
   year={2022}
}

Lukas Schäfer
Task Generalisation in Multi-Agent Reinforcement Learning
International Conference on Autonomous Agents and Multiagent Systems, Doctoral Consortium, 2022
Abstract | BibTex | Paper
AAMAS multi-agent-rl

@inproceedings{schaefer2022task,
   title={Task Generalisation in Multi-Agent Reinforcement Learning},
   author={Lukas Schäfer},
   booktitle={Doctoral Consortium at the International Conference on Autonomous Agents and Multiagent Systems},
   year={2022}
}

Filippos Christianos
Collaborative Training of Multiple Autonomous Agents
International Conference on Autonomous Agents and Multiagent Systems, Doctoral Consortium, 2022
Abstract | BibTex | Paper
AAMAS multi-agent-rl

@inproceedings{christianos2022collaborative,
   title={Collaborative Training of Multiple Autonomous Agents},
   author={Filippos Christianos},
   booktitle={Doctoral Consortium at the International Conference on Autonomous Agents and Multiagent Systems},
   year={2022}
}

Francisco Eiras, Majd Hawasly, Stefano V. Albrecht, Subramanian Ramamoorthy
A Two-Stage Optimization-based Motion Planner for Safe Urban Driving
IEEE Transactions on Robotics, 2022
Abstract | BibTex | arXiv | Publisher | Video
T-RO autonomous-driving

@article{eiras2021twostage,
   title = {A Two-Stage Optimization-based Motion Planner for Safe Urban Driving},
   author = {Francisco Eiras and Majd Hawasly and Stefano V. Albrecht and Subramanian Ramamoorthy},
   journal = {IEEE Transactions on Robotics},
   volume = {38},
   number = {2},
   pages = {822--834},
   year = {2022},
   doi = {10.1109/TRO.2021.3088009}
}

Morris Antonello, Mihai Dobre, Stefano V. Albrecht, John Redford, Subramanian Ramamoorthy
Flash: Fast and Light Motion Prediction for Autonomous Driving with Bayesian Inverse Planning and Learned Motion Profiles
IEEE/RSJ International Conference on Intelligent Robots and Systems, 2022
Abstract | BibTex | arXiv
IROS autonomous-driving state-estimation

@inproceedings{antonello2022flash,
   title={Flash: Fast and Light Motion Prediction for Autonomous Driving with {Bayesian} Inverse Planning and Learned Motion Profiles},
   author={Morris Antonello, Mihai Dobre, Stefano V. Albrecht, John Redford, Subramanian Ramamoorthy},
   booktitle={IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)},
   year={2022}
}

Giuseppe Vecchio, Simone Palazzo, Dario C Guastella, Ignacio Carlucho, Stefano V. Albrecht, Giovanni Muscato, Concetto Spampinato
MIDGARD: A Simulation Platform for Autonomous Navigation in Unstructured Environments
ICRA Workshop on Releasing Robots into the Wild: Simulations, Benchmarks, and Deployment, 2022
Abstract | BibTex | arXiv
ICRA deep-rl simulator

@misc{Vecchio2022MIDGARD,
   title={MIDGARD: A Simulation Platform for Autonomous Navigation in Unstructured Environments},
   author={Giuseppe Vecchio, Simone Palazzo, Dario C Guastella, Ignacio Carlucho, Stefano V. Albrecht, Giovanni Muscato, Concetto Spampinato},
   year={2022},
   eprint={2205.08389},
   archivePrefix={arXiv},
   primaryClass={cs.MA}
}

Balint Gyevnar, Massimiliano Tamborski, Cheng Wang, Christopher G. Lucas, Shay B. Cohen, Stefano V. Albrecht
A Human-Centric Method for Generating Causal Explanations in Natural Language for Autonomous Vehicle Motion Planning
IJCAI Workshop on Artificial Intelligence for Autonomous Driving, 2022
Abstract | BibTex | arXiv | Code
IJCAI autonomous-driving explainable-ai causal

@inproceedings{gyevnar2022humancentric,
   title={A Human-Centric Method for Generating Causal Explanations in Natural Language for Autonomous Vehicle Motion Planning},
   author={Balint Gyevnar and Massimiliano Tamborski and Cheng Wang and Christopher G. Lucas and Shay B. Cohen and Stefano V. Albrecht},
   booktitle={IJCAI Workshop on Artificial Intelligence for Autonomous Driving},
   year={2022}
}

Arrasy Rahman, Elliot Fosong, Ignacio Carlucho, Stefano V. Albrecht
Towards Robust Ad Hoc Teamwork Agents By Creating Diverse Training Teammates
IJCAI Workshop on Ad Hoc Teamwork, 2022
Abstract | BibTex | arXiv | Code
IJCAI ad-hoc-teamwork multi-agent-rl

@inproceedings{rahman2022towards,
   title={Towards Robust Ad Hoc Teamwork Agents By Creating Diverse Training Teammates},
   author={Arrasy Rahman and Elliot Fosong and Ignacio Carlucho and Stefano V. Albrecht},
   booktitle={IJCAI Workshop on Ad Hoc Teamwork},
   year={2022}
}

Elliot Fosong, Arrasy Rahman, Ignacio Carlucho, Stefano V. Albrecht
Few-Shot Teamwork
IJCAI Workshop on Ad Hoc Teamwork, 2022
Abstract | BibTex | arXiv
IJCAI ad-hoc-teamwork multi-agent-rl

@inproceedings{fosong2022fewshot,
   title={Few-Shot Teamwork},
   author={Elliot Fosong and Arrasy Rahman and Ignacio Carlucho and Stefano V. Albrecht},
   booktitle={IJCAI Workshop on Ad Hoc Teamwork},
   year={2022}
}

Ignacio Carlucho, Arrasy Rahman, William Ard, Elliot Fosong, Corina Barbalata, Stefano V. Albrecht
Cooperative Marine Operations Via Ad Hoc Teams
IJCAI Workshop on Ad Hoc Teamwork, 2022
Abstract | BibTex | arXiv
IJCAI ad-hoc-teamwork multi-agent-rl

@inproceedings{Carlucho2022UnderwaterAHT,
   title={Cooperative Marine Operations Via Ad Hoc Teams},
   author={Ignacio Carlucho, Arrasy Rahman, William Ard, Elliot Fosong, Corina Barbalata, Stefano V. Albrecht},
   booktitle={IJCAI Workshop on Ad Hoc Teamwork},
   year={2022}
}

Reuth Mirsky, Ignacio Carlucho, Arrasy Rahman, Elliot Fosong, William Macke, Mohan Sridharan, Peter Stone, Stefano V. Albrecht
A Survey of Ad Hoc Teamwork Research
European Conference on Multi-Agent Systems, 2022
Abstract | BibTex | arXiv
EUMAS survey ad-hoc-teamwork

@inproceedings{mirsky2022survey,
   title={A Survey of Ad Hoc Teamwork Research},
   author={Reuth Mirsky and Ignacio Carlucho and Arrasy Rahman and Elliot Fosong and William Macke and Mohan Sridharan and Peter Stone and Stefano V. Albrecht},
   booktitle={European Conference on Multi-Agent Systems (EUMAS)},
   year={2022}
}

Arrasy Rahman, Ignacio Carlucho, Niklas Höpner, Stefano V. Albrecht
A General Learning Framework for Open Ad Hoc Teamwork Using Graph-based Policy Learning
arXiv:2210.05448, 2022
Abstract | BibTex | arXiv
ad-hoc-teamwork deep-rl agent-modelling

@misc{Rahman2022POGPL,
   title={A General Learning Framework for Open Ad Hoc Teamwork Using Graph-based Policy Learning},
   author={Arrasy Rahman and Ignacio Carlucho and Niklas H\"opner and Stefano V. Albrecht},
   year={2022},
   eprint={2210.05448},
   archivePrefix={arXiv}
}

Aleksandar Krnjaic, Jonathan D. Thomas, Georgios Papoudakis, Lukas Schäfer, Peter Börsting, Stefano V. Albrecht
Scalable Multi-Agent Reinforcement Learning for Warehouse Logistics with Robotic and Human Co-Workers
arXiv:2212.11498, 2022
Abstract | BibTex | arXiv
deep-rl multi-agent-rl

@misc{Krnjaic2022HSNAC,
   title={Scalable Multi-Agent Reinforcement Learning for Warehouse Logistics with Robotic and Human Co-Workers},
   author={Aleksandar Krnjaic and Jonathan D. Thomas and Georgios Papoudakis and Lukas Sch\"afer and Peter B\"orsting and Stefano V. Albrecht,
   year={2022},
   eprint={2212.11498},
   archivePrefix={arXiv}
}

Lukas Schäfer, Filippos Christianos, Amos Storkey, Stefano V. Albrecht
Learning Task Embeddings for Teamwork Adaptation in Multi-Agent Reinforcement Learning
arxiv:2207.02249, 2022
Abstract | BibTex | arXiv
deep-rl multi-agent-rl

@misc{schaefer2022mate,
   title={Learning Task Embeddings for Teamwork Adaptation in Multi-Agent Reinforcement Learning},
   author={Lukas Schäfer and Filippos Christianos and Amos Storkey and Stefano V. Albrecht},
   year={2022},
   eprint={2207.02249},
   archivePrefix={arXiv},
   primaryClass={cs.MA}
}

Filippos Christianos, Georgios Papoudakis, Stefano V. Albrecht
Pareto Actor-Critic for Equilibrium Selection in Multi-Agent Reinforcement Learning
arXiv:2209.14344, 2022
Abstract | BibTex | arXiv
deep-rl multi-agent-rl

@misc{christianos2022pareto,
   title={Pareto Actor-Critic for Equilibrium Selection in Multi-Agent Reinforcement Learning},
   author={Filippos Christianos and Georgios Papoudakis and Stefano V. Albrecht},
   year={2022},
   eprint={2209.14344},
   archivePrefix={arXiv},
   primaryClass={cs.LG}
}

Filippos Christianos, Peter Karkus, Boris Ivanovic, Stefano V. Albrecht, Marco Pavone
Planning with Occluded Traffic Agents using Bi-Level Variational Occlusion Models
arXiv:2210.14584, 2022
Abstract | BibTex | arXiv
autonomous-driving

@misc{christianos2022bivo,
   title={Planning with Occluded Traffic Agents using Bi-Level Variational Occlusion Models},
   author={Filippos Christianos and Peter Karkus and Boris Ivanovic and Stefano V. Albrecht and Marco Pavone},
   year={2022},
   eprint={2210.14584},
   archivePrefix={arXiv}
}

Anthony Knittel, Majd Hawasly, Stefano V. Albrecht, John Redford, Subramanian Ramamoorthy
DiPA: Diverse and Probabilistically Accurate Interactive Prediction
arXiv:2210.06106, 2022
Abstract | BibTex | arXiv
autonomous-driving state-estimation

@misc{brewitt2022verifiable,
   title={{DiPA:} Diverse and Probabilistically Accurate Interactive Prediction},
   author={Anthony Knittel and Majd Hawasly and Stefano V. Albrecht and John Redford and Subramanian Ramamoorthy},
   year={2022},
   eprint={2210.06106},
   archivePrefix={arXiv},
   primaryClass={cs.RO}
}

2021

Georgios Papoudakis, Filippos Christianos, Lukas Schäfer, Stefano V. Albrecht
Benchmarking Multi-Agent Deep Reinforcement Learning Algorithms in Cooperative Tasks
Conference on Neural Information Processing Systems, Datasets and Benchmarks Track, 2021
Abstract | BibTex | arXiv | Code
NeurIPS deep-rl multi-agent-rl

@inproceedings{papoudakis2021benchmarking,
   title={Benchmarking Multi-Agent Deep Reinforcement Learning Algorithms in Cooperative Tasks},
   author={Georgios Papoudakis and Filippos Christianos and Lukas Sch\"afer and Stefano V. Albrecht},
   booktitle = {Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks (NeurIPS)},
   year={2021},
   url = {http://arxiv.org/abs/2006.07869},
   openreview = {https://openreview.net/forum?id=cIrPX-Sn5n},
   code = {https://github.com/uoe-agents/epymarl}
}

Georgios Papoudakis, Filippos Christianos, Stefano V. Albrecht
Agent Modelling under Partial Observability for Deep Reinforcement Learning
Conference on Neural Information Processing Systems, 2021
Abstract | BibTex | arXiv | Code
NeurIPS deep-rl agent-modelling

@inproceedings{papoudakis2021local,
   title={Agent Modelling under Partial Observability for Deep Reinforcement Learning},
   author={Georgios Papoudakis and Filippos Christianos and Stefano V. Albrecht},
   booktitle = {Proceedings of the Neural Information Processing Systems (NeurIPS)},
   year = {2021}
}

Rujie Zhong, Josiah P. Hanna, Lukas Schäfer, Stefano V. Albrecht
Robust On-Policy Data Collection for Data-Efficient Policy Evaluation
NeurIPS Workshop on Offline Reinforcement Learning, 2021
Abstract | BibTex | arXiv | Code
NeurIPS deep-rl

@inproceedings{zhong2021robust,
   title={Robust On-Policy Data Collection for Data-Efficient Policy Evaluation},
   author={Rujie Zhong and Josiah P. Hanna and Lukas Sch\"afer and Stefano V. Albrecht},
   booktitle={NeurIPS Workshop on Offline Reinforcement Learning (OfflineRL)},
   year={2021}
}

Arrasy Rahman, Niklas Höpner, Filippos Christianos, Stefano V. Albrecht
Towards Open Ad Hoc Teamwork Using Graph-based Policy Learning
International Conference on Machine Learning, 2021
Abstract | BibTex | arXiv | Video | Code
ICML deep-rl agent-modelling ad-hoc-teamwork

@inproceedings{rahman2021open,
   title={Towards Open Ad Hoc Teamwork Using Graph-based Policy Learning},
   author={Arrasy Rahman and Niklas H\"opner and Filippos Christianos and Stefano V. Albrecht},
   booktitle={International Conference on Machine Learning (ICML)},
   year={2021}
}

Filippos Christianos, Georgios Papoudakis, Arrasy Rahman, Stefano V. Albrecht
Scaling Multi-Agent Reinforcement Learning with Selective Parameter Sharing
International Conference on Machine Learning, 2021
Abstract | BibTex | arXiv | Video | Code
ICML deep-rl multi-agent-rl

@inproceedings{christianos2021scaling,
   title={Scaling Multi-Agent Reinforcement Learning with Selective Parameter Sharing},
   author={Filippos Christianos and Georgios Papoudakis and Arrasy Rahman and Stefano V. Albrecht},
   booktitle={International Conference on Machine Learning (ICML)},
   year={2021}
}

Lukas Schäfer, Filippos Christianos, Josiah Hanna, Stefano V. Albrecht
Decoupling Exploration and Exploitation in Reinforcement Learning
ICML Workshop on Unsupervised Reinforcement Learning, 2021
Abstract | BibTex | arXiv | Code
ICML deep-rl intrinsic-reward

@inproceedings{schaefer2021decoupling,
   title={Decoupling Exploration and Exploitation in Reinforcement Learning},
   author={Lukas Schäfer and Filippos Christianos and Josiah Hanna and Stefano V. Albrecht},
   booktitle={ICML Workshop on Unsupervised Reinforcement Learning (URL)},
   year={2021}
}

Stefano V. Albrecht, Cillian Brewitt, John Wilhelm, Balint Gyevnar, Francisco Eiras, Mihai Dobre, Subramanian Ramamoorthy
Interpretable Goal-based Prediction and Planning for Autonomous Driving
IEEE International Conference on Robotics and Automation, 2021
Abstract | BibTex | arXiv | Video | Code
ICRA autonomous-driving goal-recognition explainable-ai

@inproceedings{albrecht2020igp2,
   title={Interpretable Goal-based Prediction and Planning for Autonomous Driving},
   author={Stefano V. Albrecht and Cillian Brewitt and John Wilhelm and Balint Gyevnar and Francisco Eiras and Mihai Dobre and Subramanian Ramamoorthy},
   booktitle={IEEE International Conference on Robotics and Automation (ICRA)},
   year={2021}
}

Cillian Brewitt, Balint Gyevnar, Samuel Garcin, Stefano V. Albrecht
GRIT: Fast, Interpretable, and Verifiable Goal Recognition with Learned Decision Trees for Autonomous Driving
IEEE/RSJ International Conference on Intelligent Robots and Systems, 2021
Abstract | BibTex | arXiv | Video | Code
IROS autonomous-driving goal-recognition explainable-ai

@inproceedings{brewitt2021grit,
   title={{GRIT:} Fast, Interpretable, and Verifiable Goal Recognition with Learned Decision Trees for Autonomous Driving},
   author={Cillian Brewitt and Balint Gyevnar and Samuel Garcin and Stefano V. Albrecht},
   booktitle={IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)},
   year={2021}
}

Josiah P. Hanna, Arrasy Rahman, Elliot Fosong, Francisco Eiras, Mihai Dobre, John Redford, Subramanian Ramamoorthy, Stefano V. Albrecht
Interpretable Goal Recognition in the Presence of Occluded Factors for Autonomous Vehicles
IEEE/RSJ International Conference on Intelligent Robots and Systems, 2021
Abstract | BibTex | arXiv
IROS autonomous-driving goal-recognition explainable-ai

@inproceedings{hanna2021interpretable,
   title={Interpretable Goal Recognition in the Presence of Occluded Factors for Autonomous Vehicles},
   author={Josiah P. Hanna and Arrasy Rahman and Elliot Fosong and Francisco Eiras and Mihai Dobre and John Redford and Subramanian Ramamoorthy and Stefano V. Albrecht},
   booktitle={IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)},
   year={2021}
}

Henry Pulver, Francisco Eiras, Ludovico Carozza, Majd Hawasly, Stefano V. Albrecht, Subramanian Ramamoorthy
PILOT: Efficient Planning by Imitation Learning and Optimisation for Safe Autonomous Driving
IEEE/RSJ International Conference on Intelligent Robots and Systems, 2021
Abstract | BibTex | arXiv | Video
IROS autonomous-driving

@inproceedings{pulver2020pilot,
   title={{PILOT:} Efficient Planning by Imitation Learning and Optimisation for Safe Autonomous Driving},
   author={Henry Pulver and Francisco Eiras and Ludovico Carozza and Majd Hawasly and Stefano V. Albrecht and Subramanian Ramamoorthy},
   booktitle={IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)},
   year={2021}
}

Ibrahim H. Ahmed, Josiah P. Hanna, Elliot Fosong, Stefano V. Albrecht
Towards Quantum-Secure Authentication and Key Agreement via Abstract Multi-Agent Interaction
International Conference on Practical Applications of Agents and Multi-Agent Systems, 2021
Abstract | BibTex | arXiv | Publisher | Code
PAAMS security agent-modelling

@inproceedings{ahmed2021quantum,
   title={Towards Quantum-Secure Authentication and Key Agreement via Abstract Multi-Agent Interaction},
   author={Ibrahim H. Ahmed and Josiah P. Hanna and Elliot Fosong and Stefano V. Albrecht},
   booktitle={International Conference on Practical Applications of Agents and Multi-Agent Systems (PAAMS)},
   year={2021}
}

Shangmin Guo, Yi Ren, Kory Mathewson, Simon Kirby, Stefano V. Albrecht, Kenny Smith
Expressivity of Emergent Language is a Trade-off between Contextual Complexity and Unpredictability
arXiv:2106.03982, 2021
Abstract | BibTex | arXiv
multi-agent-rl emergent-communication

@misc{guo2021expressivity,
   title={Expressivity of Emergent Language is a Trade-off between Contextual Complexity and Unpredictability},
   author={Shangmin Guo and Yi Ren and Kory Mathewson and Simon Kirby and Stefano V. Albrecht and Kenny Smith},
   year={2021},
   eprint={2106.03982},
   archivePrefix={arXiv},
   primaryClass={cs.CL}
}

Trevor McInroe, Lukas Schäfer, Stefano V. Albrecht
Learning Temporally-Consistent Representations for Data-Efficient Reinforcement Learning
arXiv:2110.04935, 2021
Abstract | BibTex | arXiv | Code
deep-rl

@misc{mcinroe2021learning,
   title={Learning Temporally-Consistent Representations for Data-Efficient Reinforcement Learning},
   author={Trevor McInroe and Lukas Schäfer and Stefano V. Albrecht},
   year={2021},
   eprint={2110.04935},
   archivePrefix={arXiv},
   primaryClass={cs.LG}
}

2020

Stefano V. Albrecht, Peter Stone, Michael P. Wellman
Special Issue on Autonomous Agents Modelling Other Agents: Guest Editorial
Artificial Intelligence, 2020
Abstract | BibTex | Publisher | Special Issue
AIJ survey agent-modelling

@article{albrecht2020special,
   title = {Special Issue on Autonomous Agents Modelling Other Agents: Guest Editorial},
   author = {Stefano V. Albrecht and Peter Stone and Michael P. Wellman},
   journal = {Artificial Intelligence},
   volume = {285},
   year = {2020},
   publisher = {Elsevier},
   url = {https://doi.org/10.1016/j.artint.2020.103292}
}

Filippos Christianos, Lukas Schäfer, Stefano V. Albrecht
Shared Experience Actor-Critic for Multi-Agent Reinforcement Learning
Conference on Neural Information Processing Systems, 2020
Abstract | BibTex | arXiv
NeurIPS deep-rl multi-agent-rl

@inproceedings{christianos2020shared,
   title={Shared Experience Actor-Critic for Multi-Agent Reinforcement Learning},
   author={Filippos Christianos and Lukas Sch\"afer and Stefano V. Albrecht},
   booktitle={34th Conference on Neural Information Processing Systems},
   year={2020}
}

Georgios Papoudakis, Stefano V. Albrecht
Variational Autoencoders for Opponent Modeling in Multi-Agent Systems
AAAI Workshop on Reinforcement Learning in Games, 2020
Abstract | BibTex | arXiv
AAAI deep-rl agent-modelling

@inproceedings{papoudakis2020variational,
   title={Variational Autoencoders for Opponent Modeling in Multi-Agent Systems},
   author={Georgios Papoudakis and Stefano V. Albrecht},
   booktitle={AAAI Workshop on Reinforcement Learning in Games},
   year={2020}
}

Arrasy Rahman, Niklas Höpner, Filippos Christianos, Stefano V. Albrecht
Open Ad Hoc Teamwork using Graph-based Policy Learning
arXiv:2006.10412, 2020
Abstract | BibTex | arXiv
deep-rl agent-modelling ad-hoc-teamwork

@misc{rahman2020open,
   title={Open Ad Hoc Teamwork using Graph-based Policy Learning},
   author={Arrasy Rahman and Niklas H\"opner and Filippos Christianos and Stefano V. Albrecht},
   year={2020},
   eprint={2006.10412},
   archivePrefix={arXiv},
   primaryClass={cs.LG}
}

Georgios Papoudakis, Filippos Christianos , Lukas Schäfer, Stefano V. Albrecht
Comparative Evaluation of Multi-Agent Deep Reinforcement Learning Algorithms
arXiv:2006.07869, 2020
Abstract | BibTex | arXiv
deep-rl multi-agent-rl

@misc{papoudakis2020comparative,
   title={Comparative Evaluation of Multi-Agent Deep Reinforcement Learning Algorithms},
   author={Georgios Papoudakis and Filippos Christianos and Lukas Sch\"afer and Stefano V. Albrecht},
   year={2020},
   eprint={2006.07869},
   archivePrefix={arXiv},
   primaryClass={cs.LG}
}

Georgios Papoudakis, Filippos Christianos, Stefano V. Albrecht
Local Information Opponent Modelling Using Variational Autoencoders
arXiv:2006.09447, 2020
Abstract | BibTex | arXiv
deep-rl agent-modelling

@misc{papoudakis2020opponent,
   title={Local Information Opponent Modelling Using Variational Autoencoders},
   author={Georgios Papoudakis and Filippos Christianos and Stefano V. Albrecht},
   year={2020},
   eprint={2006.09447},
   archivePrefix={arXiv},
   primaryClass={cs.LG}
}

Ibrahim H. Ahmed, Josiah P. Hanna, Stefano V. Albrecht
Quantum-Secure Authentication via Abstract Multi-Agent Interaction
arXiv:2007.09327, 2020
Abstract | BibTex | arXiv
security agent-modelling

@misc{ahmed2020quantumsecure,
   title={Quantum-Secure Authentication via Abstract Multi-Agent Interaction},
   author={Ibrahim H. Ahmed and Josiah P. Hanna and Stefano V. Albrecht},
   year={2020},
   eprint={2007.09327},
   archivePrefix={arXiv},
   primaryClass={cs.CR}
}

Stefano V. Albrecht, Cillian Brewitt, John Wilhelm, Balint Gyevnar, Francisco Eiras, Mihai Dobre, Subramanian Ramamoorthy
Interpretable Goal-based Prediction and Planning for Autonomous Driving
arXiv:2002.02277, 2020
Abstract | BibTex | arXiv
autonomous-driving goal-recognition explainable-ai

@misc{albrecht2020integrating,
   title={Interpretable Goal-based Prediction and Planning for Autonomous Driving},
   author={Stefano V. Albrecht and Cillian Brewitt and John Wilhelm and Balint Gyevnar and Francisco Eiras and Mihai Dobre and Subramanian Ramamoorthy},
   year={2020},
   eprint={2002.02277},
   archivePrefix={arXiv},
   primaryClass={cs.RO}
}

Henry Pulver, Francisco Eiras, Ludovico Carozza, Majd Hawasly, Stefano V. Albrecht, Subramanian Ramamoorthy
PILOT: Efficient Planning by Imitation Learning and Optimisation for Safe Autonomous Driving
arXiv:2011.00509, 2020
Abstract | BibTex | arXiv
autonomous-driving

@misc{pulver2020pilot,
   title={{PILOT:} Efficient Planning by Imitation Learning and Optimisation for Safe Autonomous Driving},
   author={Henry Pulver and Francisco Eiras and Ludovico Carozza and Majd Hawasly and Stefano V. Albrecht and Subramanian Ramamoorthy},
   year={2020},
   eprint={2011.00509},
   archivePrefix={arXiv},
   primaryClass={cs.RO}
}

Francisco Eiras, Majd Hawasly, Stefano V. Albrecht, Subramanian Ramamoorthy
Two-Stage Optimization-based Motion Planner for Safe Urban Driving
arXiv:2002.02215, 2020
Abstract | BibTex | arXiv
autonomous-driving

@misc{eiras2020twostage,
   title={Two-Stage Optimization-based Motion Planner for Safe Urban Driving},
   author={Francisco Eiras and Majd Hawasly and Stefano V. Albrecht and Subramanian Ramamoorthy},
   year={2020},
   eprint={2002.02215},
   archivePrefix={arXiv},
   primaryClass={cs.RO}
}

2019

Maciej Wiatrak, Stefano V. Albrecht, Andrew Nystrom
Stabilizing Generative Adversarial Networks: A Survey
arXiv:1910.00927, 2019
Abstract | BibTex | arXiv
survey security

@misc{wiatrak2019stabilizing,
   title={Stabilizing Generative Adversarial Networks: A Survey},
   author={Maciej Wiatrak and Stefano V. Albrecht and Andrew Nystrom},
   year={2019},
   eprint={1910.00927},
   archivePrefix={arXiv},
   primaryClass={cs.LG}
}

Georgios Papoudakis, Filippos Christianos, Arrasy Rahman, Stefano V. Albrecht
Dealing with Non-Stationarity in Multi-Agent Deep Reinforcement Learning
arXiv:1906.04737, 2019
Abstract | BibTex | arXiv
survey deep-rl multi-agent-rl

@misc{papoudakis2019dealing,
   title={Dealing with Non-Stationarity in Multi-Agent Deep Reinforcement Learning},
   author={Georgios Papoudakis and Filippos Christianos and Arrasy Rahman and Stefano V. Albrecht},
   year={2019},
   eprint={1906.04737},
   archivePrefix={arXiv},
   primaryClass={cs.LG}
}

2018

Stefano V. Albrecht, Peter Stone
Autonomous Agents Modelling Other Agents: A Comprehensive Survey and Open Problems
Artificial Intelligence, 2018
Abstract | BibTex | arXiv | Publisher
AIJ survey agent-modelling goal-recognition

@article{ albrecht2018modelling,
   title = {Autonomous Agents Modelling Other Agents: A Comprehensive Survey and Open Problems},
   author = {Stefano V. Albrecht and Peter Stone},
   journal = {Artificial Intelligence},
   volume = {258},
   pages = {66--95},
   year = {2018},
   publisher = {Elsevier},
   note = {DOI: 10.1016/j.artint.2018.01.002}
}

Craig Innes, Alex Lascarides, Stefano V. Albrecht, Subramanian Ramamoorthy, Benjamin Rosman
Reasoning about Unforeseen Possibilities During Policy Learning
arXiv:1801.03331, 2018
Abstract | BibTex | arXiv
causal

@misc{innes2018reasoning,
   title={Reasoning about Unforeseen Possibilities During Policy Learning},
   author={Craig Innes and Alex Lascarides and Stefano V. Albrecht and Subramanian Ramamoorthy and Benjamin Rosman},
   year={2018},
   eprint={1801.03331},
   archivePrefix={arXiv},
   primaryClass={cs.AI}
}

2017

Stefano V. Albrecht, Somchaya Liemhetcharat, Peter Stone
Special Issue on Multiagent Interaction without Prior Coordination: Guest Editorial
Journal of Autonomous Agents and Multi-Agent Systems, 2017
Abstract | BibTex | Publisher | MIPC Workshop Series
JAAMAS survey ad-hoc-teamwork

@article{ albrecht2017special,
   title = {Special Issue on Multiagent Interaction without Prior Coordination: Guest Editorial},
   author = {Stefano V. Albrecht and Somchaya Liemhetcharat and Peter Stone},
   journal = {Autonomous Agents and Multi-Agent Systems},
   volume = {31},
   issue = {4},
   pages = {765--766},
   year = {2017},
   publisher = {Springer},
   url = {http://dx.doi.org/10.1007/s10458-016-9358-0}
}

Stefano V. Albrecht, Peter Stone
Reasoning about Hypothetical Agent Behaviours and their Parameters
International Conference on Autonomous Agents and Multiagent Systems, 2017
Abstract | BibTex | arXiv
AAMAS ad-hoc-teamwork agent-modelling

@inproceedings{ albrecht2017reasoning,
   title = {Reasoning about Hypothetical Agent Behaviours and their Parameters},
   author = {Stefano V. Albrecht and Peter Stone},
   booktitle = {Proceedings of the 16th International Conference on Autonomous Agents and Multiagent Systems},
   pages = {547--555},
   year = {2017}
}

Stefano V. Albrecht, Subramanian Ramamoorthy
Exploiting Causality for Selective Belief Filtering in Dynamic Bayesian Networks (Extended Abstract)
International Joint Conference on Artificial Intelligence, 2017
Abstract | BibTex | arXiv
IJCAI state-estimation causal

@inproceedings{ albrecht2017causality,
   title = {Exploiting Causality for Selective Belief Filtering in Dynamic {B}ayesian Networks (Extended Abstract)},
   author = {Stefano V. Albrecht and Subramanian Ramamoorthy},
   booktitle = {Proceedings of the 26th International Joint Conference on Artificial Intelligence},
   address = {Melbourne, Australia},
   month = {August},
   year = {2017}
}

2016

Stefano V. Albrecht, Jacob W. Crandall, Subramanian Ramamoorthy
Belief and Truth in Hypothesised Behaviours
Artificial Intelligence, 2016
Abstract | BibTex | arXiv | Publisher
AIJ agent-modelling ad-hoc-teamwork

@article{ albrecht2016belief,
   title = {Belief and Truth in Hypothesised Behaviours},
   author = {Stefano V. Albrecht and Jacob W. Crandall and Subramanian Ramamoorthy},
   journal = {Artificial Intelligence},
   volume = {235},
   pages = {63--94},
   year = {2016},
   publisher = {Elsevier},
   note = {DOI: 10.1016/j.artint.2016.02.004}
}

Stefano V. Albrecht, Subramanian Ramamoorthy
Exploiting Causality for Selective Belief Filtering in Dynamic Bayesian Networks
Journal of Artificial Intelligence Research, 2016
Abstract | BibTex | arXiv | Publisher
JAIR state-estimation causal

@article{ albrecht2016causality,
   title = {Exploiting Causality for Selective Belief Filtering in Dynamic {B}ayesian Networks},
   author = {Stefano V. Albrecht and Subramanian Ramamoorthy},
   journal = {Journal of Artificial Intelligence Research},
   volume = {55},
   pages = {1135--1178},
   year = {2016},
   publisher = {AI Access Foundation},
   note = {DOI: 10.1613/jair.5044}
}

2015

Stefano V. Albrecht, Subramanian Ramamoorthy
Are You Doing What I Think You Are Doing? Criticising Uncertain Agent Models
Conference on Uncertainty in Artificial Intelligence, 2015
Abstract | BibTex | arXiv
UAI agent-modelling

@inproceedings{ albrecht2015criticising,
   title = {Are You Doing What {I} Think You Are Doing? Criticising Uncertain Agent Models},
   author = {Stefano V. Albrecht and Subramanian Ramamoorthy},
   booktitle = {Proceedings of the 31st Conference on Uncertainty in Artificial Intelligence},
   pages = {52--61},
   year = {2015}
}

Stefano V. Albrecht, Jacob W. Crandall, Subramanian Ramamoorthy
An Empirical Study on the Practical Impact of Prior Beliefs over Policy Types
AAAI Conference on Artificial Intelligence, 2015
Abstract | BibTex | arXiv | Appendix
AAAI agent-modelling ad-hoc-teamwork

@inproceedings{ albrecht2015empirical,
   title = {An Empirical Study on the Practical Impact of Prior Beliefs over Policy Types},
   author = {Stefano V. Albrecht and Jacob W. Crandall and Subramanian Ramamoorthy},
   booktitle = {Proceedings of the 29th AAAI Conference on Artificial Intelligence},
   pages = {1988--1994},
   year = {2015}
}

Stefano V. Albrecht, Jacob W. Crandall, Subramanian Ramamoorthy
E-HBA: Using Action Policies for Expert Advice and Agent Typification
AAAI Workshop on Multiagent Interaction without Prior Coordination, 2015
Abstract | BibTex | arXiv | Appendix
AAAI agent-modelling ad-hoc-teamwork

@inproceedings{ albrecht2015ehba,
   title = {{E-HBA}: Using Action Policies for Expert Advice and Agent Typification},
   author = {Stefano V. Albrecht and Jacob W. Crandall and Subramanian Ramamoorthy},
   booktitle = {AAAI Workshop on Multiagent Interaction without Prior Coordination},
   address = {Austin, Texas, USA},
   month = {January},
   year = {2015}
}

2014

Stefano V. Albrecht, Subramanian Ramamoorthy
On Convergence and Optimality of Best-Response Learning with Policy Types in Multiagent Systems
Conference on Uncertainty in Artificial Intelligence, 2014
Abstract | BibTex | arXiv | Appendix
UAI agent-modelling

@inproceedings{ albrecht2014convergence,
   title = {On Convergence and Optimality of Best-Response Learning with Policy Types in Multiagent Systems},
   author = {Stefano V. Albrecht and Subramanian Ramamoorthy},
   booktitle = {Proceedings of the 30th Conference on Uncertainty in Artificial Intelligence},
   pages = {12--21},
   year = {2014}
}

2013

Stefano V. Albrecht, Subramanian Ramamoorthy
A Game-Theoretic Model and Best-Response Learning Method for Ad Hoc Coordination in Multiagent Systems
International Conference on Autonomous Agents and Multiagent Systems, 2013
Abstract | BibTex | arXiv (full technical report) | Extended Abstract
AAMAS ad-hoc-teamwork agent-modelling

@inproceedings{ albrecht2013game,
   title = {A Game-Theoretic Model and Best-Response Learning Method for Ad Hoc Coordination in Multiagent Systems},
   author = {Stefano V. Albrecht and Subramanian Ramamoorthy},
   booktitle = {Proceedings of the 12th International Conference on Autonomous Agents and Multiagent Systems},
   address = {St. Paul, Minnesota, USA},
   month = {May},
   year = {2013}
}

2012

Stefano V. Albrecht, Subramanian Ramamoorthy
Comparative Evaluation of Multiagent Learning Algorithms in a Diverse Set of Ad Hoc Team Problems
International Conference on Autonomous Agents and Multiagent Systems, 2012
Abstract | BibTex | arXiv
AAMAS multi-agent-rl ad-hoc-teamwork

@inproceedings{ albrecht2012comparative,
   title = {Comparative Evaluation of {MAL} Algorithms in a Diverse Set of Ad Hoc Team Problems},
   author = {Stefano V. Albrecht and Subramanian Ramamoorthy},
   booktitle = {Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems},
   pages = {349--356},
   year = {2012}
}