Abstract
We investigate an approach for enabling a reinforcement learning
agent to learn about dangerous states or constraints from stop-
feedback preventing the agent from taking any further, potentially
dangerous, actions. Such feedback could be provided by human
supervisors overseeing the RL agent’s behavior while carrying
out some complex tasks. To enable the RL agent to learn from
the supervisor’s feedback, we propose a probabilistic model for
approximating how the supervisor’s feedback could have been
generated and consider a Bayesian approach for inferring dangerous
states. We evaluated our approach using an OpenAI Safety Gym
environment and demonstrated that our agent can effectively infer
the imposed safety constraints. Furthermore, we conducted a user
study to validate our human-inspired feedback model and to obtain
insights into the human provision of stop-feedback.
agent to learn about dangerous states or constraints from stop-
feedback preventing the agent from taking any further, potentially
dangerous, actions. Such feedback could be provided by human
supervisors overseeing the RL agent’s behavior while carrying
out some complex tasks. To enable the RL agent to learn from
the supervisor’s feedback, we propose a probabilistic model for
approximating how the supervisor’s feedback could have been
generated and consider a Bayesian approach for inferring dangerous
states. We evaluated our approach using an OpenAI Safety Gym
environment and demonstrated that our agent can effectively infer
the imposed safety constraints. Furthermore, we conducted a user
study to validate our human-inspired feedback model and to obtain
insights into the human provision of stop-feedback.
| Original language | English |
|---|---|
| Title of host publication | he proceedings of the 22nd International Conference on Autonomous Agents and Multiagent Systems (AAMAS-2023) |
| ISBN (Electronic) | 978-1-4503-9432-1 |
| Publication status | Published - 29 May 2023 |
| Event | 22nd International Conference on Autonomous Agents and Multiagent Systems - London, United Kingdom Duration: 29 May 2023 → 2 Jun 2023 https://aamas2023.soton.ac.uk/ |
Conference
| Conference | 22nd International Conference on Autonomous Agents and Multiagent Systems |
|---|---|
| Abbreviated title | AAMAS |
| Country/Territory | United Kingdom |
| City | London |
| Period | 29/05/23 → 2/06/23 |
| Internet address |
Austrian Fields of Science 2012
- 102001 Artificial intelligence
Fingerprint
Dive into the research topics of 'Learning Constraints From Human Stop-Feedback in Reinforcement Learning'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver