Reinforcement Learning with Qualitative Feedback
Name: Reinforcement Learning with Qualitative Feedback
Code Name: RLQF
Funding: DFG - Part of the Priority Programm "Autonomous Learning"
Our goal in this project is to generalize the standard RL framework so as to allow more general types of feedback, notably non-numerical rewards and qualitative advice. Building on novel methods, such as ranking functions that allow for sorting such models. While the focus of the project is on the development of theoretical and methodological foundations of a "preference-based reinforcement learning", we also envision two case studies putting our ideas into practice, one in the field of game playing and another one in a medical context.