podcast -- Yahoo Answers users seek advice, opinion, as well as expertise in research by Mark Ackerman, Lada Adamic and STIET fellow Eytan Bakshy
Podcast discussing the STIET research program with Jeff MacKie-Mason and Tom Finholt
podcast -- Yahoo Answers users seek advice, opinion, as well as expertise in research by Mark Ackerman, Lada Adamic and STIET fellow Eytan Bakshy
Podcast discussing the STIET research program with Jeff MacKie-Mason and Tom FinholtSatinder Singh
Professor of Electrical Engineering and Computer Science, University of Michigan
4-5:30 pm
UM: 411 West Hall
WSU: 313 State Hall (via videoconference)
In the computational reinforcement learning (RL) framework, rewards—more specifically, reward functions—determine the problem the learning agent is trying to solve. Properties of the reward function influence how easy or hard the problem is, and how well an agent may do, but RL theory and algorithms are completely insensitive to the source of rewards. This is a strength of the framework because of the generality it confers, but it is also a weakness because it defers key questions about the nature of reward functions. In this talk, I address this weakness from two directions. First, I consider the role of evolution in determining where rewards come from in natural agents. Specifically, I present a computational framework in which evolved rewards capture regularities across environments leaving the agent to learn regularities within its environment during its lifetime. Second, I describe how in designing artificial agents the current use of rewards confounds their role in defining preferences over behaviors and their role as parameters of actual agent behavior (RL agents act so as to maximize reward). Disentangling this "preferences parameters confound" can be beneficial in designing artificial agents. I will present many empirical illustrations of both of these aspects of rethinking rewards in RL.
* This talk describe joint work with Richard Lewis, Andrew G. Barto, Jonathan Sorg and Akram Helou.
Satinder's website is http://www.eecs.umich.edu/~baveja/
Satinder Singh Baveja is a Professor of Electrical Engineering and Computer Science. His main research interest is in the old-fashioned goal of Artificial Intelligence (AI), that of building autonomous agents that can learn to be broadly competent in complex, dynamic, and uncertain environments. The field of reinforcement learning (RL) has focused on this goal and accordingly my deepest contributions are in RL. More recently, he has been taking seriously the challenge of building agents that can interact with other agents and even humans in both artificial and natural environments. This has led to research in: human-computer interaction, computational game theory, and mechanism design.