Search

Scholarly Works (2 results)

Article
Peer Reviewed

A computationally rational model of human reinforcment learning

Proceedings of the Annual Meeting of the Cognitive Science Society, Volume 43 (2021)

Human learning efficiency in reinforcement learning tasks decreases when the number of the presented stimuli increases, a finding known as the "set size effect". From the computational rationality perspective, this effect can be interpreted as the brain’s balancing task performance against rising cognitive costs. Still, it remains unclear how best to quantify cognitive cost in learning tasks. One candidate is policy complexity, defined in terms of information theory as the mutual information between the sensory input and behavioral response. However, using a published data set (Collins & Frank, 2012), we show that policy complexity alone cannot explain the set size effect because the optimal policy complexity does not necessarily increase with the set size. We therefore propose a computational model and conduct a model-based analysis to show the minimal constituents of cognitive cost are policy complexity and representation complexity---the information quantity conveyed from sensory inputs to internal representations.

Cover page: A computationally rational model of human reinforcment learning

Creative Commons 'BY' version 4.0 license

Article
Peer Reviewed

A computationally rational analysis of response strategy in a probability learning task

Proceedings of the Annual Meeting of the Cognitive Science Society, Volume 44 (2022)

Intelligent behavior requires the ability to adapt to an ever-changing environment. But are humans rational or normative in this ability? We apply a resource-rational analysis to the data from a probability learning task (Gagne et al., 2020). Our analysis hypothesizes that people seek to maximize the expected utility of behavior, while simultaneously minimizing the complexity of their behavioral policies. We report evidence consistent with this hypothesis. We also show that people adopt simpler policies in situations of greater environmental stability, and interpret this as a consequence of reward maximization.

Cover page: A computationally rational analysis of response strategy in a probability learning task