A reinforcement learning model of bounded optimal strategy learning

Xiuli Chen; Andrew Howes

A reinforcement learning model of bounded optimal strategy learning

Xiuli Chen, Andrew Howes

Computer Science

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Abstract

In this paper we report a reinforcement learning model of how individuals learn the value of strategies for remembering. The model learns from experience about the changing speed and accuracy of memory strategies. The reward function was sensitive to the internal information processing constraints (limited working memory capacity) of the participants. In addition, because the value of strategies for remembering changed with practice, experience was discounted according to a recency-weighted function. The model was used to generate predictions of the behavioural data of 40 participants who were asked to copy appointment information from an email message to a calendar. The experience discounting parameter for a model of each individual participant was set so as to maximize the expected rewards for that participant. The predictions of this bounded optimal control model were compared with the observed data. The result suggests that people may be able to choose remembering strategies on the basis of optimally discounted past experience.

Original language	English
Title of host publication	Proceedings of the 11th International Conference on Cognitive Modeling, ICCM 2012
Pages	193-198
Number of pages	6
Publication status	Published - 2012
Event	11th International Conference on Cognitive Modeling, ICCM 2012 - Berlin, Germany Duration: 13 Apr 2012 → 15 Apr 2012

Publication series

Name	Proceedings of the 11th International Conference on Cognitive Modeling, ICCM 2012

Conference

Conference	11th International Conference on Cognitive Modeling, ICCM 2012
Country/Territory	Germany
City	Berlin
Period	13/04/12 → 15/04/12

Bibliographical note

Copyright:
Copyright 2013 Elsevier B.V., All rights reserved.

Keywords

Bounded optimal
Information processing bounds
Memory constraints
Reinforcement learning

ASJC Scopus subject areas

Artificial Intelligence
Modelling and Simulation

Cite this

@inproceedings{9ce587da59984c5cba4e32a8765adf5b,

title = "A reinforcement learning model of bounded optimal strategy learning",

abstract = "In this paper we report a reinforcement learning model of how individuals learn the value of strategies for remembering. The model learns from experience about the changing speed and accuracy of memory strategies. The reward function was sensitive to the internal information processing constraints (limited working memory capacity) of the participants. In addition, because the value of strategies for remembering changed with practice, experience was discounted according to a recency-weighted function. The model was used to generate predictions of the behavioural data of 40 participants who were asked to copy appointment information from an email message to a calendar. The experience discounting parameter for a model of each individual participant was set so as to maximize the expected rewards for that participant. The predictions of this bounded optimal control model were compared with the observed data. The result suggests that people may be able to choose remembering strategies on the basis of optimally discounted past experience.",

keywords = "Bounded optimal, Information processing bounds, Memory constraints, Reinforcement learning",

author = "Xiuli Chen and Andrew Howes",

year = "2012",

language = "English",

isbn = "9783798324084",

series = "Proceedings of the 11th International Conference on Cognitive Modeling, ICCM 2012",

pages = "193--198",

booktitle = "Proceedings of the 11th International Conference on Cognitive Modeling, ICCM 2012",

}

A reinforcement learning model of bounded optimal strategy learning. / Chen, Xiuli; Howes, Andrew.
Proceedings of the 11th International Conference on Cognitive Modeling, ICCM 2012. 2012. p. 193-198 (Proceedings of the 11th International Conference on Cognitive Modeling, ICCM 2012).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

TY - GEN

T1 - A reinforcement learning model of bounded optimal strategy learning

AU - Chen, Xiuli

AU - Howes, Andrew

PY - 2012

Y1 - 2012

N2 - In this paper we report a reinforcement learning model of how individuals learn the value of strategies for remembering. The model learns from experience about the changing speed and accuracy of memory strategies. The reward function was sensitive to the internal information processing constraints (limited working memory capacity) of the participants. In addition, because the value of strategies for remembering changed with practice, experience was discounted according to a recency-weighted function. The model was used to generate predictions of the behavioural data of 40 participants who were asked to copy appointment information from an email message to a calendar. The experience discounting parameter for a model of each individual participant was set so as to maximize the expected rewards for that participant. The predictions of this bounded optimal control model were compared with the observed data. The result suggests that people may be able to choose remembering strategies on the basis of optimally discounted past experience.

AB - In this paper we report a reinforcement learning model of how individuals learn the value of strategies for remembering. The model learns from experience about the changing speed and accuracy of memory strategies. The reward function was sensitive to the internal information processing constraints (limited working memory capacity) of the participants. In addition, because the value of strategies for remembering changed with practice, experience was discounted according to a recency-weighted function. The model was used to generate predictions of the behavioural data of 40 participants who were asked to copy appointment information from an email message to a calendar. The experience discounting parameter for a model of each individual participant was set so as to maximize the expected rewards for that participant. The predictions of this bounded optimal control model were compared with the observed data. The result suggests that people may be able to choose remembering strategies on the basis of optimally discounted past experience.

KW - Bounded optimal

KW - Information processing bounds

KW - Memory constraints

KW - Reinforcement learning

UR - http://www.scopus.com/inward/record.url?scp=84877777964&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:84877777964

SN - 9783798324084

T3 - Proceedings of the 11th International Conference on Cognitive Modeling, ICCM 2012

SP - 193

EP - 198

BT - Proceedings of the 11th International Conference on Cognitive Modeling, ICCM 2012

T2 - 11th International Conference on Cognitive Modeling, ICCM 2012

Y2 - 13 April 2012 through 15 April 2012

ER -

A reinforcement learning model of bounded optimal strategy learning

Abstract

Publication series

Conference

Bibliographical note

Keywords

ASJC Scopus subject areas

Fingerprint

Cite this