“Adaptive learning” as a mechanistic candidate for reaching optimal task-set representations flexibly

Ardid, Salva; Balcarras, Matthew; Womelsdorf, Thilo

doi:10.1186/1471-2202-15-S1-P8

Volume 15 Supplement 1

Abstracts from the Twenty Third Annual Computational Neuroscience Meeting: CNS*2014

Poster presentation
Open access
Published: 21 July 2014

“Adaptive learning” as a mechanistic candidate for reaching optimal task-set representations flexibly

Salva Ardid¹,
Matthew Balcarras¹ &
Thilo Womelsdorf¹

BMC Neuroscience volume 15, Article number: P8 (2014) Cite this article

1293 Accesses
1 Citations
Metrics details

Top-down inputs from prefrontal cortex impact on sensory neurons [1, 2], enhancing their selectivity to attended stimuli, while sensory processing of distractors is suppressed [1, 3, 4]. However, what are the neuro-computational mechanisms that identify the behaviorally relevant information that is worth to bias? Previous studies linked selective attention to learned value [5], suggesting that attentional selection relies on internal mechanisms that track the relevance of sensory information. Consistent with this view, we recently introduced a reinforcement learning (RL) approach for the deployment of selective attention [6]. We tested model-free and model-based versions of RL to identify the mechanisms that most accurately predict behavior: whereas model-based prioritizes attentional selection to task features that are systematically associated with reward, model-free considers all available features. Our results proved that the optimal task-set representation significantly improved predictive power, suggesting that monkeys benefited from model-based mechanisms [6]. Yet, model-based presents two important limitations: i) it is unable to adapt to changes in the association of reward with sensory features that are not included in the learned model, and ii) it excludes the mechanism of learning by which subjects derive the proper task-set representation. The question remains then of how a prioritized task set can be learned to exploit the benefits of employing a model.

With the aim to workaround model-based limitations and shed light on the underlying mechanisms that make model-based benefits possible, we propose here the “adaptive learning” mechanism for flexible task-set representation. The mechanism is dynamically tuned according to the statistics of association between sensory features and reward outcome, flexibly adjusting its regime of operation between model-free and model-based systems. To test the adaptive learning mechanism, we developed it in a RL model and extended our previous analysis of monkey behavior to both, cued [5] and uncued [6], versions of the same attention task. This RL model was able to transition from a naive starting point to an optimal task-set representation, and to flexibly adapt among optimal task-set representations upon changes in reward contingencies. The model achieved so by tracking a separate learning rate α for each stimulus feature in the environment. If the selection of a stimulus feature was systematically correlated with a particular outcome (reward or no-reward), α increased. In contrast, when a feature was unable to consistently predict reward, α decayed. Thus, α changed dynamically, and over a large number of trials any α associated with a non-predictive feature went to zero, effectively eliminating it from the task set, which tuned performance towards that of the model-based system.

The adaptive learning mechanism introduced here represents a step further in our understanding of the origins of selective attention. Notably, our results prove that the adaptive learning is an optimal mechanistic candidate to support arbitrary prioritized model-based formation under generic conditions of covert attentional selection, and regardless of whether attentional selection operates on cued or uncued tasks.

References

Ardid S, Wang X-J, Compte A: An integrated microcircuit model of attentional processing in the neocortex. J Neurosci. 2007, 27 (32): 8486-8495. 10.1523/JNEUROSCI.1145-07.2007.
Article CAS PubMed Google Scholar
Gregoriou GG, Gotts SJ, Zhou H, Desimone R: High-frequency, long-range coupling between prefrontal and visual cortex during attention. Science. 2009, 324 (5931): 1207-1210. 10.1126/science.1171402.
Article PubMed Central CAS PubMed Google Scholar
Reynolds JH, Chelazzi L: Attentional modulation of visual processing. Annu Rev Neurosci. 2004, 27: 611-647. 10.1146/annurev.neuro.26.041002.131039.
Article CAS PubMed Google Scholar
Maunsell JH, Treue S: Feature-based attention in visual cortex. Trends Neurosci. 2006, 29 (6): 317-322. 10.1016/j.tins.2006.04.001.
Article CAS PubMed Google Scholar
Kaping D, Vinck M, Hutchison RM, Everling S, Womelsdorf T: Specific contributions of ventromedial, anterior cingulate, and lateral prefrontal cortex for attentional selection and stimulus valuation. PLoS Biol. 2011, 9 (12): e1001224-10.1371/journal.pbio.1001224.
Article PubMed Central CAS PubMed Google Scholar
Balcarras M, Ardid S, Kaping D, Everling S, Womelsdorf T: Learning of expected stimulus values in a selective attentional foraging task by nonhuman primates. Soc Neurosci Abstr. 2012
Google Scholar

Download references

Acknowledgements

This work is supported by the Canadian Institutes of Health Research and Natural Sciences and Engineering Research Council of Canada.

Author information

Authors and Affiliations

Department of Biology and Centre for Vision Research, York University, Toronto, Ontario, Canada, M3J 1P3
Salva Ardid, Matthew Balcarras & Thilo Womelsdorf

Authors

Salva Ardid
View author publications
You can also search for this author in PubMed Google Scholar
Matthew Balcarras
View author publications
You can also search for this author in PubMed Google Scholar
Thilo Womelsdorf
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Salva Ardid.

Rights and permissions

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Cite this article

Ardid, S., Balcarras, M. & Womelsdorf, T. “Adaptive learning” as a mechanistic candidate for reaching optimal task-set representations flexibly. BMC Neurosci 15 (Suppl 1), P8 (2014). https://0-doi-org.brum.beds.ac.uk/10.1186/1471-2202-15-S1-P8

Download citation

Published: 21 July 2014
DOI: https://0-doi-org.brum.beds.ac.uk/10.1186/1471-2202-15-S1-P8

Abstracts from the Twenty Third Annual Computational Neuroscience Meeting: CNS*2014

“Adaptive learning” as a mechanistic candidate for reaching optimal task-set representations flexibly

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

BMC Neuroscience

Contact us

Abstracts from the Twenty Third Annual Computational Neuroscience Meeting: CNS*2014

“Adaptive learning” as a mechanistic candidate for reaching optimal task-set representations flexibly

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

BMC Neuroscience

Contact us