A biologically inspired architecture with switching units can learn to generalize across backgrounds

Doris Voina; Eric Shea-Brown; Stefan Mihalas

doi:10.1016/j.neunet.2023.09.014

A biologically inspired architecture with switching units can learn to generalize across backgrounds

Neural Netw. 2023 Nov:168:615-630. doi: 10.1016/j.neunet.2023.09.014. Epub 2023 Sep 17.

Authors

Doris Voina¹, Eric Shea-Brown², Stefan Mihalas²

Affiliations

¹ Department of Applied Mathematics, Computational Neuroscience Center, University of Washington, Seattle, WA 98195, USA. Electronic address: dvoina@uw.edu.
² Department of Applied Mathematics, Computational Neuroscience Center, University of Washington, Seattle, WA 98195, USA; Allen Institute for Brain Science, 615 Westlake Ave N, Seattle, WA 98109, USA.

PMID: 37839332
PMCID: PMC10843013 (available on 2024-11-01)
DOI: 10.1016/j.neunet.2023.09.014

Abstract

Humans and other animals navigate different environments effortlessly, their brains rapidly and accurately generalizing across contexts. Despite recent progress in deep learning, this flexibility remains a challenge for many artificial systems. Here, we show how a bio-inspired network motif can explicitly address this issue. We do this using a dataset of MNIST digits of varying transparency, set on one of two backgrounds of different statistics that define two contexts: a pixel-wise noise or a more naturalistic background from the CIFAR-10 dataset. After learning digit classification when both contexts are shown sequentially, we find that both shallow and deep networks have sharply decreased performance when returning to the first background - an instance of the catastrophic forgetting phenomenon known from continual learning. To overcome this, we propose the bottleneck-switching network or switching network for short. This is a bio-inspired architecture analogous to a well-studied network motif in the visual cortex, with additional "switching" units that are activated in the presence of a new background, assuming a priori a contextual signal to turn these units on or off. Intriguingly, only a few of these switching units are sufficient to enable the network to learn the new context without catastrophic forgetting through inhibition of redundant background features. Further, the bottleneck-switching network can generalize to novel contexts similar to contexts it has learned. Importantly, we find that - again as in the underlying biological network motif, recurrently connecting the switching units to network layers is advantageous for context generalization.

Keywords: Bio-inspired; Context; Continual learning; Domain adaptation; Generalization; Switching network.

MeSH terms

Brain* / physiology
Generalization, Psychological
Humans
Neural Networks, Computer*

Abstract

MeSH terms

Grants and funding