Knowledge transfer from machine learning to neuroscience

Abstract

An increasingly popular methodological view in systems neuroscience says that the best way to understand large-scale neural systems is to (i) define a set of goal-directed behaviors for which that system is responsible, (ii) train an artificial neural network to perform that behavior, (iii) study how the neural network generates that behavior, and (iv) use that knowledge to make inferences about how the biological network does it. Call this ML-neuroscience. As with any novel methodological doctrine, ML-neuroscience has attracted controversy. Skeptics say that biological networks are so different from artificial networks that such comparisons are likely to be misleading. One response to this skepticism is to say that, insofar as we are interested in information-processing properties, artificial neural networks really do exemplify the crucial properties of biological networks. Here, I want to offer a different response that concedes more to the skeptic, but nevertheless manages to defend ML-neuroscience. My strategy is to conceptualize the transfer of knowledge from machine learning models to neurobiological systems as an instance of the more general phenomenon of trans-domain modeling. The history of science is full of cases in which mathematical models developed in one discipline get redeployed in other disciplines, despite the lack of readily observable empirical similarities between the respective target systems. What makes such trans-domain modeling possible? Usually, it is not that the two target systems turn out to be two instances of the same natural kind. If that were the case, we should expect to develop a new body of theory that extends to both systems, and a set of theoretical terms that refer to elements in both. This expectation of theoretical unity is sometimes encouraged by defenders of ML-neuroscience, but ought not be. Trans-domain modeling is often possible because the two systems share rather abstract structural properties that are hard to notice without the use of mathematics. Capturing these abstract structural properties often spurs scientific progress in the absence of theoretical unity. I will illustrate this by means of the well-known Lotka-Volterra model in population biology, which was rediscovered in economic theory. I will then use this case as a guide as I consider which level of abstraction is appropriate for making inferences from machine learning models to neuroscience.

Date
Dec 1, 2021 2:00 PM
Location
Berlin