Amr Farahat @AmrFouad_ tweet - 🧵 time! 1/15 Why are CNNs so good at predicting neural responses in the primate visual system? Is it their design (architecture) or learning (training)? And does this change along the visual hierarchy? https://t.co/wqPuLiHEVM

Amr Farahat

@AmrFouad_

8 months

🧵 time! 1/15 Why are CNNs so good at predicting neural responses in the primate visual system? Is it their design (architecture) or learning (training)? And does this change along the visual hierarchy?

Amr Farahat

@AmrFouad_

10 months

🚨Preprint Alert New work with @martin_a_vinck We elucidate the architectural bias that enables CNNs to predict early visual cortex responses in macaques and humans even without optimization of convolutional kernels. 🧠🤖

230

Replies

Amr Farahat

@AmrFouad_

8 months

2/15 We found that training CNNs for object recognition doesn’t improve V1 encoding as much as it does for higher visual areas (like IT in monkeys or VO in humans)! Is V1 encoding more about architecture than learning?

Amr Farahat

@AmrFouad_

8 months

3/15 Surprisingly, we found out that even training simple CNN models directly on V1 data did not improve encoding performance substantially, unlike IT. However, that was only true for CNNs using ReLU activation functions and/or max pooling.

Amr Farahat

@AmrFouad_

8 months

4/15 We quantified the complexity of the models' transformations and found that ReLU models and max pooling models had considerably higher complexity. Complexity explained substantial variance in V1 encoding performance in comparison to IT (63%) and VO (55%) (not shown here)

Amr Farahat

@AmrFouad_

8 months

5/15 This means that predicting responses in higher visual areas (e.g., IT, VO) strongly depends on precise weight configurations acquired through training in contrast to V1, highlighting the functional specialization of those areas.

Amr Farahat

@AmrFouad_

8 months

6/15 Even when we shuffled the trained weights of the convolutional filters, V1 models were way less affected than IT

Amr Farahat

@AmrFouad_

8 months

7/15 Importantly, these findings hold true both for firing rates in monkeys and human fMRI data, suggesting their generalizability.

Amr Farahat

@AmrFouad_

8 months

8/15 ReLU was introduced to DNNs inspired by sparsity and i/o function of biological neurons. To test its biological relevance, we looked for characteristics of early visual processing: orientation selectivity and the capacity to support texture discrimination

Amr Farahat

@AmrFouad_

8 months

9/15 We quantified the orientation selectivity (OS) of artificial neurons using circular variance and calculated how their distribution deviates from the distribution of an independent dataset of experimentally recorded v1 neurons

Amr Farahat

@AmrFouad_

8 months

10/15 We found that trained ReLU networks are the most V1-like concerning OS. Moreover, random ReLU networks were the most V1-like among random networks and even on par with other fully trained networks.

Amr Farahat

@AmrFouad_

8 months

11/15 Then we tested for the ability of random networks to support texture discrimination, a task known to involve early visual cortex. We created Texture-MNIST, a dataset that allows for training for two tasks: object (Digit) recognition and texture discrimination

Amr Farahat

@AmrFouad_

8 months

12/15 We found that random ReLU networks performed the best among random networks and only slightly worse than the fully trained counterpart.

Amr Farahat

@AmrFouad_

8 months

13/15 Our results suggest that the architecture bias of CNNs is key to predicting neural responses in the early visual cortex, which aligns with results in computer vision, showing that random convolutions suffice for several visual tasks.

Amr Farahat

@AmrFouad_

8 months

14/15 Our results also emphasize the importance of rigorous controls when using black box models like DNNs in neural modeling. They can show what makes a good neural model, and help us generate hypotheses about brain computations

Amr Farahat

@AmrFouad_

8 months

15/15 It is also important to use various ways to assess model strengths and weaknesses, not just one like prediction accuracy.

Amr Farahat

@AmrFouad_

8 months