Research Challenges the Dominance of Primary Auditory Cortex in Speech Models

The intricate process of speech perception involves the human auditory system deciphering linguistic abstractions from speech signals. While traditional models using linear feature-encoding have had limited success in understanding this complex process, artificial neural networks, particularly deep neural network (DNN) models, have shown promise in speech recognition tasks.

A recent groundbreaking study utilized state-of-the-art DNN models to explore neural coding from the auditory nerve to the speech cortex, unravelling the correlation between DNN representations and neural activity in the ascending auditory system. 

The research unveiled several key findings: 

  • The hierarchy in DNNs learning speech representations correlates well with the ascending auditory pathway, demonstrating the alignment of computational structures. 
  • Unsupervised speech models performed on par with or even better than purely supervised or fine-tuned models, showcasing the DNNs’ ability to learn meaningful representations without explicit linguistic knowledge. 
  • Deeper layers of DNNs exhibited better correlation with neural activity in higher-order auditory cortex regions, aligning with phonemic and syllabic structures in speech. 
  • DNN-based models revealed language-specific properties in cross-language speech perception, offering insights into language-specific coding in the superior temporal gyrus (STG) during cross-language perception. 

The study employed a neural encoding framework to systematically evaluate the similarity between the auditory pathway and DNN models with different architectures and training strategies. Importantly, it used a cross-linguistic paradigm, going beyond the constraints of a single language, to uncover both language-invariant and language-specific aspects during speech perception. 

The results challenged traditional cognitive models and neural encoding models, showcasing the limitations of linear encoding models in capturing higher-order speech information. DNNs, with their nonlinearity and dynamic temporal integration of phonological contextual information, outperformed traditional models, especially in predicting responses in the nonprimary auditory cortex. 

The study also shed light on the computational attributes of DNN models, indicating that different architectures better correlated with different parts of the auditory pathway. While convolution layers were suitable for the auditory periphery, deeper transformer-encoder and LSTM layers better fit the speech–auditory cortex, demonstrating the DNNs’ ability to adapt to different levels of processing. 

The findings have implications for interpreting the functions of the primary and nonprimary auditory cortical areas. The study challenged the notion of the primary auditory cortex’s sole contribution to advanced computational models of speech processing, emphasizing the role of the entire auditory pathway in speech perception. 

In conclusion, this research offers new insights into neural coding in the auditory cortex, demonstrating the potential of DNN models to correlate with and enhance our understanding of the human auditory system. The study’s approach opens avenues for data-driven computational models of sensory perception and emphasizes the importance of considering dynamic temporal integration and nonlinearity in modelling speech processing across the auditory pathway. 

Journal Reference  

Li, Y., Anumanchipalli, G.K., Mohamed, A. et al. Dissecting neural computations in the human auditory pathway using deep neural networks for speech. Nat Neurosci (2023). https://doi.org/10.1038/s41593-023-01468-4 

Latest Posts

Free CME credits

Both our subscription plans include Free CME/CPD AMA PRA Category 1 credits.

Digital Certificate PDF

On course completion, you will receive a full-sized presentation quality digital certificate.

medtigo Simulation

A dynamic medical simulation platform designed to train healthcare professionals and students to effectively run code situations through an immersive hands-on experience in a live, interactive 3D environment.

medtigo Points

medtigo points is our unique point redemption system created to award users for interacting on our site. These points can be redeemed for special discounts on the medtigo marketplace as well as towards the membership cost itself.
 
  • Registration with medtigo = 10 points
  • 1 visit to medtigo’s website = 1 point
  • Interacting with medtigo posts (through comments/clinical cases etc.) = 5 points
  • Attempting a game = 1 point
  • Community Forum post/reply = 5 points

    *Redemption of points can occur only through the medtigo marketplace, courses, or simulation system. Money will not be credited to your bank account. 10 points = $1.

All Your Certificates in One Place

When you have your licenses, certificates and CMEs in one place, it's easier to track your career growth. You can easily share these with hospitals as well, using your medtigo app.

Our Certificate Courses