Search | arXiv e-print repository

arXiv:2406.04733 [pdf]

Unsupervised representation learning with Hebbian synaptic and structural plasticity in brain-like feedforward neural networks

Authors: Naresh Ravichandran, Anders Lansner, Pawel Herman

Abstract: Neural networks that can capture key principles underlying brain computation offer exciting new opportunities for developing artificial intelligence and brain-like computing algorithms. Such networks remain biologically plausible while leveraging localized forms of synaptic learning rules and modular network architecture found in the neocortex. Compared to backprop-driven deep learning approches,… ▽ More Neural networks that can capture key principles underlying brain computation offer exciting new opportunities for developing artificial intelligence and brain-like computing algorithms. Such networks remain biologically plausible while leveraging localized forms of synaptic learning rules and modular network architecture found in the neocortex. Compared to backprop-driven deep learning approches, they provide more suitable models for deploying on neuromorphic hardware and have greater potential for scalability on large-scale computing clusters. The development of such brain-like neural networks depends on having a learning procedure that can build effective internal representations from data. In this work, we introduce and evaluate a brain-like neural network model capable of unsupervised representation learning. It builds on the Bayesian Confidence Propagation Neural Network (BCPNN), which has earlier been implemented as abstract as well as biophyscially detailed recurrent attractor neural networks explaining various cortical associative memory phenomena. Here we developed a feedforward BCPNN model to perform representation learning by incorporating a range of brain-like attributes derived from neocortical circuits such as cortical columns, divisive normalization, Hebbian synaptic plasticity, structural plasticity, sparse activity, and sparse patchy connectivity. The model was tested on a diverse set of popular machine learning benchmarks: grayscale images (MNIST, Fashion-MNIST), RGB natural images (SVHN, CIFAR-10), QSAR (MUV, HIV), and malware detection (EMBER). The performance of the model when using a linear classifier to predict the class labels fared competitively with conventional multi-layer perceptrons and other state-of-the-art brain-like neural networks. △ Less

Submitted 7 June, 2024; originally announced June 2024.

arXiv:2406.03054 [pdf]

Spiking representation learning for associative memories

Authors: Naresh Ravichandran, Anders Lansner, Pawel Herman

Abstract: Networks of interconnected neurons communicating through spiking signals offer the bedrock of neural computations. Our brains spiking neural networks have the computational capacity to achieve complex pattern recognition and cognitive functions effortlessly. However, solving real-world problems with artificial spiking neural networks (SNNs) has proved to be difficult for a variety of reasons. Cruc… ▽ More Networks of interconnected neurons communicating through spiking signals offer the bedrock of neural computations. Our brains spiking neural networks have the computational capacity to achieve complex pattern recognition and cognitive functions effortlessly. However, solving real-world problems with artificial spiking neural networks (SNNs) has proved to be difficult for a variety of reasons. Crucially, scaling SNNs to large networks and processing large-scale real-world datasets have been challenging, especially when compared to their non-spiking deep learning counterparts. The critical operation that is needed of SNNs is the ability to learn distributed representations from data and use these representations for perceptual, cognitive and memory operations. In this work, we introduce a novel SNN that performs unsupervised representation learning and associative memory operations leveraging Hebbian synaptic and activity-dependent structural plasticity coupled with neuron-units modelled as Poisson spike generators with sparse firing (~1 Hzへるつ mean and ~100 Hzへるつ maximum firing rate). Crucially, the architecture of our model derives from the neocortical columnar organization and combines feedforward projections for learning hidden representations and recurrent projections for forming associative memories. We evaluated the model on properties relevant for attractor-based associative memories such as pattern completion, perceptual rivalry, distortion resistance, and prototype extraction. △ Less

Submitted 5 June, 2024; originally announced June 2024.

arXiv:2405.17921 [pdf]

Towards Clinical AI Fairness: Filling Gaps in the Puzzle

Authors: Mingxuan Liu, Yilin Ning, Salinelat Teixayavong, Xiaoxuan Liu, Mayli Mertens, Yuqing Shang, Xin Li, Di Miao, Jie Xu, Daniel Shu Wei Ting, Lionel Tim-Ee Cheng, Jasmine Chiat Ling Ong, Zhen Ling Teo, Ting Fang Tan, Narrendar RaviChandran, Fei Wang, Leo Anthony Celi, Marcus Eng Hock Ong, Nan Liu

Abstract: The ethical integration of Artificial Intelligence (AI) in healthcare necessitates addressing fairness-a concept that is highly context-specific across medical fields. Extensive studies have been conducted to expand the technical components of AI fairness, while tremendous calls for AI fairness have been raised from healthcare. Despite this, a significant disconnect persists between technical adva… ▽ More The ethical integration of Artificial Intelligence (AI) in healthcare necessitates addressing fairness-a concept that is highly context-specific across medical fields. Extensive studies have been conducted to expand the technical components of AI fairness, while tremendous calls for AI fairness have been raised from healthcare. Despite this, a significant disconnect persists between technical advancements and their practical clinical applications, resulting in a lack of contextualized discussion of AI fairness in clinical settings. Through a detailed evidence gap analysis, our review systematically pinpoints several deficiencies concerning both healthcare data and the provided AI fairness solutions. We highlight the scarcity of research on AI fairness in many medical domains where AI technology is increasingly utilized. Additionally, our analysis highlights a substantial reliance on group fairness, aiming to ensure equality among demographic groups from a macro healthcare system perspective; in contrast, individual fairness, focusing on equity at a more granular level, is frequently overlooked. To bridge these gaps, our review advances actionable strategies for both the healthcare and AI research communities. Beyond applying existing AI fairness methods in healthcare, we further emphasize the importance of involving healthcare professionals to refine AI fairness concepts and methods to ensure contextually relevant and ethically sound AI applications in healthcare. △ Less

Submitted 28 May, 2024; originally announced May 2024.

arXiv:2402.17400 [pdf, other]

Investigating Continual Pretraining in Large Language Models: Insights and Implications

Authors: Çağatay Yıldız, Nishaanth Kanna Ravichandran, Prishruit Punia, Matthias Bethge, Beyza Ermis

Abstract: This paper studies the evolving domain of Continual Learning (CL) in large language models (LLMs), with a focus on developing strategies for efficient and sustainable training. Our primary emphasis is on continual domain-adaptive pretraining, a process designed to equip LLMs with the ability to integrate new information from various domains while retaining previously learned knowledge and enhancin… ▽ More This paper studies the evolving domain of Continual Learning (CL) in large language models (LLMs), with a focus on developing strategies for efficient and sustainable training. Our primary emphasis is on continual domain-adaptive pretraining, a process designed to equip LLMs with the ability to integrate new information from various domains while retaining previously learned knowledge and enhancing cross-domain knowledge transfer without relying on domain-specific identification. Unlike previous studies, which mostly concentrate on a limited selection of tasks or domains and primarily aim to address the issue of forgetting, our research evaluates the adaptability and capabilities of LLMs to changing data landscapes in practical scenarios. To this end, we introduce a new benchmark designed to measure the adaptability of LLMs to these evolving data environments, offering a comprehensive framework for evaluation. We examine the impact of model size on learning efficacy and forgetting, as well as how the progression and similarity of emerging domains affect the knowledge transfer within these models. Our findings uncover several key insights: (i) when the sequence of domains shows semantic similarity, continual pretraining enables LLMs to better specialize in the current domain compared to stand-alone fine-tuning, (ii) training across a diverse range of domains enhances both backward and forward knowledge transfer, and (iii) smaller models are particularly sensitive to continual pretraining, showing the most significant rates of both forgetting and learning. We posit that our research marks a shift towards establishing a more realistic benchmark for investigating CL in LLMs, and has the potential to play a key role in guiding the direction of future research in the field. △ Less

Submitted 27 February, 2024; originally announced February 2024.

arXiv:2401.00335 [pdf]

Benchmarking Hebbian learning rules for associative memory

Authors: Anders Lansner, Naresh B Ravichandran, Pawel Herman

Abstract: Associative memory or content addressable memory is an important component function in computer science and information processing and is a key concept in cognitive and computational brain science. Many different neural network architectures and learning rules have been proposed to model associative memory of the brain while investigating key functions like pattern completion and rivalry, noise re… ▽ More Associative memory or content addressable memory is an important component function in computer science and information processing and is a key concept in cognitive and computational brain science. Many different neural network architectures and learning rules have been proposed to model associative memory of the brain while investigating key functions like pattern completion and rivalry, noise reduction, and storage capacity. A less investigated but important function is prototype extraction where the training set comprises pattern instances generated by distorting prototype patterns and the task of the trained network is to recall the correct prototype pattern given a new instance. In this paper we characterize these different aspects of associative memory performance and benchmark six different learning rules on storage capacity and prototype extraction. We consider only models with Hebbian plasticity that operate on sparse distributed representations with unit activities in the interval [0,1]. We evaluate both non-modular and modular network architectures and compare performance when trained and tested on different kinds of sparse random binary pattern sets, including correlated ones. We show that covariance learning has a robust but low storage capacity under these conditions and that the Bayesian Confidence Propagation learning rule (BCPNN) is superior with a good margin in all cases except one, reaching a three times higher composite score than the second best learning rule tested. △ Less

Submitted 30 December, 2023; originally announced January 2024.

Comments: 24 pages, 9 figures

arXiv:2312.03886 [pdf, other]

On The Fairness Impacts of Hardware Selection in Machine Learning

Authors: Sree Harsha Nelaturu, Nishaanth Kanna Ravichandran, Cuong Tran, Sara Hooker, Ferdinando Fioretto

Abstract: In the machine learning ecosystem, hardware selection is often regarded as a mere utility, overshadowed by the spotlight on algorithms and data. This oversight is particularly problematic in contexts like ML-as-a-service platforms, where users often lack control over the hardware used for model deployment. How does the choice of hardware impact generalization properties? This paper investigates th… ▽ More In the machine learning ecosystem, hardware selection is often regarded as a mere utility, overshadowed by the spotlight on algorithms and data. This oversight is particularly problematic in contexts like ML-as-a-service platforms, where users often lack control over the hardware used for model deployment. How does the choice of hardware impact generalization properties? This paper investigates the influence of hardware on the delicate balance between model performance and fairness. We demonstrate that hardware choices can exacerbate existing disparities, attributing these discrepancies to variations in gradient flows and loss surfaces across different demographic groups. Through both theoretical and empirical analysis, the paper not only identifies the underlying factors but also proposes an effective strategy for mitigating hardware-induced performance imbalances. △ Less

Submitted 6 December, 2023; originally announced December 2023.

arXiv:2305.03866 [pdf]

Spiking neural networks with Hebbian plasticity for unsupervised representation learning

Authors: Naresh Ravichandran, Anders Lansner, Pawel Herman

Abstract: We introduce a novel spiking neural network model for learning distributed internal representations from data in an unsupervised procedure. We achieved this by transforming the non-spiking feedforward Bayesian Confidence Propagation Neural Network (BCPNN) model, employing an online correlation-based Hebbian-Bayesian learning and rewiring mechanism, shown previously to perform representation learni… ▽ More We introduce a novel spiking neural network model for learning distributed internal representations from data in an unsupervised procedure. We achieved this by transforming the non-spiking feedforward Bayesian Confidence Propagation Neural Network (BCPNN) model, employing an online correlation-based Hebbian-Bayesian learning and rewiring mechanism, shown previously to perform representation learning, into a spiking neural network with Poisson statistics and low firing rate comparable to in vivo cortical pyramidal neurons. We evaluated the representations learned by our spiking model using a linear classifier and show performance close to the non-spiking BCPNN, and competitive with other Hebbian-based spiking networks when trained on MNIST and F-MNIST machine learning benchmarks. △ Less

Submitted 10 May, 2023; v1 submitted 5 May, 2023; originally announced May 2023.

arXiv:2304.11622 [pdf, other]

doi 10.1103/PhysRevB.108.155201

Dramatic Failure of the Callaway Description of Heat Flow in Boron Arsenide and Boron Antimonide Driven by Phonon Scattering Selection Rules

Authors: Nikhil Malviya, Navaneetha K. Ravichandran

Abstract: Callaway's simplified heat flow model is often used to confirm experimental realizations of unconventional, hydrodynamic and Poiseuille phonon transport in ultrahigh thermal conductivity ($κかっぱ$) materials, due to its simplicity and low computational cost. Here, we show that the Callaway model works exceptionally well for most ultrahigh-$κかっぱ$ materials like diamond and boron nitride, but fails dramatic… ▽ More Callaway's simplified heat flow model is often used to confirm experimental realizations of unconventional, hydrodynamic and Poiseuille phonon transport in ultrahigh thermal conductivity ($κかっぱ$) materials, due to its simplicity and low computational cost. Here, we show that the Callaway model works exceptionally well for most ultrahigh-$κかっぱ$ materials like diamond and boron nitride, but fails dramatically for boron arsenide (BAs) and boron antimonide (BSb). This failure is driven by the inability of the Callaway model to effectively describe the severely restricted phonon scattering in BAs and BSb, where many scattering selection rules are activated simultaneously. Our work highlights the powerful predictive capability of the Callaway model, and gives insights into the nature of phonon scattering in ultrahigh-$κかっぱ$ materials and the suitability of the Callaway's description of heat flow through them. △ Less

Submitted 1 September, 2023; v1 submitted 23 April, 2023; originally announced April 2023.

Journal ref: Phys. Rev. B 108, 155201 (2023)

arXiv:2206.15036 [pdf]

Brain-like combination of feedforward and recurrent network components achieves prototype extraction and robust pattern recognition

Authors: Naresh Balaji Ravichandran, Anders Lansner, Pawel Herman

Abstract: Associative memory has been a prominent candidate for the computation performed by the massively recurrent neocortical networks. Attractor networks implementing associative memory have offered mechanistic explanation for many cognitive phenomena. However, attractor memory models are typically trained using orthogonal or random patterns to avoid interference between memories, which makes them unfea… ▽ More Associative memory has been a prominent candidate for the computation performed by the massively recurrent neocortical networks. Attractor networks implementing associative memory have offered mechanistic explanation for many cognitive phenomena. However, attractor memory models are typically trained using orthogonal or random patterns to avoid interference between memories, which makes them unfeasible for naturally occurring complex correlated stimuli like images. We approach this problem by combining a recurrent attractor network with a feedforward network that learns distributed representations using an unsupervised Hebbian-Bayesian learning rule. The resulting network model incorporates many known biological properties: unsupervised learning, Hebbian plasticity, sparse distributed activations, sparse connectivity, columnar and laminar cortical architecture, etc. We evaluate the synergistic effects of the feedforward and recurrent network components in complex pattern recognition tasks on the MNIST handwritten digits dataset. We demonstrate that the recurrent attractor component implements associative memory when trained on the feedforward-driven internal (hidden) representations. The associative memory is also shown to perform prototype extraction from the training data and make the representations robust to severely distorted input. We argue that several aspects of the proposed integration of feedforward and recurrent computations are particularly attractive from a machine learning perspective. △ Less

Submitted 3 September, 2022; v1 submitted 30 June, 2022; originally announced June 2022.

arXiv:2106.15546 [pdf]

Semi-supervised learning with Bayesian Confidence Propagation Neural Network

Authors: Naresh Balaji Ravichandran, Anders Lansner, Pawel Herman

Abstract: Learning internal representations from data using no or few labels is useful for machine learning research, as it allows using massive amounts of unlabeled data. In this work, we use the Bayesian Confidence Propagation Neural Network (BCPNN) model developed as a biologically plausible model of the cortex. Recent work has demonstrated that these networks can learn useful internal representations fr… ▽ More Learning internal representations from data using no or few labels is useful for machine learning research, as it allows using massive amounts of unlabeled data. In this work, we use the Bayesian Confidence Propagation Neural Network (BCPNN) model developed as a biologically plausible model of the cortex. Recent work has demonstrated that these networks can learn useful internal representations from data using local Bayesian-Hebbian learning rules. In this work, we show how such representations can be leveraged in a semi-supervised setting by introducing and comparing different classifiers. We also evaluate and compare such networks with other popular semi-supervised classifiers. △ Less

Submitted 29 June, 2021; originally announced June 2021.

arXiv:2106.05373 [pdf, other]

doi 10.1145/3468044.3468052

StreamBrain: An HPC Framework for Brain-like Neural Networks on CPUs, GPUs and FPGAs

Authors: Artur Podobas, Martin Svedin, Steven W. D. Chien, Ivy B. Peng, Naresh Balaji Ravichandran, Pawel Herman, Anders Lansner, Stefano Markidis

Abstract: The modern deep learning method based on backpropagation has surged in popularity and has been used in multiple domains and application areas. At the same time, there are other -- less-known -- machine learning algorithms with a mature and solid theoretical foundation whose performance remains unexplored. One such example is the brain-like Bayesian Confidence Propagation Neural Network (BCPNN). In… ▽ More The modern deep learning method based on backpropagation has surged in popularity and has been used in multiple domains and application areas. At the same time, there are other -- less-known -- machine learning algorithms with a mature and solid theoretical foundation whose performance remains unexplored. One such example is the brain-like Bayesian Confidence Propagation Neural Network (BCPNN). In this paper, we introduce StreamBrain -- a framework that allows neural networks based on BCPNN to be practically deployed in High-Performance Computing systems. StreamBrain is a domain-specific language (DSL), similar in concept to existing machine learning (ML) frameworks, and supports backends for CPUs, GPUs, and even FPGAs. We empirically demonstrate that StreamBrain can train the well-known ML benchmark dataset MNIST within seconds, and we are the first to demonstrate BCPNN on STL-10 size networks. We also show how StreamBrain can be used to train with custom floating-point formats and illustrate the impact of using different bfloat variations on BCPNN using FPGAs. △ Less

Submitted 9 June, 2021; originally announced June 2021.

Comments: Accepted for publication at the International Symposium on Highly Efficient Accelerators and Reconfigurable Technologies (HEART 2021)

arXiv:2011.07661 [pdf, ps, other]

doi 10.1016/j.mlwa.2021.100112

hyper-sinh: An Accurate and Reliable Function from Shallow to Deep Learning in TensorFlow and Keras

Authors: Luca Parisi, Renfei Ma, Narrendar RaviChandran, Matteo Lanzillotta

Abstract: This paper presents the 'hyper-sinh', a variation of the m-arcsinh activation function suitable for Deep Learning (DL)-based algorithms for supervised learning, such as Convolutional Neural Networks (CNN). hyper-sinh, developed in the open source Python libraries TensorFlow and Keras, is thus described and validated as an accurate and reliable activation function for both shallow and deep neural n… ▽ More This paper presents the 'hyper-sinh', a variation of the m-arcsinh activation function suitable for Deep Learning (DL)-based algorithms for supervised learning, such as Convolutional Neural Networks (CNN). hyper-sinh, developed in the open source Python libraries TensorFlow and Keras, is thus described and validated as an accurate and reliable activation function for both shallow and deep neural networks. Improvements in accuracy and reliability in image and text classification tasks on five (N = 5) benchmark data sets available from Keras are discussed. Experimental results demonstrate the overall competitive classification performance of both shallow and deep neural networks, obtained via this novel function. This function is evaluated with respect to gold standard activation functions, demonstrating its overall competitive accuracy and reliability for both image and text classification. △ Less

Submitted 15 November, 2020; originally announced November 2020.

Comments: 19 pages, 6 listings/Python code snippets, 4 figures, 5 tables

MSC Class: 68T07; 68T10; 68T45; 68T50; 68U35 ACM Class: I.2.1; I.2.7; I.2.10; I.4.9; I.5.1; I.5.4; I.5.5

arXiv:2010.15264 [pdf, other]

How do defects limit the ultrahigh thermal conductivity of BAs? A first principles study

Authors: Mauro Fava, Nakib Haider Protik, Chunhua Li, Navaneetha Krishnan Ravichandran, Jesús Carrete, Ambroise van Roekeghem, Georg K. H. Madsen, Natalio Mingo, David Broido

Abstract: The promise enabled by BAs high thermal conductivity in power electronics cannot be assessed without taking into account the reduction incurred when doping the material. Using first principles calculations, we determine the thermal conductivity reduction induced by different group IV impurities in BAs as a function of concentration and charge state. We unveil a general trend, where neutral impurit… ▽ More The promise enabled by BAs high thermal conductivity in power electronics cannot be assessed without taking into account the reduction incurred when doping the material. Using first principles calculations, we determine the thermal conductivity reduction induced by different group IV impurities in BAs as a function of concentration and charge state. We unveil a general trend, where neutral impurities scatter phonons more strongly than the charged ones. $\text{C}_{\text{B}}$ and $\text{Ge}_{\text{As}}$ impurities show by far the weakest phonon scattering and retain BAs $κかっぱ$ values of over $\sim$ 1000 $\text{W}\cdot\text{K}^{-1}\cdot\text{m}^{-1}$ even up to high densities making them ideal n-type and p-type dopants. Furthermore, going beyond the doping compensation threshold associated to Fermi level pinning triggers observable changes in the thermal conductivity. This informs design considerations on the doping of BAs, and it also suggests a direct way to determine the onset of compensation doping in experimental samples. △ Less

Submitted 28 October, 2020; originally announced October 2020.

arXiv:2009.01464 [pdf, other]

doi 10.1038/s41467-021-23618-7

Exposing the hidden influence of selection rules on phonon-phonon scattering by pressure and temperature tuning

Authors: Navaneetha K. Ravichandran, David Broido

Abstract: Using ab initio calculations, we show that the hidden influence of selection rules on three-phonon scattering can be exposed through anomalous signatures in the pressure ($P$) and temperature ($T$) dependence of the thermal conductivities, $κかっぱ$, of certain compounds. Boron phosphide reveals such underlying behavior through an exceptionally sharp initial rise in $κかっぱ$ with increasing $P$, which may be… ▽ More Using ab initio calculations, we show that the hidden influence of selection rules on three-phonon scattering can be exposed through anomalous signatures in the pressure ($P$) and temperature ($T$) dependence of the thermal conductivities, $κかっぱ$, of certain compounds. Boron phosphide reveals such underlying behavior through an exceptionally sharp initial rise in $κかっぱ$ with increasing $P$, which may be the steepest of any material, and also a peak and decrease in $κかっぱ$ at high $P$. These features are in stark contrast to the measured behavior for many solids, and occur at experimentally accessible conditions. Similar anomalous behavior is predicted for silicon carbide and other related materials. △ Less

Submitted 3 September, 2020; originally announced September 2020.

arXiv:2005.03476 [pdf, other]

Brain-like approaches to unsupervised learning of hidden representations -- a comparative study

Authors: Naresh Balaji Ravichandran, Anders Lansner, Pawel Herman

Abstract: Unsupervised learning of hidden representations has been one of the most vibrant research directions in machine learning in recent years. In this work we study the brain-like Bayesian Confidence Propagating Neural Network (BCPNN) model, recently extended to extract sparse distributed high-dimensional representations. The usefulness and class-dependent separability of the hidden representations whe… ▽ More Unsupervised learning of hidden representations has been one of the most vibrant research directions in machine learning in recent years. In this work we study the brain-like Bayesian Confidence Propagating Neural Network (BCPNN) model, recently extended to extract sparse distributed high-dimensional representations. The usefulness and class-dependent separability of the hidden representations when trained on MNIST and Fashion-MNIST datasets is studied using an external linear classifier and compared with other unsupervised learning methods that include restricted Boltzmann machines and autoencoders. △ Less

Submitted 16 April, 2021; v1 submitted 6 May, 2020; originally announced May 2020.

Comments: arXiv admin note: text overlap with arXiv:2003.12415

arXiv:2003.12415 [pdf]

doi 10.1109/IJCNN48605.2020.9207061

Learning representations in Bayesian Confidence Propagation neural networks

Authors: Naresh Balaji Ravichandran, Anders Lansner, Pawel Herman

Abstract: Unsupervised learning of hierarchical representations has been one of the most vibrant research directions in deep learning during recent years. In this work we study biologically inspired unsupervised strategies in neural networks based on local Hebbian learning. We propose new mechanisms to extend the Bayesian Confidence Propagating Neural Network (BCPNN) architecture, and demonstrate their capa… ▽ More Unsupervised learning of hierarchical representations has been one of the most vibrant research directions in deep learning during recent years. In this work we study biologically inspired unsupervised strategies in neural networks based on local Hebbian learning. We propose new mechanisms to extend the Bayesian Confidence Propagating Neural Network (BCPNN) architecture, and demonstrate their capability for unsupervised learning of salient hidden representations when tested on the MNIST dataset. △ Less

Submitted 27 March, 2020; originally announced March 2020.

Journal ref: 2020 International Joint Conference on Neural Networks (IJCNN)

arXiv:2003.08893 [pdf, other]

doi 10.1103/PhysRevX.10.021063

Phonon-Phonon Interactions in Strongly Bonded Solids: Selection Rules and Higher-Order Processes

Authors: Navaneetha K. Ravichandran, David Broido

Abstract: We show that the commonly used lowest-order theory of phonon-phonon interactions frequently fails to accurately describe the anharmonic phonon decay rates and thermal conductivity ($κかっぱ$), even among strongly bonded crystals. Applying a first principles theory that includes both the lowest-order three-phonon and the higher-order four-phonon processes to seventeen zinc blende semiconductors, we find… ▽ More We show that the commonly used lowest-order theory of phonon-phonon interactions frequently fails to accurately describe the anharmonic phonon decay rates and thermal conductivity ($κかっぱ$), even among strongly bonded crystals. Applying a first principles theory that includes both the lowest-order three-phonon and the higher-order four-phonon processes to seventeen zinc blende semiconductors, we find that the lowest-order theory drastically overestimates the measured $κかっぱ$ for many of these materials, while inclusion of four-phonon scattering gives significantly improved agreement with measurements. We have identified new selection rules on three-phonon processes that help explain many of these failures in terms of anomalously weak anharmonic phonon decay rates predicted by the lowest-order theory competing with four-phonon processes. We also show that zinc blende compounds containing boron (B), carbon (C) or nitrogen (N) atoms have exceptionally weak four-phonon scattering, much weaker than in compounds that do not contain B, C or N atoms. This new understanding helps explain the ultrahigh $κかっぱ$ in several technologically important materials like cubic boron arsenide, boron phosphide and silicon carbide. At the same time, it not only makes the possibility of achieving high $κかっぱ$ in materials without B, C or N atoms unlikely, but it also suggests that it may be necessary to include four-phonon processes in many future studies. Our work gives new insights into the nature of anharmonic processes in solids and demonstrates the broad importance of higher-order phonon-phonon interactions in assessing the thermal properties of materials. △ Less

Submitted 19 March, 2020; originally announced March 2020.

Comments: 18 pages, 7 figures

Journal ref: Phys. Rev. X 10, 021063 (2020)

arXiv:1612.08401 [pdf, other]

doi 10.1103/PhysRevB.95.205423

Experimental metrology to obtain thermal phonon transmission coefficients at solid interfaces

Authors: Chengyun Hua, Xiangwen Chen, Navaneetha K. Ravichandran, Austin J. Minnich

Abstract: Interfaces play an essential role in phonon-mediated heat conduction in solids, impacting applications ranging from thermoelectric waste heat recovery to heat dissipation in electronics. From the microscopic perspective, interfacial phonon transport is described by transmission coefficients that link vibrational modes in the materials composing the interface. However, direct experimental determina… ▽ More Interfaces play an essential role in phonon-mediated heat conduction in solids, impacting applications ranging from thermoelectric waste heat recovery to heat dissipation in electronics. From the microscopic perspective, interfacial phonon transport is described by transmission coefficients that link vibrational modes in the materials composing the interface. However, direct experimental determination of these coefficients is challenging because most experiments provide a mode-averaged interface conductance that obscures the microscopic detail. Here, we report a metrology to extract thermal phonon transmission coefficients at solid interfaces using ab-initio phonon transport modeling and a thermal characterization technique, time-domain thermoreflectance. In combination with transmission electron microscopy characterization of the interface, our approach allows us to link the atomic structure of an interface to the spectral content of the heat crossing it. Our work provides a useful perspective on the microscopic processes governing interfacial heat conduction. △ Less

Submitted 26 December, 2016; originally announced December 2016.

Comments: arXiv admin note: substantial text overlap with arXiv:1509.07806

Journal ref: Phys. Rev. B 95, 205423 (2017)

arXiv:1511.03312 [pdf, ps, other]

doi 10.1103/PhysRevB.93.035314

The Role of Thermalizing and Non-thermalizing Walls in Phonon Heat Conduction along Thin Films

Authors: Navaneetha K. Ravichandran, Austin J. Minnich

Abstract: Phonon boundary scattering is typically treated using the Fuchs-Sondheimer theory, which assumes that phonons are thermalized to the local temperature at the boundary. However, whether such a thermalization process actually occurs and its effect on thermal transport remains unclear. Here we examine thermal transport along thin films with both thermalizing and non-thermalizing walls by solving the… ▽ More Phonon boundary scattering is typically treated using the Fuchs-Sondheimer theory, which assumes that phonons are thermalized to the local temperature at the boundary. However, whether such a thermalization process actually occurs and its effect on thermal transport remains unclear. Here we examine thermal transport along thin films with both thermalizing and non-thermalizing walls by solving the spectral Boltzmann transport equation (BTE) for steady state and transient transport. We find that in steady state, the thermal transport is governed by the Fuchs-Sondheimer theory and is insensitive to whether the boundaries are thermalizing or not. In contrast, under transient conditions, the thermal decay rates are significantly different for thermalizing and non-thermalizing walls. We also show that, for transient transport, the thermalizing boundary condition is unphysical due to violation of heat flux conservation at the boundaries. Our results provide insights into the boundary scattering process of thermal phonons over a range of heating length scales that are useful for interpreting thermal measurements on nanostructures. △ Less

Submitted 10 November, 2015; originally announced November 2015.

arXiv:1509.07806 [pdf, other]

Fresnel transmission coefficients for thermal phonons at solid interfaces

Authors: Chengyun Hua, Xiangwen Chen, Navaneetha K. Ravichandran, Austin J. Minnich

Abstract: Interfaces play an essential role in phonon-mediated heat conduction in solids, impacting applications ranging from thermoelectric waste heat recovery to heat dissipation in electronics. From a microscopic perspective, interfacial phonon transport is described by transmission and reflection coefficients, analogous to the well-known Fresnel coefficients for light. However, these coefficients have n… ▽ More Interfaces play an essential role in phonon-mediated heat conduction in solids, impacting applications ranging from thermoelectric waste heat recovery to heat dissipation in electronics. From a microscopic perspective, interfacial phonon transport is described by transmission and reflection coefficients, analogous to the well-known Fresnel coefficients for light. However, these coefficients have never been directly measured, and thermal transport processes at interfaces remain poorly understood despite considerable effort. Here, we report the first measurements of the Fresnel transmission coefficients for thermal phonons at a metal-semiconductor interface using ab-initio phonon transport modeling and a thermal characterization technique, time-domain thermoreflectance. Our measurements show that interfaces act as thermal phonon filters that transmit primarily low frequency phonons, leading to these phonons being the dominant energy carriers across the interface despite the larger density of states of high frequency phonons. Our work realizes the long-standing goal of directly measuring thermal phonon transmission coefficients and demonstrates a general route to study microscopic processes governing interfacial heat conduction. △ Less

Submitted 25 September, 2015; originally announced September 2015.

arXiv:1403.7647 [pdf, other]

doi 10.1103/PhysRevB.89.205432

Coherent and Incoherent Thermal Transport in Nanomeshes

Authors: Navaneetha K. Ravichandran, Austin J. Minnich

Abstract: Coherent thermal transport in nanopatterned structures is a topic of considerable interest, but whether it occurs in certain structures remains unclear due to poor understanding of which phonons conduct heat. Here, we perform the first fully three-dimensional, frequency-dependent simulations of thermal transport in nanomeshes by solving the Boltzmann transport equation with a novel, efficient Mont… ▽ More Coherent thermal transport in nanopatterned structures is a topic of considerable interest, but whether it occurs in certain structures remains unclear due to poor understanding of which phonons conduct heat. Here, we perform the first fully three-dimensional, frequency-dependent simulations of thermal transport in nanomeshes by solving the Boltzmann transport equation with a novel, efficient Monte Carlo method. From the spectral information in our simulations, we show that thermal transport in nanostructures that can be created with available lithographic techniques is dominated by incoherent boundary scattering at room temperature. Our result provides important insights into the conditions required for coherent thermal transport to occur in artificial structures. △ Less

Submitted 29 March, 2014; originally announced March 2014.

Comments: 12 pages, 5 figures

Showing 1–21 of 21 results for author: RaviChandran, N