Search | arXiv e-print repository

Proof-of-concept: Using ChatGPT to Translate and Modernize an Earth System Model from Fortran to Python/JAX

Authors: Anthony Zhou, Linnia Hawkins, Pierre Gentine

Abstract: Earth system models (ESMs) are vital for understanding past, present, and future climate, but they suffer from legacy technical infrastructure. ESMs are primarily implemented in Fortran, a language that poses a high barrier of entry for early career scientists and lacks a GPU runtime, which has become essential for continued advancement as GPU power increases and CPU scaling slows. Fortran also la… ▽ More Earth system models (ESMs) are vital for understanding past, present, and future climate, but they suffer from legacy technical infrastructure. ESMs are primarily implemented in Fortran, a language that poses a high barrier of entry for early career scientists and lacks a GPU runtime, which has become essential for continued advancement as GPU power increases and CPU scaling slows. Fortran also lacks differentiability - the capacity to differentiate through numerical code - which enables hybrid models that integrate machine learning methods. Converting an ESM from Fortran to Python/JAX could resolve these issues. This work presents a semi-automated method for translating individual model components from Fortran to Python/JAX using a large language model (GPT-4). By translating the photosynthesis model from the Community Earth System Model (CESM), we demonstrate that the Python/JAX version results in up to 100x faster runtimes using GPU parallelization, and enables parameter estimation via automatic differentiation. The Python code is also easy to read and run and could be used by instructors in the classroom. This work illustrates a path towards the ultimate goal of making climate models fast, inclusive, and differentiable. △ Less

Submitted 13 February, 2024; originally announced May 2024.

arXiv:2403.02215 [pdf, other]

Joint Parameter and Parameterization Inference with Uncertainty Quantification through Differentiable Programming

Authors: Yongquan Qu, Mohamed Aziz Bhouri, Pierre Gentine

Abstract: Accurate representations of unknown and sub-grid physical processes through parameterizations (or closure) in numerical simulations with quantified uncertainty are critical for resolving the coarse-grained partial differential equations that govern many problems ranging from weather and climate prediction to turbulence simulations. Recent advances have seen machine learning (ML) increasingly appli… ▽ More Accurate representations of unknown and sub-grid physical processes through parameterizations (or closure) in numerical simulations with quantified uncertainty are critical for resolving the coarse-grained partial differential equations that govern many problems ranging from weather and climate prediction to turbulence simulations. Recent advances have seen machine learning (ML) increasingly applied to model these subgrid processes, resulting in the development of hybrid physics-ML models through the integration with numerical solvers. In this work, we introduce a novel framework for the joint estimation of physical parameters and machine learning parameterizations with uncertainty quantification. Our framework incorporates online training and efficient Bayesian inference within a high-dimensional parameter space, facilitated by differentiable programming. This proof of concept underscores the substantial potential of differentiable programming in synergistically combining machine learning with differential equations, thereby enhancing the capabilities of hybrid physics-ML modeling. △ Less

Submitted 6 May, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

Comments: Accepted at ICLR 2024 Workshop on AI4Differential Equations in Science

arXiv:2402.03079 [pdf, other]

Improving Atmospheric Processes in Earth System Models with Deep Learning Ensembles and Stochastic Parameterizations

Authors: Gunnar Behrens, Tom Beucler, Fernando Iglesias-Suarez, Sungduk Yu, Pierre Gentine, Michael Pritchard, Mierk Schwabe, Veronika Eyring

Abstract: Deep learning has proven to be a valuable tool to represent subgrid processes in climate models, but most application cases have so far used idealized settings and deterministic approaches. Here, we develop ensemble and stochastic parameterizations with calibrated uncertainty quantification to learn subgrid convective and turbulent processes and surface radiative fluxes of a superparameterization… ▽ More Deep learning has proven to be a valuable tool to represent subgrid processes in climate models, but most application cases have so far used idealized settings and deterministic approaches. Here, we develop ensemble and stochastic parameterizations with calibrated uncertainty quantification to learn subgrid convective and turbulent processes and surface radiative fluxes of a superparameterization embedded in an Earth System Model (ESM). We explore three methods to construct stochastic parameterizations: 1) a single Deep Neural Network (DNN) with Monte Carlo Dropout; 2) a multi-network ensemble; and 3) a Variational Encoder Decoder with latent space perturbation. We show that the multi-network ensembles improve the representation of convective processes in the planetary boundary layer compared to individual DNNs. The respective uncertainty quantification illustrates that the two latter methods are advantageous compared to a dropout-based DNN ensemble regarding the spread of convective processes. We develop a novel partial coupling strategy to sidestep issues in condensate emulation to evaluate the multi-network parameterizations in online runs coupled to the ESM. We can conduct Earth-like stable runs over more than 5 months with the ensemble approach, while such simulations using individual DNNs fail within days. Moreover, we show that our novel ensemble parameterizations improve the representation of extreme precipitation and the underlying diurnal cycle compared to a traditional parameterization, although faithfully representing the mean precipitation pattern remains challenging. Our results pave the way towards a new generation of parameterizations using machine learning with realistic uncertainty quantification that significantly improve the representation of subgrid effects. △ Less

Submitted 5 February, 2024; originally announced February 2024.

Comments: main: 34 pages, 8 figures, 1 table; supporting information: 39 pages, 23 figures, 4 tables ; submitted to Journal of Advances in Modeling Earth Systems (JAMES)

arXiv:2312.04291 [pdf, other]

Simulating the Air Quality Impact of Prescribed Fires Using Graph Neural Network-Based PM$_{2.5}$ Forecasts

Authors: Kyleen Liao, Jatan Buch, Kara Lamb, Pierre Gentine

Abstract: The increasing size and severity of wildfires across the western United States have generated dangerous levels of PM$_{2.5}$ concentrations in recent years. In a changing climate, expanding the use of prescribed fires is widely considered to be the most robust fire mitigation strategy. However, reliably forecasting the potential air quality impact from prescribed fires, which is critical in planni… ▽ More The increasing size and severity of wildfires across the western United States have generated dangerous levels of PM$_{2.5}$ concentrations in recent years. In a changing climate, expanding the use of prescribed fires is widely considered to be the most robust fire mitigation strategy. However, reliably forecasting the potential air quality impact from prescribed fires, which is critical in planning the prescribed fires' location and time, at hourly to daily time scales remains a challenging problem. In this paper, we introduce a spatial-temporal graph neural network (GNN) based forecasting model for hourly PM$_{2.5}$ predictions across California. Using a two-step approach, we leverage our forecasting model to estimate the PM$_{2.5}$ contribution of wildfires. Integrating the GNN-based PM$_{2.5}$ forecasting model with prescribed fire simulations, we propose a novel framework to forecast the PM$_{2.5}$ pollution of prescribed fires. This framework helps determine March as the optimal month for implementing prescribed fires in California and quantifies the potential air quality trade-offs involved in conducting more prescribed fires outside the fire season. △ Less

Submitted 23 May, 2024; v1 submitted 7 December, 2023; originally announced December 2023.

Comments: 10 pages; multiple figures; matches version submitted to Environmental Data Science

arXiv:2311.14987 [pdf]

Reconstruction of a Long-term spatially Contiguous Solar-Induced Fluorescence (LCSIF) over 1982-2022

Authors: Jianing Fang, Xu Lian, Youngryel Ryu, Sungchan Jeong, Chongya Jiang, Pierre Gentine

Abstract: Satellite-observed solar-induced chlorophyll fluorescence (SIF) is a powerful proxy for diagnosing the photosynthetic characteristics of terrestrial ecosystems. Despite the increasing spatial and temporal resolutions of these satellite retrievals, records of SIF are primarily limited to the recent decade, impeding their application in detecting long-term dynamics of ecosystem function and structur… ▽ More Satellite-observed solar-induced chlorophyll fluorescence (SIF) is a powerful proxy for diagnosing the photosynthetic characteristics of terrestrial ecosystems. Despite the increasing spatial and temporal resolutions of these satellite retrievals, records of SIF are primarily limited to the recent decade, impeding their application in detecting long-term dynamics of ecosystem function and structure. In this study, we leverage the two surface reflectance bands (red and near-infrared) available both from Advanced Very High-Resolution Radiometer (AVHRR, 1982-2022) and MODerate-resolution Imaging Spectroradiometer (MODIS, 2001-2022). Importantly, we calibrate and orbit-correct the AVHRR bands against their MODIS counterparts during their overlapping period. Using the long-term bias-corrected reflectance data, a neural network is then built to reproduce the Orbiting Carbon Observatory-2 SIF using AVHRR and MODIS, and used to map SIF globally over the entire 1982-2022 period. Compared with the previous MODIS-based CSIF product relying on four reflectance bands, our two-band-based product has similar skill but can be advantageously extended to the bias-corrected AVHRR period. Further comparison with three widely used vegetation indices (NDVI, kNDVI, NIRv; all based empirically on red and near-infrared bands) shows a higher or comparable correlation of LCSIF with satellite SIF and site-level GPP estimates across vegetation types, ensuring a greater capacity of LCSIF for representing terrestrial photosynthesis. Globally, LCSIF-AVHRR shows an accelerating upward trend since 1982, with an average rate of 0.0025 mW m-2 nm-1 sr-1 per decade during 1982-2000 and 0.0038 mW m-2 nm-1 sr-1 per decade during 2001-2022. Our LCSIF data provide opportunities to better understand the long-term dynamics of ecosystem photosynthesis and their underlying driving processes. △ Less

Submitted 19 June, 2024; v1 submitted 25 November, 2023; originally announced November 2023.

arXiv:2311.03251 [pdf, other]

Interpretable multiscale Machine Learning-Based Parameterizations of Convection for ICON

Authors: Helge Heuer, Mierk Schwabe, Pierre Gentine, Marco A. Giorgetta, Veronika Eyring

Abstract: In order to improve climate projections, machine learning (ML)-based parameterizations have been developed for Earth System Models (ESMs) with the goal to better represent subgrid-scale processes or to accelerate computations by emulating existent parameterizations. These data-driven models have shown some success in approximating subgrid-scale processes. However, most studies have used a particul… ▽ More In order to improve climate projections, machine learning (ML)-based parameterizations have been developed for Earth System Models (ESMs) with the goal to better represent subgrid-scale processes or to accelerate computations by emulating existent parameterizations. These data-driven models have shown some success in approximating subgrid-scale processes. However, most studies have used a particular machine learning method to parameterize the subgrid tendencies or fluxes originating from the compound effect of various small-scale processes (e.g., turbulence, radiation, convection, gravity waves) in mostly idealized settings or from superparameterizations. Here, we use a filtering technique to explicitly separate convection from these processes in data produced by the Icosahedral Non-hydrostatic modelling framework (ICON) in a realistic setting. We use a method improved by incorporating density fluctuations for computing the subgrid fluxes and compare various different machine learning algorithms to predict these fluxes. We further examine the predictions of the best performing non-deep learning model (Gradient Boosted Tree regression) and the best deep-learning model (U-Net). We discover that the U-Net learns non-causal relations between convective precipitation and convective subgrid fluxes and develop an ablated model excluding precipitating tracer species. We connect the learned relations of the U-Net to physical processes in contrast to non-deep learning-based algorithms. The ML schemes are coupled online to the host ICON model and the non-causal links reveal weaknesses in stability and precipitation predictions. Predicted precipitation extremes of the ablated U-Net show higher accuracy over the conventional convection parameterization. Thus, our results provide a significant advance upon existing ML subgrid representation in ESMs. △ Less

Submitted 14 April, 2024; v1 submitted 6 November, 2023; originally announced November 2023.

arXiv:2309.16177 [pdf, other]

Sampling Hybrid Climate Simulation at Scale to Reliably Improve Machine Learning Parameterization

Authors: Jerry Lin, Sungduk Yu, Liran Peng, Tom Beucler, Eliot Wong-Toi, Zeyuan Hu, Pierre Gentine, Margarita Geleta, Mike Pritchard

Abstract: Machine-learning (ML) parameterizations of subgrid processes (here of turbulence, convection, and radiation) may one day replace conventional parameterizations by emulating high-resolution physics without the cost of explicit simulation. However, their development has been stymied by uncertainty surrounding whether or not improved offline performance translates to improved online performance (i.e.… ▽ More Machine-learning (ML) parameterizations of subgrid processes (here of turbulence, convection, and radiation) may one day replace conventional parameterizations by emulating high-resolution physics without the cost of explicit simulation. However, their development has been stymied by uncertainty surrounding whether or not improved offline performance translates to improved online performance (i.e., when coupled to a large-scale general circulation model (GCM)). A key barrier has been the limited sampling of the online effects of the ML design decisions and tuning due to the complexity of performing large ensembles of hybrid physics-ML climate simulations. Our work examines the coupled behavior of full-physics ML parameterizations using large ensembles of hybrid simulations, totalling 2,970 in our case. With extensive sampling, we statistically confirm that lowering offline error lowers online error (given certain constraints). However, we also reveal that decisions decreasing online error, like removing dropout, can trade off against hybrid model stability and vice versa. Nevertheless, we are able to identify design decisions that yield unambiguous improvements to offline and online performance, namely incorporating memory and training on multiple climates. We also find that converting moisture input from specific to relative humidity enhances online stability and that using a Mean Absolute Error (MAE) loss breaks the aforementioned offline/online error relationship. By enabling rapid online experimentation at scale, we empirically answer previously unresolved questions regarding subgrid ML parameterization design. △ Less

Submitted 4 July, 2024; v1 submitted 28 September, 2023; originally announced September 2023.

Comments: 16 pages, 4 figures

arXiv:2309.14780 [pdf]

Transferring climate change knowledge

Authors: Francesco Immorlano, Veronika Eyring, Thomas le Monnier de Gouville, Gabriele Accarino, Donatello Elia, Giovanni Aloisio, Pierre Gentine

Abstract: Accurate and precise climate projections are required for climate adaptation and mitigation, but Earth system models still exhibit great uncertainties. Several approaches have been developed to reduce the spread of climate projections and feedbacks, yet those methods cannot capture the non-linear complexity inherent in the climate system. Using a Transfer Learning approach, we show that Machine Le… ▽ More Accurate and precise climate projections are required for climate adaptation and mitigation, but Earth system models still exhibit great uncertainties. Several approaches have been developed to reduce the spread of climate projections and feedbacks, yet those methods cannot capture the non-linear complexity inherent in the climate system. Using a Transfer Learning approach, we show that Machine Learning can be used to optimally leverage and merge the knowledge gained from Earth system models simulations and historical observations to more accurately project global surface air temperature fields in the 21st century. We reach an uncertainty reduction of more than 50% with respect to state-of-the-art approaches. We give evidence that our novel method provides narrower projection uncertainty together with more accurate mean climate projections, urgently required for climate adaptation. △ Less

Submitted 19 June, 2024; v1 submitted 26 September, 2023; originally announced September 2023.

arXiv:2309.10231 [pdf, other]

Multi-fidelity climate model parameterization for better generalization and extrapolation

Authors: Mohamed Aziz Bhouri, Liran Peng, Michael S. Pritchard, Pierre Gentine

Abstract: Machine-learning-based parameterizations (i.e. representation of sub-grid processes) of global climate models or turbulent simulations have recently been proposed as a powerful alternative to physical, but empirical, representations, offering a lower computational cost and higher accuracy. Yet, those approaches still suffer from a lack of generalization and extrapolation beyond the training data,… ▽ More Machine-learning-based parameterizations (i.e. representation of sub-grid processes) of global climate models or turbulent simulations have recently been proposed as a powerful alternative to physical, but empirical, representations, offering a lower computational cost and higher accuracy. Yet, those approaches still suffer from a lack of generalization and extrapolation beyond the training data, which is however critical to projecting climate change or unobserved regimes of turbulence. Here we show that a multi-fidelity approach, which integrates datasets of different accuracy and abundance, can provide the best of both worlds: the capacity to extrapolate leveraging the physically-based parameterization and a higher accuracy using the machine-learning-based parameterizations. In an application to climate modeling, the multi-fidelity framework yields more accurate climate projections without requiring major increase in computational resources. Our multi-fidelity randomized prior networks (MF-RPNs) combine physical parameterization data as low-fidelity and storm-resolving historical run's data as high-fidelity. To extrapolate beyond the training data, the MF-RPNs are tested on high-fidelity warming scenarios, $+4K$, data. We show the MF-RPN's capacity to return much more skillful predictions compared to either low- or high-fidelity (historical data) simulations trained only on one regime while providing trustworthy uncertainty quantification across a wide range of scenarios. Our approach paves the way for the use of machine-learning based methods that can optimally leverage historical observations or high-fidelity simulations and extrapolate to unseen regimes such as climate change. △ Less

Submitted 18 September, 2023; originally announced September 2023.

Comments: 27 pages, 16 figures

arXiv:2306.08754 [pdf, other]

ClimSim-Online: A Large Multi-scale Dataset and Framework for Hybrid ML-physics Climate Emulation

Authors: Sungduk Yu, Zeyuan Hu, Akshay Subramaniam, Walter Hannah, Liran Peng, Jerry Lin, Mohamed Aziz Bhouri, Ritwik Gupta, Björn Lütjens, Justus C. Will, Gunnar Behrens, Julius J. M. Busecke, Nora Loose, Charles I. Stern, Tom Beucler, Bryce Harrop, Helge Heuer, Benjamin R. Hillman, Andrea Jenney, Nana Liu, Alistair White, Tian Zheng, Zhiming Kuang, Fiaz Ahmed, Elizabeth Barnes , et al. (22 additional authors not shown)

Abstract: Modern climate projections lack adequate spatial and temporal resolution due to computational constraints, leading to inaccuracies in representing critical processes like thunderstorms that occur on the sub-resolution scale. Hybrid methods combining physics with machine learning (ML) offer faster, higher fidelity climate simulations by outsourcing compute-hungry, high-resolution simulations to ML… ▽ More Modern climate projections lack adequate spatial and temporal resolution due to computational constraints, leading to inaccuracies in representing critical processes like thunderstorms that occur on the sub-resolution scale. Hybrid methods combining physics with machine learning (ML) offer faster, higher fidelity climate simulations by outsourcing compute-hungry, high-resolution simulations to ML emulators. However, these hybrid ML-physics simulations require domain-specific data and workflows that have been inaccessible to many ML experts. As an extension of the ClimSim dataset (Yu et al., 2024), we present ClimSim-Online, which also includes an end-to-end workflow for developing hybrid ML-physics simulators. The ClimSim dataset includes 5.7 billion pairs of multivariate input/output vectors, capturing the influence of high-resolution, high-fidelity physics on a host climate simulator's macro-scale state. The dataset is global and spans ten years at a high sampling frequency. We provide a cross-platform, containerized pipeline to integrate ML models into operational climate simulators for hybrid testing. We also implement various ML baselines, alongside a hybrid baseline simulator, to highlight the ML challenges of building stable, skillful emulators. The data (https://huggingface.co/datasets/LEAP/ClimSim_high-res) and code (https://leap-stc.github.io/ClimSim and https://github.com/leap-stc/climsim-online) are publicly released to support the development of hybrid ML-physics and high-fidelity climate simulations. △ Less

Submitted 8 July, 2024; v1 submitted 14 June, 2023; originally announced June 2023.

Comments: This manuscript is an expanded version of our paper that received the Outstanding Paper Award at the NeurIPS 2023 conference

arXiv:2304.12952 [pdf]

doi 10.1029/2023JD039202

Causally-informed deep learning to improve climate models and projections

Authors: Fernando Iglesias-Suarez, Pierre Gentine, Breixo Solino-Fernandez, Tom Beucler, Michael Pritchard, Jakob Runge, Veronika Eyring

Abstract: Climate models are essential to understand and project climate change, yet long-standing biases and uncertainties in their projections remain. This is largely associated with the representation of subgrid-scale processes, particularly clouds and convection. Deep learning can learn these subgrid-scale processes from computationally expensive storm-resolving models while retaining many features at a… ▽ More Climate models are essential to understand and project climate change, yet long-standing biases and uncertainties in their projections remain. This is largely associated with the representation of subgrid-scale processes, particularly clouds and convection. Deep learning can learn these subgrid-scale processes from computationally expensive storm-resolving models while retaining many features at a fraction of computational cost. Yet, climate simulations with embedded neural network parameterizations are still challenging and highly depend on the deep learning solution. This is likely associated with spurious non-physical correlations learned by the neural networks due to the complexity of the physical dynamical system. Here, we show that the combination of causality with deep learning helps removing spurious correlations and optimizing the neural network algorithm. To resolve this, we apply a causal discovery method to unveil causal drivers in the set of input predictors of atmospheric subgrid-scale processes of a superparameterized climate model in which deep convection is explicitly resolved. The resulting causally-informed neural networks are coupled to the climate model, hence, replacing the superparameterization and radiation scheme. We show that the climate simulations with causally-informed neural network parameterizations retain many convection-related properties and accurately generate the climate of the original high-resolution climate model, while retaining similar generalization capabilities to unseen climates compared to the non-causal approach. The combination of causal discovery and deep learning is a new and promising approach that leads to stable and more trustworthy climate simulations and paves the way towards more physically-based causal deep learning approaches also in other scientific disciplines. △ Less

Submitted 20 March, 2024; v1 submitted 25 April, 2023; originally announced April 2023.

Journal ref: Journal of Geophysical Research: Atmospheres, 129, e2023JD039202

arXiv:2304.08063 [pdf, other]

Data-Driven Equation Discovery of a Cloud Cover Parameterization

Authors: Arthur Grundner, Tom Beucler, Pierre Gentine, Veronika Eyring

Abstract: A promising method for improving the representation of clouds in climate models, and hence climate projections, is to develop machine learning-based parameterizations using output from global storm-resolving models. While neural networks can achieve state-of-the-art performance within their training distribution, they can make unreliable predictions outside of it. Additionally, they often require… ▽ More A promising method for improving the representation of clouds in climate models, and hence climate projections, is to develop machine learning-based parameterizations using output from global storm-resolving models. While neural networks can achieve state-of-the-art performance within their training distribution, they can make unreliable predictions outside of it. Additionally, they often require post-hoc tools for interpretation. To avoid these limitations, we combine symbolic regression, sequential feature selection, and physical constraints in a hierarchical modeling framework. This framework allows us to discover new equations diagnosing cloud cover from coarse-grained variables of global storm-resolving model simulations. These analytical equations are interpretable by construction and easily transferable to other grids or climate models. Our best equation balances performance and complexity, achieving a performance comparable to that of neural networks ($R^2=0.94$) while remaining simple (with only 11 trainable parameters). It reproduces cloud cover distributions more accurately than the Xu-Randall scheme across all cloud regimes (Hellinger distances $<0.09$), and matches neural networks in condensate-rich regimes. When applied and fine-tuned to the ERA5 reanalysis, the equation exhibits superior transferability to new data compared to all other optimal cloud cover schemes. Our findings demonstrate the effectiveness of symbolic regression in discovering interpretable, physically-consistent, and nonlinear equations to parameterize cloud cover. △ Less

Submitted 19 February, 2024; v1 submitted 17 April, 2023; originally announced April 2023.

Comments: 35 pages, 10 figures, Submitted to 'Journal of Advances in Modeling Earth Systems' (JAMES)

Journal ref: Journal of Advances in Modeling Earth Systems (JAMES), 2024

arXiv:2301.04027 [pdf]

doi 10.1038/s43017-023-00450-9

Differentiable modeling to unify machine learning and physical models and advance Geosciences

Authors: Chaopeng Shen, Alison P. Appling, Pierre Gentine, Toshiyuki Bandai, Hoshin Gupta, Alexandre Tartakovsky, Marco Baity-Jesi, Fabrizio Fenicia, Daniel Kifer, Li Li, Xiaofeng Liu, Wei Ren, Yi Zheng, Ciaran J. Harman, Martyn Clark, Matthew Farthing, Dapeng Feng, Praveen Kumar, Doaa Aboelyazeed, Farshid Rahmani, Hylke E. Beck, Tadd Bindas, Dipankar Dwivedi, Kuai Fang, Marvin Höge , et al. (5 additional authors not shown)

Abstract: Process-Based Modeling (PBM) and Machine Learning (ML) are often perceived as distinct paradigms in the geosciences. Here we present differentiable geoscientific modeling as a powerful pathway toward dissolving the perceived barrier between them and ushering in a paradigm shift. For decades, PBM offered benefits in interpretability and physical consistency but struggled to efficiently leverage lar… ▽ More Process-Based Modeling (PBM) and Machine Learning (ML) are often perceived as distinct paradigms in the geosciences. Here we present differentiable geoscientific modeling as a powerful pathway toward dissolving the perceived barrier between them and ushering in a paradigm shift. For decades, PBM offered benefits in interpretability and physical consistency but struggled to efficiently leverage large datasets. ML methods, especially deep networks, presented strong predictive skills yet lacked the ability to answer specific scientific questions. While various methods have been proposed for ML-physics integration, an important underlying theme -- differentiable modeling -- is not sufficiently recognized. Here we outline the concepts, applicability, and significance of differentiable geoscientific modeling (DG). "Differentiable" refers to accurately and efficiently calculating gradients with respect to model variables, critically enabling the learning of high-dimensional unknown relationships. DG refers to a range of methods connecting varying amounts of prior knowledge to neural networks and training them together, capturing a different scope than physics-guided machine learning and emphasizing first principles. Preliminary evidence suggests DG offers better interpretability and causality than ML, improved generalizability and extrapolation capability, and strong potential for knowledge discovery, while approaching the performance of purely data-driven ML. DG models require less training data while scaling favorably in performance and efficiency with increasing amounts of data. With DG, geoscientists may be better able to frame and investigate questions, test hypotheses, and discover unrecognized linkages. △ Less

Submitted 26 December, 2023; v1 submitted 10 January, 2023; originally announced January 2023.

Journal ref: Nat Rev Earth Environ 4, 552-567 (2023)

arXiv:2210.14488 [pdf, other]

History-Based, Bayesian, Closure for Stochastic Parameterization: Application to Lorenz '96

Authors: Mohamed Aziz Bhouri, Pierre Gentine

Abstract: Physical parameterizations are used as representations of unresolved subgrid processes within weather and global climate models or coarse-scale turbulent models, whose resolutions are too coarse to resolve small-scale processes. These parameterizations are typically grounded on physically-based, yet empirical, representations of the underlying small-scale processes. Machine learning-based paramete… ▽ More Physical parameterizations are used as representations of unresolved subgrid processes within weather and global climate models or coarse-scale turbulent models, whose resolutions are too coarse to resolve small-scale processes. These parameterizations are typically grounded on physically-based, yet empirical, representations of the underlying small-scale processes. Machine learning-based parameterizations have recently been proposed as an alternative and have shown great promises to reduce uncertainties associated with small-scale processes. Yet, those approaches still show some important mismatches that are often attributed to stochasticity in the considered process. This stochasticity can be due to noisy data, unresolved variables or simply to the inherent chaotic nature of the process. To address these issues, we develop a new type of parameterization (closure) which is based on a Bayesian formalism for neural networks, to account for uncertainty quantification, and includes memory, to account for the non-instantaneous response of the closure. To overcome the curse of dimensionality of Bayesian techniques in high-dimensional spaces, the Bayesian strategy is based on a Hamiltonian Monte Carlo Markov Chain sampling strategy that takes advantage of the likelihood function and kinetic energy's gradients with respect to the parameters to accelerate the sampling process. We apply the proposed Bayesian history-based parameterization to the Lorenz '96 model in the presence of noisy and sparse data, similar to satellite observations, and show its capacity to predict skillful forecasts of the resolved variables while returning trustworthy uncertainty quantifications for different sources of error. This approach paves the way for the use of Bayesian approaches for closure problems. △ Less

Submitted 26 October, 2022; originally announced October 2022.

arXiv:2209.06086 [pdf]

Carbon Monitor-Power: near-real-time monitoring of global power generation on hourly to daily scales

Authors: Biqing Zhu, Xuanren Song, Zhu Deng, Wenli Zhao, Da Huo, Taochun Sun, Piyu Ke, Duo Cui, Chenxi Lu, Haiwang Zhong, Chaopeng Hong, Jian Qiu, Steven J. Davis, Pierre Gentine, Philippe Ciais, Zhu Liu

Abstract: We constructed a frequently updated, near-real-time global power generation dataset: Carbon Monitor-Power since January, 2016 at national levels with near-global coverage and hourly-to-daily time resolution. The data presented here are collected from 37 countries across all continents for eight source groups, including three types of fossil sources (coal, gas, and oil), nuclear energy and four gro… ▽ More We constructed a frequently updated, near-real-time global power generation dataset: Carbon Monitor-Power since January, 2016 at national levels with near-global coverage and hourly-to-daily time resolution. The data presented here are collected from 37 countries across all continents for eight source groups, including three types of fossil sources (coal, gas, and oil), nuclear energy and four groups of renewable energy sources (solar energy, wind energy, hydro energy and other renewables including biomass, geothermal, etc.). The global near-real-time power dataset shows the dynamics of the global power system, including its hourly, daily, weekly and seasonal patterns as influenced by daily periodical activities, weekends, seasonal cycles, regular and irregular events (i.e., holidays) and extreme events (i.e., the COVID-19 pandemic). The Carbon Monitor-Power dataset reveals that the COVID-19 pandemic caused strong disruptions in some countries (i.e., China and India), leading to a temporary or long-lasting shift to low carbon intensity, while it had only little impact in some other countries (i.e., Australia). This dataset offers a large range of opportunities for power-related scientific research and policy-making. △ Less

Submitted 13 September, 2022; originally announced September 2022.

arXiv:2208.11843 [pdf, other]

Comparing Storm Resolving Models and Climates via Unsupervised Machine Learning

Authors: Griffin Mooers, Mike Pritchard, Tom Beucler, Prakhar Srivastava, Harshini Mangipudi, Liran Peng, Pierre Gentine, Stephan Mandt

Abstract: Global Storm-Resolving Models (GSRMs) have gained widespread interest because of the unprecedented detail with which they resolve the global climate. However, it remains difficult to quantify objective differences in how GSRMs resolve complex atmospheric formations. This lack of comprehensive tools for comparing model similarities is a problem in many disparate fields that involve simulation tools… ▽ More Global Storm-Resolving Models (GSRMs) have gained widespread interest because of the unprecedented detail with which they resolve the global climate. However, it remains difficult to quantify objective differences in how GSRMs resolve complex atmospheric formations. This lack of comprehensive tools for comparing model similarities is a problem in many disparate fields that involve simulation tools for complex data. To address this challenge we develop methods to estimate distributional distances based on both nonlinear dimensionality reduction and vector quantization. Our approach automatically learns physically meaningful notions of similarity from low-dimensional latent data representations that the different models produce. This enables an intercomparison of nine GSRMs based on their high-dimensional simulation data (2D vertical velocity snapshots) and reveals that only six are similar in their representation of atmospheric dynamics. Furthermore, we uncover signatures of the convective response to global warming in a fully unsupervised way. Our study provides a path toward evaluating future high-resolution simulation data more objectively. △ Less

Submitted 2 December, 2023; v1 submitted 24 August, 2022; originally announced August 2022.

Comments: 26 pages, 21 figures. In revision at Scientific Reports

arXiv:2205.00743 [pdf, other]

doi 10.1109/TGRS.2023.3237008

Machine-learned cloud classes from satellite data for process-oriented climate model evaluation

Authors: A. Kaps, A. Lauer, G. Camps-Valls, P. Gentine, L. Gómez-Chova, V. Eyring

Abstract: Clouds play a key role in regulating climate change but are difficult to simulate within Earth system models (ESMs). Improving the representation of clouds is one of the key tasks towards more robust climate change projections. This study introduces a new machine-learning based framework relying on satellite observations to improve understanding of the representation of clouds and their relevant p… ▽ More Clouds play a key role in regulating climate change but are difficult to simulate within Earth system models (ESMs). Improving the representation of clouds is one of the key tasks towards more robust climate change projections. This study introduces a new machine-learning based framework relying on satellite observations to improve understanding of the representation of clouds and their relevant processes in climate models. The proposed method is capable of assigning distributions of established cloud types to coarse data. It facilitates a more objective evaluation of clouds in ESMs and improves the consistency of cloud process analysis. The method is built on satellite data from the MODIS instrument labelled by deep neural networks with cloud types defined by the World Meteorological Organization (WMO), using cloud type labels from CloudSat as ground truth. The method is applicable to datasets with information about physical cloud variables comparable to MODIS satellite data and at sufficiently high temporal resolution. We apply the method to alternative satellite data from the Cloud\_cci project (ESA Climate Change Initiative), coarse-grained to typical resolutions of climate models. The resulting cloud type distributions are physically consistent and the horizontal resolutions typical of ESMs are sufficient to apply our method. We recommend outputting crucial variables required by our method for future ESM data evaluation. This will enable the use of labelled satellite data for a more systematic evaluation of clouds in climate models. △ Less

Submitted 28 October, 2022; v1 submitted 2 May, 2022; originally announced May 2022.

Comments: Main Paper 16 pages, 11 figures. Supporting material 7 Pages, 8 figures. This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

arXiv:2204.08708 [pdf, other]

doi 10.1029/2022MS003130

Non-Linear Dimensionality Reduction with a Variational Encoder Decoder to Understand Convective Processes in Climate Models

Authors: Gunnar Behrens, Tom Beucler, Pierre Gentine, Fernando Iglesias-Suarez, Michael Pritchard, Veronika Eyring

Abstract: Deep learning can accurately represent sub-grid-scale convective processes in climate models, learning from high resolution simulations. However, deep learning methods usually lack interpretability due to large internal dimensionality, resulting in reduced trustworthiness in these methods. Here, we use Variational Encoder Decoder structures (VED), a non-linear dimensionality reduction technique, t… ▽ More Deep learning can accurately represent sub-grid-scale convective processes in climate models, learning from high resolution simulations. However, deep learning methods usually lack interpretability due to large internal dimensionality, resulting in reduced trustworthiness in these methods. Here, we use Variational Encoder Decoder structures (VED), a non-linear dimensionality reduction technique, to learn and understand convective processes in an aquaplanet superparameterized climate model simulation, where deep convective processes are simulated explicitly. We show that similar to previous deep learning studies based on feed-forward neural nets, the VED is capable of learning and accurately reproducing convective processes. In contrast to past work, we show this can be achieved by compressing the original information into only five latent nodes. As a result, the VED can be used to understand convective processes and delineate modes of convection through the exploration of its latent dimensions. A close investigation of the latent space enables the identification of different convective regimes: a) stable conditions are clearly distinguished from deep convection with low outgoing longwave radiation and strong precipitation; b) high optically thin cirrus-like clouds are separated from low optically thick cumulus clouds; and c) shallow convective processes are associated with large-scale moisture content and surface diabatic heating. Our results demonstrate that VEDs can accurately represent convective processes in climate models, while enabling interpretability and better understanding of sub-grid-scale physical processes, paving the way to increasingly interpretable machine learning parameterizations with promising generative properties △ Less

Submitted 26 July, 2022; v1 submitted 19 April, 2022; originally announced April 2022.

Comments: main paper: 30 pages, 11 figures; supporting informations: 37 pages, 19 figures, 11 tables; Submitted to 'Journal of Advances in Modeling Earth Systems' (JAMES)

arXiv:2112.11317 [pdf, other]

doi 10.1029/2021MS002959

Deep Learning Based Cloud Cover Parameterization for ICON

Authors: Arthur Grundner, Tom Beucler, Pierre Gentine, Fernando Iglesias-Suarez, Marco A. Giorgetta, Veronika Eyring

Abstract: A promising approach to improve cloud parameterizations within climate models and thus climate projections is to use deep learning in combination with training data from storm-resolving model (SRM) simulations. The ICOsahedral Non-hydrostatic (ICON) modeling framework permits simulations ranging from numerical weather prediction to climate projections, making it an ideal target to develop neural n… ▽ More A promising approach to improve cloud parameterizations within climate models and thus climate projections is to use deep learning in combination with training data from storm-resolving model (SRM) simulations. The ICOsahedral Non-hydrostatic (ICON) modeling framework permits simulations ranging from numerical weather prediction to climate projections, making it an ideal target to develop neural network (NN) based parameterizations for sub-grid scale processes. Within the ICON framework, we train NN based cloud cover parameterizations with coarse-grained data based on realistic regional and global ICON SRM simulations. We set up three different types of NNs that differ in the degree of vertical locality they assume for diagnosing cloud cover from coarse-grained atmospheric state variables. The NNs accurately estimate sub-grid scale cloud cover from coarse-grained data that has similar geographical characteristics as their training data. Additionally, globally trained NNs can reproduce sub-grid scale cloud cover of the regional SRM simulation. Using the game-theory based interpretability library SHapley Additive exPlanations, we identify an overemphasis on specific humidity and cloud ice as the reason why our column-based NN cannot perfectly generalize from the global to the regional coarse-grained SRM data. The interpretability tool also helps visualize similarities and differences in feature importance between regionally and globally trained column-based NNs, and reveals a local relationship between their cloud cover predictions and the thermodynamic environment. Our results show the potential of deep learning to derive accurate yet interpretable cloud cover parameterizations from global SRMs, and suggest that neighborhood-based models may be a good compromise between accuracy and generalizability. △ Less

Submitted 6 December, 2022; v1 submitted 21 December, 2021; originally announced December 2021.

Comments: 42 pages, 17 figures, Submitted to 'Journal of Advances in Modeling Earth Systems' (JAMES)

Journal ref: Journal of Advances in Modeling Earth Systems (JAMES), 2022

arXiv:2112.08440 [pdf, other]

Climate-Invariant Machine Learning

Authors: Tom Beucler, Pierre Gentine, Janni Yuval, Ankitesh Gupta, Liran Peng, Jerry Lin, Sungduk Yu, Stephan Rasp, Fiaz Ahmed, Paul A. O'Gorman, J. David Neelin, Nicholas J. Lutsko, Michael Pritchard

Abstract: Projecting climate change is a generalization problem: we extrapolate the recent past using physical models across past, present, and future climates. Current climate models require representations of processes that occur at scales smaller than model grid size, which have been the main source of model projection uncertainty. Recent machine learning (ML) algorithms hold promise to improve such proc… ▽ More Projecting climate change is a generalization problem: we extrapolate the recent past using physical models across past, present, and future climates. Current climate models require representations of processes that occur at scales smaller than model grid size, which have been the main source of model projection uncertainty. Recent machine learning (ML) algorithms hold promise to improve such process representations, but tend to extrapolate poorly to climate regimes they were not trained on. To get the best of the physical and statistical worlds, we propose a new framework - termed "climate-invariant" ML - incorporating knowledge of climate processes into ML algorithms, and show that it can maintain high offline accuracy across a wide range of climate conditions and configurations in three distinct atmospheric models. Our results suggest that explicitly incorporating physical knowledge into data-driven models of Earth system processes can improve their consistency, data efficiency, and generalizability across climate regimes. △ Less

Submitted 17 January, 2024; v1 submitted 14 December, 2021; originally announced December 2021.

Comments: 26+28 pages, 9+15 figures, 0+3 tables in the main text + supplementary materials. Accepted for publication in Science Advances on Jan 5, 2024

arXiv:2107.10197 [pdf, other]

Zero-Shot Learning of Aerosol Optical Properties with Graph Neural Networks

Authors: Kara D. Lamb, Pierre Gentine

Abstract: Aerosols sourced from combustion such as black carbon (BC) are important short-lived climate forcers whose direct radiative forcing and atmospheric lifetime depend on their morphology. These aerosols' complex morphology makes modeling their optical properties difficult, contributing to uncertainty in both their direct and indirect climate effects. Accurate and fast calculations of BC optical prope… ▽ More Aerosols sourced from combustion such as black carbon (BC) are important short-lived climate forcers whose direct radiative forcing and atmospheric lifetime depend on their morphology. These aerosols' complex morphology makes modeling their optical properties difficult, contributing to uncertainty in both their direct and indirect climate effects. Accurate and fast calculations of BC optical properties are needed for remote sensing inversions and for radiative forcing calculations in atmospheric models, but current methods to accurately calculate the optical properties of these aerosols are computationally expensive and are compiled in extensive databases off-line to be used as a look-up table. Recent advances in machine learning approaches have shown the potential of graph neural networks (GNN's) for various physical science applications, demonstrating skill in generalizing beyond initial training data by learning internal properties and small-scale interactions defining the emergent behavior of the larger system. Here we demonstrate that a GNN trained to predict the optical properties of numerically-generated BC fractal aggregates can accurately generalize to arbitrarily shaped particles, even over much larger (10x) aggregates than in the training dataset, providing a fast and accurate method to calculate aerosol optical properties in models and for observational retrievals. This zero-shot learning approach could be integrated into atmospheric models or remote sensing inversions to predict the physical properties of realistically-shaped aerosol and cloud particles. In addition, GNN's can be used to gain physical intuition on the relationship between small-scale interactions (here of the spheres' positions and interactions) and large-scale properties (here of the radiative properties of aerosols). △ Less

Submitted 21 July, 2021; originally announced July 2021.

Comments: 8 pages, 4 figures, Supplementary Information

arXiv:2107.08586 [pdf]

doi 10.1016/j.xinn.2021.100182

Global Gridded Daily CO$_2$ Emissions

Authors: Xinyu Dou, Yilong Wang, Philippe Ciais, Frédéric Chevallier, Steven J. Davis, Monica Crippa, Greet Janssens-Maenhout, Diego Guizzardi, Efisio Solazzo, Feifan Yan, Da Huo, Zheng Bo, Zhu Deng, Biqing Zhu, Hengqi Wang, Qiang Zhang, Pierre Gentine, Zhu Liu

Abstract: Precise and high-resolution carbon dioxide (CO$_2$) emission data is of great importance of achieving the carbon neutrality around the world. Here we present for the first time the near-real-time Global Gridded Daily CO$_2$ Emission Datasets (called GRACED) from fossil fuel and cement production with a global spatial-resolution of 0.1$^\circ$ by 0.1$^\circ$ and a temporal-resolution of 1-day. Grid… ▽ More Precise and high-resolution carbon dioxide (CO$_2$) emission data is of great importance of achieving the carbon neutrality around the world. Here we present for the first time the near-real-time Global Gridded Daily CO$_2$ Emission Datasets (called GRACED) from fossil fuel and cement production with a global spatial-resolution of 0.1$^\circ$ by 0.1$^\circ$ and a temporal-resolution of 1-day. Gridded fossil emissions are computed for different sectors based on the daily national CO$_2$ emissions from near real time dataset (Carbon Monitor), the spatial patterns of point source emission dataset Global Carbon Grid (GID), Emission Database for Global Atmospheric Research (EDGAR) and spatiotemporal patters of satellite nitrogen dioxide (NO$_2$) retrievals. Our study on the global CO$_2$ emissions responds to the growing and urgent need for high-quality, fine-grained near-real-time CO2 emissions estimates to support global emissions monitoring across various spatial scales. We show the spatial patterns of emission changes for power, industry, residential consumption, ground transportation, domestic and international aviation, and international shipping sectors between 2019 and 2020. This help us to give insights on the relative contributions of various sectors and provides a fast and fine-grained overview of where and when fossil CO$_2$ emissions have decreased and rebounded in response to emergencies (e.g. COVID-19) and other disturbances of human activities than any previously published dataset. As the world recovers from the pandemic and decarbonizes its energy systems, regular updates of this dataset will allow policymakers to more closely monitor the effectiveness of climate and energy policies and quickly adapt. △ Less

Submitted 18 July, 2021; originally announced July 2021.

arXiv:2105.00912 [pdf, other]

Causal inference for process understanding in Earth sciences

Authors: Adam Massmann, Pierre Gentine, Jakob Runge

Abstract: There is growing interest in the study of causal methods in the Earth sciences. However, most applications have focused on causal discovery, i.e. inferring the causal relationships and causal structure from data. This paper instead examines causality through the lens of causal inference and how expert-defined causal graphs, a fundamental from causal theory, can be used to clarify assumptions, iden… ▽ More There is growing interest in the study of causal methods in the Earth sciences. However, most applications have focused on causal discovery, i.e. inferring the causal relationships and causal structure from data. This paper instead examines causality through the lens of causal inference and how expert-defined causal graphs, a fundamental from causal theory, can be used to clarify assumptions, identify tractable problems, and aid interpretation of results and their causality in Earth science research. We apply causal theory to generic graphs of the Earth system to identify where causal inference may be most tractable and useful to address problems in Earth Science, and avoid potentially incorrect conclusions. Specifically, causal inference may be useful when: (1) the effect of interest is only causally affected by the observed portion of the state space; or: (2) the cause of interest can be assumed to be independent of the evolution of the system's state; or: (3) the state space of the system is reconstructable from lagged observations of the system. However, we also highlight through examples how causal graphs can be used to explicitly define and communicate assumptions and hypotheses, and help to structure analyses, even if causal inference is ultimately challenging given the data availability, limitations and uncertainties. △ Less

Submitted 3 May, 2021; originally announced May 2021.

arXiv:2104.06904 [pdf]

Unprecedented decarbonization of China's power system in the post-COVID era

Authors: Biqing Zhu, Rui Guo, Zhu Deng, Wenli Zhao, Piyu Ke, Xinyu Dou, Steven J. Davis, Philippe Ciais, Pierre Gentine, Zhu Liu

Abstract: In October of 2020, China announced that it aims to start reducing its carbon dioxide (CO2) emissions before 2030 and achieve carbon neutrality before 20601. The surprise announcement came in the midst of the COVID-19 pandemic which caused a transient drop in China's emissions in the first half of 2020. Here, we show an unprecedented de-carbonization of China's power system in late 2020: although… ▽ More In October of 2020, China announced that it aims to start reducing its carbon dioxide (CO2) emissions before 2030 and achieve carbon neutrality before 20601. The surprise announcement came in the midst of the COVID-19 pandemic which caused a transient drop in China's emissions in the first half of 2020. Here, we show an unprecedented de-carbonization of China's power system in late 2020: although China's power related carbon emissions were 0.5% higher in 2020 than 2019, the majority (92.9%) of the increased power demand was met by increases in low-carbon (renewables and nuclear) generation (increased by 9.3%), as compared to only 0.4% increase for fossil fuels. China's low-carbon generation in the country grew in the second half of 2020, supplying a record high of 36.7% (increased by 1.9% compared to 2019) of total electricity in 2020, when the fossil production dropped to a historical low of 63.3%. Combined, the carbon intensity of China's power sector decreased to an historical low of 519.9 tCO2/GWh in 2020. If the fast decarbonization and slowed down power demand growth from 2019 to 2020 were to continue, by 2030, over half (50.8%) of China's power demand could be provided by low carbon sources. Our results thus reveal that China made progress towards its carbon neutrality target during the pandemic, and suggest the potential for substantial further decarbonization in the next few years if the latest trends persist. △ Less

Submitted 14 April, 2021; originally announced April 2021.

arXiv:2103.02526 [pdf]

doi 10.1038/s41561-022-00965-8

Global Daily CO$_2$ emissions for the year 2020

Authors: Zhu Liu, Zhu Deng, Philippe Ciais, Jianguang Tan, Biqing Zhu, Steven J. Davis, Robbie Andrew, Olivier Boucher, Simon Ben Arous, Pep Canadel, Xinyu Dou, Pierre Friedlingstein, Pierre Gentine, Rui Guo, Chaopeng Hong, Robert B. Jackson, Daniel M. Kammen, Piyu Ke, Corinne Le Quere, Crippa Monica, Greet Janssens-Maenhout, Glen Peters, Katsumasa Tanaka, Yilong Wang, Bo Zheng , et al. (3 additional authors not shown)

Abstract: The diurnal cycle CO$_2$ emissions from fossil fuel combustion and cement production reflect seasonality, weather conditions, working days, and more recently the impact of the COVID-19 pandemic. Here, for the first time we provide a daily CO$_2$ emission dataset for the whole year of 2020 calculated from inventory and near-real-time activity data (called Carbon Monitor project: https://carbonmonit… ▽ More The diurnal cycle CO$_2$ emissions from fossil fuel combustion and cement production reflect seasonality, weather conditions, working days, and more recently the impact of the COVID-19 pandemic. Here, for the first time we provide a daily CO$_2$ emission dataset for the whole year of 2020 calculated from inventory and near-real-time activity data (called Carbon Monitor project: https://carbonmonitor.org). It was previously suggested from preliminary estimates that did not cover the entire year of 2020 that the pandemics may have caused more than 8% annual decline of global CO$_2$ emissions. Here we show from detailed estimates of the full year data that the global reduction was only 5.4% (-1,901 MtCO$_2$, ). This decrease is 5 times larger than the annual emission drop at the peak of the 2008 Global Financial Crisis. However, global CO$_2$ emissions gradually recovered towards 2019 levels from late April with global partial re-opening. More importantly, global CO$_2$ emissions even increased slightly by +0.9% in December 2020 compared with 2019, indicating the trends of rebound of global emissions. Later waves of COVID-19 infections in late 2020 and corresponding lockdowns have caused further CO$_2$ emissions reductions particularly in western countries, but to a much smaller extent than the declines in the first wave. That even substantial world-wide lockdowns of activity led to a one-time decline in global CO$_2$ emissions of only 5.4% in one year highlights the significant challenges for climate change mitigation that we face in the post-COVID era. These declines are significant, but will be quickly overtaken with new emissions unless the COVID-19 crisis is utilized as a break-point with our fossil-fuel trajectory, notably through policies that make the COVID-19 recovery an opportunity to green national energy and development plans. △ Less

Submitted 3 March, 2021; originally announced March 2021.

arXiv:2102.03240 [pdf]

De-carbonization of global energy use during the COVID-19 pandemic

Authors: Zhu Liu, Biqing Zhu, Philippe Ciais, Steven J. Davis, Chenxi Lu, Haiwang Zhong, Piyu Ke, Yanan Cui, Zhu Deng, Duo Cui, Taochun Sun, Xinyu Dou, Jianguang Tan, Rui Guo, Bo Zheng, Katsumasa Tanaka, Wenli Zhao, Pierre Gentine

Abstract: The COVID-19 pandemic has disrupted human activities, leading to unprecedented decreases in both global energy demand and GHG emissions. Yet a little known that there is also a low carbon shift of the global energy system in 2020. Here, using the near-real-time data on energy-related GHG emissions from 30 countries (about 70% of global power generation), we show that the pandemic caused an unprece… ▽ More The COVID-19 pandemic has disrupted human activities, leading to unprecedented decreases in both global energy demand and GHG emissions. Yet a little known that there is also a low carbon shift of the global energy system in 2020. Here, using the near-real-time data on energy-related GHG emissions from 30 countries (about 70% of global power generation), we show that the pandemic caused an unprecedented de-carbonization of global power system, representing by a dramatic decrease in the carbon intensity of power sector that reached a historical low of 414.9 tCO2eq/GWh in 2020. Moreover, the share of energy derived from renewable and low-carbon sources (nuclear, hydro-energy, wind, solar, geothermal, and biomass) exceeded that from coal and oil for the first time in history in May of 2020. The decrease in global net energy demand (-1.3% in the first half of 2020 relative to the average of the period in 2016-2019) masks a large down-regulation of fossil-fuel-burning power plants supply (-6.1%) coincident with a surge of low-carbon sources (+6.2%). Concomitant changes in the diurnal cycle of electricity demand also favored low-carbon generators, including a flattening of the morning ramp, a lower midday peak, and delays in both the morning and midday load peaks in most countries. However, emission intensities in the power sector have since rebounded in many countries, and a key question for climate mitigation is thus to what extent countries can achieve and maintain lower, pandemic-level carbon intensities of electricity as part of a green recovery. △ Less

Submitted 5 February, 2021; originally announced February 2021.

arXiv:2101.06450 [pdf]

Transportation CO$_2$ emissions stayed high despite recurrent COVID outbreaks

Authors: Yilong Wang, Zhu Deng, Philippe Ciais, Zhu Liu, Steven J. Davis, Pierre Gentine, Thomas Lauvaux, Quansheng Ge

Abstract: After steep drops and then rebounds in transportation-related CO$_2$ emissions over the first half of 2020, a second wave of COVID-19 this fall has caused further -- but less substantial -- emissions reductions. Here, we use near-real-time estimates of daily emissions to explore differences in human behavior and restriction policies over the course of 2020. After steep drops and then rebounds in transportation-related CO$_2$ emissions over the first half of 2020, a second wave of COVID-19 this fall has caused further -- but less substantial -- emissions reductions. Here, we use near-real-time estimates of daily emissions to explore differences in human behavior and restriction policies over the course of 2020. △ Less

Submitted 16 January, 2021; originally announced January 2021.

arXiv:2010.12996 [pdf, other]

doi 10.1029/2020MS002385

Assessing the Potential of Deep Learning for Emulating Cloud Superparameterization in Climate Models with Real-Geography Boundary Conditions

Authors: Griffin Mooers, Mike Pritchard, Tom Beucler, Jordan Ott, Galen Yacalis, Pierre Baldi, Pierre Gentine

Abstract: We explore the potential of feed-forward deep neural networks (DNNs) for emulating cloud superparameterization in realistic geography, using offline fits to data from the Super Parameterized Community Atmospheric Model. To identify the network architecture of greatest skill, we formally optimize hyperparameters using ~250 trials. Our DNN explains over 70 percent of the temporal variance at the 15-… ▽ More We explore the potential of feed-forward deep neural networks (DNNs) for emulating cloud superparameterization in realistic geography, using offline fits to data from the Super Parameterized Community Atmospheric Model. To identify the network architecture of greatest skill, we formally optimize hyperparameters using ~250 trials. Our DNN explains over 70 percent of the temporal variance at the 15-minute sampling scale throughout the mid-to-upper troposphere. Autocorrelation timescale analysis compared against DNN skill suggests the less good fit in the tropical, marine boundary layer is driven by neural network difficulty emulating fast, stochastic signals in convection. However, spectral analysis in the temporal domain indicates skillful emulation of signals on diurnal to synoptic scales. A close look at the diurnal cycle reveals correct emulation of land-sea contrasts and vertical structure in the heating and moistening fields, but some distortion of precipitation. Sensitivity tests targeting precipitation skill reveal complementary effects of adding positive constraints vs. hyperparameter tuning, motivating the use of both in the future. A first attempt to force an offline land model with DNN emulated atmospheric fields produces reassuring results further supporting neural network emulation viability in real-geography settings. Overall, the fit skill is competitive with recent attempts by sophisticated Residual and Convolutional Neural Network architectures trained on added information, including memory of past states. Our results confirm the parameterizability of superparameterized convection with continents through machine learning and we highlight advantages of casting this problem locally in space and time for accurate emulation and hopefully quick implementation of hybrid climate models. △ Less

Submitted 20 April, 2021; v1 submitted 24 October, 2020; originally announced October 2020.

Comments: 32 Pages, 13 Figures, Revised Version Submitted to Journal of Advances in Modeling Earth Systems April 2021

arXiv:2002.09776 [pdf]

doi 10.1103/PhysRevFluids.6.034606

On the logarithmic profile of temperature in the atmospheric convective boundary layers

Authors: Yu Cheng, Qi Li, Pierre Gentine

Abstract: Wall-bounded turbulent flows are widely observed in natural and engineering systems, such as air flows near the Earth's surface, water flows in rivers, and flows around a car or a plane. The universal logarithmic velocity profile in wall-bounded turbulent flows proposed by von Kármán in 1930 is one of the few exact physical descriptions of turbulence. However, the mean velocity and temperature pro… ▽ More Wall-bounded turbulent flows are widely observed in natural and engineering systems, such as air flows near the Earth's surface, water flows in rivers, and flows around a car or a plane. The universal logarithmic velocity profile in wall-bounded turbulent flows proposed by von Kármán in 1930 is one of the few exact physical descriptions of turbulence. However, the mean velocity and temperature profiles cannot be adequately described by this universal log law when buoyancy effects are present. Monin-Obukhov similarity theory (MOST), proposed in 1954, has been the cornerstone theory to account for these buoyancy effects and to describe the atmospheric boundary layer. MOST has been used in almost all global weather, climate and hydrological models to describe the dependence of the mean velocity, temperature and scalar profiles on buoyancy. According to MOST, the logarithmic temperature profile breaks down as buoyancy effects become important. In contrast, here we show that this long-standing MOST theory does not apply for temperature. We propose a new theory for the logarithmic profile of near-wall temperature, which corrects MOST pitfalls and is supported by both high-resolution direct numerical simulations and field observations of the convective atmospheric boundary layer. Buoyancy effects do not modify the logarithmic nature but instead modulate the slope of the temperature profile compared to the universal von Kármán slope. The new formulation has widespread applications such as in climate models, where the proposed new temperature log law should lead to more realistic continental surface temperature, which are strongly impacted by buoyancy. △ Less

Submitted 22 February, 2020; originally announced February 2020.

Comments: 18 pages, 11 figures

Journal ref: Phys. Rev. Fluids 6, 034606 (2021)

arXiv:2002.08525 [pdf, other]

Towards Physically-consistent, Data-driven Models of Convection

Authors: Tom Beucler, Michael Pritchard, Pierre Gentine, Stephan Rasp

Abstract: Data-driven algorithms, in particular neural networks, can emulate the effect of sub-grid scale processes in coarse-resolution climate models if trained on high-resolution climate simulations. However, they may violate key physical constraints and lack the ability to generalize outside of their training set. Here, we show that physical constraints can be enforced in neural networks, either approxi… ▽ More Data-driven algorithms, in particular neural networks, can emulate the effect of sub-grid scale processes in coarse-resolution climate models if trained on high-resolution climate simulations. However, they may violate key physical constraints and lack the ability to generalize outside of their training set. Here, we show that physical constraints can be enforced in neural networks, either approximately by adapting the loss function or to within machine precision by adapting the architecture. As these physical constraints are insufficient to guarantee generalizability, we additionally propose to physically rescale the training and validation data to improve the ability of neural networks to generalize to unseen climates. △ Less

Submitted 17 April, 2020; v1 submitted 19 February, 2020; originally announced February 2020.

Comments: Accepted for oral presentation at the 2020 IEEE International Geoscience and Remote Sensing Symposium (IGARSS) 5 pages, 5 figures, 1 table

arXiv:1910.12125 [pdf]

Deep learning for subgrid-scale turbulence modeling in large-eddy simulations of the atmospheric boundary layer

Authors: Yu Cheng, Marco Giometto, Pit Kauffmann, Ling Lin, Chen Cao, Cody Zupnick, Harold Li, Qi Li, Ryan Abernathey, Pierre Gentine

Abstract: In large-eddy simulations, subgrid-scale (SGS) processes are parameterized as a function of filtered grid-scale variables. First-order, algebraic SGS models are based on the eddy-viscosity assumption, which does not always hold for turbulence. Here we apply supervised deep neural networks (DNNs) to learn SGS stresses from a set of neighboring coarse-grained velocity from direct numerical simulatio… ▽ More In large-eddy simulations, subgrid-scale (SGS) processes are parameterized as a function of filtered grid-scale variables. First-order, algebraic SGS models are based on the eddy-viscosity assumption, which does not always hold for turbulence. Here we apply supervised deep neural networks (DNNs) to learn SGS stresses from a set of neighboring coarse-grained velocity from direct numerical simulations (DNSs) of the atmospheric boundary layer at friction Reynolds numbers Re_τたう up to 1243 without invoking the eddy-viscosity assumption. The DNN model was found to produce higher correlation of SGS stresses compared to the Smagorinsky model and the Smagorinsky-Bardina mixed model in the surface and mixed layers and can be applied to different grid resolutions and various stability conditions ranging from near neutral to very unstable. The additional information on potential temperature and pressure were found not to be useful for SGS modeling. Deep learning thus demonstrates great potential for LESs of geophysical turbulence. △ Less

Submitted 26 October, 2019; originally announced October 2019.

Comments: 33 pages, 11 figures, 3 tables

arXiv:1909.00912 [pdf, other]

doi 10.1103/PhysRevLett.126.098302

Enforcing Analytic Constraints in Neural-Networks Emulating Physical Systems

Authors: Tom Beucler, Michael Pritchard, Stephan Rasp, Jordan Ott, Pierre Baldi, Pierre Gentine

Abstract: Neural networks can emulate nonlinear physical systems with high accuracy, yet they may produce physically-inconsistent results when violating fundamental constraints. Here, we introduce a systematic way of enforcing nonlinear analytic constraints in neural networks via constraints in the architecture or the loss function. Applied to convective processes for climate modeling, architectural constra… ▽ More Neural networks can emulate nonlinear physical systems with high accuracy, yet they may produce physically-inconsistent results when violating fundamental constraints. Here, we introduce a systematic way of enforcing nonlinear analytic constraints in neural networks via constraints in the architecture or the loss function. Applied to convective processes for climate modeling, architectural constraints enforce conservation laws to within machine precision without degrading performance. Enforcing constraints also reduces errors in the subsets of the outputs most impacted by the constraints. △ Less

Submitted 27 January, 2021; v1 submitted 2 September, 2019; originally announced September 2019.

Comments: 21 pages, 11 figures, 9 tables. Submitted to Physical Review Letters

Journal ref: Phys. Rev. Lett. 126, 098302 (2021)

arXiv:1906.06786 [pdf, other]

Recovering the parameters underlying the Lorenz-96 chaotic dynamics

Authors: Soukayna Mouatadid, Pierre Gentine, Wei Yu, Steve Easterbrook

Abstract: Climate projections suffer from uncertain equilibrium climate sensitivity. The reason behind this uncertainty is the resolution of global climate models, which is too coarse to resolve key processes such as clouds and convection. These processes are approximated using heuristics in a process called parameterization. The selection of these parameters can be subjective, leading to significant uncert… ▽ More Climate projections suffer from uncertain equilibrium climate sensitivity. The reason behind this uncertainty is the resolution of global climate models, which is too coarse to resolve key processes such as clouds and convection. These processes are approximated using heuristics in a process called parameterization. The selection of these parameters can be subjective, leading to significant uncertainties in the way clouds are represented in global climate models. Here, we explore three deep network algorithms to infer these parameters in an objective and data-driven way. We compare the performance of a fully-connected network, a one-dimensional and, a two-dimensional convolutional networks to recover the underlying parameters of the Lorenz-96 model, a non-linear dynamical system that has similar behavior to the climate system. △ Less

Submitted 16 June, 2019; originally announced June 2019.

Comments: ICML 2019 workshop on climate change

arXiv:1906.06622 [pdf, other]

Achieving Conservation of Energy in Neural Network Emulators for Climate Modeling

Authors: Tom Beucler, Stephan Rasp, Michael Pritchard, Pierre Gentine

Abstract: Artificial neural-networks have the potential to emulate cloud processes with higher accuracy than the semi-empirical emulators currently used in climate models. However, neural-network models do not intrinsically conserve energy and mass, which is an obstacle to using them for long-term climate predictions. Here, we propose two methods to enforce linear conservation laws in neural-network emulato… ▽ More Artificial neural-networks have the potential to emulate cloud processes with higher accuracy than the semi-empirical emulators currently used in climate models. However, neural-network models do not intrinsically conserve energy and mass, which is an obstacle to using them for long-term climate predictions. Here, we propose two methods to enforce linear conservation laws in neural-network emulators of physical models: Constraining (1) the loss function or (2) the architecture of the network itself. Applied to the emulation of explicitly-resolved cloud processes in a prototype multi-scale climate model, we show that architecture constraints can enforce conservation laws to satisfactory numerical precision, while all constraints help the neural-network better generalize to conditions outside of its training set, such as global warming. △ Less

Submitted 15 June, 2019; originally announced June 2019.

Comments: ICML 2019 Workshop. Climate Change: How Can AI Help? 3 pages, 3 figures, 1 table

arXiv:1811.09608 [pdf]

On the Power-law Scaling of Turbulence Cospectra Part 1: Stably Stratified Atmospheric Boundary Layer

Authors: Yu Cheng, Qi Li, Andrey Grachev, Stefania Argentini, Harindra J. S. Fernando, Pierre Gentine

Abstract: Turbulent fluxes in the atmospheric surface layer are key input for the prediction of weather, hydrology, and carbon dioxide concentration. In numerical modelling of turbulent fluxes, a -7/3 power-law scaling in turbulence cospectra is usually assumed at high wavenumbers. In eddy-covariance (EC) measurements of turbulent fluxes, an assumed shape of turbulence cospectra is typically required for hi… ▽ More Turbulent fluxes in the atmospheric surface layer are key input for the prediction of weather, hydrology, and carbon dioxide concentration. In numerical modelling of turbulent fluxes, a -7/3 power-law scaling in turbulence cospectra is usually assumed at high wavenumbers. In eddy-covariance (EC) measurements of turbulent fluxes, an assumed shape of turbulence cospectra is typically required for high-frequency spectral corrections, typically assuming a -7/3 power law. The derivation of -7/3 power-law scaling is based primarily on dimensional analysis, and other cospectral scaling has also been observed. Here we examine the shape of turbulence cospectra at high wavenumbers from extensive field measurements of wind velocity, temperature, water vapour and CO2 concentrations in various stably stratified atmospheric conditions. We propose a turbulence cospectral shape with -2 power law rather than -7/3 law for high wavenumber equilibrium range of the stable atmospheric boundary layer. This finding contributes to improved estimation of turbulent fluxes in both modelling and observation. △ Less

Submitted 21 November, 2018; originally announced November 2018.

Comments: 21 pages, 15 figures

arXiv:1806.04731 [pdf, other]

doi 10.1073/pnas.1810286115

Deep learning to represent sub-grid processes in climate models

Authors: Stephan Rasp, Michael S. Pritchard, Pierre Gentine

Abstract: The representation of nonlinear sub-grid processes, especially clouds, has been a major source of uncertainty in climate models for decades. Cloud-resolving models better represent many of these processes and can now be run globally but only for short-term simulations of at most a few years because of computational limitations. Here we demonstrate that deep learning can be used to capture many adv… ▽ More The representation of nonlinear sub-grid processes, especially clouds, has been a major source of uncertainty in climate models for decades. Cloud-resolving models better represent many of these processes and can now be run globally but only for short-term simulations of at most a few years because of computational limitations. Here we demonstrate that deep learning can be used to capture many advantages of cloud-resolving modeling at a fraction of the computational cost. We train a deep neural network to represent all atmospheric sub-grid processes in a climate model by learning from a multi-scale model in which convection is treated explicitly. The trained neural network then replaces the traditional sub-grid parameterizations in a global general circulation model in which it freely interacts with the resolved dynamics and the surface-flux scheme. The prognostic multi-year simulations are stable and closely reproduce not only the mean climate of the cloud-resolving simulation but also key aspects of variability, including precipitation extremes and the equatorial wave spectrum. Furthermore, the neural network approximately conserves energy despite not being explicitly instructed to. Finally, we show that the neural network parameterization generalizes to new surface forcing patterns but struggles to cope with temperatures far outside its training manifold. Our results show the feasibility of using deep learning for climate model parameterization. In a broader context, we anticipate that data-driven Earth System Model development could play a key role in reducing climate prediction uncertainty in the coming decade. △ Less

Submitted 7 September, 2018; v1 submitted 12 June, 2018; originally announced June 2018.

Comments: View official PNAS version at https://doi.org/10.1073/pnas.1810286115

Journal ref: Proceedings of the National Academy of Sciences Sep 2018, 201810286; DOI: 10.1073/pnas.1810286115

arXiv:1805.05444 [pdf, other]

doi 10.1029/2019MS001790

When does vapor pressure deficit drive or reduce evapotranspiration?

Authors: Adam Massmann, Pierre Gentine, Changjie Lin

Abstract: Increasing vapor pressure deficit (VPD) increases atmospheric demand for water. While increased evapotranspiration (ET) in response to increased atmospheric demand seems intuitive, plants are capable of reducing ET in response to increased VPD by closing their stomata. We examine which effect dominates the response to increasing VPD: atmospheric demand and increases in ET, or plant response (stoma… ▽ More Increasing vapor pressure deficit (VPD) increases atmospheric demand for water. While increased evapotranspiration (ET) in response to increased atmospheric demand seems intuitive, plants are capable of reducing ET in response to increased VPD by closing their stomata. We examine which effect dominates the response to increasing VPD: atmospheric demand and increases in ET, or plant response (stomata closure) and decreases in ET. We use Penman-Monteith, combined with semi-empirical optimal stomatal regulation theory and underlying water use efficiency, to develop a theoretical framework for assessing ET response to VPD. The theory suggests that depending on the environment and plant characteristics, ET response to increasing VPD can vary from strongly decreasing to increasing, highlighting the diversity of plant water regulation strategies. The ET response varies due to: 1) climate, with tropical and temperate climates more likely to exhibit a positive ET response to increasing VPD than boreal and arctic climates; 2) photosynthesis strategy, with C3 plants more likely to exhibit a positive ET response than C4 plants; and 3) plant type, with crops more likely to exhibit a positive ET response, and shrubs and gymniosperm trees more likely to exhibit a negative ET response. These results, derived from previous literature connecting plant parameters to plant and climate characteristics, highlight the utility of our simplified framework for understanding complex land atmosphere systems in terms of idealized scenarios in which ET responds to VPD only. This response is otherwise challenging to assess in an environment where many processes co-evolve together. △ Less

Submitted 18 September, 2019; v1 submitted 14 May, 2018; originally announced May 2018.

Journal ref: Journal of Advances in Modeling Earth Systems, 11. (2019)

arXiv:1801.05847 [pdf, ps, other]

Turbulence Spectra in the Stable Atmospheric Boundary Layer

Authors: Yu Cheng, Qi Li, Stefania Argentini, Chadi Sayde, Pierre Gentine

Abstract: Stratification can cause turbulence spectra to deviate from Kolmogorov's isotropic -5/3 power-law scaling in the universal equilibrium range at high Reynolds numbers. However, a consensus has not been reached with regard to the exact shape of the spectra. Here we propose a theoretically-derived shape of the turbulent kinetic energy (TKE) and temperature spectra in horizontal wavenumber that consis… ▽ More Stratification can cause turbulence spectra to deviate from Kolmogorov's isotropic -5/3 power-law scaling in the universal equilibrium range at high Reynolds numbers. However, a consensus has not been reached with regard to the exact shape of the spectra. Here we propose a theoretically-derived shape of the turbulent kinetic energy (TKE) and temperature spectra in horizontal wavenumber that consists of three regimes at small Froude number: the buoyancy subrange, a transition region and isotropic inertial subrange through derivation based on previous research. These regimes are confirmed by various observations in the atmospheric boundary layer. We also show that DNS may not apply in the study of very stable atmospheric boundary layers at very high Reynolds numbers as they cannot correctly represent the observed spectral regimes because of the lack of scale separation limited by current computational capacity. In addition, the spectrum in the transition regime explains why Monin-Obukhov similarity theory cannot entirely describe the behavior of the stable atmospheric boundary. △ Less

Submitted 18 October, 2018; v1 submitted 17 January, 2018; originally announced January 2018.

Comments: 10 pages, 5 figures

Showing 1–38 of 38 results for author: Gentine, P