(Translated by https://www.hiragana.jp/)
Search | arXiv e-print repository
Skip to main content

Showing 1–50 of 138 results for author: E, W

.
  1. arXiv:2407.06152  [pdf, other

    physics.chem-ph cs.AI

    Uni-ELF: A Multi-Level Representation Learning Framework for Electrolyte Formulation Design

    Authors: Boshen Zeng, Sian Chen, Xinxin Liu, Changhong Chen, Bin Deng, Xiaoxu Wang, Zhifeng Gao, Yuzhi Zhang, Weinan E, Linfeng Zhang

    Abstract: Advancements in lithium battery technology heavily rely on the design and engineering of electrolytes. However, current schemes for molecular design and recipe optimization of electrolytes lack an effective computational-experimental closed loop and often fall short in accurately predicting diverse electrolyte formulation properties. In this work, we introduce Uni-ELF, a novel multi-level represen… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

  2. arXiv:2407.01178  [pdf, other

    cs.CL cs.AI cs.LG

    $\text{Memory}^3$: Language Modeling with Explicit Memory

    Authors: Hongkang Yang, Zehao Lin, Wenjin Wang, Hao Wu, Zhiyu Li, Bo Tang, Wenqiang Wei, Jinbo Wang, Zeyun Tang, Shichao Song, Chenyang Xi, Yu Yu, Kai Chen, Feiyu Xiong, Linpeng Tang, Weinan E

    Abstract: The training and inference of large language models (LLMs) are together a costly process that transports knowledge from raw data to meaningful computation. Inspired by the memory hierarchy of the human brain, we reduce this cost by equipping LLMs with explicit memory, a memory format cheaper than model parameters and text retrieval-augmented generation (RAG). Conceptually, with most of its knowled… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    MSC Class: 68T50 ACM Class: I.2.7

  3. arXiv:2406.14969  [pdf, other

    cs.LG cs.AI

    Uni-Mol2: Exploring Molecular Pretraining Model at Scale

    Authors: Xiaohong Ji, Zhen Wang, Zhifeng Gao, Hang Zheng, Linfeng Zhang, Guolin Ke, Weinan E

    Abstract: In recent years, pretraining models have made significant advancements in the fields of natural language processing (NLP), computer vision (CV), and life sciences. The significant advancements in NLP and CV are predominantly driven by the expansion of model parameters and data size, a phenomenon now recognized as the scaling laws. However, research exploring scaling law in molecular pretraining mo… ▽ More

    Submitted 1 July, 2024; v1 submitted 21 June, 2024; originally announced June 2024.

  4. arXiv:2405.20763  [pdf, other

    cs.LG math.OC stat.ML

    Improving Generalization and Convergence by Enhancing Implicit Regularization

    Authors: Mingze Wang, Haotian He, Jinbo Wang, Zilin Wang, Guanhua Huang, Feiyu Xiong, Zhiyu Li, Weinan E, Lei Wu

    Abstract: In this work, we propose an Implicit Regularization Enhancement (IRE) framework to accelerate the discovery of flat solutions in deep learning, thereby improving generalization and convergence. Specifically, IRE decouples the dynamics of flat and sharp directions, which boosts the sharpness reduction along flat directions while maintaining the training stability in sharp directions. We show that I… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

    Comments: 35 pages

  5. arXiv:2405.12356  [pdf, other

    physics.bio-ph cs.LG physics.chem-ph physics.data-an

    Coarse-graining conformational dynamics with multi-dimensional generalized Langevin equation: how, when, and why

    Authors: Pinchen Xie, Yunrui Qiu, Weinan E

    Abstract: A data-driven ab initio generalized Langevin equation (AIGLE) approach is developed to learn and simulate high-dimensional, heterogeneous, coarse-grained conformational dynamics. Constrained by the fluctuation-dissipation theorem, the approach can build coarse-grained models in dynamical consistency with all-atom molecular dynamics. We also propose practical criteria for AIGLE to enforce long-term… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

  6. arXiv:2402.18785  [pdf, ps, other

    gr-qc hep-th

    On the semiclassical bounce with strong minimal assumptions

    Authors: Wagno Cesar e Silva, Ilya L. Shapiro

    Abstract: We explore the possibility of avoiding cosmological singularity with a bounce solution in the early Universe. The main finding is that simple and well-known semiclassical correction, which describes the mixing of radiation and gravity in the effective action, may provide an analytic solution with a bounce. The solution requires a positive beta function for the total radiation term and the contract… ▽ More

    Submitted 26 April, 2024; v1 submitted 28 February, 2024; originally announced February 2024.

    Comments: 14 pages, 3 figures. Added discussion related to the energy conditions and some references

    MSC Class: 81T20; 83C47; 83F05

  7. arXiv:2402.00522  [pdf, ps, other

    cs.LG stat.ML

    Understanding the Expressive Power and Mechanisms of Transformer for Sequence Modeling

    Authors: Mingze Wang, Weinan E

    Abstract: We conduct a systematic study of the approximation properties of Transformer for sequence modeling with long, sparse and complicated memory. We investigate the mechanisms through which different components of Transformer, such as the dot-product self-attention, positional encoding and feed-forward layer, affect its expressive power, and we study their combined effects through establishing explicit… ▽ More

    Submitted 2 July, 2024; v1 submitted 1 February, 2024; originally announced February 2024.

    Comments: 70 pages

  8. arXiv:2401.08309  [pdf, other

    cs.CL cs.LG

    Anchor function: a type of benchmark functions for studying language models

    Authors: Zhongwang Zhang, Zhiwei Wang, Junjie Yao, Zhangchen Zhou, Xiaolong Li, Weinan E, Zhi-Qin John Xu

    Abstract: Understanding transformer-based language models is becoming increasingly crucial, particularly as they play pivotal roles in advancing towards artificial general intelligence. However, language model research faces significant challenges, especially for academic research groups with constrained resources. These challenges include complex data structures, unknown target functions, high computationa… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

  9. arXiv:2401.01220  [pdf, other

    math.NA

    Solving multiscale dynamical systems by deep learning

    Authors: Zhi-Qin John Xu, Junjie Yao, Yuxiao Yi, Liangkai Hang, Weinan E, Yaoyu Zhang, Tianhan Zhang

    Abstract: Multiscale dynamical systems, modeled by high-dimensional stiff ordinary differential equations (ODEs) with wide-ranging characteristic timescales, arise across diverse fields of science and engineering, but their numerical solvers often encounter severe efficiency bottlenecks. This paper introduces a novel DeePODE method, which consists of a global multiscale sampling method and a fitting by deep… ▽ More

    Submitted 2 January, 2024; originally announced January 2024.

    Comments: 7 pages, 6 figures

  10. arXiv:2312.15492  [pdf, other

    physics.chem-ph cond-mat.mtrl-sci physics.comp-ph

    DPA-2: Towards a universal large atomic model for molecular and material simulation

    Authors: Duo Zhang, Xinzijian Liu, Xiangyu Zhang, Chengqian Zhang, Chun Cai, Hangrui Bi, Yiming Du, Xuejian Qin, Jiameng Huang, Bowen Li, Yifan Shan, Jinzhe Zeng, Yuzhi Zhang, Siyuan Liu, Yifan Li, Junhan Chang, Xinyan Wang, Shuo Zhou, Jianchuan Liu, Xiaoshan Luo, Zhenyu Wang, Wanrun Jiang, Jing Wu, Yudi Yang, Jiyuan Yang , et al. (17 additional authors not shown)

    Abstract: The rapid development of artificial intelligence (AI) is driving significant changes in the field of atomic modeling, simulation, and design. AI-based potential energy models have been successfully used to perform large-scale and long-time simulations with the accuracy of ab initio electronic structure methods. However, the model generation process still hinders applications at scale. We envision… ▽ More

    Submitted 24 December, 2023; originally announced December 2023.

  11. arXiv:2312.11691  [pdf, other

    hep-th

    SUSY QED with Lorentz-asymmetric fermionic matter and a glance at the electron's EDM

    Authors: João Paulo S. Melo, Wagno Cesar e Silva, José A. Helayël-Neto

    Abstract: This contribution sets out to pursue the investigation of a supersymmetric electrodynamics model with Lorentz-symmetry violation (LSV) manifested by a space-time unbalance in the propagation of the fermionic charged matter. Despite violation of Lorentz symmetry, the supersymmetry algebra is kept untouched. We then adopt a superspace approach to build up an $\mathcal{N}=1$-supersymmetric Abelian ga… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

    Comments: 32 pages, 1 figure

  12. arXiv:2311.17749  [pdf, other

    math.OC cs.RO

    Learning Free Terminal Time Optimal Closed-loop Control of Manipulators

    Authors: Wei Hu, Yue Zhao, Weinan E, Jiequn Han, Jihao Long

    Abstract: This paper presents a novel approach to learning free terminal time closed-loop control for robotic manipulation tasks, enabling dynamic adjustment of task duration and control inputs to enhance performance. We extend the supervised learning approach, namely solving selected optimal open-loop problems and utilizing them as training data for a policy network, to the free terminal time scenario. Thr… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

  13. arXiv:2307.04638  [pdf, other

    cond-mat.mtrl-sci physics.comp-ph

    DeePTB: A deep learning-based tight-binding approach with $ab$ $initio$ accuracy

    Authors: Qiangqiang Gu, Zhanghao Zhouyin, Shishir Kumar Pandey, Peng Zhang, Linfeng Zhang, Weinan E

    Abstract: Simulating electronic behavior in materials and devices with realistic large system sizes remains a formidable task within the $ab$ $initio$ framework. We propose DeePTB, an efficient deep learning-based tight-binding (TB) approach with $ab$ $initio$ accuracy to address this issue. By training with $ab$ $initio$ eigenvalues, our method can efficiently predict TB Hamiltonians for unseen structures.… ▽ More

    Submitted 11 July, 2023; v1 submitted 10 July, 2023; originally announced July 2023.

    Comments: 14 pages, 12 figures

  14. arXiv:2305.01243  [pdf

    physics.comp-ph cs.LG

    Invertible Coarse Graining with Physics-Informed Generative Artificial Intelligence

    Authors: Jun Zhang, Xiaohan Lin, Weinan E, Yi Qin Gao

    Abstract: Multiscale molecular modeling is widely applied in scientific research of molecular properties over large time and length scales. Two specific challenges are commonly present in multiscale modeling, provided that information between the coarse and fine representations of molecules needs to be properly exchanged: One is to construct coarse grained models by passing information from the fine to coar… ▽ More

    Submitted 20 July, 2024; v1 submitted 2 May, 2023; originally announced May 2023.

    Comments: 16 pages, 5 figures

  15. arXiv:2304.09409  [pdf, other

    physics.chem-ph physics.atm-clus

    DeePMD-kit v2: A software package for Deep Potential models

    Authors: Jinzhe Zeng, Duo Zhang, Denghui Lu, Pinghui Mo, Zeyu Li, Yixiao Chen, Marián Rynik, Li'ang Huang, Ziyao Li, Shaochen Shi, Yingze Wang, Haotian Ye, Ping Tuo, Jiabin Yang, Ye Ding, Yifan Li, Davide Tisi, Qiyu Zeng, Han Bao, Yu Xia, Jiameng Huang, Koki Muraoka, Yibo Wang, Junhan Chang, Fengbo Yuan , et al. (22 additional authors not shown)

    Abstract: DeePMD-kit is a powerful open-source software package that facilitates molecular dynamics simulations using machine learning potentials (MLP) known as Deep Potential (DP) models. This package, which was released in 2017, has been widely used in the fields of physics, chemistry, biology, and material science for studying atomistic systems. The current version of DeePMD-kit offers numerous advanced… ▽ More

    Submitted 18 April, 2023; originally announced April 2023.

    Comments: 51 pages, 2 figures

    ACM Class: J.2

    Journal ref: J. Chem. Phys. 159, 054801 (2023)

  16. arXiv:2304.06913  [pdf, other

    math.NA physics.comp-ph

    The Random Feature Method for Time-dependent Problems

    Authors: Jingrun Chen, Weinan E, Yixin Luo

    Abstract: We present a framework for solving time-dependent partial differential equations (PDEs) in the spirit of the random feature method. The numerical solution is constructed using a space-time partition of unity and random feature functions. Two different ways of constructing the random feature functions are investigated: feature functions that treat the spatial and temporal variables (STC) on the sam… ▽ More

    Submitted 13 April, 2023; originally announced April 2023.

    Comments: 26 pages, 12 figures

    MSC Class: 65M20; 65M55; 65M70

  17. arXiv:2302.03498  [pdf, other

    cs.CL cs.SD eess.AS

    MAC: A unified framework boosting low resource automatic speech recognition

    Authors: Zeping Min, Qian Ge, Zhong Li, Weinan E

    Abstract: We propose a unified framework for low resource automatic speech recognition tasks named meta audio concatenation (MAC). It is easy to implement and can be carried out in extremely low resource environments. Mathematically, we give a clear description of MAC framework from the perspective of bayesian sampling. In this framework, we leverage a novel concatenative synthesis text-to-speech system to… ▽ More

    Submitted 15 February, 2023; v1 submitted 5 February, 2023; originally announced February 2023.

  18. Effective approach to the Antoniadis-Mottola model: quantum decoupling of the higher derivative terms

    Authors: Wagno Cesar e Silva, Ilya L. Shapiro

    Abstract: We explore the decoupling of massive ghost mode in the $4D$ (four-dimensional) theory of the conformal factor of the metric. The model was introduced by Antoniadis and Mottola in [1] and can be regarded as a close analog of the fourth-derivative quantum gravity. The analysis of the derived one-loop nonlocal form factors includes their asymptotic behavior in the UV and IR limits. In the UV (high en… ▽ More

    Submitted 12 July, 2023; v1 submitted 30 January, 2023; originally announced January 2023.

    Comments: 35 pages, 20 figures. Several detailed explanations, discussions, references and Appendices added to improve presentation. Fits the published version

    Journal ref: J. High Energ. Phys. 2023 (2023) 97

  19. arXiv:2211.10824  [pdf, other

    physics.comp-ph physics.chem-ph quant-ph

    Hybrid Auxiliary Field Quantum Monte Carlo for Molecular Systems

    Authors: Yixiao Chen, Linfeng Zhang, Weinan E, Roberto Car

    Abstract: We propose a quantum Monte Carlo approach to solve the ground state many-body Schrodinger equation for the electronic ground state. The method combines optimization from variational Monte Carlo and propagation from auxiliary field quantum Monte Carlo, in a way that significantly alleviates the sign problem. In application to molecular systems, we obtain highly accurate results for configurations d… ▽ More

    Submitted 20 April, 2023; v1 submitted 19 November, 2022; originally announced November 2022.

  20. arXiv:2211.06558  [pdf, other

    physics.comp-ph cond-mat.mes-hall cond-mat.mtrl-sci

    Ab Initio Generalized Langevin Equation

    Authors: Pinchen Xie, Roberto Car, Weinan E

    Abstract: We introduce a machine learning-based approach called ab initio generalized Langevin equation (AIGLE) to model the dynamics of slow collective variables in materials and molecules. In this scheme, the parameters are learned from atomistic simulations based on ab initio quantum mechanical models. Force field, memory kernel, and noise generator are constructed in the context of the Mori-Zwanzig form… ▽ More

    Submitted 15 February, 2024; v1 submitted 11 November, 2022; originally announced November 2022.

  21. arXiv:2209.04078  [pdf, other

    math.OC

    Initial Value Problem Enhanced Sampling for Closed-Loop Optimal Control Design with Deep Neural Networks

    Authors: Xuanxi Zhang, Jihao Long, Wei Hu, Weinan E, Jiequn Han

    Abstract: Closed-loop optimal control design for high-dimensional nonlinear systems has been a long-standing challenge. Traditional methods, such as solving the associated Hamilton-Jacobi-Bellman equation, suffer from the curse of dimensionality. Recent literature proposed a new promising approach based on supervised learning, by leveraging powerful open-loop optimal control solvers to generate training dat… ▽ More

    Submitted 9 July, 2023; v1 submitted 8 September, 2022; originally announced September 2022.

  22. arXiv:2207.13380  [pdf, other

    math.NA physics.comp-ph

    Bridging Traditional and Machine Learning-based Algorithms for Solving PDEs: The Random Feature Method

    Authors: Jingrun Chen, Xurong Chi, Weinan E, Zhouwang Yang

    Abstract: One of the oldest and most studied subject in scientific computing is algorithms for solving partial differential equations (PDEs). A long list of numerical methods have been proposed and successfully used for various applications. In recent years, deep learning methods have shown their superiority for high-dimensional PDEs where traditional methods fail. However, for low dimensional problems, it… ▽ More

    Submitted 27 July, 2022; originally announced July 2022.

  23. arXiv:2206.05946  [pdf, other

    hep-ex

    Measurement of the branching fraction for the decay $B \to K^{\ast}(892)\ell^+\ell^-$ at Belle II

    Authors: Belle II Collaboration, F. Abudinén, I. Adachi, R. Adak, K. Adamczyk, L. Aggarwal, P. Ahlburg, H. Ahmed, J. K. Ahn, H. Aihara, N. Akopov, A. Aloisio, F. Ameli, L. Andricek, N. Anh Ky, D. M. Asner, H. Atmacan, V. Aulchenko, T. Aushev, V. Aushev, T. Aziz, V. Babu, S. Bacher, H. Bae, S. Baehr , et al. (569 additional authors not shown)

    Abstract: We report a measurement of the branching fraction of $B \to K^{\ast}(892)\ell^+\ell^-$ decays, where $\ell^+\ell^- = μみゅー^+μみゅー^-$ or $e^+e^-$, using electron-positron collisions recorded at an energy at or near the $Υうぷしろん(4S)$ mass and corresponding to an integrated luminosity of $189$ fb$^{-1}$. The data was collected during 2019--2021 by the Belle II experiment at the SuperKEKB $e^{+}e^{-}$ asymmetric-en… ▽ More

    Submitted 19 September, 2022; v1 submitted 13 June, 2022; originally announced June 2022.

    Report number: BELLE2-CONF-PH-2022-009

  24. arXiv:2205.11839  [pdf, other

    cond-mat.mtrl-sci

    Ab initio multi-scale modeling of ferroelectrics: The case of PbTiO3

    Authors: Pinchen Xie, Yixiao Chen, Weinan E, Roberto Car

    Abstract: We report an ab initio multi-scale study of lead titanate using the Deep Potential (DP) models, a family of machine learning-based atomistic models, trained on first-principles density functional theory data, to represent potential and polarization surfaces. Our approach includes anharmonic effects beyond the limitations of reduced models and of the linear approximation for the polarization. The c… ▽ More

    Submitted 24 May, 2022; originally announced May 2022.

  25. arXiv:2205.08622  [pdf, other

    math.OC

    Solving optimal control of rigid-body dynamics with collisions using the hybrid minimum principle

    Authors: Wei Hu, Jihao Long, Yaohua Zang, Weinan E, Jiequn Han

    Abstract: Collisions are common in many dynamical systems with real applications. They can be formulated as hybrid dynamical systems with discontinuities automatically triggered when states transverse certain manifolds. We present an algorithm for the optimal control problem of such hybrid dynamical systems based on solving the equations derived from the hybrid minimum principle (HMP). The algorithm is an i… ▽ More

    Submitted 10 May, 2023; v1 submitted 17 May, 2022; originally announced May 2022.

    MSC Class: 49Mxx

  26. arXiv:2205.07990  [pdf, other

    math.OC

    Empowering Optimal Control with Machine Learning: A Perspective from Model Predictive Control

    Authors: Weinan E, Jiequn Han, Jihao Long

    Abstract: Solving complex optimal control problems have confronted computational challenges for a long time. Recent advances in machine learning have provided us with new opportunities to address these challenges. This paper takes model predictive control, a popular optimal control method, as the primary example to survey recent progress that leverages machine learning techniques to empower optimal control… ▽ More

    Submitted 20 July, 2022; v1 submitted 16 May, 2022; originally announced May 2022.

  27. arXiv:2203.06753  [pdf, other

    math.OC

    A Machine Learning Enhanced Algorithm for the Optimal Landing Problem

    Authors: Yaohua Zang, Jihao Long, Xuanxi Zhang, Wei Hu, Weinan E, Jiequn Han

    Abstract: We propose a machine learning enhanced algorithm for solving the optimal landing problem. Using Pontryagin's minimum principle, we derive a two-point boundary value problem for the landing problem. The proposed algorithm uses deep learning to predict the optimal landing time and a space-marching technique to provide good initial guesses for the boundary value problem solver. The performance of the… ▽ More

    Submitted 13 March, 2022; originally announced March 2022.

  28. arXiv:2203.00393  [pdf, other

    cond-mat.mtrl-sci physics.comp-ph

    Deep Potentials for Materials Science

    Authors: Tongqi Wen, Linfeng Zhang, Han Wang, Weinan E, David J. Srolovitz

    Abstract: To fill the gap between accurate (and expensive) ab initio calculations and efficient atomistic simulations based on empirical interatomic potentials, a new class of descriptions of atomic interactions has emerged and been widely applied; i.e., machine learning potentials (MLPs). One recently developed type of MLP is the Deep Potential (DP) method. In this review, we provide an introduction to DP… ▽ More

    Submitted 1 March, 2022; originally announced March 2022.

  29. Trace anomaly and induced action for a metric-scalar background

    Authors: Manuel Asorey, Wagno Cesar e Silva, Ilya L. Shapiro, Públio R. B. do Vale

    Abstract: The conformal anomaly and anomaly-induced effective action represent useful and economic ways to describe semiclassical contributions to the action of gravity. We discuss the anomaly in the case when the background is formed by metric and scalar fields and formulate the induced action in two standard covariant forms. The analysis of induced action at low energies reveals existing connection to the… ▽ More

    Submitted 7 February, 2023; v1 submitted 31 January, 2022; originally announced February 2022.

    Comments: Explanations and references added. Fits the version to be published in EPJC. 22 pages, no figures

    MSC Class: 81T50; 81T20; 81T15; 83C45

    Journal ref: Eur. Phys. J. C 83 (2023) 157

  30. arXiv:2201.03549  [pdf, other

    physics.chem-ph cs.LG math.NA physics.comp-ph physics.flu-dyn

    A multi-scale sampling method for accurate and robust deep neural network to predict combustion chemical kinetics

    Authors: Tianhan Zhang, Yuxiao Yi, Yifan Xu, Zhi X. Chen, Yaoyu Zhang, Weinan E, Zhi-Qin John Xu

    Abstract: Machine learning has long been considered as a black box for predicting combustion chemical kinetics due to the extremely large number of parameters and the lack of evaluation standards and reproducibility. The current work aims to understand two basic questions regarding the deep neural network (DNN) method: what data the DNN needs and how general the DNN method can be. Sampling and preprocessing… ▽ More

    Submitted 12 August, 2022; v1 submitted 9 January, 2022; originally announced January 2022.

  31. arXiv:2201.02025  [pdf, other

    cs.LG math.OC

    A deep learning-based model reduction (DeePMR) method for simplifying chemical kinetics

    Authors: Zhiwei Wang, Yaoyu Zhang, Enhan Zhao, Yiguang Ju, Weinan E, Zhi-Qin John Xu, Tianhan Zhang

    Abstract: A deep learning-based model reduction (DeePMR) method for simplifying chemical kinetics is proposed and validated using high-temperature auto-ignitions, perfectly stirred reactors (PSR), and one-dimensional freely propagating flames of n-heptane/air mixtures. The mechanism reduction is modeled as an optimization problem on Boolean space, where a Boolean vector, each entry corresponding to a specie… ▽ More

    Submitted 8 September, 2022; v1 submitted 6 January, 2022; originally announced January 2022.

  32. arXiv:2112.14798  [pdf, other

    physics.comp-ph cs.LG physics.flu-dyn

    DeePN$^2$: A deep learning-based non-Newtonian hydrodynamic model

    Authors: Lidong Fang, Pei Ge, Lei Zhang, Weinan E, Huan Lei

    Abstract: A long standing problem in the modeling of non-Newtonian hydrodynamics of polymeric flows is the availability of reliable and interpretable hydrodynamic models that faithfully encode the underlying micro-scale polymer dynamics. The main complication arises from the long polymer relaxation time, the complex molecular structure and heterogeneous interaction. DeePN$^2$, a deep learning-based non-Newt… ▽ More

    Submitted 13 April, 2022; v1 submitted 29 December, 2021; originally announced December 2021.

  33. arXiv:2112.14377  [pdf, other

    econ.GN cs.LG

    DeepHAM: A Global Solution Method for Heterogeneous Agent Models with Aggregate Shocks

    Authors: Jiequn Han, Yucheng Yang, Weinan E

    Abstract: An efficient, reliable, and interpretable global solution method, the Deep learning-based algorithm for Heterogeneous Agent Models (DeepHAM), is proposed for solving high dimensional heterogeneous agent models with aggregate shocks. The state distribution is approximately represented by a set of optimal generalized moments. Deep neural networks are used to approximate the value and policy function… ▽ More

    Submitted 21 February, 2022; v1 submitted 28 December, 2021; originally announced December 2021.

    Comments: Slides available at https://users.flatironinstitute.org/~jhan/files/DeepHAM_slides.pdf

  34. arXiv:2112.13327  [pdf, other

    physics.chem-ph physics.comp-ph

    A deep potential model with long-range electrostatic interactions

    Authors: Linfeng Zhang, Han Wang, Maria Carolina Muniz, Athanassios Z. Panagiotopoulos, Roberto Car, Weinan E

    Abstract: Machine learning models for the potential energy of multi-atomic systems, such as the deep potential (DP) model, make possible molecular simulations with the accuracy of quantum mechanical density functional theory, at a cost only moderately higher than that of empirical force fields. However, the majority of these models lack explicit long-range interactions and fail to describe properties that d… ▽ More

    Submitted 16 February, 2022; v1 submitted 26 December, 2021; originally announced December 2021.

  35. Measurement of inclusive electrons from open heavy-flavor hadron decays in $p$+$p$ collisions at $\sqrt{s} = 200$ GeV with the STAR detector

    Authors: STAR Collaboration, M. S. Abdallah, B. E. Aboona, J. Adam, L. Adamczyk, J. R. Adams, J. K. Adkins, G. Agakishiev, I. Aggarwal, M. M. Agg arwal, Z. Ahammed, I. Alekseev, D. M. Anderson, A. Aparin, E. C. Aschenauer, M. U. Ashraf, F. G. Atetalla, A. Attri, G. S. Averichev, V. Bairathi, W. Baker, J. G. Ball Cap, K. Barish, A. Behera, R. Bellwied , et al. (372 additional authors not shown)

    Abstract: We report a new measurement of the production cross section for inclusive electrons from open heavy-flavor hadron decays as a function of transverse momentum ($p_{\rm T}$) at mid-rapidity ($|y|<$ 0.7) in $p$+$p$ collisions at $\sqrt{s} = 200$ GeV. The result is presented for 2.5 $<p_{\rm T}<$ 10 GeV/$c$ with an improved precision above 6 GeV/$c$ with respect to the previous measurements, providing… ▽ More

    Submitted 3 March, 2022; v1 submitted 27 September, 2021; originally announced September 2021.

    Journal ref: Phys. Rev. D 105, 032007 (2022)

  36. arXiv:2109.06502  [pdf

    cond-mat.other physics.chem-ph

    Subsurface Carbon-Induced Local Charge of Copper for On-Surface Displacement Reaction

    Authors: Shaoshan Wang, Pengcheng Ding, Zhuo Li, Cristina Mattioli, Wenlong E, Ye Sun, André Gourdon, Lev Kantorovich, Flemming Besenbacher, Xueming Yang, Miao Yu

    Abstract: Transition metal carbides have sparked unprecedented enthusiasm as high-performance catalysts in recent years. Still, the catalytic properties of copper (Cu) carbide remain unexplored. By introducing subsurface carbon (C) to Cu(111), displacement reaction of proton in carboxyl acid group with single Cu atom is demonstrated at the atomic scale and room temperature. Its occurrence is attributed to t… ▽ More

    Submitted 14 September, 2021; originally announced September 2021.

    Comments: Angewandte Chemie International Edition, Wiley-VCH Verlag, 2021

  37. On the vector conformal models in an arbitrary dimension

    Authors: Manuel Asorey, Lesław Rachwał, Ilya L. Shapiro, Wagno Cesar e Silva

    Abstract: The conventional model of the gauge vector field is invariant under the local conformal symmetry only in the four-dimensional space ($4d$). Conformal generalization to an arbitrary dimension $d$ is impossible even for the free theory, differently from scalar and fermion fields. We discuss how to overcome this restriction and eventually construct four vector conformal actions. One of these models i… ▽ More

    Submitted 3 October, 2021; v1 submitted 27 July, 2021; originally announced July 2021.

    Comments: 15 pages. A few formulations made more precise, fits the version accepted in EPJP

    Journal ref: Eur. Phys. J. Plus 136 (2021) 1043

  38. MOD-Net: A Machine Learning Approach via Model-Operator-Data Network for Solving PDEs

    Authors: Lulu Zhang, Tao Luo, Yaoyu Zhang, Weinan E, Zhi-Qin John Xu, Zheng Ma

    Abstract: In this paper, we propose a a machine learning approach via model-operator-data network (MOD-Net) for solving PDEs. A MOD-Net is driven by a model to solve PDEs based on operator representation with regularization from data. For linear PDEs, we use a DNN to parameterize the Green's function and obtain the neural operator to approximate the solution according to the Green's method. To train the DNN… ▽ More

    Submitted 28 December, 2021; v1 submitted 8 July, 2021; originally announced July 2021.

  39. arXiv:2107.03633  [pdf, other

    cs.LG stat.ML

    Generalization Error of GAN from the Discriminator's Perspective

    Authors: Hongkang Yang, Weinan E

    Abstract: The generative adversarial network (GAN) is a well-known model for learning high-dimensional distributions, but the mechanism for its generalization ability is not understood. In particular, GAN is vulnerable to the memorization phenomenon, the eventual convergence to the empirical distribution. We consider a simplified GAN model with the generator replaced by a density, and analyze how the discri… ▽ More

    Submitted 5 November, 2021; v1 submitted 8 July, 2021; originally announced July 2021.

    MSC Class: 68T07; 62G07; 60-08

  40. arXiv:2104.07794  [pdf, ps, other

    cs.LG

    An $L^2$ Analysis of Reinforcement Learning in High Dimensions with Kernel and Neural Network Approximation

    Authors: Jihao Long, Jiequn Han, Weinan E

    Abstract: Reinforcement learning (RL) algorithms based on high-dimensional function approximation have achieved tremendous empirical success in large-scale problems with an enormous number of states. However, most analysis of such algorithms gives rise to error bounds that involve either the number of states or the number of features. This paper considers the situation where the function approximation is ma… ▽ More

    Submitted 15 February, 2022; v1 submitted 15 April, 2021; originally announced April 2021.

  41. Efficient sampling of high-dimensional free energy landscapes using adaptive reinforced dynamics

    Authors: Dongdong Wang, Yanze Wang, Junhan Chang, Linfeng Zhang, Han Wang, Weinan E

    Abstract: Enhanced sampling methods such as metadynamics and umbrella sampling have become essential tools for exploring the configuration space of molecules and materials. At the same time, they have long faced a number of issues such as the inefficiency when dealing with a large number of collective variables (CVs) or systems with high free energy barriers. In this work, we show that with \redc{the cluste… ▽ More

    Submitted 26 December, 2021; v1 submitted 4 April, 2021; originally announced April 2021.

  42. arXiv:2103.16810  [pdf, other

    stat.ME

    An Expectation-Maximization Algorithm for Continuous-time Hidden Markov Models

    Authors: Qingcan Wang, Weinan E

    Abstract: We propose a unified framework that extends the inference methods for classical hidden Markov models to continuous settings, where both the hidden states and observations occur in continuous time. Two different settings are analyzed: hidden jump process with a finite state space, and hidden diffusion process with a continuous state space. For each setting, we first estimate the hidden states given… ▽ More

    Submitted 17 June, 2021; v1 submitted 31 March, 2021; originally announced March 2021.

    MSC Class: 60J25; 65C40

  43. The Phase Diagram of a Deep Potential Water Model

    Authors: Linfeng Zhang, Han Wang, Roberto Car, Weinan E

    Abstract: Using the Deep Potential methodology, we construct a model that reproduces accurately the potential energy surface of the SCAN approximation of density functional theory for water, from low temperature and pressure to about 2400 K and 50 GPa, excluding the vapor stability region. The computational efficiency of the model makes it possible to predict its phase diagram using molecular dynamics. Sati… ▽ More

    Submitted 11 February, 2021; v1 submitted 9 February, 2021; originally announced February 2021.

    Journal ref: Phys. Rev. Lett. 126, 236001 (2021)

  44. arXiv:2012.14615  [pdf, other

    physics.chem-ph physics.comp-ph

    DeePKS-kit: a package for developing machine learning-based chemically accurate energy and density functional models

    Authors: Yixiao Chen, Linfeng Zhang, Han Wang, Weinan E

    Abstract: We introduce DeePKS-kit, an open-source software package for developing machine learning based energy and density functional models. DeePKS-kit is interfaced with PyTorch, an open-source machine learning library, and PySCF, an ab initio computational chemistry program that provides simple and customized tools for developing quantum chemistry codes. It supports the DeePHF and DeePKS methods. In add… ▽ More

    Submitted 21 June, 2021; v1 submitted 29 December, 2020; originally announced December 2020.

  45. arXiv:2012.12654  [pdf

    physics.chem-ph cs.LG math.NA

    A deep learning-based ODE solver for chemical kinetics

    Authors: Tianhan Zhang, Yaoyu Zhang, Weinan E, Yiguang Ju

    Abstract: Developing efficient and accurate algorithms for chemistry integration is a challenging task due to its strong stiffness and high dimensionality. The current work presents a deep learning-based numerical method called DeepCombustion0.0 to solve stiff ordinary differential equation systems. The homogeneous autoignition of DME/air mixture, including 54 species, is adopted as an example to illustrate… ▽ More

    Submitted 23 November, 2020; originally announced December 2020.

  46. Bounce and stability in the early cosmology with anomaly-induced corrections

    Authors: Wagno Cesar e Silva, Ilya L. Shapiro

    Abstract: An extremely fast exponential expansion of the Universe is typical for the stable version of the inflationary model, based on the anomaly-induced action of gravity. The total amount of exponential $e$-folds could be very large, before the transition to the unstable version and the beginning of the Starobinsky inflation. Thus, the stable exponential expansion can be seen as a pre-inflationary semic… ▽ More

    Submitted 25 December, 2020; v1 submitted 18 December, 2020; originally announced December 2020.

    Comments: 22 pages, 12 figures. Extended version, added more details concerning phase diagrams, many new references and a few comments. Article written by invitation for the Special Issue "Physics and Mathematics of the Dark Universe" in Symmetry (Editor Sergei Ketov)

    Journal ref: Symmetry 13 (2021) 50

  47. arXiv:2012.05420  [pdf, ps, other

    cs.LG stat.ML

    On the emergence of simplex symmetry in the final and penultimate layers of neural network classifiers

    Authors: Weinan E, Stephan Wojtowytsch

    Abstract: A recent numerical study observed that neural network classifiers enjoy a large degree of symmetry in the penultimate layer. Namely, if $h(x) = Af(x) +b$ where $A$ is a linear map and $f$ is the output of the penultimate layer of the network (after activation), then all data points $x_{i, 1}, \dots, x_{i, N_i}$ in a class $C_i$ are mapped to a single point $y_i$ by $f$ and the points $y_i$ are loc… ▽ More

    Submitted 4 June, 2021; v1 submitted 9 December, 2020; originally announced December 2020.

    MSC Class: 68T07; 62H30

  48. arXiv:2012.01484  [pdf, ps, other

    math.AP cs.LG

    Some observations on high-dimensional partial differential equations with Barron data

    Authors: Weinan E, Stephan Wojtowytsch

    Abstract: We use explicit representation formulas to show that solutions to certain partial differential equations lie in Barron spaces or multilayer spaces if the PDE data lie in such function spaces. Consequently, these solutions can be represented efficiently using artificial neural networks, even in high dimension. Conversely, we present examples in which the solution fails to lie in the function space… ▽ More

    Submitted 4 June, 2021; v1 submitted 2 December, 2020; originally announced December 2020.

    MSC Class: 68T07; 35C15; 65M80

  49. arXiv:2011.14269  [pdf, other

    stat.ML cs.LG

    Generalization and Memorization: The Bias Potential Model

    Authors: Hongkang Yang, Weinan E

    Abstract: Models for learning probability distributions such as generative models and density estimators behave quite differently from models for learning functions. One example is found in the memorization phenomenon, namely the ultimate convergence to the empirical distribution, that occurs in generative adversarial networks (GANs). For this reason, the issue of generalization is more subtle than that for… ▽ More

    Submitted 1 March, 2021; v1 submitted 28 November, 2020; originally announced November 2020.

    Comments: Added new section on regularized model

    MSC Class: 68T07; 60-08

  50. arXiv:2010.05627  [pdf, other

    cs.LG cs.AI math.OC stat.ML

    Towards Theoretically Understanding Why SGD Generalizes Better Than ADAM in Deep Learning

    Authors: Pan Zhou, Jiashi Feng, Chao Ma, Caiming Xiong, Steven Hoi, Weinan E

    Abstract: It is not clear yet why ADAM-alike adaptive gradient algorithms suffer from worse generalization performance than SGD despite their faster training speed. This work aims to provide understandings on this generalization gap by analyzing their local convergence behaviors. Specifically, we observe the heavy tails of gradient noise in these algorithms. This motivates us to analyze these algorithms thr… ▽ More

    Submitted 28 November, 2021; v1 submitted 12 October, 2020; originally announced October 2020.

    Comments: NeurIPS 2020