(Translated by https://www.hiragana.jp/)
Search | arXiv e-print repository
Skip to main content

Showing 1–20 of 20 results for author: Olson, L N

.
  1. arXiv:2408.03413  [pdf, other

    cs.LG cs.CE physics.flu-dyn

    A TVD neural network closure and application to turbulent combustion

    Authors: Seung Won Suh, Jonathan F MacArt, Luke N Olson, Jonathan B Freund

    Abstract: Trained neural networks (NN) have attractive features for closing governing equations, but in the absence of additional constraints, they can stray from physical reality. A NN formulation is introduced to preclude spurious oscillations that violate solution boundedness or positivity. It is embedded in the discretized equations as a machine learning closure and strictly constrained, inspired by tot… ▽ More

    Submitted 6 August, 2024; originally announced August 2024.

  2. arXiv:2407.07253  [pdf, other

    math.NA

    Monolithic Multigrid Preconditioners for High-Order Discretizations of Stokes Equations

    Authors: Alexey Voronin, Graham Harper, Scott MacLachlan, Luke N. Olson, Raymond S. Tuminaro

    Abstract: This work introduces and assesses the efficiency of a monolithic $ph$MG multigrid framework designed for high-order discretizations of stationary Stokes systems using Taylor-Hood and Scott-Vogelius elements. The proposed approach integrates coarsening in both approximation order ($p$) and mesh resolution ($h$), to address the computational and memory efficiency challenges that are often encountere… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

  3. arXiv:2306.06795  [pdf, other

    math.NA

    Monolithic Algebraic Multigrid Preconditioners for the Stokes Equations

    Authors: Alexey Voronin, Scott MacLachlan, Luke N. Olson, Raymond Tuminaro

    Abstract: We investigate a novel monolithic algebraic multigrid (AMG) preconditioner for the Taylor-Hood ($\pmb{\mathbb{P}}_2/\mathbb{P}_1$) and Scott-Vogelius ($\pmb{\mathbb{P}}_2/\mathbb{P}_1^{disc}$) discretizations of the Stokes equations. The algorithm is based on the use of the lower-order $\pmb{\mathbb{P}}_1\text{iso}\kern1pt\pmb{\mathbb{P}}_2/\mathbb{P}_1$ operator within a defect-correction setting… ▽ More

    Submitted 31 August, 2024; v1 submitted 11 June, 2023; originally announced June 2023.

  4. arXiv:2212.05159  [pdf, other

    cs.LG cs.MS math.NA

    Optimized Sparse Matrix Operations for Reverse Mode Automatic Differentiation

    Authors: Nicolas Nytko, Ali Taghibakhshi, Tareq Uz Zaman, Scott MacLachlan, Luke N. Olson, Matt West

    Abstract: Sparse matrix representations are ubiquitous in computational science and machine learning, leading to significant reductions in compute time, in comparison to dense representation, for problems that have local connectivity. The adoption of sparse representation in leading ML frameworks such as PyTorch is incomplete, however, with support for both automatic differentiation and GPU acceleration mis… ▽ More

    Submitted 9 November, 2023; v1 submitted 9 December, 2022; originally announced December 2022.

  5. arXiv:2209.06141  [pdf, other

    cs.DC

    Characterizing the Performance of Node-Aware Strategies for Irregular Point-to-Point Communication on Heterogeneous Architectures

    Authors: Shelby Lockhart, Amanda Bienz, William D. Gropp, Luke N. Olson

    Abstract: Supercomputer architectures are trending toward higher computational throughput due to the inclusion of heterogeneous compute nodes. These multi-GPU nodes increase on-node computational efficiency, while also increasing the amount of data to be communicated and the number of potential data flow paths. In this work, we characterize the performance of irregular point-to-point communication with MPI… ▽ More

    Submitted 13 September, 2022; originally announced September 2022.

    Comments: 14 pages, 13 figures

  6. arXiv:2205.11580  [pdf, other

    math.NA

    On Computing Coercivity Constants in Linear Variational Problems Through Eigenvalue Analysis

    Authors: Peter Sentz, Jehanzeb Hameed Chaudhry, Luke N. Olson

    Abstract: In this work, we investigate the convergence of numerical approximations to coercivity constants of variational problems. These constants are essential components of rigorous error bounds for reduced-order modeling; extension of these bounds to the error with respect to exact solutions requires an understanding of convergence rates for discrete coercivity constants. The results are obtained by cha… ▽ More

    Submitted 23 May, 2022; originally announced May 2022.

    Comments: 29 pages, 3 figures

  7. arXiv:2110.10775  [pdf, other

    math.NA

    Reduced Basis Approximations of Parameterized Dynamical Partial Differential Equations via Neural Networks

    Authors: Peter Sentz, Kristian Beckwith, Eric C. Cyr, Luke N. Olson, Ravi Patel

    Abstract: Projection-based reduced order models are effective at approximating parameter-dependent differential equations that are parametrically separable. When parametric separability is not satisfied, which occurs in both linear and nonlinear problems, projection-based methods fail to adequately reduce the computational complexity. Devising alternative reduced order models is crucial for obtaining effici… ▽ More

    Submitted 20 October, 2021; originally announced October 2021.

    Comments: 21 pages, 10 figures

  8. Performance of Low Synchronization Orthogonalization Methods in Anderson Accelerated Fixed Point Solvers

    Authors: Shelby Lockhart, David J. Gardner, Carol S. Woodward, Stephen Thomas, Luke N. Olson

    Abstract: Anderson Acceleration (AA) is a method to accelerate the convergence of fixed point iterations for nonlinear, algebraic systems of equations. Due to the requirement of solving a least squares problem at each iteration and a reliance on modified Gram-Schmidt for updating the iteration space, AA requires extra costly synchronization steps for global reductions. Moreover, the number of reductions in… ▽ More

    Submitted 18 October, 2021; originally announced October 2021.

    Comments: 11 pages, 6 figures

    Journal ref: Proceedings of the 2022 SIAM Conference on Parallel Processing for Scientific Computing

  9. arXiv:2105.13280  [pdf, other

    math.NA

    Coarse-Grid Selection Using Simulated Annealing

    Authors: Tareq. U. Zaman, Scott P. MacLachlan, Luke N. Olson, Matt West

    Abstract: Multilevel techniques are efficient approaches for solving the large linear systems that arise from discretized partial differential equations and other problems. While geometric multigrid requires detailed knowledge about the underlying problem and its discretization, algebraic multigrid aims to be less intrusive, requiring less knowledge about the origin of the linear system. A key step in algeb… ▽ More

    Submitted 19 January, 2023; v1 submitted 27 May, 2021; originally announced May 2021.

    Comments: 22 pages, 12 figures

  10. Low-order preconditioning of the Stokes equations

    Authors: Alexey Voronin, Yunhui He, Scott MacLachlan, Luke N. Olson, Ray Tuminaro

    Abstract: A well-known strategy for building effective preconditioners for higher-order discretizations of some PDEs, such as Poisson's equation, is to leverage effective preconditioners for their low-order analogs. In this work, we show that high-quality preconditioners can also be derived for the Taylor-Hood discretization of the Stokes equations in much the same manner. In particular, we investigate the… ▽ More

    Submitted 21 April, 2021; v1 submitted 22 March, 2021; originally announced March 2021.

    Comments: In the process of being to submitted to NLA@Wiley

    Journal ref: Numer Linear Algebra Appl. 2022;29:e2426

  11. arXiv:2010.10378  [pdf, other

    cs.DC

    Modeling Data Movement Performance on Heterogeneous Architectures

    Authors: Amanda Bienz, Luke N. Olson, William D. Gropp, Shelby Lockhart

    Abstract: The cost of data movement on parallel systems varies greatly with machine architecture, job partition, and nearby jobs. Performance models that accurately capture the cost of data movement provide a tool for analysis, allowing for communication bottlenecks to be pinpointed. Modern heterogeneous architectures yield increased variance in data movement as there are a number of viable paths for inter-… ▽ More

    Submitted 16 July, 2021; v1 submitted 20 October, 2020; originally announced October 2020.

    Comments: 7 pages, 6 Figures, Preprint

  12. arXiv:2003.04555  [pdf, other

    math.NA

    A Least-Squares Finite Element Reduced Basis Method

    Authors: Jehanzeb Hameed Chaudhry, Luke N. Olson, Peter Sentz

    Abstract: We present a reduced basis (RB) method for parametrized linear elliptic partial differential equations (PDEs) in a least-squares finite element framework. A rigorous and reliable error estimate is developed, and is shown to bound the error with respect to the exact solution of the PDE, in contrast to estimates that measure error with respect to a finite-dimensional (high-fidelity) approximation. I… ▽ More

    Submitted 23 September, 2020; v1 submitted 10 March, 2020; originally announced March 2020.

    Comments: 25 pages, 10 figures

    ACM Class: G.1.8

  13. arXiv:1910.09650  [pdf, other

    cs.DC

    Node-Aware Improvements to Allreduce

    Authors: Amanda Bienz, Luke N. Olson, William D. Gropp

    Abstract: The \texttt{MPI\_Allreduce} collective operation is a core kernel of many parallel codebases, particularly for reductions over a single value per process. The commonly used allreduce recursive-doubling algorithm obtains the lower bound message count, yielding optimality for small reduction sizes based on node-agnostic performance models. However, this algorithm yields duplicate messages between se… ▽ More

    Submitted 21 October, 2019; originally announced October 2019.

    Comments: 10 pages, 11 figures, ExaMPI Workshop at SC19

  14. arXiv:1806.02030  [pdf, other

    cs.DC

    Improving Performance Models for Irregular Point-to-Point Communication

    Authors: Amanda Bienz, William D. Gropp, Luke N. Olson

    Abstract: Parallel applications are often unable to take full advantage of emerging parallel architectures due to scaling limitations, which arise due to inter-process communication. Performance models are used to analyze the sources of communication costs. However, traditional models for point-to-point communication fail to capture the full cost of many irregular operations, such as sparse matrix methods.… ▽ More

    Submitted 6 June, 2018; originally announced June 2018.

    Comments: 8 pages, 11 figures

  15. High-order Finite Element--Integral Equation Coupling on Embedded Meshes

    Authors: Natalie N. Beams, Andreas Klöckner, Luke N. Olson

    Abstract: This paper presents a high-order method for solving an interface problem for the Poisson equation on embedded meshes through a coupled finite element and integral equation approach. The method is capable of handling homogeneous or inhomogeneous jump conditions without modification and retains high-order convergence close to the embedded interface. We present finite element-integral equation (FE-IE… ▽ More

    Submitted 16 August, 2018; v1 submitted 8 April, 2018; originally announced April 2018.

    MSC Class: 65N30; 65N38; 65N85

  16. arXiv:1803.02481  [pdf, other

    cs.MS cs.PF math.NA physics.comp-ph

    Scaling Structured Multigrid to 500K+ Cores through Coarse-Grid Redistribution

    Authors: Andrew Reisner, Luke N. Olson, J. David Moulton

    Abstract: The efficient solution of sparse, linear systems resulting from the discretization of partial differential equations is crucial to the performance of many physics-based simulations. The algorithmic optimality of multilevel approaches for common discretizations makes them a good candidate for an efficient parallel solver. Yet, modern architectures for high-performance computing systems continue to… ▽ More

    Submitted 6 March, 2018; originally announced March 2018.

    Comments: 21 pages

    Report number: Los Alamos Report LA-UR-17-22886

  17. arXiv:1612.08060  [pdf, other

    cs.DC cs.MS

    Node Aware Sparse Matrix-Vector Multiplication

    Authors: Amanda Bienz, William D. Gropp, Luke N. Olson

    Abstract: The sparse matrix-vector multiply (SpMV) operation is a key computational kernel in many simulations and linear solvers. The large communication requirements associated with a reference implementation of a parallel SpMV result in poor parallel scalability. The cost of communication depends on the physical locations of the send and receive processes: messages injected into the network are more cost… ▽ More

    Submitted 15 November, 2017; v1 submitted 23 December, 2016; originally announced December 2016.

    Comments: 27 pages, 16 figures

  18. A Root-Node Based Algebraic Multigrid Method

    Authors: Thomas A. Manteuffel, Luke N. Olson, Jacob B. Schroder, Ben S. Southworth

    Abstract: This paper provides a unified and detailed presentation of root-node style algebraic multigrid (AMG). Algebraic multigrid is a popular and effective iterative method for solving large, sparse linear systems that arise from discretizing partial differential equations. However, while AMG is designed for symmetric positive definite matrices (SPD), certain SPD problems, such as anisotropic diffusion,… ▽ More

    Submitted 28 January, 2018; v1 submitted 10 October, 2016; originally announced October 2016.

    Comments: 35 pages, 10 figures

    MSC Class: 65F10; 65M22; 65M55

  19. arXiv:1512.04629  [pdf, other

    cs.DC math.NA

    Reducing Parallel Communication in Algebraic Multigrid through Sparsification

    Authors: Amanda Bienz, Robert D. Falgout William Gropp, Luke N. Olson, Jacob B. Schroder

    Abstract: Algebraic multigrid (AMG) is an $\mathcal{O}(n)$ solution process for many large sparse linear systems. A hierarchy of progressively coarser grids is constructed that utilize complementary relaxation and interpolation operators. High-energy error is reduced by relaxation, while low-energy error is mapped to coarse-grids and reduced there. However, large parallel communication costs often limit par… ▽ More

    Submitted 14 December, 2015; originally announced December 2015.

    Comments: 27 pages, 19 figures, submitted to SISC, multigrid, algebraic multigrid, non-Galerkin multigrid, high performance computing

    MSC Class: 65F50

  20. arXiv:1503.08509  [pdf, ps, other

    math.NA

    A Finite Element Based P3M Method for N-body Problems

    Authors: Natalie N. Beams, Luke N. Olson, Jonathan B. Freund

    Abstract: We introduce a fast mesh-based method for computing N-body interactions that is both scalable and accurate. The method is founded on a particle-particle--particle-mesh P3M approach, which decomposes a potential into rapidly decaying short-range interactions and smooth, mesh-resolvable long-range interactions. However, in contrast to the traditional approach of using Gaussian screen functions to ac… ▽ More

    Submitted 29 March, 2015; originally announced March 2015.

    Comments: 20 pages, submitted to SISC

    MSC Class: 7008; 70F10; 65N30; 65N99 ACM Class: G.1.0; G.1.8