(Translated by https://www.hiragana.jp/)
Search | arXiv e-print repository
Skip to main content

Showing 1–5 of 5 results for author: Pelenitsyn, A

.
  1. arXiv:2407.00829  [pdf, other

    cs.DC cs.PF

    SABLE: Staging Blocked Evaluation of Sparse Matrix Computations

    Authors: Pratyush Das, Adhitha Dias, Anxhelo Xhebraj, Artem Pelenitsyn, Kirshanthan Sundararajah, Milind Kulkarni

    Abstract: Sparse Matrices found in the real world often have some structure in how the dense elements are organized. While the inspector-executor model inspects matrices for structure, its generality can overlook further specialization. We propose a system that - if the sparse matrix is stored in a blocked storage format - can generate more efficient code by constructing regular loops over these blocks. Our… ▽ More

    Submitted 3 April, 2024; originally announced July 2024.

  2. arXiv:2405.17590  [pdf, other

    cs.PL cs.PF

    Optimizing Layout of Recursive Datatypes with Marmoset

    Authors: Vidush Singhal, Chaitanya Koparkar, Joseph Zullo, Artem Pelenitsyn, Michael Vollmer, Mike Rainey, Ryan Newton, Milind Kulkarni

    Abstract: While programmers know that the low-level memory representation of data structures can have significant effects on performance, compiler support to optimize the layout of those structures is an under-explored field. Prior work has optimized the layout of individual, non-recursive structures without considering how collections of those objects in linked or recursive data structures are laid out. Th… ▽ More

    Submitted 3 June, 2024; v1 submitted 27 May, 2024; originally announced May 2024.

    Journal ref: European Conference on Object Oriented Programming 2024

  3. arXiv:2311.09549  [pdf, other

    cs.PL

    SparseAuto: An Auto-Scheduler for Sparse Tensor Computations Using Recursive Loop Nest Restructuring

    Authors: Adhitha Dias, Logan Anderson, Kirshanthan Sundararajah, Artem Pelenitsyn, Milind Kulkarni

    Abstract: Automated code generation and performance optimizations for sparse tensor algebra are cardinal since they have become essential in many real-world applications like quantum computing, physics, chemistry, and machine learning. General sparse tensor algebra compilers are not always versatile enough to generate asymptotically optimal code for sparse tensor contractions. This paper shows how to optimi… ▽ More

    Submitted 5 January, 2024; v1 submitted 15 November, 2023; originally announced November 2023.

  4. arXiv:2311.09168  [pdf, other

    cs.GR cs.DC cs.PF

    Arkade: k-Nearest Neighbor Search With Non-Euclidean Distances using GPU Ray Tracing

    Authors: Durga Mandarapu, Vani Nagarajan, Artem Pelenitsyn, Milind Kulkarni

    Abstract: High-performance implementations of $k$-Nearest Neighbor Search ($k$NN) in low dimensions use tree-based data structures. Tree algorithms are hard to parallelize on GPUs due to their irregularity. However, newer Nvidia GPUs offer hardware support for tree operations through ray-tracing cores. Recent works have proposed using RT cores to implement $k$NN search, but they all have a hardware-imposed… ▽ More

    Submitted 21 April, 2024; v1 submitted 15 November, 2023; originally announced November 2023.

  5. Type Stability in Julia: Avoiding Performance Pathologies in JIT Compilation (Extended Version)

    Authors: Artem Pelenitsyn, Julia Belyakova, Benjamin Chung, Ross Tate, Jan Vitek

    Abstract: As a scientific programming language, Julia strives for performance but also provides high-level productivity features. To avoid performance pathologies, Julia users are expected to adhere to a coding discipline that enables so-called type stability. Informally, a function is type stable if the type of the output depends only on the types of the inputs, not their values. This paper provides a form… ▽ More

    Submitted 17 November, 2021; v1 submitted 4 September, 2021; originally announced September 2021.

    Comments: OOPSLA '21, extended version