(Translated by https://www.hiragana.jp/)
Search | arXiv e-print repository
Skip to main content

Showing 1–50 of 63 results for author: Salim, A

.
  1. arXiv:2406.11929  [pdf, other

    cs.LG math.PR

    Long-time asymptotics of noisy SVGD outside the population limit

    Authors: Victor Priser, Pascal Bianchi, Adil Salim

    Abstract: Stein Variational Gradient Descent (SVGD) is a widely used sampling algorithm that has been successfully applied in several areas of Machine Learning. SVGD operates by iteratively moving a set of interacting particles (which represent the samples) to approximate the target distribution. Despite recent studies on the complexity of SVGD and its variants, their long-time asymptotic behavior (i.e., a… ▽ More

    Submitted 21 June, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

  2. arXiv:2404.14219  [pdf, other

    cs.CL cs.AI

    Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

    Authors: Marah Abdin, Sam Ade Jacobs, Ammar Ahmad Awan, Jyoti Aneja, Ahmed Awadallah, Hany Awadalla, Nguyen Bach, Amit Bahree, Arash Bakhtiari, Jianmin Bao, Harkirat Behl, Alon Benhaim, Misha Bilenko, Johan Bjorck, Sébastien Bubeck, Qin Cai, Martin Cai, Caio César Teodoro Mendes, Weizhu Chen, Vishrav Chaudhary, Dong Chen, Dongdong Chen, Yen-Chun Chen, Yi-Ling Chen, Parul Chopra , et al. (90 additional authors not shown)

    Abstract: We introduce phi-3-mini, a 3.8 billion parameter language model trained on 3.3 trillion tokens, whose overall performance, as measured by both academic benchmarks and internal testing, rivals that of models such as Mixtral 8x7B and GPT-3.5 (e.g., phi-3-mini achieves 69% on MMLU and 8.38 on MT-bench), despite being small enough to be deployed on a phone. The innovation lies entirely in our dataset… ▽ More

    Submitted 23 May, 2024; v1 submitted 22 April, 2024; originally announced April 2024.

    Comments: 19 pages

  3. arXiv:2311.12825  [pdf, ps, other

    cs.AI cs.LG stat.ME

    A PSO Based Method to Generate Actionable Counterfactuals for High Dimensional Data

    Authors: Shashank Shekhar, Asif Salim, Adesh Bansode, Vivaswan Jinturkar, Anirudha Nayak

    Abstract: Counterfactual explanations (CFE) are methods that explain a machine learning model by giving an alternate class prediction of a data point with some minimal changes in its features. It helps the users to identify their data attributes that caused an undesirable prediction like a loan or credit card rejection. We describe an efficient and an actionable counterfactual (CF) generation method based o… ▽ More

    Submitted 30 November, 2023; v1 submitted 30 September, 2023; originally announced November 2023.

    Comments: Accepted in IEEE CSDE 2023

  4. A comment on singular and non-singular black holes using the Gaussian distribution

    Authors: D. Batic, M. Nowakowski, S. A. Salim

    Abstract: In this work, we join the controversial discussion on singular and non-singular black holes using the Gaussian distribution. Our result which uses correct boundary conditions shifts the debate in favour of regular black holes at the centre. The present findings add new insights into the ongoing discussions surrounding singularities in black hole solutions of the Einstein equations.

    Submitted 19 November, 2023; originally announced November 2023.

    Comments: 6 pages

  5. arXiv:2306.16308  [pdf, other

    math.PR cs.LG math.ST stat.ML

    Gaussian random field approximation via Stein's method with applications to wide random neural networks

    Authors: Krishnakumar Balasubramanian, Larry Goldstein, Nathan Ross, Adil Salim

    Abstract: We derive upper bounds on the Wasserstein distance ($W_1$), with respect to $\sup$-norm, between any continuous $\mathbb{R}^d$ valued random field indexed by the $n$-sphere and the Gaussian, based on Stein's method. We develop a novel Gaussian smoothing technique that allows us to transfer a bound in a smoother metric to the $W_1$ distance. The smoothing is based on covariance functions constructe… ▽ More

    Submitted 30 April, 2024; v1 submitted 28 June, 2023; originally announced June 2023.

    Comments: To appear in Applied and Computational Harmonic Analysis

  6. arXiv:2306.11644  [pdf, other

    cs.CL cs.AI cs.LG

    Textbooks Are All You Need

    Authors: Suriya Gunasekar, Yi Zhang, Jyoti Aneja, Caio César Teodoro Mendes, Allie Del Giorno, Sivakanth Gopi, Mojan Javaheripi, Piero Kauffmann, Gustavo de Rosa, Olli Saarikivi, Adil Salim, Shital Shah, Harkirat Singh Behl, Xin Wang, Sébastien Bubeck, Ronen Eldan, Adam Tauman Kalai, Yin Tat Lee, Yuanzhi Li

    Abstract: We introduce phi-1, a new large language model for code, with significantly smaller size than competing models: phi-1 is a Transformer-based model with 1.3B parameters, trained for 4 days on 8 A100s, using a selection of ``textbook quality" data from the web (6B tokens) and synthetically generated textbooks and exercises with GPT-3.5 (1B tokens). Despite this small scale, phi-1 attains pass@1 accu… ▽ More

    Submitted 2 October, 2023; v1 submitted 20 June, 2023; originally announced June 2023.

    Comments: 26 pages; changed color scheme of plot. fixed minor typos and added couple clarifications

  7. arXiv:2305.11798  [pdf, ps, other

    cs.LG math.ST stat.ML

    The probability flow ODE is provably fast

    Authors: Sitan Chen, Sinho Chewi, Holden Lee, Yuanzhi Li, Jianfeng Lu, Adil Salim

    Abstract: We provide the first polynomial-time convergence guarantees for the probability flow ODE implementation (together with a corrector step) of score-based generative modeling. Our analysis is carried out in the wake of recent results obtaining such guarantees for the SDE-based implementation (i.e., denoising diffusion probabilistic modeling or DDPM), but requires the development of novel techniques f… ▽ More

    Submitted 19 May, 2023; originally announced May 2023.

    Comments: 23 pages, 2 figures

  8. arXiv:2304.05398  [pdf, other

    math.ST cs.LG math.OC

    Forward-backward Gaussian variational inference via JKO in the Bures-Wasserstein Space

    Authors: Michael Diao, Krishnakumar Balasubramanian, Sinho Chewi, Adil Salim

    Abstract: Variational inference (VI) seeks to approximate a target distribution $πぱい$ by an element of a tractable family of distributions. Of key interest in statistics and machine learning is Gaussian VI, which approximates $πぱい$ by minimizing the Kullback-Leibler (KL) divergence to $πぱい$ over the space of Gaussians. In this work, we develop the (Stochastic) Forward-Backward Gaussian Variational Inference (FB-G… ▽ More

    Submitted 10 April, 2023; originally announced April 2023.

  9. arXiv:2302.09487  [pdf

    cs.HC cs.AI cs.LG

    Understanding how the use of AI decision support tools affect critical thinking and over-reliance on technology by drug dispensers in Tanzania

    Authors: Ally Salim Jr, Megan Allen, Kelvin Mariki, Kevin James Masoy, Jafary Liana

    Abstract: The use of AI in healthcare is designed to improve care delivery and augment the decisions of providers to enhance patient outcomes. When deployed in clinical settings, the interaction between providers and AI is a critical component for measuring and understanding the effectiveness of these digital tools on broader health outcomes. Even in cases where AI algorithms have high diagnostic accuracy,… ▽ More

    Submitted 22 February, 2023; v1 submitted 19 February, 2023; originally announced February 2023.

  10. arXiv:2209.11215  [pdf, ps, other

    cs.LG math.ST

    Sampling is as easy as learning the score: theory for diffusion models with minimal data assumptions

    Authors: Sitan Chen, Sinho Chewi, Jerry Li, Yuanzhi Li, Adil Salim, Anru R. Zhang

    Abstract: We provide theoretical convergence guarantees for score-based generative models (SGMs) such as denoising diffusion probabilistic models (DDPMs), which constitute the backbone of large-scale real-world generative models such as DALL$\cdot$E 2. Our main result is that, assuming accurate score estimates, such SGMs can efficiently sample from essentially any realistic data distribution. In contrast to… ▽ More

    Submitted 15 April, 2023; v1 submitted 22 September, 2022; originally announced September 2022.

    Comments: 29 pages

  11. arXiv:2209.07513  [pdf, other

    math.OC

    On the complexity of finding stationary points of smooth functions in one dimension

    Authors: Sinho Chewi, Sébastien Bubeck, Adil Salim

    Abstract: We characterize the query complexity of finding stationary points of one-dimensional non-convex but smooth functions. We consider four settings, based on whether the algorithms under consideration are deterministic or randomized, and whether the oracle outputs $1^{\rm st}$-order or both $0^{\rm th}$- and $1^{\rm st}$-order information. Our results show that algorithms for this task provably benefi… ▽ More

    Submitted 18 March, 2023; v1 submitted 15 September, 2022; originally announced September 2022.

    Comments: 17 pages, 3 figures

  12. arXiv:2206.00920  [pdf, ps, other

    cs.LG

    Federated Learning with a Sampling Algorithm under Isoperimetry

    Authors: Lukang Sun, Adil Salim, Peter Richtárik

    Abstract: Federated learning uses a set of techniques to efficiently distribute the training of a machine learning algorithm across several devices, who own the training data. These techniques critically rely on reducing the communication cost -- the main bottleneck -- between the devices and a central server. Federated learning algorithms usually take an optimization approach: they are algorithms for minim… ▽ More

    Submitted 7 June, 2022; v1 submitted 2 June, 2022; originally announced June 2022.

  13. arXiv:2203.12859  [pdf, other

    stat.ME

    Making SMART decisions in prophylaxis and treatment studies

    Authors: Robert K. Mahar, Katherine J. Lee, Bibhas Chakraborty, Agus Salim, Julie A. Simpson

    Abstract: The optimal prophylaxis, and treatment if the prophylaxis fails, for a disease may be best evaluated using a sequential multiple assignment randomised trial (SMART). A SMART is a multi-stage study that randomises a participant to an initial treatment, observes some response to that treatment and then, depending on their observed response, randomises the same participant to an alternative treatment… ▽ More

    Submitted 24 March, 2022; originally announced March 2022.

  14. arXiv:2202.06386  [pdf, ps, other

    math.ST stat.ML

    Improved analysis for a proximal algorithm for sampling

    Authors: Yongxin Chen, Sinho Chewi, Adil Salim, Andre Wibisono

    Abstract: We study the proximal sampler of Lee, Shen, and Tian (2021) and obtain new convergence guarantees under weaker assumptions than strong log-concavity: namely, our results hold for (1) weakly log-concave targets, and (2) targets satisfying isoperimetric assumptions which allow for non-log-concavity. We demonstrate our results by obtaining new state-of-the-art sampling guarantees for several classes… ▽ More

    Submitted 13 February, 2022; originally announced February 2022.

    Comments: 34 pages

  15. arXiv:2202.05214  [pdf, other

    math.ST stat.ML

    Towards a Theory of Non-Log-Concave Sampling: First-Order Stationarity Guarantees for Langevin Monte Carlo

    Authors: Krishnakumar Balasubramanian, Sinho Chewi, Murat A. Erdogdu, Adil Salim, Matthew Zhang

    Abstract: For the task of sampling from a density $πぱい\propto \exp(-V)$ on $\mathbb{R}^d$, where $V$ is possibly non-convex but $L$-gradient Lipschitz, we prove that averaged Langevin Monte Carlo outputs a sample with $\varepsilon$-relative Fisher information after $O( L^2 d^2/\varepsilon^2)$ iterations. This is the sampling analogue of complexity bounds for finding an $\varepsilon$-approximate first-order st… ▽ More

    Submitted 10 February, 2022; originally announced February 2022.

  16. arXiv:2201.08901  [pdf

    cs.CV

    An Ensemble Model for Face Liveness Detection

    Authors: Shashank Shekhar, Avinash Patel, Mrinal Haloi, Asif Salim

    Abstract: In this paper, we present a passive method to detect face presentation attack a.k.a face liveness detection using an ensemble deep learning technique. Face liveness detection is one of the key steps involved in user identity verification of customers during the online onboarding/transaction processes. During identity verification, an unauthenticated user tries to bypass the verification system by… ▽ More

    Submitted 19 January, 2022; originally announced January 2022.

    Comments: Accepted and presented at MLDM 2022. To be published in Lattice journal

  17. arXiv:2201.06433  [pdf, other

    cs.LG

    A Comparative study of Hyper-Parameter Optimization Tools

    Authors: Shashank Shekhar, Adesh Bansode, Asif Salim

    Abstract: Most of the machine learning models have associated hyper-parameters along with their parameters. While the algorithm gives the solution for parameters, its utility for model performance is highly dependent on the choice of hyperparameters. For a robust performance of a model, it is necessary to find out the right hyper-parameter combination. Hyper-parameter optimization (HPO) is a systematic proc… ▽ More

    Submitted 17 January, 2022; originally announced January 2022.

    Comments: Selected and presented at IEEE CSDE 2021. To be published in Proceedings of IEEE CSDE 2021

  18. arXiv:2106.03076  [pdf, ps, other

    cs.LG math.OC

    A Convergence Theory for SVGD in the Population Limit under Talagrand's Inequality T1

    Authors: Adil Salim, Lukang Sun, Peter Richtárik

    Abstract: Stein Variational Gradient Descent (SVGD) is an algorithm for sampling from a target density which is known up to a multiplicative constant. Although SVGD is a popular algorithm in practice, its theoretical study is limited to a few recent works. We study the convergence of SVGD in the population limit, (i.e., with an infinite number of particles) to sample from a non-logconcave target distributio… ▽ More

    Submitted 16 June, 2022; v1 submitted 6 June, 2021; originally announced June 2021.

  19. arXiv:2104.14123  [pdf, other

    cs.LG

    An efficient scheme based on graph centrality to select nodes for training for effective learning

    Authors: CR Sandeep, Asif Salim, R Sethunadh, S Sumitra

    Abstract: The process of selecting points for training a machine learning model is often a challenging task. Many times, we will have a lot of data, but for training, we require the labels and labeling is often costly. So we need to select the points for training in an efficient manner so that the model trained on the points selected will be better than the ones trained on any other training set. We propose… ▽ More

    Submitted 19 May, 2021; v1 submitted 29 April, 2021; originally announced April 2021.

  20. arXiv:2102.11079  [pdf, ps, other

    math.OC

    An Optimal Algorithm for Strongly Convex Minimization under Affine Constraints

    Authors: Adil Salim, Laurent Condat, Dmitry Kovalev, Peter Richtárik

    Abstract: Optimization problems under affine constraints appear in various areas of machine learning. We consider the task of minimizing a smooth strongly convex function F(x) under the affine constraint Kx=b, with an oracle providing evaluations of the gradient of F and multiplications by K and its transpose. We provide lower bounds on the number of gradient computations and matrix multiplications to achie… ▽ More

    Submitted 10 April, 2022; v1 submitted 22 February, 2021; originally announced February 2021.

  21. arXiv:2012.02896  [pdf, other

    eess.SY

    Experimental Implementation of an Adaptive Digital Autopilot

    Authors: Ankit Goel, Juan Augusto Paredes, Harshil Dadhaniya, Syed Aseem Ul Islam, Abdulazeez Mohammed Salim, Sai Ravela, Dennis Bernstein

    Abstract: This paper develops an adaptive digital autopilot for quadcopters and presents experimental results. The adaptive digital autopilot is constructed by augmenting the PX4 autopilot control system architecture with adaptive digital control laws based on retrospective cost adaptive control (RCAC). In order to investigate the performance of the adaptive digital autopilot, the default gains of the fixed… ▽ More

    Submitted 4 December, 2020; originally announced December 2020.

    Comments: Submitted to ACC 2021

  22. Neighborhood Preserving Kernels for Attributed Graphs

    Authors: Asif Salim, Shiju. S. S, Sumitra. S

    Abstract: We describe the design of a reproducing kernel suitable for attributed graphs, in which the similarity between the two graphs is defined based on the neighborhood information of the graph nodes with the aid of a product graph formulation. We represent the proposed kernel as the weighted sum of two other kernels of which one is an R-convolution kernel that processes the attribute information of the… ▽ More

    Submitted 13 October, 2020; originally announced October 2020.

    Journal ref: IEEE Transations on Pattern Analysis and Machine Intelligence, 2022

  23. arXiv:2009.13801  [pdf, other

    cs.LG stat.ML

    Framework for Designing Filters of Spectral Graph Convolutional Neural Networks in the Context of Regularization Theory

    Authors: Asif Salim, Sumitra S

    Abstract: Graph convolutional neural networks (GCNNs) have been widely used in graph learning. It has been observed that the smoothness functional on graphs can be defined in terms of the graph Laplacian. This fact points out in the direction of using Laplacian in deriving regularization operators on graphs and its consequent use with spectral GCNN filter designs. In this work, we explore the regularization… ▽ More

    Submitted 29 September, 2020; originally announced September 2020.

  24. arXiv:2006.11773  [pdf, other

    math.OC stat.ML

    Optimal and Practical Algorithms for Smooth and Strongly Convex Decentralized Optimization

    Authors: Dmitry Kovalev, Adil Salim, Peter Richtárik

    Abstract: We consider the task of decentralized minimization of the sum of smooth strongly convex functions stored across the nodes of a network. For this problem, lower bounds on the number of gradient computations and the number of communication rounds required to achieve $\varepsilon$ accuracy have recently been proven. We propose two new algorithms for this decentralized optimization problem and equip t… ▽ More

    Submitted 13 November, 2020; v1 submitted 21 June, 2020; originally announced June 2020.

  25. arXiv:2006.09797  [pdf, other

    stat.ML cs.LG

    A Non-Asymptotic Analysis for Stein Variational Gradient Descent

    Authors: Anna Korba, Adil Salim, Michael Arbel, Giulia Luise, Arthur Gretton

    Abstract: We study the Stein Variational Gradient Descent (SVGD) algorithm, which optimises a set of particles to approximate a target probability distribution $πぱい\propto e^{-V}$ on $\mathbb{R}^d$. In the population limit, SVGD performs gradient descent in the space of probability distributions on the KL divergence with respect to $πぱい$, where the gradient is smoothed through a kernel integral operator. In thi… ▽ More

    Submitted 3 January, 2021; v1 submitted 17 June, 2020; originally announced June 2020.

    Comments: Accepted to Neurips 2020

  26. arXiv:2006.09270  [pdf, other

    stat.ML cs.LG math.OC

    Primal Dual Interpretation of the Proximal Stochastic Gradient Langevin Algorithm

    Authors: Adil Salim, Peter Richtárik

    Abstract: We consider the task of sampling with respect to a log concave probability distribution. The potential of the target distribution is assumed to be composite, \textit{i.e.}, written as the sum of a smooth convex term, and a nonsmooth convex term possibly taking infinite values. The target distribution can be seen as a minimizer of the Kullback-Leibler divergence defined on the Wasserstein space (\t… ▽ More

    Submitted 22 February, 2021; v1 submitted 16 June, 2020; originally announced June 2020.

  27. arXiv:2006.00416  [pdf, other

    eess.SY cs.RO

    Adaptive Digital PID Control of a Quadcopter with Unknown Dynamics

    Authors: Ankit Goel, Abdulazeez Mohammed Salim, Ahmad Ansari, Sai Ravela, Dennis Bernstein

    Abstract: This paper develops an adaptive autopilot for quadcopters with unknown dynamics. To do this, the PX4 autopilot architecture is modified so that the feedback and feedforward controllers are replaced by adaptive control laws based on retrospective cost adaptive control (RCAC). The present paper provides a numerical investigation of the performance of the adaptive autopilot on a quadcopter with unkno… ▽ More

    Submitted 30 May, 2020; originally announced June 2020.

    Comments: Submitted to ACC2020

  28. arXiv:2004.12354  [pdf, ps, other

    math.GR

    Subgroups of a finitary linear group

    Authors: V. A. Bovdi, O. Yu. Dashkova, M. A. Salim

    Abstract: Let FL_s(K) be the finitary linear group of degree s over an associative ring K with unity. We prove that the torsion subgroups of FL_s(K) are locally finite for certain classes of rings K. A description of some f.g. solvable subgroups of FL_s(K) are given.

    Submitted 26 April, 2020; originally announced April 2020.

    Comments: 7 pages

    MSC Class: 20H25

  29. arXiv:2004.02635  [pdf, other

    math.OC cs.LG stat.ML

    Dualize, Split, Randomize: Toward Fast Nonsmooth Optimization Algorithms

    Authors: Adil Salim, Laurent Condat, Konstantin Mishchenko, Peter Richtárik

    Abstract: We consider minimizing the sum of three convex functions, where the first one F is smooth, the second one is nonsmooth and proximable and the third one is the composition of a nonsmooth proximable function with a linear operator L. This template problem has many applications, for instance, in image processing and machine learning. First, we propose a new primal-dual algorithm, which we call PDDY,… ▽ More

    Submitted 26 July, 2022; v1 submitted 3 April, 2020; originally announced April 2020.

  30. Operators on positive semidefinite inner product spaces

    Authors: Victor A. Bovdi, Tetiana Klymchuk, Tetiana Rybalkina, Mohamed A. Salim, Vladimir V. Sergeichuk

    Abstract: We give canonical forms of selfadjoint and isometric operators on a complex vector space $U$ with scalar product given by a positive semidefinite Hermitian form, and of Hermitian forms on $U$. For an arbitrary system of semiunitary spaces and linear mappings on/between them, we give an algorithm that reduces their matrices to canonical form.

    Submitted 15 March, 2020; originally announced March 2020.

    Comments: 28 pages

    MSC Class: 15A21; 15A42; 15A63; 47B50

    Journal ref: Linear Algebra Appl. 596 (2020) 82-105

  31. Derivations of group rings

    Authors: Orest D. Artemovych, Victor A. Bovdi, Mohamed A. Salim

    Abstract: Let R[G] be the group ring of a group G over an associative ring R with unity such that all prime divisors of orders of elements of G are invertible in R. If R is finite and G is a Chernikov (torsion FC-) group, then each R-derivation of R[G] is inner. Similar results also are obtained for other classes of groups G and rings R.

    Submitted 3 March, 2020; originally announced March 2020.

    Comments: 17 pages

    MSC Class: 20C05; 16S34; 20F45; 20F19; 16W25

  32. arXiv:2002.03035  [pdf, other

    math.OC stat.ML

    The Wasserstein Proximal Gradient Algorithm

    Authors: Adil Salim, Anna Korba, Giulia Luise

    Abstract: Wasserstein gradient flows are continuous time dynamics that define curves of steepest descent to minimize an objective function over the space of probability measures (i.e., the Wasserstein space). This objective is typically a divergence w.r.t. a fixed target distribution. In recent years, these continuous time dynamics have been used to study the convergence of machine learning algorithms aimin… ▽ More

    Submitted 21 February, 2021; v1 submitted 7 February, 2020; originally announced February 2020.

  33. arXiv:1912.09925  [pdf, other

    cs.LG cs.DC math.NA math.OC

    Distributed Fixed Point Methods with Compressed Iterates

    Authors: Sélim Chraibi, Ahmed Khaled, Dmitry Kovalev, Peter Richtárik, Adil Salim, Martin Takáč

    Abstract: We propose basic and natural assumptions under which iterative optimization methods with compressed iterates can be analyzed. This problem is motivated by the practice of federated learning, where a large model stored in the cloud is compressed before it is sent to a mobile device, which then proceeds with training based on local data. We develop standard and variance reduced methods, and establis… ▽ More

    Submitted 20 December, 2019; originally announced December 2019.

    Comments: 15 pages, 4 algorithms, 4 Theorems

  34. arXiv:1910.04405  [pdf, ps, other

    math.OC math.PR

    A Strong Law of Large Numbers for Random Monotone Operators

    Authors: Adil Salim

    Abstract: Random monotone operators are stochastic versions of maximal monotone operators which play an important role in stochastic nonsmooth optimization. Several stochastic nonsmooth optimization algorithms have been shown to converge to a zero of a mean operator defined as the expectation, in the sense of the Aumann integral, of a random monotone operator. In this note, we prove a strong law of large… ▽ More

    Submitted 20 October, 2023; v1 submitted 10 October, 2019; originally announced October 2019.

  35. The variety of dual mock-Lie algebras

    Authors: Luisa M. Camacho, Ivan Kaygorodov, Victor Lopatkin, Mohamed A. Salim

    Abstract: We classify all complex $7$- and $8$-dimensional dual mock-Lie algebras by algebraic and geometric way. Also we find all non-trivial complex $9$-dimensional dual mock-Lie algebras.

    Submitted 1 October, 2019; originally announced October 2019.

    Comments: arXiv admin note: substantial text overlap with arXiv:1905.05361 and text overlap with arXiv:1907.00685

  36. arXiv:1909.08704  [pdf, other

    cs.DC

    Balsam: Automated Scheduling and Execution of Dynamic, Data-Intensive HPC Workflows

    Authors: Michael A. Salim, Thomas D. Uram, J. Taylor Childers, Prasanna Balaprakash, Venkatram Vishwanath, Michael E. Papka

    Abstract: We introduce the Balsam service to manage high-throughput task scheduling and execution on supercomputing systems. Balsam allows users to populate a task database with a variety of tasks ranging from simple independent tasks to dynamic multi-task workflows. With abstractions for the local resource scheduler and MPI environment, Balsam dynamically packages tasks into ensemble jobs and manages their… ▽ More

    Submitted 18 September, 2019; originally announced September 2019.

    Comments: SC '18: 8th Workshop on Python for High-Performance and Scientific Computing (PyHPC 2018)

  37. arXiv:1906.04370  [pdf, other

    stat.ML cs.LG

    Maximum Mean Discrepancy Gradient Flow

    Authors: Michael Arbel, Anna Korba, Adil Salim, Arthur Gretton

    Abstract: We construct a Wasserstein gradient flow of the maximum mean discrepancy (MMD) and study its convergence properties. The MMD is an integral probability metric defined for a reproducing kernel Hilbert space (RKHS), and serves as a metric on probability measures for a sufficiently rich RKHS. We obtain conditions for convergence of the gradient flow towards a global optimum, that can be related to… ▽ More

    Submitted 3 December, 2019; v1 submitted 10 June, 2019; originally announced June 2019.

  38. arXiv:1905.11768  [pdf, other

    stat.ML cs.LG math.OC math.ST

    Stochastic Proximal Langevin Algorithm: Potential Splitting and Nonasymptotic Rates

    Authors: Adil Salim, Dmitry Kovalev, Peter Richtárik

    Abstract: We propose a new algorithm---Stochastic Proximal Langevin Algorithm (SPLA)---for sampling from a log concave distribution. Our method is a generalization of the Langevin algorithm to potentials expressed as the sum of one stochastic smooth term and multiple stochastic nonsmooth terms. In each iteration, our splitting technique only requires access to a stochastic gradient of the smooth term and a… ▽ More

    Submitted 16 June, 2020; v1 submitted 28 May, 2019; originally announced May 2019.

    Journal ref: Neurips 2019 (Spotlight)

  39. arXiv:1901.08170  [pdf, ps, other

    math.OC stat.ML

    A Fully Stochastic Primal-Dual Algorithm

    Authors: Pascal Bianchi, Walid Hachem, Adil Salim

    Abstract: A new stochastic primal--dual algorithm for solving a composite optimization problem is proposed. It is assumed that all the functions/operators that enter the optimization problem are given as statistical expectations. These expectations are unknown but revealed across time through i.i.d. realizations. The proposed algorithm is proven to converge to a saddle point of the Lagrangian function. In t… ▽ More

    Submitted 22 June, 2020; v1 submitted 23 January, 2019; originally announced January 2019.

  40. arXiv:1808.06444  [pdf

    cs.LG stat.ML

    Synthetic Patient Generation: A Deep Learning Approach Using Variational Autoencoders

    Authors: Ally Salim Jr

    Abstract: Artificial Intelligence in healthcare is a new and exciting frontier and the possibilities are endless. With deep learning approaches beating human performances in many areas, the logical next step is to attempt their application in the health space. For these and other Machine Learning approaches to produce good results and have their potential realized, the need for, and importance of, large amo… ▽ More

    Submitted 20 August, 2018; originally announced August 2018.

    MSC Class: 68T00

  41. arXiv:1804.00934  [pdf, other

    math.OC stat.ML

    A Constant Step Stochastic Douglas-Rachford Algorithm with Application to Non Separable Regularizations

    Authors: Adil Salim, Pascal Bianchi, Walid Hachem

    Abstract: The Douglas Rachford algorithm is an algorithm that converges to a minimizer of a sum of two convex functions. The algorithm consists in fixed point iterations involving computations of the proximity operators of the two functions separately. The paper investigates a stochastic version of the algorithm where both functions are random and the step size is constant. We establish that the iterates of… ▽ More

    Submitted 3 April, 2018; originally announced April 2018.

  42. arXiv:1712.09186  [pdf, ps, other

    math.RA math.GN

    Completely simple endomorphism rings of modules

    Authors: V. A. Bovdi, M. A. Salim, Mihail Ursul

    Abstract: It is proved that if A_p is a countable elementary abelian p-group, then: (i) The ring End(A_p) does not admit a nondiscrete locally compact ring topology. (ii) Under (CH) the simple ring End(A_p)/I, where I is the ideal of End(A_p) consisting of all endomorphisms with finite images, does not admit a nondiscrete locally compact ring topology. (iii) The finite topology on End(A_p) is the only secon… ▽ More

    Submitted 26 December, 2017; originally announced December 2017.

    Comments: 16 pages

    MSC Class: 16W80; 16N20; 16S50; 16N40

  43. Reduction of a pair of skew-symmetric matrices to its canonical form under congruence

    Authors: V. A. Bovdi, T. G. Gerasimova, M. A. Salim, V. V. Sergeichuk

    Abstract: Let $(A,B)$ be a pair of skew-symmetric matrices over a field of characteristic not 2. Its regularization decomposition is a direct sum \[ (\underline{\underline A},\underline{\underline B})\oplus (A_1,B_1)\oplus\dots\oplus(A_t,B_t) \] that is congruent to $(A,B)$, in which $(\underline{\underline A},\underline{\underline B})$ is a pair of nonsingular matrices and $(A_1,B_1),$ $\dots,$… ▽ More

    Submitted 23 December, 2017; originally announced December 2017.

    Comments: 16 pages

    MSC Class: 15A21; 15A22; 15A63; 51A50

    Journal ref: Linear Algebra Appl. 543 (2018) 17-30

  44. arXiv:1712.07027  [pdf, other

    math.OC cs.LG stat.ML

    Snake: a Stochastic Proximal Gradient Algorithm for Regularized Problems over Large Graphs

    Authors: Adil Salim, Pascal Bianchi, Walid Hachem

    Abstract: A regularized optimization problem over a large unstructured graph is studied, where the regularization term is tied to the graph geometry. Typical regularization examples include the total variation and the Laplacian regularizations over the graph. When applying the proximal gradient algorithm to solve this problem, there exist quite affordable methods to implement the proximity operator (backwar… ▽ More

    Submitted 19 December, 2017; originally announced December 2017.

  45. Symplectic spaces and pairs of symmetric and nonsingular skew-symmetric matrices under congruence

    Authors: Victor A. Bovdi, Roger A. Horn, Mohamed A. Salim, Vladimir V. Sergeichuk

    Abstract: Let $\mathbb F$ be a field of characteristic not $2$, and let $(A,B)$ be a pair of $n\times n$ matrices over $\mathbb F$, in which $A$ is symmetric and $B$ is skew-symmetric. A canonical form of $(A,B)$ with respect to congruence transformations $(S^TAS,S^TBS)$ was given by Sergeichuk (1988) up to classification of symmetric and Hermitian forms over finite extensions of $\mathbb F$. We obtain a si… ▽ More

    Submitted 29 September, 2017; originally announced September 2017.

    Comments: 19 pages

    MSC Class: 15A21; 15A22; 15A63; 51A50

    Journal ref: Linear Algebra and Its Applications 537 (2018) 84-99

  46. arXiv:1702.04144  [pdf, ps, other

    math.OC

    A constant step Forward-Backward algorithm involving random maximal monotone operators

    Authors: Pascal Bianchi, Walid Hachem, Adil Salim

    Abstract: A stochastic Forward-Backward algorithm with a constant step is studied. At each time step, this algorithm involves an independent copy of a couple of random maximal monotone operators. Defining a mean operator as a selection integral, the differential inclusion built from the sum of the two mean operators is considered. As a first result, it is shown that the interpolated process obtained from th… ▽ More

    Submitted 4 April, 2018; v1 submitted 14 February, 2017; originally announced February 2017.

  47. arXiv:1612.03831  [pdf, ps, other

    math.PR

    Constant Step Stochastic Approximations Involving Differential Inclusions: Stability, Long-Run Convergence and Applications

    Authors: Pascal Bianchi, Walid Hachem, Adil Salim

    Abstract: We consider a Markov chain $(x_n)$ whose kernel is indexed by a scaling parameter $γがんま>0$, refered to as the step size. The aim is to analyze the behavior of the Markov chain in the doubly asymptotic regime where $n\to\infty$ then $γがんま\to 0$. First, under mild assumptions on the so-called drift of the Markov chain, we show that the interpolated process converges narrowly to the solutions of a Differen… ▽ More

    Submitted 14 December, 2017; v1 submitted 12 December, 2016; originally announced December 2016.

  48. Neighborhood radius estimation for Arnold's miniversal deformations of complex and $p$-adic matrices

    Authors: Victor A. Bovdi, Mohammed A. Salim, Vladimir V. Sergeichuk

    Abstract: V.I. Arnold (1971) constructed a simple normal form to which all complex matrices $B$ in a neighborhood $U$ of a given square matrix $A$ can be reduced by similarity transformations that smoothly depend on the entries of $B$. We calculate the radius of the neighborhood $U$. A.A. Mailybaev (1999, 2001) constructed a reducing similarity transformation in the form of Taylor series; we construct this… ▽ More

    Submitted 10 November, 2016; originally announced November 2016.

    Comments: 19 pages

    MSC Class: 15A21; 15B33; 37J40

    Journal ref: Linear Algebra Appl. 512 (2017) 97-112

  49. Differential Modulation for Asynchronous Two-Way-Relay Systems over Frequency-Selective Fading Channels

    Authors: Ahmad Salim, Tolga M. Duman

    Abstract: In this paper, we propose two schemes for asynchronous multi-relay two-way relay (MR-TWR) systems in which neither the users nor the relays know the channel state information (CSI). In an MR-TWR system, two users exchange their messages with the help of $N_R$ relays. Most of the existing works on MR-TWR systems based on differential modulation assume perfect symbol-level synchronization between al… ▽ More

    Submitted 23 October, 2016; originally announced October 2016.

    Journal ref: Wirel. Commun. Mob. Comput., 16: 2422 to 2435 (2016)

  50. arXiv:1606.07589  [pdf, ps, other

    math.RA math.GR math.RT

    Group algebras whose groups of normalized units have exponent 4

    Authors: V. A. Bovdi, M. A. Salim

    Abstract: We give a full description of locally finite p-groups G such that the normalized group of units V(FG) of the group algebra FG over a field F of characteristic p has exponent 4.

    Submitted 21 July, 2016; v1 submitted 24 June, 2016; originally announced June 2016.

    Comments: 7 pages

    MSC Class: 16S34; 16U60