(Translated by https://www.hiragana.jp/)
Search | arXiv e-print repository
Skip to main content

Showing 1–16 of 16 results for author: Kachman, T

.
  1. arXiv:2311.10468  [pdf, other

    cs.LG cs.AI cs.CE cs.GT cs.MA

    Using Cooperative Game Theory to Prune Neural Networks

    Authors: Mauricio Diaz-Ortiz Jr, Benjamin Kempinski, Daphne Cornelisse, Yoram Bachrach, Tal Kachman

    Abstract: We show how solution concepts from cooperative game theory can be used to tackle the problem of pruning neural networks. The ever-growing size of deep neural networks (DNNs) increases their performance, but also their computational requirements. We introduce a method called Game Theory Assisted Pruning (GTAP), which reduces the neural network's size while preserving its predictive accuracy. GTAP… ▽ More

    Submitted 17 November, 2023; originally announced November 2023.

  2. arXiv:2309.09968  [pdf, other

    cs.LG

    Generating and Imputing Tabular Data via Diffusion and Flow-based Gradient-Boosted Trees

    Authors: Alexia Jolicoeur-Martineau, Kilian Fatras, Tal Kachman

    Abstract: Tabular data is hard to acquire and is subject to missing values. This paper introduces a novel approach for generating and imputing mixed-type (continuous and categorical) tabular data utilizing score-based diffusion and conditional flow matching. In contrast to prior methods that rely on neural networks to learn the score function or the vector field, we adopt XGBoost, a widely used Gradient-Boo… ▽ More

    Submitted 19 February, 2024; v1 submitted 18 September, 2023; originally announced September 2023.

    Comments: Code: https://github.com/SamsungSAILMontreal/ForestDiffusion

  3. arXiv:2305.16192  [pdf, other

    cs.LG cs.AI physics.chem-ph q-bio.QM

    Explainability Techniques for Chemical Language Models

    Authors: Stefan Hödl, William Robinson, Yoram Bachrach, Wilhelm Huck, Tal Kachman

    Abstract: Explainability techniques are crucial in gaining insights into the reasons behind the predictions of deep learning models, which have not yet been applied to chemical language models. We propose an explainable AI technique that attributes the importance of individual atoms towards the predictions made by these models. Our method backpropagates the relevance information towards the chemical input s… ▽ More

    Submitted 25 May, 2023; originally announced May 2023.

  4. arXiv:2304.05907  [pdf, ps, other

    cs.LG cs.AI math.NA

    Diffusion models with location-scale noise

    Authors: Alexia Jolicoeur-Martineau, Kilian Fatras, Ke Li, Tal Kachman

    Abstract: Diffusion Models (DMs) are powerful generative models that add Gaussian noise to the data and learn to remove it. We wanted to determine which noise distribution (Gaussian or non-Gaussian) led to better generated data in DMs. Since DMs do not work by design with non-Gaussian noise, we built a framework that allows reversing a diffusion process with non-Gaussian location-scale noise. We use that fr… ▽ More

    Submitted 12 April, 2023; originally announced April 2023.

  5. arXiv:2304.01335  [pdf, other

    cond-mat.stat-mech cs.LG

    Charting the Topography of the Neural Network Landscape with Thermal-Like Noise

    Authors: Theo Jules, Gal Brener, Tal Kachman, Noam Levi, Yohai Bar-Sinai

    Abstract: The training of neural networks is a complex, high-dimensional, non-convex and noisy optimization problem whose theoretical understanding is interesting both from an applicative perspective and for fundamental reasons. A core challenge is to understand the geometry and topography of the landscape that guides the optimization. In this work, we employ standard Statistical Mechanics methods, namely,… ▽ More

    Submitted 18 April, 2023; v1 submitted 3 April, 2023; originally announced April 2023.

    Comments: 7 pages, 4 figures

  6. arXiv:2210.01801  [pdf, other

    cs.LG cs.AI

    Safe Reinforcement Learning From Pixels Using a Stochastic Latent Representation

    Authors: Yannick Hogewind, Thiago D. Simao, Tal Kachman, Nils Jansen

    Abstract: We address the problem of safe reinforcement learning from pixel observations. Inherent challenges in such settings are (1) a trade-off between reward optimization and adhering to safety constraints, (2) partial observability, and (3) high-dimensional observations. We formalize the problem in a constrained, partially observable Markov decision process framework, where an agent obtains distinct rew… ▽ More

    Submitted 2 October, 2022; originally announced October 2022.

  7. arXiv:2208.08798  [pdf, other

    cs.LG cs.AI cs.GT cs.MA econ.TH

    Neural Payoff Machines: Predicting Fair and Stable Payoff Allocations Among Team Members

    Authors: Daphne Cornelisse, Thomas Rood, Mateusz Malinowski, Yoram Bachrach, Tal Kachman

    Abstract: In many multi-agent settings, participants can form teams to achieve collective outcomes that may far surpass their individual capabilities. Measuring the relative contributions of agents and allocating them shares of the reward that promote long-lasting cooperation are difficult tasks. Cooperative game theory offers solution concepts identifying distribution schemes, such as the Shapley value, th… ▽ More

    Submitted 18 August, 2022; originally announced August 2022.

  8. arXiv:2112.14570  [pdf, other

    cs.GT cs.LG cs.MA

    Lyapunov Exponents for Diversity in Differentiable Games

    Authors: Jonathan Lorraine, Paul Vicol, Jack Parker-Holder, Tal Kachman, Luke Metz, Jakob Foerster

    Abstract: Ridge Rider (RR) is an algorithm for finding diverse solutions to optimization problems by following eigenvectors of the Hessian ("ridges"). RR is designed for conservative gradient systems (i.e., settings involving a single loss function), where it branches at saddles - easy-to-find bifurcation points. We generalize this idea to non-conservative, multi-agent gradient systems by proposing a method… ▽ More

    Submitted 24 December, 2021; originally announced December 2021.

    Comments: AAMAS2022, 24 pages

  9. arXiv:2111.05803  [pdf, other

    cs.LG stat.ML

    Gradients are Not All You Need

    Authors: Luke Metz, C. Daniel Freeman, Samuel S. Schoenholz, Tal Kachman

    Abstract: Differentiable programming techniques are widely used in the community and are responsible for the machine learning renaissance of the past several decades. While these methods are powerful, they have limits. In this short report, we discuss a common chaos based failure mode which appears in a variety of differentiable circumstances, ranging from recurrent neural networks and numerical physics sim… ▽ More

    Submitted 20 January, 2022; v1 submitted 10 November, 2021; originally announced November 2021.

  10. arXiv:2111.05127  [pdf, other

    math.PR cond-mat.stat-mech

    Anomalous Diffusion: Fractional Brownian Motion vs. Fractional Ito Motion

    Authors: Iddo Eliazar, Tal Kachman

    Abstract: Generalizing Brownian motion (BM), fractional Brownian motion (FBM) is a paradigmatic selfsimilar model for anomalous diffusion. Specifically, varying its Hurst exponent, FBM spans: sub-diffusion, regular diffusion, and super-diffusion. As BM, also FBM is a symmetric and Gaussian process, with a continuous trajectory, and with a stationary velocity. In contrast to BM, FBM is neither a Markov proce… ▽ More

    Submitted 11 November, 2021; v1 submitted 8 November, 2021; originally announced November 2021.

  11. arXiv:2105.14080  [pdf, other

    cs.LG cs.CV math.OC stat.ML

    Gotta Go Fast When Generating Data with Score-Based Models

    Authors: Alexia Jolicoeur-Martineau, Ke Li, Rémi Piché-Taillefer, Tal Kachman, Ioannis Mitliagkas

    Abstract: Score-based (denoising diffusion) generative models have recently gained a lot of success in generating realistic and diverse data. These approaches define a forward diffusion process for transforming data to noise and generate data by reversing it (thereby going from noise to data). Unfortunately, current score-based models generate data very slowly due to the sheer number of score network evalua… ▽ More

    Submitted 28 May, 2021; originally announced May 2021.

    Comments: Code is available on https://github.com/AlexiaJM/score_sde_fast_sampling

  12. arXiv:1904.04917  [pdf, other

    stat.ML cs.LG

    Novel Uncertainty Framework for Deep Learning Ensembles

    Authors: Tal Kachman, Michal Moshkovitz, Michal Rosen-Zvi

    Abstract: Deep neural networks have become the default choice for many of the machine learning tasks such as classification and regression. Dropout, a method commonly used to improve the convergence of deep neural networks, generates an ensemble of thinned networks with extensive weight sharing. Recent studies that dropout can be viewed as an approximate variational inference in Gaussian processes, and used… ▽ More

    Submitted 9 April, 2019; originally announced April 2019.

  13. arXiv:1805.11837  [pdf, other

    cs.CV cs.LG

    Learning multiple non-mutually-exclusive tasks for improved classification of inherently ordered labels

    Authors: Vadim Ratner, Yoel Shoshan, Tal Kachman

    Abstract: Medical image classification involves thresholding of labels that represent malignancy risk levels. Usually, a task defines a single threshold, and when developing computer-aided diagnosis tools, a single network is trained per such threshold, e.g. as screening out healthy (very low risk) patients to leave possibly sick ones for further analysis (low threshold), or trying to find malignant cases a… ▽ More

    Submitted 21 November, 2018; v1 submitted 30 May, 2018; originally announced May 2018.

  14. arXiv:1501.03132  [pdf, other

    nlin.CD cond-mat.stat-mech

    Numerical implementation of the multiscale and averaging methods for quasi periodic systems

    Authors: Tal Kachman, Shmuel Fishman, Avy Soffer

    Abstract: We consider the problem of numerically solving the Schrödinger equation with a potential that is quasi periodic in space and time. We introduce a numerical scheme based on a newly developed multi-time scale and averaging technique. We demonstrate that with this novel method we can solve efficiently and with rigorous control of the error such an equation for long times. A comparison with the standa… ▽ More

    Submitted 25 July, 2016; v1 submitted 13 January, 2015; originally announced January 2015.

    Comments: 26 pages, 8 figures

    MSC Class: 37D45

  15. arXiv:1405.5808  [pdf, other

    cond-mat.stat-mech

    Dynamics of a Classical Particle in a Quasi Periodic Potential

    Authors: Yaniv Tenenbaum Katan, Tal Kachman, Shmuel Fishman, Avy Soffer

    Abstract: We study the dynamics of a one-dimensional classical particle in a space and time dependent potential with randomly chosen parameters. The focus of this work is a quasi-periodic potential, which only includes a finite number of Fourier components. The momentum is calculated analytically for short time within a self-consistent approximation, under certain conditions. We find that the dynamics can… ▽ More

    Submitted 4 January, 2015; v1 submitted 22 May, 2014; originally announced May 2014.

  16. arXiv:1404.7174  [pdf

    cs.CV

    Computer vision-based recognition of liquid surfaces and phase boundaries in transparent vessels, with emphasis on chemistry applications

    Authors: Sagi Eppel, Tal Kachman

    Abstract: The ability to recognize the liquid surface and the liquid level in transparent containers is perhaps the most commonly used evaluation method when dealing with fluids. Such recognition is essential in determining the liquid volume, fill level, phase boundaries and phase separation in various fluid systems. The recognition of liquid surfaces is particularly important in solution chemistry, where i… ▽ More

    Submitted 6 November, 2014; v1 submitted 28 April, 2014; originally announced April 2014.

    Comments: Source code for phase boundary and liquid surface recognition available at: http://www.mathworks.com/matlabcentral/fileexchange/46893-computer-vision-based-recognition-of-liquid-surface-and-liquid-level-of-liquid-of-transparent-vessel