(Translated by https://www.hiragana.jp/)
Search | arXiv e-print repository
Skip to main content

Showing 1–12 of 12 results for author: Saukh, O

Searching in archive cs. Search in all archives.
.
  1. arXiv:2311.13349  [pdf, other

    cs.LG

    REDS: Resource-Efficient Deep Subnetworks for Dynamic Resource Constraints

    Authors: Francesco Corti, Balz Maag, Joachim Schauer, Ulrich Pferschy, Olga Saukh

    Abstract: Deep models deployed on edge devices frequently encounter resource variability, which arises from fluctuating energy levels, timing constraints, or prioritization of other critical tasks within the system. State-of-the-art machine learning pipelines generate resource-agnostic models, not capable to adapt at runtime. In this work we introduce Resource-Efficient Deep Subnetworks (REDS) to tackle mod… ▽ More

    Submitted 20 March, 2024; v1 submitted 22 November, 2023; originally announced November 2023.

  2. arXiv:2311.11029  [pdf, other

    cs.CV cs.AI

    Geometric Data Augmentations to Mitigate Distribution Shifts in Pollen Classification from Microscopic Images

    Authors: Nam Cao, Olga Saukh

    Abstract: Distribution shifts are characterized by differences between the training and test data distributions. They can significantly reduce the accuracy of machine learning models deployed in real-world scenarios. This paper explores the distribution shift problem when classifying pollen grains from microscopic images collected in the wild with a low-cost camera sensor. We leverage the domain knowledge t… ▽ More

    Submitted 18 November, 2023; originally announced November 2023.

    Comments: 16 pages, 6 figures, ICPADS 2023

  3. arXiv:2305.13536  [pdf, other

    cs.LG

    Subspace-Configurable Networks

    Authors: Dong Wang, Olga Saukh, Xiaoxi He, Lothar Thiele

    Abstract: While the deployment of deep learning models on edge devices is increasing, these models often lack robustness when faced with dynamic changes in sensed data. This can be attributed to sensor drift, or variations in the data compared to what was used during offline training due to factors such as specific sensor placement or naturally changing sensing conditions. Hence, achieving the desired robus… ▽ More

    Submitted 28 May, 2024; v1 submitted 22 May, 2023; originally announced May 2023.

    Comments: This paper has been accepted by the Third Conference on Lifelong Learning Agents (CoLLAs), 2024

  4. arXiv:2304.14108  [pdf, other

    cs.CV cs.CL cs.LG

    DataComp: In search of the next generation of multimodal datasets

    Authors: Samir Yitzhak Gadre, Gabriel Ilharco, Alex Fang, Jonathan Hayase, Georgios Smyrnis, Thao Nguyen, Ryan Marten, Mitchell Wortsman, Dhruba Ghosh, Jieyu Zhang, Eyal Orgad, Rahim Entezari, Giannis Daras, Sarah Pratt, Vivek Ramanujan, Yonatan Bitton, Kalyani Marathe, Stephen Mussmann, Richard Vencu, Mehdi Cherti, Ranjay Krishna, Pang Wei Koh, Olga Saukh, Alexander Ratner, Shuran Song , et al. (9 additional authors not shown)

    Abstract: Multimodal datasets are a critical component in recent breakthroughs such as Stable Diffusion and GPT-4, yet their design does not receive the same research attention as model architectures or training algorithms. To address this shortcoming in the ML ecosystem, we introduce DataComp, a testbed for dataset experiments centered around a new candidate pool of 12.8 billion image-text pairs from Commo… ▽ More

    Submitted 20 October, 2023; v1 submitted 27 April, 2023; originally announced April 2023.

    Comments: NeurIPS 2023 Datasets and Benchmarks Track

  5. arXiv:2302.13602  [pdf, other

    cs.CV cs.LG

    The Role of Pre-training Data in Transfer Learning

    Authors: Rahim Entezari, Mitchell Wortsman, Olga Saukh, M. Moein Shariatnia, Hanie Sedghi, Ludwig Schmidt

    Abstract: The transfer learning paradigm of model pre-training and subsequent fine-tuning produces high-accuracy models. While most studies recommend scaling the pre-training size to benefit most from transfer learning, a question remains: what data and method should be used for pre-training? We investigate the impact of pre-training data distribution on the few-shot and full fine-tuning performance using 3… ▽ More

    Submitted 1 March, 2023; v1 submitted 27 February, 2023; originally announced February 2023.

  6. arXiv:2211.08403  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    REPAIR: REnormalizing Permuted Activations for Interpolation Repair

    Authors: Keller Jordan, Hanie Sedghi, Olga Saukh, Rahim Entezari, Behnam Neyshabur

    Abstract: In this paper we look into the conjecture of Entezari et al. (2021) which states that if the permutation invariance of neural networks is taken into account, then there is likely no loss barrier to the linear interpolation between SGD solutions. First, we observe that neuron alignment methods alone are insufficient to establish low-barrier linear connectivity between SGD solutions due to a phenome… ▽ More

    Submitted 25 September, 2023; v1 submitted 15 November, 2022; originally announced November 2022.

  7. arXiv:2207.00200  [pdf, other

    cs.LG cs.CV

    Studying the impact of magnitude pruning on contrastive learning methods

    Authors: Francesco Corti, Rahim Entezari, Sara Hooker, Davide Bacciu, Olga Saukh

    Abstract: We study the impact of different pruning techniques on the representation learned by deep neural networks trained with contrastive loss functions. Our work finds that at high sparsity levels, contrastive learning results in a higher number of misclassified examples relative to models trained with traditional cross-entropy loss. To understand this pronounced difference, we use metrics such as the n… ▽ More

    Submitted 1 July, 2022; originally announced July 2022.

  8. arXiv:2206.10915  [pdf, other

    cs.CV

    Understanding the effect of sparsity on neural networks robustness

    Authors: Lukas Timpl, Rahim Entezari, Hanie Sedghi, Behnam Neyshabur, Olga Saukh

    Abstract: This paper examines the impact of static sparsity on the robustness of a trained network to weight perturbations, data corruption, and adversarial examples. We show that, up to a certain sparsity achieved by increasing network width and depth while keeping the network capacity fixed, sparsified networks consistently match and often outperform their initially dense versions. Robustness and accuracy… ▽ More

    Submitted 22 June, 2022; originally announced June 2022.

  9. arXiv:2206.07422  [pdf, other

    eess.IV cs.CV

    Deep Neural Network Pruning for Nuclei Instance Segmentation in Hematoxylin & Eosin-Stained Histological Images

    Authors: Amirreza Mahbod, Rahim Entezari, Isabella Ellinger, Olga Saukh

    Abstract: Recently, pruning deep neural networks (DNNs) has received a lot of attention for improving accuracy and generalization power, reducing network size, and increasing inference speed on specialized hardwares. Although pruning was mainly tested on computer vision tasks, its application in the context of medical image analysis has hardly been explored. This work investigates the impact of well-known p… ▽ More

    Submitted 15 June, 2022; originally announced June 2022.

  10. arXiv:2110.06296  [pdf, other

    cs.LG

    The Role of Permutation Invariance in Linear Mode Connectivity of Neural Networks

    Authors: Rahim Entezari, Hanie Sedghi, Olga Saukh, Behnam Neyshabur

    Abstract: In this paper, we conjecture that if the permutation invariance of neural networks is taken into account, SGD solutions will likely have no barrier in the linear interpolation between them. Although it is a bold conjecture, we show how extensive empirical attempts fall short of refuting it. We further provide a preliminary theoretical result to support our conjecture. Our conjecture has implicatio… ▽ More

    Submitted 5 July, 2022; v1 submitted 12 October, 2021; originally announced October 2021.

  11. arXiv:2011.10144  [pdf, other

    cs.LG cs.CY

    Interpretable and Transferable Models to Understand the Impact of Lockdown Measures on Local Air Quality

    Authors: Johanna Einsiedler, Yun Cheng, Franz Papst, Olga Saukh

    Abstract: The COVID-19 related lockdown measures offer a unique opportunity to understand how changes in economic activity and traffic affect ambient air quality and how much pollution reduction potential can the society offer through digitalization and mobilitylimiting policies. In this work, we estimate pollution reduction over the lockdown period by using the measurements from ground air pollution monito… ▽ More

    Submitted 26 March, 2021; v1 submitted 19 November, 2020; originally announced November 2020.

  12. arXiv:1909.10364  [pdf, other

    cs.LG cs.CV

    Class-dependent Compression of Deep Neural Networks

    Authors: Rahim Entezari, Olga Saukh

    Abstract: Today's deep neural networks require substantial computation resources for their training, storage, and inference, which limits their effective use on resource-constrained devices. Many recent research activities explore different options for compressing and optimizing deep models. On the one hand, in many real-world applications, we face the data imbalance challenge, i.e. when the number of label… ▽ More

    Submitted 19 April, 2020; v1 submitted 23 September, 2019; originally announced September 2019.