(Translated by https://www.hiragana.jp/)
Search | arXiv e-print repository
Skip to main content

Showing 1–24 of 24 results for author: Sener, O

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.08719  [pdf, other

    stat.ML cs.LG stat.ME

    Addressing Misspecification in Simulation-based Inference through Data-driven Calibration

    Authors: Antoine Wehenkel, Juan L. Gamella, Ozan Sener, Jens Behrmann, Guillermo Sapiro, Marco Cuturi, Jörn-Henrik Jacobsen

    Abstract: Driven by steady progress in generative modeling, simulation-based inference (SBI) has enabled inference over stochastic simulators. However, recent work has demonstrated that model misspecification can harm SBI's reliability. This work introduces robust posterior estimation (ROPE), a framework that overcomes model misspecification with a small real-world calibration set of ground truth parameter… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

  2. arXiv:2402.08823  [pdf, other

    cs.CV cs.LG

    RanDumb: A Simple Approach that Questions the Efficacy of Continual Representation Learning

    Authors: Ameya Prabhu, Shiven Sinha, Ponnurangam Kumaraguru, Philip H. S. Torr, Ozan Sener, Puneet K. Dokania

    Abstract: We propose RanDumb to examine the efficacy of continual representation learning. RanDumb embeds raw pixels using a fixed random transform which approximates an RBF-Kernel, initialized before seeing any data, and learns a simple linear classifier on top. We present a surprising and consistent finding: RanDumb significantly outperforms the continually learned representations using deep networks acro… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

    Comments: Tech Report

  3. arXiv:2308.15856  [pdf, other

    cs.LG stat.ML

    Domain Generalization without Excess Empirical Risk

    Authors: Ozan Sener, Vladlen Koltun

    Abstract: Given data from diverse sets of distinct distributions, domain generalization aims to learn models that generalize to unseen distributions. A common approach is designing a data-driven surrogate penalty to capture generalization and minimize the empirical risk jointly with the penalty. We argue that a significant failure mode of this recipe is an excess risk due to an erroneous penalty or hardness… ▽ More

    Submitted 30 August, 2023; originally announced August 2023.

    Comments: Published at NeurIPS 2022

  4. arXiv:2308.09228  [pdf, other

    cs.CV cs.LG stat.ML

    Generalized Sum Pooling for Metric Learning

    Authors: Yeti Z. Gurbuz, Ozan Sener, A. Aydın Alatan

    Abstract: A common architectural choice for deep metric learning is a convolutional neural network followed by global average pooling (GAP). Albeit simple, GAP is a highly effective way to aggregate information. One possible explanation for the effectiveness of GAP is considering each feature vector as representing a different semantic entity and GAP as a convex combination of them. Following this perspecti… ▽ More

    Submitted 21 August, 2023; v1 submitted 17 August, 2023; originally announced August 2023.

    Comments: Accepted as a conference paper at International Conference on Computer Vision (ICCV) 2023

  5. arXiv:2307.13918  [pdf, other

    stat.ML cs.LG q-bio.QM

    Simulation-based Inference for Cardiovascular Models

    Authors: Antoine Wehenkel, Jens Behrmann, Andrew C. Miller, Guillermo Sapiro, Ozan Sener, Marco Cuturi, Jörn-Henrik Jacobsen

    Abstract: Over the past decades, hemodynamics simulators have steadily evolved and have become tools of choice for studying cardiovascular systems in-silico. While such tools are routinely used to simulate whole-body hemodynamics from physiological parameters, solving the corresponding inverse problem of mapping waveforms back to plausible physiological parameters remains both promising and challenging. Mot… ▽ More

    Submitted 29 July, 2023; v1 submitted 25 July, 2023; originally announced July 2023.

  6. arXiv:2305.09253  [pdf, other

    cs.CV cs.LG

    Online Continual Learning Without the Storage Constraint

    Authors: Ameya Prabhu, Zhipeng Cai, Puneet Dokania, Philip Torr, Vladlen Koltun, Ozan Sener

    Abstract: Traditional online continual learning (OCL) research has primarily focused on mitigating catastrophic forgetting with fixed and limited storage allocation throughout an agent's lifetime. However, a broad range of real-world applications are primarily constrained by computational costs rather than storage limitations. In this paper, we target such applications, investigating the online continual le… ▽ More

    Submitted 2 November, 2023; v1 submitted 16 May, 2023; originally announced May 2023.

    Comments: Tech Report [Additional Experiments and Improved ACM]

  7. arXiv:2211.14721  [pdf, other

    cs.LG math.OC

    Generalizing Gaussian Smoothing for Random Search

    Authors: Katelyn Gao, Ozan Sener

    Abstract: Gaussian smoothing (GS) is a derivative-free optimization (DFO) algorithm that estimates the gradient of an objective using perturbations of the current parameters sampled from a standard normal distribution. We generalize it to sampling perturbations from a larger family of distributions. Based on an analysis of DFO for non-convex functions, we propose to choose a distribution for perturbations t… ▽ More

    Submitted 26 November, 2022; originally announced November 2022.

    Comments: This work was published at ICML 2022. This version contains some minor corrections and a link to a code repository

  8. arXiv:2210.06401  [pdf, other

    cs.CV

    Improving information retention in large scale online continual learning

    Authors: Zhipeng Cai, Vladlen Koltun, Ozan Sener

    Abstract: Given a stream of data sampled from non-stationary distributions, online continual learning (OCL) aims to adapt efficiently to new data while retaining existing knowledge. The typical approach to address information retention (the ability to retain previous knowledge) is keeping a replay buffer of a fixed size and computing gradients using a mixture of new data and the replay buffer. Surprisingly,… ▽ More

    Submitted 12 October, 2022; originally announced October 2022.

  9. arXiv:2112.13762  [pdf, other

    cs.CV

    MSeg: A Composite Dataset for Multi-domain Semantic Segmentation

    Authors: John Lambert, Zhuang Liu, Ozan Sener, James Hays, Vladlen Koltun

    Abstract: We present MSeg, a composite dataset that unifies semantic segmentation datasets from different domains. A naive merge of the constituent datasets yields poor performance due to inconsistent taxonomies and annotation practices. We reconcile the taxonomies and bring the pixel-level annotations into alignment by relabeling more than 220,000 object masks in more than 80,000 images, requiring more tha… ▽ More

    Submitted 27 December, 2021; originally announced December 2021.

  10. arXiv:2108.09020  [pdf, other

    cs.LG cs.CV

    Online Continual Learning with Natural Distribution Shifts: An Empirical Study with Visual Data

    Authors: Zhipeng Cai, Ozan Sener, Vladlen Koltun

    Abstract: Continual learning is the problem of learning and retaining knowledge through time over multiple tasks and environments. Research has primarily focused on the incremental classification setting, where new tasks/classes are added at discrete time intervals. Such an "offline" setting does not evaluate the ability of agents to learn effectively and efficiently, since an agent can perform multiple lea… ▽ More

    Submitted 22 September, 2021; v1 submitted 20 August, 2021; originally announced August 2021.

    Comments: Accepted to ICCV 2021

  11. arXiv:2010.12916  [pdf, other

    cs.LG math.OC stat.ML

    Modeling and Optimization Trade-off in Meta-learning

    Authors: Katelyn Gao, Ozan Sener

    Abstract: By searching for shared inductive biases across tasks, meta-learning promises to accelerate learning on novel tasks, but with the cost of solving a complex bilevel optimization problem. We introduce and rigorously define the trade-off between accurate modeling and optimization ease in meta-learning. At one end, classic meta-learning algorithms account for the structure of meta-learning but solve a… ▽ More

    Submitted 13 April, 2021; v1 submitted 24 October, 2020; originally announced October 2020.

    Comments: To appear at NeurIPS 2020

  12. arXiv:2007.09335  [pdf, other

    cs.LG cs.CL stat.ML

    Drinking from a Firehose: Continual Learning with Web-scale Natural Language

    Authors: Hexiang Hu, Ozan Sener, Fei Sha, Vladlen Koltun

    Abstract: Continual learning systems will interact with humans, with each other, and with the physical world through time -- and continue to learn and adapt as they do. An important open problem for continual learning is a large-scale benchmark that enables realistic evaluation of algorithms. In this paper, we study a natural setting for continual learning on a massive scale. We introduce the problem of per… ▽ More

    Submitted 1 November, 2020; v1 submitted 18 July, 2020; originally announced July 2020.

    Comments: Dataset Downloader: https://github.com/firehose-dataset/downloader Source Code: https://github.com/firehose-dataset/congrad

  13. Hausdorff Dimension, Heavy Tails, and Generalization in Neural Networks

    Authors: Umut Şimşekli, Ozan Sener, George Deligiannidis, Murat A. Erdogdu

    Abstract: Despite its success in a wide range of applications, characterizing the generalization properties of stochastic gradient descent (SGD) in non-convex deep learning problems is still an important challenge. While modeling the trajectories of SGD via stochastic differential equations (SDE) under heavy-tailed gradient noise has recently shed light over several peculiar characteristics of SGD, a rigoro… ▽ More

    Submitted 22 May, 2021; v1 submitted 16 June, 2020; originally announced June 2020.

    Comments: Published at NeurIPS 2020 (Spotlight) -- an imprecision in Definition 2 and a mistake in the statement and the proof of Theorem 2 are fixed

  14. arXiv:2004.12214  [pdf, other

    cs.LG math.OC stat.ML

    Learning to Guide Random Search

    Authors: Ozan Sener, Vladlen Koltun

    Abstract: We are interested in derivative-free optimization of high-dimensional functions. The sample complexity of existing methods is high and depends on problem dimensionality, unlike the dimensionality-independent rates of first-order methods. The recent success of deep learning suggests that many datasets lie on low-dimensional manifolds that can be represented by deep nonlinear models. We therefore co… ▽ More

    Submitted 25 April, 2020; originally announced April 2020.

    Comments: Published at ICLR 2020, Code is available at: https://github.com/intel-isl/LMRS

  15. arXiv:1810.04650  [pdf, other

    cs.LG stat.ML

    Multi-Task Learning as Multi-Objective Optimization

    Authors: Ozan Sener, Vladlen Koltun

    Abstract: In multi-task learning, multiple tasks are solved jointly, sharing inductive bias between them. Multi-task learning is inherently a multi-objective problem because different tasks may conflict, necessitating a trade-off. A common compromise is to optimize a proxy objective that minimizes a weighted linear combination of per-task losses. However, this workaround is only valid when the tasks do not… ▽ More

    Submitted 11 January, 2019; v1 submitted 10 October, 2018; originally announced October 2018.

    Comments: In Neural Information Processing Systems (NeurIPS) 2018

  16. arXiv:1805.12018  [pdf, other

    cs.CV

    Generalizing to Unseen Domains via Adversarial Data Augmentation

    Authors: Riccardo Volpi, Hongseok Namkoong, Ozan Sener, John Duchi, Vittorio Murino, Silvio Savarese

    Abstract: We are concerned with learning models that generalize well to different \emph{unseen} domains. We consider a worst-case formulation over data distributions that are near the source domain in the feature space. Only using training data from a single source distribution, we propose an iterative procedure that augments the dataset with examples from a fictitious target domain that is "hard" under the… ▽ More

    Submitted 6 November, 2018; v1 submitted 30 May, 2018; originally announced May 2018.

    Comments: Accepted to NIPS 2018 (camera ready)

  17. arXiv:1805.11614  [pdf, other

    cs.LG stat.ML

    Deep Learning under Privileged Information Using Heteroscedastic Dropout

    Authors: John Lambert, Ozan Sener, Silvio Savarese

    Abstract: Unlike machines, humans learn through rapid, abstract model-building. The role of a teacher is not simply to hammer home right or wrong answers, but rather to provide intuitive comments, comparisons, and explanations to a pupil. This is what the Learning Under Privileged Information (LUPI) paradigm endeavors to model by utilizing extra knowledge only available during training. We propose a new LUP… ▽ More

    Submitted 29 May, 2018; originally announced May 2018.

    Comments: CVPR 2018

  18. arXiv:1708.00489  [pdf, other

    stat.ML cs.CV cs.LG

    Active Learning for Convolutional Neural Networks: A Core-Set Approach

    Authors: Ozan Sener, Silvio Savarese

    Abstract: Convolutional neural networks (CNNs) have been successfully applied to many recognition and learning tasks using a universal recipe; training a deep model on a very large dataset of supervised examples. However, this approach is rather restrictive in practice since collecting a large set of labeled images is very expensive. One way to ease this problem is coming up with smart ways for choosing ima… ▽ More

    Submitted 1 June, 2018; v1 submitted 1 August, 2017; originally announced August 2017.

    Comments: ICLR 2018 Paper

  19. arXiv:1605.03324  [pdf, other

    cs.CV cs.RO stat.ML

    Unsupervised Semantic Action Discovery from Video Collections

    Authors: Ozan Sener, Amir Roshan Zamir, Chenxia Wu, Silvio Savarese, Ashutosh Saxena

    Abstract: Human communication takes many forms, including speech, text and instructional videos. It typically has an underlying structure, with a starting point, ending, and certain objective steps between them. In this paper, we consider instructional videos where there are tens of millions of them on the Internet. We propose a method for parsing a video into such semantic steps in an unsupervised way. O… ▽ More

    Submitted 11 May, 2016; originally announced May 2016.

    Comments: First version of this paper arXiv:1506.08438 appeared in ICCV 2015. This extended version has more details on the learning algorithm and hierarchical clustering with full derivation, additional analysis on the robustness to the subtitle noise, and a novel application on robotics

  20. arXiv:1603.03541  [pdf, other

    cs.CV cs.LG cs.RO

    Watch-n-Patch: Unsupervised Learning of Actions and Relations

    Authors: Chenxia Wu, Jiemi Zhang, Ozan Sener, Bart Selman, Silvio Savarese, Ashutosh Saxena

    Abstract: There is a large variation in the activities that humans perform in their everyday lives. We consider modeling these composite human activities which comprises multiple basic level actions in a completely unsupervised setting. Our model learns high-level co-occurrence and temporal relations between the actions. We consider the video as a sequence of short-term action clips, which contains human-wo… ▽ More

    Submitted 11 March, 2016; originally announced March 2016.

    Comments: arXiv admin note: text overlap with arXiv:1512.04208

  21. arXiv:1602.03534  [pdf, other

    stat.ML cs.LG

    Unsupervised Transductive Domain Adaptation

    Authors: Ozan Sener, Hyun Oh Song, Ashutosh Saxena, Silvio Savarese

    Abstract: Supervised learning with large scale labeled datasets and deep layered models has made a paradigm shift in diverse areas in learning and recognition. However, this approach still suffers generalization issues under the presence of a domain shift between the training and the test data distribution. In this regard, unsupervised domain adaptation algorithms have been proposed to directly address the… ▽ More

    Submitted 25 March, 2016; v1 submitted 10 February, 2016; originally announced February 2016.

  22. arXiv:1506.08438  [pdf, other

    cs.CV

    Unsupervised Semantic Parsing of Video Collections

    Authors: Ozan Sener, Amir Zamir, Silvio Savarese, Ashutosh Saxena

    Abstract: Human communication typically has an underlying structure. This is reflected in the fact that in many user generated videos, a starting point, ending, and certain objective steps between these two can be identified. In this paper, we propose a method for parsing a video into such semantic steps in an unsupervised way. The proposed method is capable of providing a semantic "storyline" of the video… ▽ More

    Submitted 27 January, 2016; v1 submitted 28 June, 2015; originally announced June 2015.

  23. arXiv:1412.0691  [pdf, other

    cs.AI cs.RO

    RoboBrain: Large-Scale Knowledge Engine for Robots

    Authors: Ashutosh Saxena, Ashesh Jain, Ozan Sener, Aditya Jami, Dipendra K. Misra, Hema S. Koppula

    Abstract: In this paper we introduce a knowledge engine, which learns and shares knowledge representations, for robots to carry out a variety of tasks. Building such an engine brings with it the challenge of dealing with multiple data modalities including symbols, natural language, haptic senses, robot trajectories, visual features and many others. The \textit{knowledge} stored in the engine comes from mult… ▽ More

    Submitted 12 April, 2015; v1 submitted 1 December, 2014; originally announced December 2014.

    Comments: 10 pages, 9 figures

  24. Efficient MRF Energy Propagation for Video Segmentation via Bilateral Filters

    Authors: Ozan Sener, Kemal Ugur, A. Aydin Alatan

    Abstract: Segmentation of an object from a video is a challenging task in multimedia applications. Depending on the application, automatic or interactive methods are desired; however, regardless of the application type, efficient computation of video object segmentation is crucial for time-critical applications; specifically, mobile and interactive applications require near real-time efficiencies. In this p… ▽ More

    Submitted 26 October, 2014; v1 submitted 22 January, 2013; originally announced January 2013.

    Comments: Multimedia, IEEE Transactions on (Volume:16, Issue: 5, Aug. 2014)