(Translated by https://www.hiragana.jp/)
Search | arXiv e-print repository
Skip to main content

Showing 1–10 of 10 results for author: Ravichandran, A

Searching in archive stat. Search in all archives.
.
  1. arXiv:2306.12394  [pdf, ps, other

    stat.ME stat.AP

    Optimal allocation of sample size for randomization-based inference from $2^K$ factorial designs

    Authors: Arun Ravichandran, Nicole E. Pashley, Brian Libgober, Tirthankar Dasgupta

    Abstract: Optimizing the allocation of units into treatment groups can help researchers improve the precision of causal estimators and decrease costs when running factorial experiments. However, existing optimal allocation results typically assume a super-population model and that the outcome data comes from a known family of distributions. Instead, we focus on randomization-based causal inference for the f… ▽ More

    Submitted 21 June, 2023; originally announced June 2023.

    Comments: 27 pages

    Journal ref: Journal of Causal Inference 12, no. 1 (2024)

  2. arXiv:2101.06640  [pdf, other

    cs.LG stat.ML

    Estimating informativeness of samples with Smooth Unique Information

    Authors: Hrayr Harutyunyan, Alessandro Achille, Giovanni Paolini, Orchid Majumder, Avinash Ravichandran, Rahul Bhotika, Stefano Soatto

    Abstract: We define a notion of information that an individual sample provides to the training of a neural network, and we specialize it to measure both how much a sample informs the final weights and how much it informs the function computed by the weights. Though related, we show that these quantities have a qualitatively different behavior. We give efficient approximations of these quantities using a lin… ▽ More

    Submitted 28 March, 2021; v1 submitted 17 January, 2021; originally announced January 2021.

    Comments: ICLR 2021, 22 pages

  3. arXiv:2012.11140  [pdf, other

    cs.LG cs.CV stat.ML

    LQF: Linear Quadratic Fine-Tuning

    Authors: Alessandro Achille, Aditya Golatkar, Avinash Ravichandran, Marzia Polito, Stefano Soatto

    Abstract: Classifiers that are linear in their parameters, and trained by optimizing a convex loss function, have predictable behavior with respect to changes in the training data, initial conditions, and optimization. Such desirable properties are absent in deep neural networks (DNNs), typically trained by non-linear fine-tuning of a pre-trained model. Previous attempts to linearize DNNs have led to intere… ▽ More

    Submitted 21 December, 2020; originally announced December 2020.

  4. arXiv:2008.12478  [pdf, other

    cs.LG stat.ML

    Predicting Training Time Without Training

    Authors: Luca Zancato, Alessandro Achille, Avinash Ravichandran, Rahul Bhotika, Stefano Soatto

    Abstract: We tackle the problem of predicting the number of optimization steps that a pre-trained deep network needs to converge to a given value of the loss function. To do so, we leverage the fact that the training dynamics of a deep network during fine-tuning are well approximated by those of a linearized model. This allows us to approximate the training loss and accuracy at any point during training by… ▽ More

    Submitted 28 August, 2020; originally announced August 2020.

  5. arXiv:2002.11770  [pdf, other

    cs.CV cs.LG stat.ML

    Rethinking the Hyperparameters for Fine-tuning

    Authors: Hao Li, Pratik Chaudhari, Hao Yang, Michael Lam, Avinash Ravichandran, Rahul Bhotika, Stefano Soatto

    Abstract: Fine-tuning from pre-trained ImageNet models has become the de-facto standard for various computer vision tasks. Current practices for fine-tuning typically involve selecting an ad-hoc choice of hyperparameters and keeping them fixed to values normally used for training from scratch. This paper re-examines several common practices of setting hyperparameters for fine-tuning. Our findings are based… ▽ More

    Submitted 19 February, 2020; originally announced February 2020.

    Comments: Published as a conference paper at ICLR 2020

  6. arXiv:2002.04162  [pdf, other

    cs.LG cs.CV stat.ML

    Incremental Meta-Learning via Indirect Discriminant Alignment

    Authors: Qing Liu, Orchid Majumder, Alessandro Achille, Avinash Ravichandran, Rahul Bhotika, Stefano Soatto

    Abstract: Majority of the modern meta-learning methods for few-shot classification tasks operate in two phases: a meta-training phase where the meta-learner learns a generic representation by solving multiple few-shot tasks sampled from a large dataset and a testing phase, where the meta-learner leverages its learnt internal representation for a specific few-shot task involving classes which were not seen d… ▽ More

    Submitted 21 April, 2020; v1 submitted 10 February, 2020; originally announced February 2020.

  7. arXiv:1911.12528  [pdf, other

    cs.LG cs.CV stat.ML

    Unbiased Evaluation of Deep Metric Learning Algorithms

    Authors: Istvan Fehervari, Avinash Ravichandran, Srikar Appalaraju

    Abstract: Deep metric learning (DML) is a popular approach for images retrieval, solving verification (same or not) problems and addressing open set classification. Arguably, the most common DML approach is with triplet loss, despite significant advances in the area of DML. Triplet loss suffers from several issues such as collapse of the embeddings, high sensitivity to sampling schemes and more importantly… ▽ More

    Submitted 27 November, 2019; originally announced November 2019.

  8. arXiv:1909.02729  [pdf, other

    cs.LG cs.CV stat.ML

    A Baseline for Few-Shot Image Classification

    Authors: Guneet S. Dhillon, Pratik Chaudhari, Avinash Ravichandran, Stefano Soatto

    Abstract: Fine-tuning a deep network trained with the standard cross-entropy loss is a strong baseline for few-shot learning. When fine-tuned transductively, this outperforms the current state-of-the-art on standard datasets such as Mini-ImageNet, Tiered-ImageNet, CIFAR-FS and FC-100 with the same hyper-parameters. The simplicity of this approach enables us to demonstrate the first few-shot learning results… ▽ More

    Submitted 21 October, 2020; v1 submitted 6 September, 2019; originally announced September 2019.

    Journal ref: International Conference on Learning Representations (ICLR), 2020

  9. arXiv:1905.04398  [pdf, other

    cs.LG cs.CV stat.ML

    Few-Shot Learning with Embedded Class Models and Shot-Free Meta Training

    Authors: Avinash Ravichandran, Rahul Bhotika, Stefano Soatto

    Abstract: We propose a method for learning embeddings for few-shot learning that is suitable for use with any number of ways and any number of shots (shot-free). Rather than fixing the class prototypes to be the Euclidean average of sample embeddings, we allow them to live in a higher-dimensional space (embedded class models) and learn the prototypes along with the model parameters. The class representation… ▽ More

    Submitted 21 April, 2020; v1 submitted 10 May, 2019; originally announced May 2019.

    Comments: Accepted to ICCV 2019

  10. arXiv:1902.03545  [pdf, other

    cs.LG cs.AI stat.ML

    Task2Vec: Task Embedding for Meta-Learning

    Authors: Alessandro Achille, Michael Lam, Rahul Tewari, Avinash Ravichandran, Subhransu Maji, Charless Fowlkes, Stefano Soatto, Pietro Perona

    Abstract: We introduce a method to provide vectorial representations of visual classification tasks which can be used to reason about the nature of those tasks and their relations. Given a dataset with ground-truth labels and a loss function defined over those labels, we process images through a "probe network" and compute an embedding based on estimates of the Fisher information matrix associated with the… ▽ More

    Submitted 10 February, 2019; originally announced February 2019.