(Translated by https://www.hiragana.jp/)
Search | arXiv e-print repository
Skip to main content

Showing 1–9 of 9 results for author: Detommaso, G

.
  1. arXiv:2404.04689  [pdf, other

    stat.ML cs.CL cs.LG

    Multicalibration for Confidence Scoring in LLMs

    Authors: Gianluca Detommaso, Martin Bertran, Riccardo Fogliato, Aaron Roth

    Abstract: This paper proposes the use of "multicalibration" to yield interpretable and reliable confidence scores for outputs generated by large language models (LLMs). Multicalibration asks for calibration not just marginally, but simultaneously across various intersecting groupings of the data. We show how to form groupings for prompt/completion pairs that are correlated with the probability of correctnes… ▽ More

    Submitted 6 April, 2024; originally announced April 2024.

  2. arXiv:2302.04019  [pdf, other

    cs.LG stat.ML

    Fortuna: A Library for Uncertainty Quantification in Deep Learning

    Authors: Gianluca Detommaso, Alberto Gasparin, Michele Donini, Matthias Seeger, Andrew Gordon Wilson, Cedric Archambeau

    Abstract: We present Fortuna, an open-source library for uncertainty quantification in deep learning. Fortuna supports a range of calibration techniques, such as conformal prediction that can be applied to any trained neural network to generate reliable uncertainty estimates, and scalable Bayesian inference methods that can be applied to Flax-based deep neural networks trained from scratch for improved unce… ▽ More

    Submitted 8 February, 2023; originally announced February 2023.

  3. arXiv:2207.08200  [pdf, other

    stat.ML cs.AI cs.LG

    Uncertainty Calibration in Bayesian Neural Networks via Distance-Aware Priors

    Authors: Gianluca Detommaso, Alberto Gasparin, Andrew Wilson, Cedric Archambeau

    Abstract: As we move away from the data, the predictive uncertainty should increase, since a great variety of explanations are consistent with the little available information. We introduce Distance-Aware Prior (DAP) calibration, a method to correct overconfidence of Bayesian deep learning models outside of the training domain. We define DAPs as prior distributions over the model parameters that depend on t… ▽ More

    Submitted 17 July, 2022; originally announced July 2022.

  4. arXiv:2106.09762  [pdf, other

    stat.ME cs.AI cs.LG stat.ML

    Causal Bias Quantification for Continuous Treatments

    Authors: Gianluca Detommaso, Michael Brückner, Philip Schulz, Victor Chernozhukov

    Abstract: We extend the definition of the marginal causal effect to the continuous treatment setting and develop a novel characterization of causal bias in the framework of structural causal models. We prove that our derived bias expression is zero if, and only if, the causal effect is identifiable via covariate adjustment. We show that under some restrictions on the structural equations, the causal bias ca… ▽ More

    Submitted 30 January, 2022; v1 submitted 17 June, 2021; originally announced June 2021.

  5. arXiv:1910.12431  [pdf, other

    stat.CO math.NA stat.ME

    Multilevel Dimension-Independent Likelihood-Informed MCMC for Large-Scale Inverse Problems

    Authors: Tiangang Cui, Gianluca Detommaso, Robert Scheichl

    Abstract: We present a non-trivial integration of dimension-independent likelihood-informed (DILI) MCMC (Cui, Law, Marzouk, 2016) and the multilevel MCMC (Dodwell et al., 2015) to explore the hierarchy of posterior distributions. This integration offers several advantages: First, DILI-MCMC employs an intrinsic likelihood-informed subspace (LIS) (Cui et al., 2014) -- which involves a number of forward and ad… ▽ More

    Submitted 29 November, 2023; v1 submitted 28 October, 2019; originally announced October 2019.

  6. arXiv:1905.10687  [pdf, other

    stat.ML cs.AI cs.LG

    HINT: Hierarchical Invertible Neural Transport for Density Estimation and Bayesian Inference

    Authors: Jakob Kruse, Gianluca Detommaso, Ullrich Köthe, Robert Scheichl

    Abstract: Many recent invertible neural architectures are based on coupling block designs where variables are divided in two subsets which serve as inputs of an easily invertible (usually affine) triangular transformation. While such a transformation is invertible, its Jacobian is very sparse and thus may lack expressiveness. This work presents a simple remedy by noting that subdivision and (affine) couplin… ▽ More

    Submitted 25 May, 2021; v1 submitted 25 May, 2019; originally announced May 2019.

    Comments: Published at AAAI 2021

  7. arXiv:1901.07987  [pdf, other

    stat.ML cs.LG

    Stein Variational Online Changepoint Detection with Applications to Hawkes Processes and Neural Networks

    Authors: Gianluca Detommaso, Hanne Hoitzing, Tiangang Cui, Ardavan Alamir

    Abstract: Bayesian online changepoint detection (BOCPD) (Adams & MacKay, 2007) offers a rigorous and viable way to identify changepoints in complex systems. In this work, we introduce a Stein variational online changepoint detection (SVOCD) method to provide a computationally tractable generalization of BOCPD beyond the exponential family of probability distributions. We integrate the recently developed Ste… ▽ More

    Submitted 25 May, 2019; v1 submitted 23 January, 2019; originally announced January 2019.

    Comments: 14 pages, 6 figures

  8. arXiv:1806.03085  [pdf, other

    stat.ML cs.LG math.NA

    A Stein variational Newton method

    Authors: Gianluca Detommaso, Tiangang Cui, Alessio Spantini, Youssef Marzouk, Robert Scheichl

    Abstract: Stein variational gradient descent (SVGD) was recently proposed as a general purpose nonparametric variational inference algorithm [Liu & Wang, NIPS 2016]: it minimizes the Kullback-Leibler divergence between the target distribution and its approximation by implementing a form of functional gradient descent on a reproducing kernel Hilbert space. In this paper, we accelerate and generalize the SVGD… ▽ More

    Submitted 29 October, 2018; v1 submitted 8 June, 2018; originally announced June 2018.

    Comments: 18 pages, 7 figures

    Journal ref: NIPS 2018

  9. arXiv:1802.07539  [pdf, other

    math.NA

    Continuous Level Monte Carlo and Sample-Adaptive Model Hierarchies

    Authors: Gianluca Detommaso, Tim Dodwell, Rob Scheichl

    Abstract: In this paper, we present a generalisation of the Multilevel Monte Carlo (MLMC) method to a setting where the level parameter is a continuous variable. This Continuous Level Monte Carlo (CLMC) estimator provides a natural framework in PDE applications to adapt the model hierarchy to each sample. In addition, it can be made unbiased with respect to the expected value of the true quantity of interes… ▽ More

    Submitted 21 February, 2018; originally announced February 2018.

    Comments: 22 pages, 4 figures