(Translated by https://www.hiragana.jp/)
Search | arXiv e-print repository
Skip to main content

Showing 1–2 of 2 results for author: Paren, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.01424  [pdf, other

    cs.LG cs.AI cs.CL

    Universal In-Context Approximation By Prompting Fully Recurrent Models

    Authors: Aleksandar Petrov, Tom A. Lamb, Alasdair Paren, Philip H. S. Torr, Adel Bibi

    Abstract: Zero-shot and in-context learning enable solving tasks without model fine-tuning, making them essential for developing generative model solutions. Therefore, it is crucial to understand whether a pretrained model can be prompted to approximate any function, i.e., whether it is a universal in-context approximator. While it was recently shown that transformer models do possess this property, these r… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  2. arXiv:2201.12678  [pdf, ps, other

    cs.LG cs.CV

    A Stochastic Bundle Method for Interpolating Networks

    Authors: Alasdair Paren, Leonard Berrada, Rudra P. K. Poudel, M. Pawan Kumar

    Abstract: We propose a novel method for training deep neural networks that are capable of interpolation, that is, driving the empirical loss to zero. At each iteration, our method constructs a stochastic approximation of the learning objective. The approximation, known as a bundle, is a pointwise maximum of linear functions. Our bundle contains a constant function that lower bounds the empirical loss. This… ▽ More

    Submitted 29 January, 2022; originally announced January 2022.