(Translated by https://www.hiragana.jp/)
Search | arXiv e-print repository
Skip to main content

Showing 1–32 of 32 results for author: Backurs, A

.
  1. arXiv:2403.08917  [pdf, other

    cs.CR cs.DS cs.LG

    Efficiently Computing Similarities to Private Datasets

    Authors: Arturs Backurs, Zinan Lin, Sepideh Mahabadi, Sandeep Silwal, Jakub Tarnawski

    Abstract: Many methods in differentially private model training rely on computing the similarity between a query point (such as public or synthetic data) and private data. We abstract out this common subroutine and study the following fundamental algorithmic problem: Given a similarity function $f$ and a large high-dimensional private dataset $X \subset \mathbb{R}^d$, output a differentially private (DP) da… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

    Comments: To appear at ICLR 2024

  2. arXiv:2403.01749  [pdf, other

    cs.CL

    Differentially Private Synthetic Data via Foundation Model APIs 2: Text

    Authors: Chulin Xie, Zinan Lin, Arturs Backurs, Sivakanth Gopi, Da Yu, Huseyin A Inan, Harsha Nori, Haotian Jiang, Huishuai Zhang, Yin Tat Lee, Bo Li, Sergey Yekhanin

    Abstract: Text data has become extremely valuable due to the emergence of machine learning algorithms that learn from it. A lot of high-quality text data generated in the real world is private and therefore cannot be shared or used freely due to privacy concerns. Generating synthetic replicas of private text data with a formal privacy guarantee, i.e., differential privacy (DP), offers a promising and scalab… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

  3. arXiv:2310.16960  [pdf, other

    cs.LG cs.CR

    Privately Aligning Language Models with Reinforcement Learning

    Authors: Fan Wu, Huseyin A. Inan, Arturs Backurs, Varun Chandrasekaran, Janardhan Kulkarni, Robert Sim

    Abstract: Positioned between pre-training and user deployment, aligning large language models (LLMs) through reinforcement learning (RL) has emerged as a prevailing strategy for training instruction following-models such as ChatGPT. In this work, we initiate the study of privacy-preserving alignment of LLMs through Differential Privacy (DP) in conjunction with RL. Following the influential work of Ziegler e… ▽ More

    Submitted 3 May, 2024; v1 submitted 25 October, 2023; originally announced October 2023.

    Comments: Accepted at ICLR 2024

  4. arXiv:2212.01539  [pdf, other

    cs.LG stat.ML

    Exploring the Limits of Differentially Private Deep Learning with Group-wise Clipping

    Authors: Jiyan He, Xuechen Li, Da Yu, Huishuai Zhang, Janardhan Kulkarni, Yin Tat Lee, Arturs Backurs, Nenghai Yu, Jiang Bian

    Abstract: Differentially private deep learning has recently witnessed advances in computational efficiency and privacy-utility trade-off. We explore whether further improvements along the two axes are possible and provide affirmative answers leveraging two instantiations of \emph{group-wise clipping}. To reduce the compute time overhead of private learning, we show that \emph{per-layer clipping}, where the… ▽ More

    Submitted 3 December, 2022; originally announced December 2022.

    Comments: 25 pages

  5. arXiv:2206.04301  [pdf, other

    cs.LG cs.AI cs.CL

    Unveiling Transformers with LEGO: a synthetic reasoning task

    Authors: Yi Zhang, Arturs Backurs, Sébastien Bubeck, Ronen Eldan, Suriya Gunasekar, Tal Wagner

    Abstract: We propose a synthetic reasoning task, LEGO (Learning Equality and Group Operations), that encapsulates the problem of following a chain of reasoning, and we study how the Transformer architectures learn this task. We pay special attention to data effects such as pretraining (on seemingly unrelated NLP tasks) and dataset composition (e.g., differing chain length at training and test time), as well… ▽ More

    Submitted 17 February, 2023; v1 submitted 9 June, 2022; originally announced June 2022.

  6. arXiv:2206.01838  [pdf, other

    cs.LG cs.CR

    Differentially Private Model Compression

    Authors: Fatemehsadat Mireshghallah, Arturs Backurs, Huseyin A Inan, Lukas Wutschitz, Janardhan Kulkarni

    Abstract: Recent papers have shown that large pre-trained language models (LLMs) such as BERT, GPT-2 can be fine-tuned on private data to achieve performance comparable to non-private models for many downstream Natural Language Processing (NLP) tasks while simultaneously guaranteeing differential privacy. The inference cost of these models -- which consist of hundreds of millions of parameters -- however, c… ▽ More

    Submitted 3 June, 2022; originally announced June 2022.

  7. arXiv:2110.06500  [pdf, other

    cs.LG cs.CL cs.CR stat.ML

    Differentially Private Fine-tuning of Language Models

    Authors: Da Yu, Saurabh Naik, Arturs Backurs, Sivakanth Gopi, Huseyin A. Inan, Gautam Kamath, Janardhan Kulkarni, Yin Tat Lee, Andre Manoel, Lukas Wutschitz, Sergey Yekhanin, Huishuai Zhang

    Abstract: We give simpler, sparser, and faster algorithms for differentially private fine-tuning of large-scale pre-trained language models, which achieve the state-of-the-art privacy versus utility tradeoffs on many standard NLP tasks. We propose a meta-framework for this problem, inspired by the recent success of highly parameter-efficient methods for fine-tuning. Our experiments show that differentially… ▽ More

    Submitted 14 July, 2022; v1 submitted 13 October, 2021; originally announced October 2021.

    Comments: ICLR 2022. Code available at https://github.com/huseyinatahaninan/Differentially-Private-Fine-tuning-of-Language-Models

  8. arXiv:2102.08341  [pdf, other

    cs.DS cs.LG math.NA

    Faster Kernel Matrix Algebra via Density Estimation

    Authors: Arturs Backurs, Piotr Indyk, Cameron Musco, Tal Wagner

    Abstract: We study fast algorithms for computing fundamental properties of a positive semidefinite kernel matrix $K \in \mathbb{R}^{n \times n}$ corresponding to $n$ points $x_1,\ldots,x_n \in \mathbb{R}^d$. In particular, we consider estimating the sum of kernel matrix entries, along with its top eigenvalue and eigenvector. We show that the sum of matrix entries can be estimated to $1+εいぷしろん$ relative error i… ▽ More

    Submitted 17 June, 2021; v1 submitted 16 February, 2021; originally announced February 2021.

  9. arXiv:2101.08248  [pdf, other

    cs.CL cs.LG

    Data-to-text Generation by Splicing Together Nearest Neighbors

    Authors: Sam Wiseman, Arturs Backurs, Karl Stratos

    Abstract: We propose to tackle data-to-text generation tasks by directly splicing together retrieved segments of text from "neighbor" source-target pairs. Unlike recent work that conditions on retrieved neighbors but generates text token-by-token, left-to-right, we learn a policy that directly manipulates segments of neighbor text, by inserting or replacing them in partially constructed generations. Standar… ▽ More

    Submitted 28 October, 2021; v1 submitted 20 January, 2021; originally announced January 2021.

    Comments: EMNLP 2021; figures updated/improved

  10. arXiv:2010.14181  [pdf, ps, other

    cs.CC cs.LG

    Impossibility Results for Grammar-Compressed Linear Algebra

    Authors: Amir Abboud, Arturs Backurs, Karl Bringmann, Marvin Künnemann

    Abstract: To handle vast amounts of data, it is natural and popular to compress vectors and matrices. When we compress a vector from size $N$ down to size $n \ll N$, it certainly makes it easier to store and transmit efficiently, but does it also make it easier to process? In this paper we consider lossless compression schemes, and ask if we can run our computations on the compressed data as efficiently a… ▽ More

    Submitted 27 October, 2020; originally announced October 2020.

    Comments: NeurIPS'20, 20 pages

  11. arXiv:2008.13374  [pdf, ps, other

    cs.LG stat.ML

    Active Local Learning

    Authors: Arturs Backurs, Avrim Blum, Neha Gupta

    Abstract: In this work we consider active local learning: given a query point $x$, and active access to an unlabeled training set $S$, output the prediction $h(x)$ of a near-optimal $h \in H$ using significantly fewer labels than would be needed to actually learn $h$ fully. In particular, the number of label queries should be independent of the complexity of $H$, and the function $h$ should be well-defined,… ▽ More

    Submitted 3 September, 2020; v1 submitted 31 August, 2020; originally announced August 2020.

    Comments: Published at COLT 2020

  12. arXiv:2008.10577  [pdf, ps, other

    cs.DS

    Fast and Simple Modular Subset Sum

    Authors: Kyriakos Axiotis, Arturs Backurs, Karl Bringmann, Ce Jin, Vasileios Nakos, Christos Tzamos, Hongxun Wu

    Abstract: We revisit the Subset Sum problem over the finite cyclic group $\mathbb{Z}_m$ for some given integer $m$. A series of recent works has provided near-optimal algorithms for this problem under the Strong Exponential Time Hypothesis. Koiliaris and Xu (SODA'17, TALG'19) gave a deterministic algorithm running in time $\tilde{O}(m^{5/4})$, which was later improved to $O(m \log^7 m)$ randomized time by A… ▽ More

    Submitted 30 October, 2020; v1 submitted 24 August, 2020; originally announced August 2020.

    Comments: accepted at SOSA'21

  13. arXiv:2004.05494  [pdf, other

    cs.CG

    Submodular Clustering in Low Dimensions

    Authors: Arturs Backurs, Sariel Har-Peled

    Abstract: We study a clustering problem where the goal is to maximize the coverage of the input points by $k$ chosen centers. Specifically, given a set of $n$ points $P \subseteq \mathbb{R}^d$, the goal is to pick $k$ centers $C \subseteq \mathbb{R}^d$ that maximize the service $ \sum_{p \in P}\mathsf{\varphi}\bigl( \mathsf{d}(p,C) \bigr) $ to the points $P$, where $\mathsf{d}(p,C)$ is the distance of $p$ t… ▽ More

    Submitted 11 April, 2020; originally announced April 2020.

    Comments: To appear in SWAT 20

  14. arXiv:1910.04126  [pdf, other

    cs.DS

    Scalable Nearest Neighbor Search for Optimal Transport

    Authors: Arturs Backurs, Yihe Dong, Piotr Indyk, Ilya Razenshteyn, Tal Wagner

    Abstract: The Optimal Transport (a.k.a. Wasserstein) distance is an increasingly popular similarity measure for rich data domains, such as images or text documents. This raises the necessity for fast nearest neighbor search algorithms according to this distance, which poses a substantial computational bottleneck on massive datasets. In this work we introduce Flowtree, a fast and accurate approximation algor… ▽ More

    Submitted 28 September, 2020; v1 submitted 9 October, 2019; originally announced October 2019.

    Comments: ICML 2020

  15. arXiv:1902.03519  [pdf, other

    cs.DS cs.LG

    Scalable Fair Clustering

    Authors: Arturs Backurs, Piotr Indyk, Krzysztof Onak, Baruch Schieber, Ali Vakilian, Tal Wagner

    Abstract: We study the fair variant of the classic $k$-median problem introduced by Chierichetti et al. [2017]. In the standard $k$-median problem, given an input pointset $P$, the goal is to find $k$ centers $C$ and assign each input point to one of the centers in $C$ such that the average distance of points to their cluster center is minimized. In the fair variant of $k$-median, the points are colored,… ▽ More

    Submitted 10 June, 2019; v1 submitted 9 February, 2019; originally announced February 2019.

    Comments: ICML 2019

  16. arXiv:1808.08494  [pdf, other

    cs.DS

    Towards Tight Approximation Bounds for Graph Diameter and Eccentricities

    Authors: Arturs Backurs, Liam Roditty, Gilad Segal, Virginia Vassilevska Williams, Nicole Wein

    Abstract: Among the most important graph parameters is the Diameter, the largest distance between any two vertices. There are no known very efficient algorithms for computing the Diameter exactly. Thus, much research has been devoted to how fast this parameter can be approximated. Chechik et al. showed that the diameter can be approximated within a multiplicative factor of $3/2$ in $\tilde{O}(m^{3/2})$ time… ▽ More

    Submitted 29 March, 2021; v1 submitted 25 August, 2018; originally announced August 2018.

    Comments: Revised to implement referee comments

  17. arXiv:1807.04825  [pdf, ps, other

    cs.DS

    Fast Modular Subset Sum using Linear Sketching

    Authors: Kyriakos Axiotis, Arturs Backurs, Christos Tzamos

    Abstract: Given n positive integers, the Modular Subset Sum problem asks if a subset adds up to a given target t modulo a given integer m. This is a natural generalization of the Subset Sum problem (where m=+\infty) with ties to additive combinatorics and cryptography. Recently, in [Bringmann, SODA'17] and [Koiliaris and Xu, SODA'17], efficient algorithms have been developed for the non-modular case, runn… ▽ More

    Submitted 12 July, 2018; originally announced July 2018.

  18. Fine-Grained Complexity of Analyzing Compressed Data: Quantifying Improvements over Decompress-And-Solve

    Authors: Amir Abboud, Arturs Backurs, Karl Bringmann, Marvin Künnemann

    Abstract: Can we analyze data without decompressing it? As our data keeps growing, understanding the time complexity of problems on compressed inputs, rather than in convenient uncompressed forms, becomes more and more relevant. Suppose we are given a compression of size $n$ of data that originally has size $N$, and we want to solve a problem with time complexity $T(\cdot)$. The naive strategy of "decompres… ▽ More

    Submitted 2 March, 2018; originally announced March 2018.

    Comments: Presented at FOCS'17. Full version. 63 pages

    ACM Class: F.2.2

  19. arXiv:1704.02958  [pdf, ps, other

    cs.CC cs.DS cs.LG stat.ML

    On the Fine-Grained Complexity of Empirical Risk Minimization: Kernel Methods and Neural Networks

    Authors: Arturs Backurs, Piotr Indyk, Ludwig Schmidt

    Abstract: Empirical risk minimization (ERM) is ubiquitous in machine learning and underlies most supervised learning methods. While there has been a large body of work on algorithms for various ERM problems, the exact computational complexity of ERM is still not understood. We address this issue for multiple popular ERM problems including kernel SVMs, kernel ridge regression, and training the final layer of… ▽ More

    Submitted 10 April, 2017; originally announced April 2017.

  20. arXiv:1607.04229  [pdf, other

    cs.CC cs.DS

    Improving Viterbi is Hard: Better Runtimes Imply Faster Clique Algorithms

    Authors: Arturs Backurs, Christos Tzamos

    Abstract: The classic algorithm of Viterbi computes the most likely path in a Hidden Markov Model (HMM) that results in a given sequence of observations. It runs in time $O(Tn^2)$ given a sequence of $T$ observations from a HMM with $n$ states. Despite significant interest in the problem and prolonged effort by different communities, no known algorithm achieves more than a polylogarithmic speedup. In this… ▽ More

    Submitted 3 November, 2016; v1 submitted 14 July, 2016; originally announced July 2016.

  21. arXiv:1602.05837  [pdf, ps, other

    cs.DS cs.CC cs.CG

    Tight Hardness Results for Maximum Weight Rectangles

    Authors: Arturs Backurs, Nishanth Dikkala, Christos Tzamos

    Abstract: Given $n$ weighted points (positive or negative) in $d$ dimensions, what is the axis-aligned box which maximizes the total weight of the points it contains? The best known algorithm for this problem is based on a reduction to a related problem, the Weighted Depth problem [T. M. Chan, FOCS'13], and runs in time $O(n^d)$. It was conjectured [Barbay et al., CCCG'13] that this runtime is tight up to… ▽ More

    Submitted 2 March, 2016; v1 submitted 18 February, 2016; originally announced February 2016.

  22. arXiv:1511.07070  [pdf, ps, other

    cs.CC cs.DS

    Which Regular Expression Patterns are Hard to Match?

    Authors: Arturs Backurs, Piotr Indyk

    Abstract: Regular expressions constitute a fundamental notion in formal language theory and are frequently used in computer science to define search patterns. A classic algorithm for these problems constructs and simulates a non-deterministic finite automaton corresponding to the expression, resulting in an $O(mn)$ running time (where $m$ is the length of the pattern and $n$ is the length of the text). This… ▽ More

    Submitted 26 September, 2016; v1 submitted 22 November, 2015; originally announced November 2015.

  23. arXiv:1510.04622  [pdf, other

    cs.CC cs.DS

    Subtree Isomorphism Revisited

    Authors: Amir Abboud, Arturs Backurs, Thomas Dueholm Hansen, Virginia Vassilevska Williams, Or Zamir

    Abstract: The Subtree Isomorphism problem asks whether a given tree is contained in another given tree. The problem is of fundamental importance and has been studied since the 1960s. For some variants, e.g., ordered trees, near-linear time algorithms are known, but for the general case truly subquadratic algorithms remain elusive. Our first result is a reduction from the Orthogonal Vectors problem to Subt… ▽ More

    Submitted 15 October, 2015; originally announced October 2015.

  24. arXiv:1504.01431  [pdf, ps, other

    cs.CC cs.DS

    If the Current Clique Algorithms are Optimal, so is Valiant's Parser

    Authors: Amir Abboud, Arturs Backurs, Virginia Vassilevska Williams

    Abstract: The CFG recognition problem is: given a context-free grammar $\mathcal{G}$ and a string $w$ of length $n$, decide if $w$ can be obtained from $\mathcal{G}$. This is the most basic parsing question and is a core computer science problem. Valiant's parser from 1975 solves the problem in $O(n^ωおめが)$ time, where $ωおめが<2.373$ is the matrix multiplication exponent. Dozens of parsing algorithms have been propo… ▽ More

    Submitted 5 November, 2015; v1 submitted 6 April, 2015; originally announced April 2015.

  25. arXiv:1504.01076  [pdf, ps, other

    cs.DS cs.CG cs.IT

    Nearly-optimal bounds for sparse recovery in generic norms, with applications to $k$-median sketching

    Authors: Arturs Backurs, Piotr Indyk, Eric Price, Ilya Razenshteyn, David P. Woodruff

    Abstract: We initiate the study of trade-offs between sparsity and the number of measurements in sparse recovery schemes for generic norms. Specifically, for a norm $\|\cdot\|$, sparsity parameter $k$, approximation factor $K>0$, and probability of failure $P>0$, we ask: what is the minimal value of $m$ so that there is a distribution over $m \times n$ matrices $A$ with the property that for any $x$, given… ▽ More

    Submitted 4 April, 2015; originally announced April 2015.

    Comments: 29 pages

  26. arXiv:1501.07053  [pdf, ps, other

    cs.CC cs.DS

    Quadratic-Time Hardness of LCS and other Sequence Similarity Measures

    Authors: Amir Abboud, Arturs Backurs, Virginia Vassilevska Williams

    Abstract: Two important similarity measures between sequences are the longest common subsequence (LCS) and the dynamic time warping distance (DTWD). The computations of these measures for two given sequences are central tasks in a variety of applications. Simple dynamic programming algorithms solve these tasks in $O(n^2)$ time, and despite an extensive amount of research, no algorithms with significantly be… ▽ More

    Submitted 29 January, 2015; v1 submitted 28 January, 2015; originally announced January 2015.

  27. arXiv:1412.0348  [pdf, ps, other

    cs.CC cs.DS

    Edit Distance Cannot Be Computed in Strongly Subquadratic Time (unless SETH is false)

    Authors: Arturs Backurs, Piotr Indyk

    Abstract: The edit distance (a.k.a. the Levenshtein distance) between two strings is defined as the minimum number of insertions, deletions or substitutions of symbols needed to transform one string into another. The problem of computing the edit distance between two strings is a classical computational task, with a well-known algorithm based on dynamic programming. Unfortunately, all known algorithms for t… ▽ More

    Submitted 15 August, 2017; v1 submitted 30 November, 2014; originally announced December 2014.

    Comments: STOC'15

  28. arXiv:1302.4625  [pdf, ps, other

    cs.CC

    On the sum of $L1$ influences

    Authors: Artūrs Bačkurs, Mohammad Bavarian

    Abstract: For a function $f$ over the discrete cube, the total $L_1$ influence of $f$ is defined as $\sum_{i=1}^n \|\partial_i f\|_1$, where $\partial_i f$ denotes the discrete derivative of $f$ in the direction $i$. In this work, we show that the total $L_1$ influence of a $[-1,1]$-valued function $f$ can be upper bounded by a polynomial in the degree of $f$, resolving affirmatively an open problem of Aaro… ▽ More

    Submitted 12 April, 2014; v1 submitted 19 February, 2013; originally announced February 2013.

    Comments: Proceedings of CCC (2014)

  29. arXiv:1208.1122  [pdf, ps, other

    quant-ph

    Optimal quantum query bounds for almost all Boolean functions

    Authors: Andris Ambainis, Arturs Backurs, Juris Smotrovs, Ronald de Wolf

    Abstract: We show that almost all n-bit Boolean functions have bounded-error quantum query complexity at least n/2, up to lower-order terms. This improves over an earlier n/4 lower bound of Ambainis, and shows that van Dam's oracle interrogation is essentially optimal for almost all functions. Our proof uses the fact that the acceptance probability of a T-query algorithm can be written as the sum of squares… ▽ More

    Submitted 6 August, 2012; originally announced August 2012.

    Comments: 8 pages LaTeX

  30. arXiv:1112.3337  [pdf, ps, other

    quant-ph cs.CC cs.DS

    Search by quantum walks on two-dimensional grid without amplitude amplification

    Authors: Andris Ambainis, Arturs Backurs, Nikolajs Nahimovs, Raitis Ozols, Alexander Rivosh

    Abstract: We study search by quantum walk on a finite two dimensional grid. The algorithm of Ambainis, Kempe, Rivosh (quant-ph/0402107) takes O(\sqrt{N log N}) steps and finds a marked location with probability O(1/log N) for grid of size \sqrt{N} * \sqrt{N}. This probability is small, thus amplitude amplification is needed to achieve Θしーた(1) success probability. The amplitude amplification adds an additional… ▽ More

    Submitted 14 December, 2011; originally announced December 2011.

    Comments: 22 pages, 3 figures

  31. arXiv:1112.3330  [pdf, other

    quant-ph cs.GT

    Quantum strategies are better than classical in almost any XOR game

    Authors: Andris Ambainis, Arturs Backurs, Kaspars Balodis, Dmitry Kravcenko, Raitis Ozols, Juris Smotrovs, Madars Virza

    Abstract: We initiate a study of random instances of nonlocal games. We show that quantum strategies are better than classical for almost any 2-player XOR game. More precisely, for large n, the entangled value of a random 2-player XOR game with n questions to every player is at least 1.21... times the classical value, for 1-o(1) fraction of all 2-player XOR games.

    Submitted 14 December, 2011; originally announced December 2011.

    Comments: 22 pages, 1 figure

  32. arXiv:1112.2856  [pdf, ps, other

    quant-ph

    Worst case analysis of non-local games

    Authors: Andris Ambainis, Arturs Backurs, Kaspars Balodis, Agnis Skuskovniks, Juris Smotrovs, Madars Virza

    Abstract: Non-local games are studied in quantum information because they provide a simple way for proving the difference between the classical world and the quantum world. A non-local game is a cooperative game played by 2 or more players against a referee. The players cannot communicate but may share common random bits or a common quantum state. A referee sends an input $x_i$ to the $i^{th}$ player who th… ▽ More

    Submitted 13 December, 2011; originally announced December 2011.

    Comments: 14 pages