-
Optimal Preprocessing for Answering On-Line Product Queries
Authors:
Noga Alon,
Baruch Schieber
Abstract:
We examine the amount of preprocessing needed for answering certain on-line queries as fast as possible. We start with the following basic problem. Suppose we are given a semigroup $(S,\circ )$. Let $s_1 ,\ldots, s_n$ be elements of $S$. We want to answer on-line queries of the form, ``What is the product $s_i \circ s_{i+1} \circ \cdots \circ s_{j-1} \circ s_j$?'' for any given $1\le i\le j\le n$.…
▽ More
We examine the amount of preprocessing needed for answering certain on-line queries as fast as possible. We start with the following basic problem. Suppose we are given a semigroup $(S,\circ )$. Let $s_1 ,\ldots, s_n$ be elements of $S$. We want to answer on-line queries of the form, ``What is the product $s_i \circ s_{i+1} \circ \cdots \circ s_{j-1} \circ s_j$?'' for any given $1\le i\le j\le n$. We show that a preprocessing of $Θ(n λ(k,n))$ time and space is both necessary and sufficient to answer each such query in at most $k$ steps, for any fixed $k$. The function $λ(k,\cdot)$ is the inverse of a certain function at the $\lfloor {k/2}\rfloor$-th level of the primitive recursive hierarchy. In case linear preprocessing is desired, we show that one can answer each such query in $O( α(n))$ steps and that this is best possible. The function $α(n)$ is the inverse Ackermann function.
We also consider the following extended problem. Let $T$ be a tree with an element of $S$ associated with each of its vertices. We want to answer on-line queries of the form, ``What is the product of the elements associated with the vertices along the path from $u$ to $v$?'' for any pair of vertices $u$ and $v$ in $T$. We derive results that are similar to the above, for the preprocessing needed for answering such queries.
All our sequential preprocessing algorithms can be parallelized efficiently to give optimal parallel algorithms which run in $O(\log n)$ time on a CREW PRAM. These parallel algorithms are optimal in both running time and total number of operations.
Our algorithms, especially for the semigroup of the real numbers with the minimum or maximum operations, have various applications in certain graph algorithms, in the utilization of communication networks and in Database retrieval.
△ Less
Submitted 10 June, 2024;
originally announced June 2024.
-
Rainbow Stackings of Random Edge-Colorings
Authors:
Noga Alon,
Colin Defant,
Noah Kravitz
Abstract:
A rainbow stacking of $r$-edge-colorings $χ_1, \ldots, χ_m$ of the complete graph on $n$ vertices is a way of superimposing $χ_1, \ldots, χ_m$ so that no edges of the same color are superimposed on each other. We determine a sharp threshold for $r$ (as a function of $m$ and $n$) governing the existence and nonexistence of rainbow stackings of random $r$-edge-colorings $χ_1,\ldots,χ_m$.
A rainbow stacking of $r$-edge-colorings $χ_1, \ldots, χ_m$ of the complete graph on $n$ vertices is a way of superimposing $χ_1, \ldots, χ_m$ so that no edges of the same color are superimposed on each other. We determine a sharp threshold for $r$ (as a function of $m$ and $n$) governing the existence and nonexistence of rainbow stackings of random $r$-edge-colorings $χ_1,\ldots,χ_m$.
△ Less
Submitted 23 May, 2024;
originally announced May 2024.
-
The Helly number of Hamming balls and related problems
Authors:
Noga Alon,
Zhihan Jin,
Benny Sudakov
Abstract:
We prove the following variant of Helly's classical theorem for Hamming balls with a bounded radius. For $n>t$ and any (finite or infinite) set $X$, if in a family of Hamming balls of radius $t$ in $X^n$, every subfamily of at most $2^{t+1}$ balls have a common point, so do all members of the family. This is tight for all $|X|>1$ and all $n>t$. The proof of the main result is based on a novel vari…
▽ More
We prove the following variant of Helly's classical theorem for Hamming balls with a bounded radius. For $n>t$ and any (finite or infinite) set $X$, if in a family of Hamming balls of radius $t$ in $X^n$, every subfamily of at most $2^{t+1}$ balls have a common point, so do all members of the family. This is tight for all $|X|>1$ and all $n>t$. The proof of the main result is based on a novel variant of the so-called dimension argument, which allows one to prove upper bounds that do not depend on the dimension of the ambient space. We also discuss several related questions and connections to problems and results in extremal finite set theory and graph theory.
△ Less
Submitted 2 June, 2024; v1 submitted 16 May, 2024;
originally announced May 2024.
-
Detecting and Deterring Manipulation in a Cognitive Hierarchy
Authors:
Nitay Alon,
Lion Schulz,
Joseph M. Barnby,
Jeffrey S. Rosenschein,
Peter Dayan
Abstract:
Social agents with finitely nested opponent models are vulnerable to manipulation by agents with deeper reasoning and more sophisticated opponent modelling. This imbalance, rooted in logic and the theory of recursive modelling frameworks, cannot be solved directly. We propose a computational framework, $\aleph$-IPOMDP, augmenting model-based RL agents' Bayesian inference with an anomaly detection…
▽ More
Social agents with finitely nested opponent models are vulnerable to manipulation by agents with deeper reasoning and more sophisticated opponent modelling. This imbalance, rooted in logic and the theory of recursive modelling frameworks, cannot be solved directly. We propose a computational framework, $\aleph$-IPOMDP, augmenting model-based RL agents' Bayesian inference with an anomaly detection algorithm and an out-of-belief policy. Our mechanism allows agents to realize they are being deceived, even if they cannot understand how, and to deter opponents via a credible threat. We test this framework in both a mixed-motive and zero-sum game. Our results show the $\aleph$ mechanism's effectiveness, leading to more equitable outcomes and less exploitation by more sophisticated agents. We discuss implications for AI safety, cybersecurity, cognitive science, and psychiatry.
△ Less
Submitted 3 May, 2024;
originally announced May 2024.
-
Sumsets in the Hypercube
Authors:
Noga Alon,
Or Zamir
Abstract:
A subset $S$ of the Boolean hypercube $\mathbb{F}_2^n$ is a sumset if $S = A+A = \{a + b \ | \ a, b\in A\}$ for some $A \subseteq \mathbb{F}_2^n$. We prove that the number of sumsets in $\mathbb{F}_2^n$ is asymptotically $(2^n-1)2^{2^{n-1}}$. Furthermore, we show that the family of sumsets in $\mathbb{F}_2^n$ is almost identical to the family of all subsets of $\mathbb{F}_2^n$ that contain a compl…
▽ More
A subset $S$ of the Boolean hypercube $\mathbb{F}_2^n$ is a sumset if $S = A+A = \{a + b \ | \ a, b\in A\}$ for some $A \subseteq \mathbb{F}_2^n$. We prove that the number of sumsets in $\mathbb{F}_2^n$ is asymptotically $(2^n-1)2^{2^{n-1}}$. Furthermore, we show that the family of sumsets in $\mathbb{F}_2^n$ is almost identical to the family of all subsets of $\mathbb{F}_2^n$ that contain a complete linear subspace of co-dimension $1$.
△ Less
Submitted 16 April, 2024; v1 submitted 25 March, 2024;
originally announced March 2024.
-
Erasure codes and Turán hypercube problems
Authors:
Noga Alon
Abstract:
We observe that several vertex Turán type problems for the hypercube that received a considerable amount of attention in the combinatorial community are equivalent to questions about erasure list-decodable codes. Analyzing a recent construction of Ellis, Ivan and Leader, and determining the Turán density of certain hypergraph augemntations we obtain improved bounds for some of these problems.
We observe that several vertex Turán type problems for the hypercube that received a considerable amount of attention in the combinatorial community are equivalent to questions about erasure list-decodable codes. Analyzing a recent construction of Ellis, Ivan and Leader, and determining the Turán density of certain hypergraph augemntations we obtain improved bounds for some of these problems.
△ Less
Submitted 24 March, 2024;
originally announced March 2024.
-
Emergent Dominance Hierarchies in Reinforcement Learning Agents
Authors:
Ram Rachum,
Yonatan Nakar,
Bill Tomlinson,
Nitay Alon,
Reuth Mirsky
Abstract:
Modern Reinforcement Learning (RL) algorithms are able to outperform humans in a wide variety of tasks. Multi-agent reinforcement learning (MARL) settings present additional challenges, and successful cooperation in mixed-motive groups of agents depends on a delicate balancing act between individual and group objectives. Social conventions and norms, often inspired by human institutions, are used…
▽ More
Modern Reinforcement Learning (RL) algorithms are able to outperform humans in a wide variety of tasks. Multi-agent reinforcement learning (MARL) settings present additional challenges, and successful cooperation in mixed-motive groups of agents depends on a delicate balancing act between individual and group objectives. Social conventions and norms, often inspired by human institutions, are used as tools for striking this balance.
In this paper, we examine a fundamental, well-studied social convention that underlies cooperation in both animal and human societies: dominance hierarchies.
We adapt the ethological theory of dominance hierarchies to artificial agents, borrowing the established terminology and definitions with as few amendments as possible. We demonstrate that populations of RL agents, operating without explicit programming or intrinsic rewards, can invent, learn, enforce, and transmit a dominance hierarchy to new populations. The dominance hierarchies that emerge have a similar structure to those studied in chickens, mice, fish, and other species.
△ Less
Submitted 22 June, 2024; v1 submitted 21 January, 2024;
originally announced January 2024.
-
Partitioning the hypercube into smaller hypercubes
Authors:
Noga Alon,
Jozsef Balogh,
Vladimir N. Potapov
Abstract:
Denote by Q_d the d-dimensional hypercube. Addressing a recent question we estimate the number of ways the vertex set of Q_d can be partitioned into vertex disjoint smaller cubes. Among other results, we prove that the asymptotic order of this function is not much larger than the number of perfect matchings of Q_d. We also describe several new (and old) questions.
Denote by Q_d the d-dimensional hypercube. Addressing a recent question we estimate the number of ways the vertex set of Q_d can be partitioned into vertex disjoint smaller cubes. Among other results, we prove that the asymptotic order of this function is not much larger than the number of perfect matchings of Q_d. We also describe several new (and old) questions.
△ Less
Submitted 3 February, 2024; v1 submitted 30 December, 2023;
originally announced January 2024.
-
Optimal Sample Complexity of Contrastive Learning
Authors:
Noga Alon,
Dmitrii Avdiukhin,
Dor Elboim,
Orr Fischer,
Grigory Yaroslavtsev
Abstract:
Contrastive learning is a highly successful technique for learning representations of data from labeled tuples, specifying the distance relations within the tuple. We study the sample complexity of contrastive learning, i.e. the minimum number of labeled tuples sufficient for getting high generalization accuracy. We give tight bounds on the sample complexity in a variety of settings, focusing on a…
▽ More
Contrastive learning is a highly successful technique for learning representations of data from labeled tuples, specifying the distance relations within the tuple. We study the sample complexity of contrastive learning, i.e. the minimum number of labeled tuples sufficient for getting high generalization accuracy. We give tight bounds on the sample complexity in a variety of settings, focusing on arbitrary distance functions, both general $\ell_p$-distances, and tree metrics. Our main result is an (almost) optimal bound on the sample complexity of learning $\ell_p$-distances for integer $p$. For any $p \ge 1$ we show that $\tilde Θ(\min(nd,n^2))$ labeled tuples are necessary and sufficient for learning $d$-dimensional representations of $n$-point datasets. Our results hold for an arbitrary distribution of the input samples and are based on giving the corresponding bounds on the Vapnik-Chervonenkis/Natarajan dimension of the associated problems. We further show that the theoretical bounds on sample complexity obtained via VC/Natarajan dimension can have strong predictive power for experimental results, in contrast with the folklore belief about a substantial gap between the statistical learning theory and the practice of deep learning.
△ Less
Submitted 1 December, 2023;
originally announced December 2023.
-
Universality for graphs with bounded density
Authors:
Noga Alon,
Natalie Dodson,
Carmen Jackson,
Rose McCarty,
Rajko Nenadov,
Lani Southern
Abstract:
A graph $G$ is $\textit{universal}$ for a (finite) family $\mathcal{H}$ of graphs if every $H \in \mathcal{H}$ is a subgraph of $G$. For a given family $\mathcal{H}$, the goal is to determine the smallest number of edges an $\mathcal{H}$-universal graph can have. With the aim of unifying a number of recent results, we consider a family of graphs with bounded density. In particular, we construct a…
▽ More
A graph $G$ is $\textit{universal}$ for a (finite) family $\mathcal{H}$ of graphs if every $H \in \mathcal{H}$ is a subgraph of $G$. For a given family $\mathcal{H}$, the goal is to determine the smallest number of edges an $\mathcal{H}$-universal graph can have. With the aim of unifying a number of recent results, we consider a family of graphs with bounded density. In particular, we construct a graph with $O_d\left( n^{2 - 1/(\lceil d \rceil + 1)} \right)$ edges which contains every $n$-vertex graph with density at most $d \in \mathbb{Q}$ ($d \ge 1$), which is close to a lower bound $Ω(n^{2 - 1/d - o(1)})$ obtained by counting lifts of a carefully chosen (small) graph. When restricting the maximum degree of such graphs to be constant, we obtain a near-optimal universality. If we further assume $d \in \mathbb{N}$, we get an asymptotically optimal construction.
△ Less
Submitted 11 January, 2024; v1 submitted 9 November, 2023;
originally announced November 2023.
-
Essentially tight bounds for rainbow cycles in proper edge-colourings
Authors:
Noga Alon,
Matija Bucić,
Lisa Sauermann,
Dmitrii Zakharov,
Or Zamir
Abstract:
An edge-coloured graph is said to be rainbow if no colour appears more than once. Extremal problems involving rainbow objects have been a focus of much research over the last decade as they capture the essence of a number of interesting problems in a variety of areas. A particularly intensively studied question due to Keevash, Mubayi, Sudakov and Verstraëte from 2007 asks for the maximum possible…
▽ More
An edge-coloured graph is said to be rainbow if no colour appears more than once. Extremal problems involving rainbow objects have been a focus of much research over the last decade as they capture the essence of a number of interesting problems in a variety of areas. A particularly intensively studied question due to Keevash, Mubayi, Sudakov and Verstraëte from 2007 asks for the maximum possible average degree of a properly edge-coloured graph on $n$ vertices without a rainbow cycle. Improving upon a series of earlier bounds, Tomon proved an upper bound of $(\log n)^{2+o(1)}$ for this question. Very recently, Janzer-Sudakov and Kim-Lee-Liu-Tran independently removed the $o(1)$ term in Tomon's bound, showing a bound of $O(\log^2 n)$. We prove an upper bound of $(\log n)^{1+o(1)}$ for this maximum possible average degree when there is no rainbow cycle. Our result is tight up to the $o(1)$ term, and so it essentially resolves this question. In addition, we observe a connection between this problem and several questions in additive number theory, allowing us to extend existing results on these questions for abelian groups to the case of non-abelian groups.
△ Less
Submitted 8 September, 2023;
originally announced September 2023.
-
The power of many colours
Authors:
Noga Alon,
Matija Bucić,
Micha Christoph,
Michael Krivelevich
Abstract:
A classical problem, due to Gerencsér and Gyárfás from 1967, asks how large a monochromatic connected component can we guarantee in any $r$-edge colouring of $K_n$? We consider how big a connected component can we guarantee in any $r$-edge colouring of $K_n$ if we allow ourselves to use up to $s$ colours. This is actually an instance of a more general question of Bollobás from about 20 years ago w…
▽ More
A classical problem, due to Gerencsér and Gyárfás from 1967, asks how large a monochromatic connected component can we guarantee in any $r$-edge colouring of $K_n$? We consider how big a connected component can we guarantee in any $r$-edge colouring of $K_n$ if we allow ourselves to use up to $s$ colours. This is actually an instance of a more general question of Bollobás from about 20 years ago which asks for a $k$-connected subgraph in the same setting. We complete the picture in terms of the approximate behaviour of the answer by determining it up to a logarithmic term, provided $n$ is large enough. We obtain more precise results for certain regimes which solve a problem of Liu, Morris and Prince from 2007, as well as disprove a conjecture they pose in a strong form.
We also consider a generalisation in a similar direction of a question first considered by Erdős and Rényi in 1956, who considered given $n$ and $m$, what is the smallest number of $m$-cliques which can cover all edges of $K_n$? This problem is essentially equivalent to the question of what is the minimum number of vertices that are certain to be incident to at least one edge of some colour in any $r$-edge colouring of $K_n$. We consider what happens if we allow ourselves to use up to $s$ colours. We obtain a more complete understanding of the answer to this question for large $n$, in particular determining it up to a constant factor for all $1\le s \le r$, as well as obtaining much more precise results for various ranges including the correct asymptotics for essentially the whole range.
△ Less
Submitted 10 June, 2024; v1 submitted 29 August, 2023;
originally announced August 2023.
-
Connectivity Graph-Codes
Authors:
Noga Alon
Abstract:
The symmetric difference of two graphs $G_1,G_2$ on the same set of vertices $V$ is the graph on $V$ whose set of edges are all edges that belong to exactly one of the two graphs $G_1,G_2$. For a fixed graph $H$ call a collection ${\cal G}$ of spanning subgraphs of $H$ a connectivity code for $H$ if the symmetric difference of any two distinct subgraphs in ${\cal G}$ is a connected spanning subgra…
▽ More
The symmetric difference of two graphs $G_1,G_2$ on the same set of vertices $V$ is the graph on $V$ whose set of edges are all edges that belong to exactly one of the two graphs $G_1,G_2$. For a fixed graph $H$ call a collection ${\cal G}$ of spanning subgraphs of $H$ a connectivity code for $H$ if the symmetric difference of any two distinct subgraphs in ${\cal G}$ is a connected spanning subgraph of $H$. It is easy to see that the maximum possible cardinality of such a collection is at most $2^{k'(H)} \leq 2^{δ(H)}$, where $k'(H)$ is the edge-connectivity of $H$ and $δ(H)$ is its minimum degree. We show that equality holds for any $d$-regular (mild) expander, and observe that equality does not hold in several natural examples including any large cubic graph, the square of a long cycle and products of a small clique with a long cycle.
△ Less
Submitted 6 September, 2023; v1 submitted 15 August, 2023;
originally announced August 2023.
-
Ordering Candidates via Vantage Points
Authors:
Noga Alon,
Colin Defant,
Noah Kravitz,
Daniel G. Zhu
Abstract:
Given an $n$-element set $C\subseteq\mathbb{R}^d$ and a (sufficiently generic) $k$-element multiset $V\subseteq\mathbb{R}^d$, we can order the points in $C$ by ranking each point $c\in C$ according to the sum of the distances from $c$ to the points of $V$. Let $Ψ_k(C)$ denote the set of orderings of $C$ that can be obtained in this manner as $V$ varies, and let $ψ^{\mathrm{max}}_{d,k}(n)$ be the m…
▽ More
Given an $n$-element set $C\subseteq\mathbb{R}^d$ and a (sufficiently generic) $k$-element multiset $V\subseteq\mathbb{R}^d$, we can order the points in $C$ by ranking each point $c\in C$ according to the sum of the distances from $c$ to the points of $V$. Let $Ψ_k(C)$ denote the set of orderings of $C$ that can be obtained in this manner as $V$ varies, and let $ψ^{\mathrm{max}}_{d,k}(n)$ be the maximum of $\lvertΨ_k(C)\rvert$ as $C$ ranges over all $n$-element subsets of $\mathbb{R}^d$. We prove that $ψ^{\mathrm{max}}_{d,k}(n)=Θ_{d,k}(n^{2dk})$ when $d \geq 2$ and that $ψ^{\mathrm{max}}_{1,k}(n)=Θ_k(n^{4\lceil k/2\rceil -1})$. As a step toward proving this result, we establish a bound on the number of sign patterns determined by a collection of functions that are sums of radicals of nonnegative polynomials; this can be understood as an analogue of a classical theorem of Warren. We also prove several results about the set $Ψ(C)=\bigcup_{k\geq 1}Ψ_k(C)$; this includes an exact description of $Ψ(C)$ when $d=1$ and when $C$ is the set of vertices of a vertex-transitive polytope.
△ Less
Submitted 9 August, 2023;
originally announced August 2023.
-
On bipartite coverings of graphs and multigraphs
Authors:
Noga Alon
Abstract:
A bipartite covering of a (multi)graph $G$ is a collection of bipartite graphs, so that each edge of $G$ belongs to at least one of them. The capacity of the covering is the sum of the numbers of vertices of these bipartite graphs. In this note we establish a (modest) strengthening of old results of Hansel and of Katona and Szemerédi, by showing that the capacity of any bipartite covering of a gra…
▽ More
A bipartite covering of a (multi)graph $G$ is a collection of bipartite graphs, so that each edge of $G$ belongs to at least one of them. The capacity of the covering is the sum of the numbers of vertices of these bipartite graphs. In this note we establish a (modest) strengthening of old results of Hansel and of Katona and Szemerédi, by showing that the capacity of any bipartite covering of a graph on $n$ vertices in which the maximum size of an independent set containing vertex number $i$ is $α_i$, is at least $\sum_i \log_2 (n/α_i).$ We also obtain slightly improved bounds for a recent result of Kim and Lee about the minimum possible capacity of a bipartite covering of complete multigraphs.
△ Less
Submitted 31 July, 2023;
originally announced July 2023.
-
Sublinear Time Shortest Path in Expander Graphs
Authors:
Noga Alon,
Allan Grønlund,
Søren Fuglede Jørgensen,
Kasper Green Larsen
Abstract:
Computing a shortest path between two nodes in an undirected unweighted graph is among the most basic algorithmic tasks. Breadth first search solves this problem in linear time, which is clearly also a lower bound in the worst case. However, several works have shown how to solve this problem in sublinear time in expectation when the input graph is drawn from one of several classes of random graphs…
▽ More
Computing a shortest path between two nodes in an undirected unweighted graph is among the most basic algorithmic tasks. Breadth first search solves this problem in linear time, which is clearly also a lower bound in the worst case. However, several works have shown how to solve this problem in sublinear time in expectation when the input graph is drawn from one of several classes of random graphs. In this work, we extend these results by giving sublinear time shortest path (and short path) algorithms for expander graphs. We thus identify a natural deterministic property of a graph (that is satisfied by typical random regular graphs) which suffices for sublinear time shortest paths. The algorithms are very simple, involving only bidirectional breadth first search and short random walks. We also complement our new algorithms by near-matching lower bounds.
△ Less
Submitted 31 July, 2023; v1 submitted 12 July, 2023;
originally announced July 2023.
-
Strong blocking sets and minimal codes from expander graphs
Authors:
Noga Alon,
Anurag Bishnoi,
Shagnik Das,
Alessandro Neri
Abstract:
A strong blocking set in a finite projective space is a set of points that intersects each hyperplane in a spanning set. We provide a new graph theoretic construction of such sets: combining constant-degree expanders with asymptotically good codes, we explicitly construct strong blocking sets in the $(k-1)$-dimensional projective space over $\mathbb{F}_q$ that have size $O( q k )$. Since strong bl…
▽ More
A strong blocking set in a finite projective space is a set of points that intersects each hyperplane in a spanning set. We provide a new graph theoretic construction of such sets: combining constant-degree expanders with asymptotically good codes, we explicitly construct strong blocking sets in the $(k-1)$-dimensional projective space over $\mathbb{F}_q$ that have size $O( q k )$. Since strong blocking sets have recently been shown to be equivalent to minimal linear codes, our construction gives the first explicit construction of $\mathbb{F}_q$-linear minimal codes of length $n$ and dimension $k$, for every prime power $q$, for which $n = O (q k)$. This solves one of the main open problems on minimal codes.
△ Less
Submitted 24 May, 2023;
originally announced May 2023.
-
A Unified Characterization of Private Learnability via Graph Theory
Authors:
Noga Alon,
Shay Moran,
Hilla Schefler,
Amir Yehudayoff
Abstract:
We provide a unified framework for characterizing pure and approximate differentially private (DP) learnability. The framework uses the language of graph theory: for a concept class $\mathcal{H}$, we define the contradiction graph $G$ of $\mathcal{H}$. Its vertices are realizable datasets, and two datasets $S,S'$ are connected by an edge if they contradict each other (i.e., there is a point $x$ th…
▽ More
We provide a unified framework for characterizing pure and approximate differentially private (DP) learnability. The framework uses the language of graph theory: for a concept class $\mathcal{H}$, we define the contradiction graph $G$ of $\mathcal{H}$. Its vertices are realizable datasets, and two datasets $S,S'$ are connected by an edge if they contradict each other (i.e., there is a point $x$ that is labeled differently in $S$ and $S'$). Our main finding is that the combinatorial structure of $G$ is deeply related to learning $\mathcal{H}$ under DP. Learning $\mathcal{H}$ under pure DP is captured by the fractional clique number of $G$. Learning $\mathcal{H}$ under approximate DP is captured by the clique number of $G$. Consequently, we identify graph-theoretic dimensions that characterize DP learnability: the clique dimension and fractional clique dimension. Along the way, we reveal properties of the contradiction graph which may be of independent interest. We also suggest several open questions and directions for future research.
△ Less
Submitted 12 June, 2024; v1 submitted 8 April, 2023;
originally announced April 2023.
-
The limit points of the top and bottom eigenvalues of regular graphs
Authors:
Noga Alon,
Fan Wei
Abstract:
We prove that for each $d \geq 3$ the set of all limit points of the second largest eigenvalue of growing sequences of $d$-regular graphs is $[2\sqrt{d-1},d]$. A similar argument shows that the set of all limit points of the smallest eigenvalue of growing sequences of $d$-regular graphs with growing (odd) girth is $[-d, -2 \sqrt{d-1}]$. The more general question of identifying all vectors which ar…
▽ More
We prove that for each $d \geq 3$ the set of all limit points of the second largest eigenvalue of growing sequences of $d$-regular graphs is $[2\sqrt{d-1},d]$. A similar argument shows that the set of all limit points of the smallest eigenvalue of growing sequences of $d$-regular graphs with growing (odd) girth is $[-d, -2 \sqrt{d-1}]$. The more general question of identifying all vectors which are limit points of the vectors of the top $k$ eigenvalues of sequences of $d$-regular graphs is considered as well. As a by product, in the study of discrete counterpart of the "scarring" phenomenon observed in the investigation of quantum ergodicity on manifolds, our technique provides a method to construct $d$-regular almost Ramanujan graphs with large girth and localized eigenvectors corresponding to eigenvalues larger than $2\sqrt{d-1}$, strengthening a result of Alon, Ganguly, and Srivastava.
△ Less
Submitted 12 October, 2023; v1 submitted 3 April, 2023;
originally announced April 2023.
-
Unit and distinct distances in typical norms
Authors:
Noga Alon,
Matija Bucić,
Lisa Sauermann
Abstract:
Erdős' unit distance problem and Erdős' distinct distances problem are among the most classical and well-known open problems in all of discrete mathematics. They ask for the maximum number of unit distances, or the minimum number of distinct distances, respectively, determined by $n$ points in the Euclidean plane. The question of what happens in these problems if one considers normed spaces other…
▽ More
Erdős' unit distance problem and Erdős' distinct distances problem are among the most classical and well-known open problems in all of discrete mathematics. They ask for the maximum number of unit distances, or the minimum number of distinct distances, respectively, determined by $n$ points in the Euclidean plane. The question of what happens in these problems if one considers normed spaces other than the Euclidean plane has been raised in the 1980s by Ulam and Erdős and attracted a lot of attention over the years. We give an essentially tight answer to both questions for almost all norms on $\mathbb{R}^d$, in a certain Baire categoric sense.
For the unit distance problem we prove that for almost all norms ||.|| on $\mathbb{R}^d$, any set of $n$ points defines at most $\frac{1}{2} d \cdot n \log_2 n$ unit distances according to ||.||. We also show that this is essentially tight, by proving that for every norm ||.|| on $\mathbb{R}^d$, for any large $n$, we can find $n$ points defining at least $\frac{1}{2}(d-1-o(1))\cdot n \log_2 n$ unit distances according to ||.||.
For the distinct distances problem, we prove that for almost all norms ||.|| on $\mathbb{R}^d$ any set of $n$ points defines at least $(1-o(1))n$ distinct distances according to ||.||. This is clearly tight up to the $o(1)$ term.
Our results settle, in a strong and somewhat surprising form, problems and conjectures of Brass, of Matoušek, and of Brass-Moser-Pach. The proofs combine combinatorial and geometric ideas with tools from Linear Algebra, Topology and Algebraic Geometry.
△ Less
Submitted 17 February, 2023;
originally announced February 2023.
-
Graph-codes
Authors:
Noga Alon
Abstract:
The symmetric difference of two graphs $G_1,G_2$ on the same set of vertices $[n]=\{1,2, \ldots ,n\}$ is the graph on $[n]$ whose set of edges are all edges that belong to exactly one of the two graphs $G_1,G_2$. Let $H$ be a fixed graph with an even (positive) number of edges, and let $D_H(n)$ denote the maximum possible cardinality of a family of graphs on $[n]$ containing no two members whose s…
▽ More
The symmetric difference of two graphs $G_1,G_2$ on the same set of vertices $[n]=\{1,2, \ldots ,n\}$ is the graph on $[n]$ whose set of edges are all edges that belong to exactly one of the two graphs $G_1,G_2$. Let $H$ be a fixed graph with an even (positive) number of edges, and let $D_H(n)$ denote the maximum possible cardinality of a family of graphs on $[n]$ containing no two members whose symmetric difference is a copy of $H$. Is it true that $D_H(n)=o(2^{n \choose 2})$ for any such $H$? We discuss this problem, compute the value of $D_H(n)$ up to a constant factor for stars and matchings, and discuss several variants of the problem including ones that have been considered in earlier work.
△ Less
Submitted 6 February, 2023; v1 submitted 30 January, 2023;
originally announced January 2023.
-
Diagonalization Games
Authors:
Noga Alon,
Olivier Bousquet,
Kasper Green Larsen,
Shay Moran,
Shlomo Moran
Abstract:
We study several variants of a combinatorial game which is based on Cantor's diagonal argument.
The game is between two players called Kronecker and Cantor. The names of the players are motivated by the known fact that Leopold Kronecker did not appreciate Georg Cantor's arguments about the infinite, and even referred to him as a "scientific charlatan". In the game Kronecker maintains a list of m…
▽ More
We study several variants of a combinatorial game which is based on Cantor's diagonal argument.
The game is between two players called Kronecker and Cantor. The names of the players are motivated by the known fact that Leopold Kronecker did not appreciate Georg Cantor's arguments about the infinite, and even referred to him as a "scientific charlatan". In the game Kronecker maintains a list of m binary vectors, each of length n, and Cantor's goal is to produce a new binary vector which is different from each of Kronecker's vectors, or prove that no such vector exists. Cantor does not see Kronecker's vectors but he is allowed to ask queries of the form"What is bit number j of vector number i?" What is the minimal number of queries with which Cantor can achieve his goal? How much better can Cantor do if he is allowed to pick his queries \emph{adaptively}, based on Kronecker's previous replies? The case when m=n is solved by diagonalization using n (non-adaptive) queries. We study this game more generally, and prove an optimal bound in the adaptive case and nearly tight upper and lower bounds in the non-adaptive case.
△ Less
Submitted 22 January, 2023; v1 submitted 5 January, 2023;
originally announced January 2023.
-
Invertibility of digraphs and tournaments
Authors:
Noga Alon,
Emil Powierski,
Michael Savery,
Alex Scott,
Elizabeth Wilmer
Abstract:
For an oriented graph $D$ and a set $X\subseteq V(D)$, the inversion of $X$ in $D$ is the digraph obtained by reversing the orientations of the edges of $D$ with both endpoints in $X$. The inversion number of $D$, $\textrm{inv}(D)$, is the minimum number of inversions which can be applied in turn to $D$ to produce an acyclic digraph. Answering a recent question of Bang-Jensen, da Silva, and Havet…
▽ More
For an oriented graph $D$ and a set $X\subseteq V(D)$, the inversion of $X$ in $D$ is the digraph obtained by reversing the orientations of the edges of $D$ with both endpoints in $X$. The inversion number of $D$, $\textrm{inv}(D)$, is the minimum number of inversions which can be applied in turn to $D$ to produce an acyclic digraph. Answering a recent question of Bang-Jensen, da Silva, and Havet we show that, for each $k\in\mathbb{N}$ and tournament $T$, the problem of deciding whether $\textrm{inv}(T)\leq k$ is solvable in time $O_k(|V(T)|^2)$, which is tight for all $k$. In particular, the problem is fixed-parameter tractable when parameterised by $k$. On the other hand, we build on their work to prove their conjecture that for $k\geq 1$ the problem of deciding whether a general oriented graph $D$ has $\textrm{inv}(D)\leq k$ is NP-complete. We also construct oriented graphs with inversion number equal to twice their cycle transversal number, confirming another conjecture of Bang-Jensen, da Silva, and Havet, and we provide a counterexample to their conjecture concerning the inversion number of so-called 'dijoin' digraphs while proving that it holds in certain cases. Finally, we asymptotically solve the natural extremal question in this setting, improving on previous bounds of Belkhechine, Bouaziz, Boudabbous, and Pouzet to show that the maximum inversion number of an $n$-vertex tournament is $(1+o(1))n$.
△ Less
Submitted 22 January, 2024; v1 submitted 22 December, 2022;
originally announced December 2022.
-
New bounds on the maximum number of neighborly boxes in R^d
Authors:
Noga Alon,
Jarosław Grytczuk,
Andrzej P. Kisielewicz,
Krzysztof Przesławski
Abstract:
A family of axis-aligned boxes in $\er^d$ is \emph{$k$-neighborly} if the intersection of every two of them has dimension at least $d-k$ and at most $d-1$. Let $n(k,d)$ denote the maximum size of such a family. It is known that $n(k,d)$ can be equivalently defined as the maximum number of vertices in a complete graph whose edges can be covered by $d$ complete bipartite graphs, with each edge cover…
▽ More
A family of axis-aligned boxes in $\er^d$ is \emph{$k$-neighborly} if the intersection of every two of them has dimension at least $d-k$ and at most $d-1$. Let $n(k,d)$ denote the maximum size of such a family. It is known that $n(k,d)$ can be equivalently defined as the maximum number of vertices in a complete graph whose edges can be covered by $d$ complete bipartite graphs, with each edge covered at most $k$ times.
We derive a new upper bound on $n(k,d)$, which implies, in particular, that $n(k,d)\leqslant (2-δ)^d$ if $k\leqslant (1-\varepsilon)d$, where $δ>0$ depends on arbitrarily chosen $\varepsilon>0$. The proof applies a classical result of Kleitman, concerning the maximum size of sets with a given diameter in discrete hypercubes. By an explicit construction we obtain also a new lower bound for $n(k,d)$, which implies that $n(k,d)\geqslant (1-o(1))\frac{d^k}{k!}$. We also study $k$-neighborly families of boxes with additional structural properties. Families called \emph{total laminations}, that split in a tree-like fashion, turn out to be particularly useful for explicit constructions. We pose a few conjectures based on these constructions and some computational experiments.
△ Less
Submitted 3 March, 2023; v1 submitted 9 December, 2022;
originally announced December 2022.
-
Cats in cubes
Authors:
Noga Alon,
Noah Kravitz
Abstract:
Answering a recent question of Patchell and Spiro, we show that when a $d$-dimensional cube of side length $n$ is filled with letters, the word $\mathsf{CAT}$ can appear contiguously at most $(3^{d-1}/2)n^d$ times (allowing diagonals); we also characterize when equality occurs and extend our results to words other than $\mathsf{CAT}$.
Answering a recent question of Patchell and Spiro, we show that when a $d$-dimensional cube of side length $n$ is filled with letters, the word $\mathsf{CAT}$ can appear contiguously at most $(3^{d-1}/2)n^d$ times (allowing diagonals); we also characterize when equality occurs and extend our results to words other than $\mathsf{CAT}$.
△ Less
Submitted 27 November, 2022;
originally announced November 2022.
-
Turán graphs with bounded matching number
Authors:
Noga Alon,
Peter Frankl
Abstract:
We determine the maximum possible number of edges of a graph with $n$ vertices, matching number at most $s$ and clique number at most $k$ for all admissible values of the parameters.
We determine the maximum possible number of edges of a graph with $n$ vertices, matching number at most $s$ and clique number at most $k$ for all admissible values of the parameters.
△ Less
Submitted 26 October, 2022;
originally announced October 2022.
-
Largest subgraph from a hereditary property in a random graph
Authors:
Noga Alon,
Michael Krivelevich,
Wojciech Samotij
Abstract:
We prove that for every non-trivial hereditary family of graphs ${\cal P}$ and for every fixed $p \in (0,1)$, the maximum possible number of edges in a subgraph of the random graph $G(n,p)$ which belongs to ${\cal P}$ is, with high probability, $$ \left(1-\frac{1}{k-1}+o(1)\right)p{n \choose 2}, $$ where $k$ is the minimum chromatic number of a graph that does not belong to ${\cal P}$.
We prove that for every non-trivial hereditary family of graphs ${\cal P}$ and for every fixed $p \in (0,1)$, the maximum possible number of edges in a subgraph of the random graph $G(n,p)$ which belongs to ${\cal P}$ is, with high probability, $$ \left(1-\frac{1}{k-1}+o(1)\right)p{n \choose 2}, $$ where $k$ is the minimum chromatic number of a graph that does not belong to ${\cal P}$.
△ Less
Submitted 23 October, 2022;
originally announced October 2022.
-
Logarithmically larger deletion codes of all distances
Authors:
Noga Alon,
Gabriela Bourla,
Ben Graham,
Xiaoyu He,
Noah Kravitz
Abstract:
The deletion distance between two binary words $u,v \in \{0,1\}^n$ is the smallest $k$ such that $u$ and $v$ share a common subsequence of length $n-k$. A set $C$ of binary words of length $n$ is called a $k$-deletion code if every pair of distinct words in $C$ has deletion distance greater than $k$. In 1965, Levenshtein initiated the study of deletion codes by showing that, for $k\ge 1$ fixed and…
▽ More
The deletion distance between two binary words $u,v \in \{0,1\}^n$ is the smallest $k$ such that $u$ and $v$ share a common subsequence of length $n-k$. A set $C$ of binary words of length $n$ is called a $k$-deletion code if every pair of distinct words in $C$ has deletion distance greater than $k$. In 1965, Levenshtein initiated the study of deletion codes by showing that, for $k\ge 1$ fixed and $n$ going to infinity, a $k$-deletion code $C\subseteq \{0,1\}^n$ of maximum size satisfies $Ω_k(2^n/n^{2k}) \leq |C| \leq O_k( 2^n/n^k)$. We make the first asymptotic improvement to these bounds by showing that there exist $k$-deletion codes with size at least $Ω_k(2^n \log n/n^{2k})$. Our proof is inspired by Jiang and Vardy's improvement to the classical Gilbert--Varshamov bounds. We also establish several related results on the number of longest common subsequences and shortest common supersequences of a pair of words with given length and deletion distance.
△ Less
Submitted 17 October, 2023; v1 submitted 23 September, 2022;
originally announced September 2022.
-
Hitting a prime in 2.43 dice rolls (on average)
Authors:
Noga Alon,
Yaakov Malinovsky
Abstract:
What is the number of rolls of fair 6-sided dice until the first time the total sum of all rolls is a prime? We compute the expectation and the variance of this random variable up to an additive error of less than 10^{-4}. This is a solution to a puzzle suggested by DasGupta (2017) in the Bulletin of the Institute of Mathematical Statistics, where the published solution is incomplete. The proof is…
▽ More
What is the number of rolls of fair 6-sided dice until the first time the total sum of all rolls is a prime? We compute the expectation and the variance of this random variable up to an additive error of less than 10^{-4}. This is a solution to a puzzle suggested by DasGupta (2017) in the Bulletin of the Institute of Mathematical Statistics, where the published solution is incomplete. The proof is simple, combining a basic dynamic programming algorithm with a quick Matlab computation and basic facts about the distribution of primes.
△ Less
Submitted 23 January, 2023; v1 submitted 15 September, 2022;
originally announced September 2022.
-
The success probability in Levine's hat problem, and independent sets in graphs
Authors:
Noga Alon,
Ehud Friedgut,
Gil Kalai,
Guy Kindler
Abstract:
Lionel Levine's hat challenge has $t$ players, each with a (very large, or infinite) stack of hats on their head, each hat independently colored at random black or white. The players are allowed to coordinate before the random colors are chosen, but not after. Each player sees all hats except for those on her own head. They then proceed to simultaneously try and each pick a black hat from their re…
▽ More
Lionel Levine's hat challenge has $t$ players, each with a (very large, or infinite) stack of hats on their head, each hat independently colored at random black or white. The players are allowed to coordinate before the random colors are chosen, but not after. Each player sees all hats except for those on her own head. They then proceed to simultaneously try and each pick a black hat from their respective stacks. They are proclaimed successful only if they are all correct. Levine's conjecture is that the success probability tends to zero when the number of players grows. We prove that this success probability is strictly decreasing in the number of players, and present some connections to problems in graph theory: relating the size of the largest independent set in a graph and in a random induced subgraph of it, and bounding the size of a set of vertices intersecting every maximum-size independent set in a graph.
△ Less
Submitted 21 August, 2023; v1 submitted 14 August, 2022;
originally announced August 2022.
-
Counting Dope Matrices
Authors:
Noga Alon,
Noah Kravitz,
Kevin O'Bryant
Abstract:
For a polynomial $P$ of degree $n$ and an $m$-tuple $Λ=(λ_1,\dots,λ_m)$ of distinct complex numbers, the dope matrix of $P$ with respect to $Λ$ is $D_P(Λ)=(δ_{ij})_{i\in [1,m],j\in[0,n]}$, where $δ_{ij}=1$ if $P^{(j)}(λ_i)=0$, and $δ_{ij}=0$ otherwise. Our first result is a combinatorial characterization of the $2$-row dope matrices (for all pairs $Λ$); using this characterization, we solve the as…
▽ More
For a polynomial $P$ of degree $n$ and an $m$-tuple $Λ=(λ_1,\dots,λ_m)$ of distinct complex numbers, the dope matrix of $P$ with respect to $Λ$ is $D_P(Λ)=(δ_{ij})_{i\in [1,m],j\in[0,n]}$, where $δ_{ij}=1$ if $P^{(j)}(λ_i)=0$, and $δ_{ij}=0$ otherwise. Our first result is a combinatorial characterization of the $2$-row dope matrices (for all pairs $Λ$); using this characterization, we solve the associated enumeration problem. We also give upper bounds on the number of $m\times(n+1)$ dope matrices, and we show that the number of $m \times (n+1)$ dope matrices for a fixed $m$-tuple $Λ$ is maximized when $Λ$ is generic. Finally, we resolve an ``extension'' problem of Nathanson and present several open problems.
△ Less
Submitted 10 December, 2022; v1 submitted 18 May, 2022;
originally announced May 2022.
-
EFX Allocations: Simplifications and Improvements
Authors:
Hannaneh Akrami,
Noga Alon,
Bhaskar Ray Chaudhury,
Jugal Garg,
Kurt Mehlhorn,
Ruta Mehta
Abstract:
The existence of EFX allocations is a fundamental open problem in discrete fair division. Given a set of agents and indivisible goods, the goal is to determine the existence of an allocation where no agent envies another following the removal of any single good from the other agent's bundle. Since the general problem has been illusive, progress is made on two fronts: $(i)$ proving existence when t…
▽ More
The existence of EFX allocations is a fundamental open problem in discrete fair division. Given a set of agents and indivisible goods, the goal is to determine the existence of an allocation where no agent envies another following the removal of any single good from the other agent's bundle. Since the general problem has been illusive, progress is made on two fronts: $(i)$ proving existence when the number of agents is small, $(ii)$ proving existence of relaxations of EFX. In this paper, we improve results on both fronts (and simplify in one of the cases).
We prove the existence of EFX allocations with three agents, restricting only one agent to have an MMS-feasible valuation function (a strict generalization of nice-cancelable valuation functions introduced by Berger et al. which subsumes additive, budget-additive and unit demand valuation functions). The other agents may have any monotone valuation functions. Our proof technique is significantly simpler and shorter than the proof by Chaudhury et al. on existence of EFX allocations when there are three agents with additive valuation functions and therefore more accessible.
Secondly, we consider relaxations of EFX allocations, namely, approximate-EFX allocations and EFX allocations with few unallocated goods (charity). Chaudhury et al. showed the existence of $(1-ε)$-EFX allocation with $O((n/ε)^{\frac{4}{5}})$ charity by establishing a connection to a problem in extremal combinatorics. We improve their result and prove the existence of $(1-ε)$-EFX allocations with $\tilde{O}((n/ ε)^{\frac{1}{2}})$ charity. In fact, some of our techniques can be used to prove improved upper-bounds on a problem in zero-sum combinatorics introduced by Alon and Krivelevich.
△ Less
Submitted 23 December, 2022; v1 submitted 16 May, 2022;
originally announced May 2022.
-
Identifying the Deviator
Authors:
Noga Alon,
Benjamin Gunby,
Xiaoyu He,
Eran Shmaya,
Eilon Solan
Abstract:
A group of players are supposed to follow a prescribed profile of strategies. If they follow this profile, they will reach a given target. We show that if the target is not reached because some player deviates, then an outside observer can identify the deviator. We also construct identification methods in two nontrivial cases.
A group of players are supposed to follow a prescribed profile of strategies. If they follow this profile, they will reach a given target. We show that if the target is not reached because some player deviates, then an outside observer can identify the deviator. We also construct identification methods in two nontrivial cases.
△ Less
Submitted 1 April, 2024; v1 submitted 7 March, 2022;
originally announced March 2022.
-
On a random model of forgetting
Authors:
Noga Alon,
Dor Elboim,
Allan Sly
Abstract:
Georgiou, Katkov and Tsodyks considered the following random process. Let $x_1,x_2,\ldots $ be an infinite sequence of independent, identically distributed, uniform random points in $[0,1]$. Starting with $S=\{0\}$, the elements $x_k$ join $S$ one by one, in order. When an entering element is larger than the current minimum element of $S$, this minimum leaves $S$. Let $S(1,n)$ denote the content o…
▽ More
Georgiou, Katkov and Tsodyks considered the following random process. Let $x_1,x_2,\ldots $ be an infinite sequence of independent, identically distributed, uniform random points in $[0,1]$. Starting with $S=\{0\}$, the elements $x_k$ join $S$ one by one, in order. When an entering element is larger than the current minimum element of $S$, this minimum leaves $S$. Let $S(1,n)$ denote the content of $S$ after the first $n$ elements $x_k$ join. Simulations suggest that the size $|S(1,n)|$ of $S$ at time $n$ is typically close to $n/e$. Here we first give a rigorous proof that this is indeed the case, and that in fact the symmetric difference of $S(1,n)$ and the set $\{x_k\ge 1-1/e: 1 \leq k \leq n \}$ is of size at most $\tilde{O}(\sqrt n)$ with high probability. Our main result is a more accurate description of the process implying, in particular, that as $n$ tends to infinity $ n^{-1/2}\big( |S(1,n)|-n/e \big) $ converges to a normal random variable with variance $3e^{-2}-e^{-1}$. We further show that the dynamics of the symmetric difference of $S(1,n)$ and the set $\{x_k\ge 1-1/e: 1 \leq k \leq n \}$ converges with proper scaling to a three dimensional Bessel process.
△ Less
Submitted 15 December, 2023; v1 submitted 4 March, 2022;
originally announced March 2022.
-
Complete minors and average degree -- a short proof
Authors:
Noga Alon,
Michael Krivelevich,
Benny Sudakov
Abstract:
We provide a short and self-contained proof of the classical result of Kostochka and of Thomason, ensuring that every graph of average degree $d$ has a complete minor of order $d/\sqrt{\log d}$.
We provide a short and self-contained proof of the classical result of Kostochka and of Thomason, ensuring that every graph of average degree $d$ has a complete minor of order $d/\sqrt{\log d}$.
△ Less
Submitted 28 November, 2022; v1 submitted 17 February, 2022;
originally announced February 2022.
-
Structured Codes of Graphs
Authors:
Noga Alon,
Anna Gujgiczer,
János Körner,
Aleksa Milojević,
Gábor Simonyi
Abstract:
We investigate the maximum size of graph families on a common vertex set of cardinality $n$ such that the symmetric difference of the edge sets of any two members of the family satisfies some prescribed condition. We solve the problem completely for infinitely many values of $n$ when the prescribed condition is connectivity or $2$-connectivity, Hamiltonicity or the containment of a spanning star.…
▽ More
We investigate the maximum size of graph families on a common vertex set of cardinality $n$ such that the symmetric difference of the edge sets of any two members of the family satisfies some prescribed condition. We solve the problem completely for infinitely many values of $n$ when the prescribed condition is connectivity or $2$-connectivity, Hamiltonicity or the containment of a spanning star. We also investigate local conditions that can be certified by looking at only a subset of the vertex set. In these cases a capacity-type asymptotic invariant is defined and when the condition is to contain a certain subgraph this invariant is shown to be a simple function of the chromatic number of this required subgraph. This is proven using classical results from extremal graph theory. Several variants are considered and the paper ends with a collection of open problems.
△ Less
Submitted 1 April, 2022; v1 submitted 14 February, 2022;
originally announced February 2022.
-
Implicit representation of sparse hereditary families
Authors:
Noga Alon
Abstract:
For a hereditary family of graphs $\FF$, let $\FF_n$ denote the set of all members of $\FF$ on $n$ vertices. The speed of $\FF$ is the function $f(n)=|\FF_n|$. An implicit representation of size $\ell(n)$ for $\FF_n$ is a function assigning a label of $\ell(n)$ bits to each vertex of any given graph $G \in \FF_n$, so that the adjacency between any pair of vertices can be determined by their labels…
▽ More
For a hereditary family of graphs $\FF$, let $\FF_n$ denote the set of all members of $\FF$ on $n$ vertices. The speed of $\FF$ is the function $f(n)=|\FF_n|$. An implicit representation of size $\ell(n)$ for $\FF_n$ is a function assigning a label of $\ell(n)$ bits to each vertex of any given graph $G \in \FF_n$, so that the adjacency between any pair of vertices can be determined by their labels. Bonamy, Esperet, Groenland and Scott proved that the minimum possible size of an implicit representation of $\FF_n$ for any hereditary family $\FF$ with speed $2^{Ω(n^2)}$ is $(1+o(1)) \log_2 |\FF_n|/n~(=Θ(n))$. A recent result of Hatami and Hatami shows that the situation is very different for very sparse hereditary families. They showed that for every $δ>0$ there are hereditary families of graphs with speed $2^{O(n \log n)}$ that do not admit implicit representations of size smaller than $n^{1/2-δ}$. In this note we show that even a mild speed bound ensures an implicit representation of size $O(n^c)$ for some $c<1$. Specifically we prove that for every $\eps>0$ there is an integer $d \geq 1$ so that if $\FF$ is a hereditary family with speed $f(n) \leq 2^{(1/4-\eps)n^2}$ then $\FF_n$ admits an implicit representation of size $O(n^{1-1/d} \log n)$. Moreover, for every integer $d>1$ there is a hereditary family for which this is tight up to the logarithmic factor.
△ Less
Submitted 2 January, 2022;
originally announced January 2022.
-
Random necklaces require fewer cuts
Authors:
Noga Alon,
Dor Elboim,
János Pach,
Gábor Tardos
Abstract:
It is known that any open necklace with beads of $t$ types in which the number of beads of each type is divisible by $k$, can be partitioned by at most $(k-1)t$ cuts into intervals that can be distributed into $k$ collections, each containing the same number of beads of each type. This is tight for all values of $k$ and $t$.
Here, we consider the case of random necklaces, where the number of bea…
▽ More
It is known that any open necklace with beads of $t$ types in which the number of beads of each type is divisible by $k$, can be partitioned by at most $(k-1)t$ cuts into intervals that can be distributed into $k$ collections, each containing the same number of beads of each type. This is tight for all values of $k$ and $t$.
Here, we consider the case of random necklaces, where the number of beads of each type is $km$. Then the minimum number of cuts required for a ``fair'' partition with the above property is a random variable $X(k,t,m)$. We prove that for fixed $k,t,$ and large $m$, this random variable is at least $(k-1)(t+1)/2$ with high probability. For $k=2$, fixed $t$, and large $m$, we determine the asymptotic behavior of the probability that $X(2,t,m)=s$ for all values of $s\le t $. We show that this probability is polynomially small when $s<(t+1)/2$, it is bounded away from zero when $s>(t+1)/2$, and decays like $Θ( 1/\log m)$ when $s=(t+1)/2$.
We also show that for large $t$, $X(2,t,1)$ is at most $(0.4+o(1))t$ with high probability and that for large $t$ and large ratio $k/\log t$, $X(k,t,1)$ is $o(kt)$ with high probability.
△ Less
Submitted 29 December, 2021;
originally announced December 2021.
-
Rank of matrices with entries from a multiplicative group
Authors:
Noga Alon,
Jozsef Solymosi
Abstract:
We establish lower bounds on the rank of matrices in which all but the diagonal entries lie in a multiplicative group of small rank. Applying these bounds we show that the distance sets of finite pointsets in $\mathbb{R}^d$ generate high rank multiplicative groups and that multiplicative groups of small rank cannot contain large sumsets.
We establish lower bounds on the rank of matrices in which all but the diagonal entries lie in a multiplicative group of small rank. Applying these bounds we show that the distance sets of finite pointsets in $\mathbb{R}^d$ generate high rank multiplicative groups and that multiplicative groups of small rank cannot contain large sumsets.
△ Less
Submitted 1 September, 2021;
originally announced September 2021.
-
Irregular Subgraphs
Authors:
Noga Alon,
Fan Wei
Abstract:
We suggest two related conjectures dealing with the existence of spanning irregular subgraphs of graphs. The first asserts that any $d$-regular graph on $n$ vertices contains a spanning subgraph in which the number of vertices of each degree between $0$ and $d$ deviates from $\frac{n}{d+1}$ by at most $2$. The second is that every graph on $n$ vertices with minimum degree $δ$ contains a spanning s…
▽ More
We suggest two related conjectures dealing with the existence of spanning irregular subgraphs of graphs. The first asserts that any $d$-regular graph on $n$ vertices contains a spanning subgraph in which the number of vertices of each degree between $0$ and $d$ deviates from $\frac{n}{d+1}$ by at most $2$. The second is that every graph on $n$ vertices with minimum degree $δ$ contains a spanning subgraph in which the number of vertices of each degree does not exceed $\frac{n}{δ+1}+2$. Both conjectures remain open, but we prove several asymptotic relaxations for graphs with a large number of vertices $n$. In particular we show that if $d^3 \log n \leq o(n)$ then every $d$-regular graph with $n$ vertices contains a spanning subgraph in which the number of vertices of each degree between $0$ and $d$ is $(1+o(1))\frac{n}{d+1}$. We also prove that any graph with $n$ vertices and minimum degree $δ$ contains a spanning subgraph in which no degree is repeated more than $(1+o(1))\frac{n}{δ+1}+2$ times.
△ Less
Submitted 6 August, 2021; v1 submitted 5 August, 2021;
originally announced August 2021.
-
Partitioning all $k$-subsets into $r$-wise intersecting families
Authors:
Noga Alon
Abstract:
Let $r \geq 2$, $n$ and $k$ be integers satisfying $k \leq \frac{r-1}{r}n$. In the original arXiv version of this note we suggested a conjecture that the family of all $k$-subsets of an $n$-set cannot be partitioned into fewer than $\lceil n-\frac{r}{r-1}(k-1) \rceil$ $r$-wise intersecting families. We noted that if true this is tight for all values of the parameters, that the case $r=2$ is Kneser…
▽ More
Let $r \geq 2$, $n$ and $k$ be integers satisfying $k \leq \frac{r-1}{r}n$. In the original arXiv version of this note we suggested a conjecture that the family of all $k$-subsets of an $n$-set cannot be partitioned into fewer than $\lceil n-\frac{r}{r-1}(k-1) \rceil$ $r$-wise intersecting families. We noted that if true this is tight for all values of the parameters, that the case $r=2$ is Kneser's conjecture, proved by Lovász, and observed that the assertion also holds provided $r$ is either a prime number or a power of $2$. We have recently learned, however, that the assertion of the conjecture for all values of the parameters follows from a recent result of Azarpendar and Jafari \cite{AJ}.
△ Less
Submitted 23 September, 2021; v1 submitted 27 July, 2021;
originally announced July 2021.
-
A Theory of PAC Learnability of Partial Concept Classes
Authors:
Noga Alon,
Steve Hanneke,
Ron Holzman,
Shay Moran
Abstract:
We extend the theory of PAC learning in a way which allows to model a rich variety of learning tasks where the data satisfy special properties that ease the learning process. For example, tasks where the distance of the data from the decision boundary is bounded away from zero. The basic and simple idea is to consider partial concepts: these are functions that can be undefined on certain parts of…
▽ More
We extend the theory of PAC learning in a way which allows to model a rich variety of learning tasks where the data satisfy special properties that ease the learning process. For example, tasks where the distance of the data from the decision boundary is bounded away from zero. The basic and simple idea is to consider partial concepts: these are functions that can be undefined on certain parts of the space. When learning a partial concept, we assume that the source distribution is supported only on points where the partial concept is defined.
This way, one can naturally express assumptions on the data such as lying on a lower dimensional surface or margin conditions. In contrast, it is not at all clear that such assumptions can be expressed by the traditional PAC theory. In fact we exhibit easy-to-learn partial concept classes which provably cannot be captured by the traditional PAC theory. This also resolves a question posed by Attias, Kontorovich, and Mansour 2019.
We characterize PAC learnability of partial concept classes and reveal an algorithmic landscape which is fundamentally different than the classical one. For example, in the classical PAC model, learning boils down to Empirical Risk Minimization (ERM). In stark contrast, we show that the ERM principle fails in explaining learnability of partial concept classes. In fact, we demonstrate classes that are incredibly easy to learn, but such that any algorithm that learns them must use an hypothesis space with unbounded VC dimension. We also find that the sample compression conjecture fails in this setting.
Thus, this theory features problems that cannot be represented nor solved in the traditional way. We view this as evidence that it might provide insights on the nature of learnability in realistic scenarios which the classical theory fails to explain.
△ Less
Submitted 20 July, 2021; v1 submitted 18 July, 2021;
originally announced July 2021.
-
On the Hat Guessing Number of Graphs
Authors:
Noga Alon,
Jeremy Chizewer
Abstract:
The hat guessing number $HG(G)$ of a graph $G$ on $n$ vertices is defined in terms of the following game: $n$ players are placed on the $n$ vertices of $G$, each wearing a hat whose color is arbitrarily chosen from a set of $q$ possible colors. Each player can see the hat colors of his neighbors, but not his own hat color. All of the players are asked to guess their own hat colors simultaneously,…
▽ More
The hat guessing number $HG(G)$ of a graph $G$ on $n$ vertices is defined in terms of the following game: $n$ players are placed on the $n$ vertices of $G$, each wearing a hat whose color is arbitrarily chosen from a set of $q$ possible colors. Each player can see the hat colors of his neighbors, but not his own hat color. All of the players are asked to guess their own hat colors simultaneously, according to a predetermined guessing strategy and the hat colors they see, where no communication between them is allowed. The hat guessing number $HG(G)$ is the largest integer $q$ such that there exists a guessing strategy guaranteeing at least one correct guess for any hat assignment of $q$ possible colors.
In this note we construct a planar graph $G$ satisfying $HG(G)=12$, settling a problem raised in \cite{BDFGM}. We also improve the known lower bound of $(2-o(1))\log_2 n$ for the typical hat guessing number of the random graph $G=G(n,1/2)$, showing that it is at least $n^{1-o(1)}$ with probability tending to $1$ as $n$ tends to infinity. Finally, we consider the linear hat guessing number of complete multipartite graphs.
△ Less
Submitted 21 July, 2021; v1 submitted 13 July, 2021;
originally announced July 2021.
-
The runsort permuton
Authors:
Noga Alon,
Colin Defant,
Noah Kravitz
Abstract:
Suppose we choose a permutation $π$ uniformly at random from $S_n$. Let $\mathsf{runsort}(π)$ be the permutation obtained by sorting the ascending runs of $π$ into lexicographic order. Alexandersson and Nabawanda recently asked if the plot of $\mathsf{runsort}(π)$, when scaled to the unit square $[0,1]^2$, converges to a limit shape as $n\to\infty$. We answer their question by showing that the mea…
▽ More
Suppose we choose a permutation $π$ uniformly at random from $S_n$. Let $\mathsf{runsort}(π)$ be the permutation obtained by sorting the ascending runs of $π$ into lexicographic order. Alexandersson and Nabawanda recently asked if the plot of $\mathsf{runsort}(π)$, when scaled to the unit square $[0,1]^2$, converges to a limit shape as $n\to\infty$. We answer their question by showing that the measures corresponding to the scaled plots of these permutations $\mathsf{runsort}(π)$ converge with probability $1$ to a permuton (limiting probability distribution) that we describe explicitly. In particular, the support of this permuton is $\{(x,y)\in[0,1]^2:x\leq ye^{1-y}\}$.
△ Less
Submitted 28 June, 2021;
originally announced June 2021.
-
Dominance Solvability in Random Games
Authors:
Noga Alon,
Kirill Rudov,
Leeat Yariv
Abstract:
We study the effectiveness of iterated elimination of strictly-dominated actions in random games. We show that dominance solvability of games is vanishingly small as the number of at least one player's actions grows. Furthermore, conditional on dominance solvability, the number of iterations required to converge to Nash equilibrium grows rapidly as action sets grow. Nonetheless, when games are hig…
▽ More
We study the effectiveness of iterated elimination of strictly-dominated actions in random games. We show that dominance solvability of games is vanishingly small as the number of at least one player's actions grows. Furthermore, conditional on dominance solvability, the number of iterations required to converge to Nash equilibrium grows rapidly as action sets grow. Nonetheless, when games are highly imbalanced, iterated elimination simplifies the game substantially by ruling out a sizable fraction of actions. Technically, we illustrate the usefulness of recent combinatorial methods for the analysis of general games.
△ Less
Submitted 22 May, 2021;
originally announced May 2021.
-
On Sums of Monotone Random Integer Variables
Authors:
Anders Aamand,
Noga Alon,
Jakob Bæk Tejs Knudsen,
Mikkel Thorup
Abstract:
We say that a random integer variable $X$ is monotone if the modulus of the characteristic function of $X$ is decreasing on $[0,π]$. This is the case for many commonly encountered variables, e.g., Bernoulli, Poisson and geometric random variables. In this note, we provide estimates for the probability that the sum of independent monotone integer variables attains precisely a specific value. We do…
▽ More
We say that a random integer variable $X$ is monotone if the modulus of the characteristic function of $X$ is decreasing on $[0,π]$. This is the case for many commonly encountered variables, e.g., Bernoulli, Poisson and geometric random variables. In this note, we provide estimates for the probability that the sum of independent monotone integer variables attains precisely a specific value. We do not assume that the variables are identically distributed. Our estimates are sharp when the specific value is close to the mean, but they are not useful further out in the tail. By combining with the trick of \emph{exponential tilting}, we obtain sharp estimates for the point probabilities in the tail under a slightly stronger assumption on the random integer variables which we call strong monotonicity.
△ Less
Submitted 13 April, 2021; v1 submitted 8 April, 2021;
originally announced April 2021.
-
Arithmetic Progressions in Sumsets of Sparse Sets
Authors:
Noga Alon,
Ryan Alweiss,
Yang P. Liu,
Anders Martinsson,
Shyam Narayanan
Abstract:
A set of positive integers $A \subset \mathbb{Z}_{> 0}$ is \emph{log-sparse} if there is an absolute constant $C$ so that for any positive integer $x$ the sequence contains at most $C$ elements in the interval $[x,2x)$. In this note we study arithmetic progressions in sums of log-sparse subsets of $\mathbb{Z}_{> 0}$. We prove that for any log-sparse subsets $S_1, \dots, S_n$ of…
▽ More
A set of positive integers $A \subset \mathbb{Z}_{> 0}$ is \emph{log-sparse} if there is an absolute constant $C$ so that for any positive integer $x$ the sequence contains at most $C$ elements in the interval $[x,2x)$. In this note we study arithmetic progressions in sums of log-sparse subsets of $\mathbb{Z}_{> 0}$. We prove that for any log-sparse subsets $S_1, \dots, S_n$ of $\mathbb{Z}_{> 0},$ the sumset $S = S_1 + \cdots + S_n$ cannot contain an arithmetic progression of size greater than $n^{(1+o(1))n}.$ We also show that this is nearly tight by proving that there exist log-sparse sets $S_1, \dots, S_n$ such that $S_1 + \cdots + S_n$ contains an arithmetic progression of size $n^{(1-o(1)) n}.$
△ Less
Submitted 18 April, 2021; v1 submitted 4 April, 2021;
originally announced April 2021.
-
Hitting all maximum independent sets
Authors:
Noga Alon
Abstract:
We describe an infinite family of graphs $G_n$, where $G_n$ has $n$ vertices, independence number at least $n/4$, and no set of less than $\sqrt{n}/2$ vertices intersects all its maximum independent sets. This is motivated by a question of Bollobás, Erdős and Tuza, and disproves a recent conjecture of Friedgut, Kalai and Kindler. Motivated by a related question of the last authors, we show that fo…
▽ More
We describe an infinite family of graphs $G_n$, where $G_n$ has $n$ vertices, independence number at least $n/4$, and no set of less than $\sqrt{n}/2$ vertices intersects all its maximum independent sets. This is motivated by a question of Bollobás, Erdős and Tuza, and disproves a recent conjecture of Friedgut, Kalai and Kindler. Motivated by a related question of the last authors, we show that for every graph $G$ on $n$ vertices with independence number $(1/4+\eps)n$, the average independence number of an induced subgraph of $G$ on a uniform random subset of the vertices is at most $(1/4+\eps-Ω(\eps^2)) n$.
△ Less
Submitted 4 April, 2021; v1 submitted 10 March, 2021;
originally announced March 2021.
-
Adversarial Laws of Large Numbers and Optimal Regret in Online Classification
Authors:
Noga Alon,
Omri Ben-Eliezer,
Yuval Dagan,
Shay Moran,
Moni Naor,
Eylon Yogev
Abstract:
Laws of large numbers guarantee that given a large enough sample from some population, the measure of any fixed sub-population is well-estimated by its frequency in the sample. We study laws of large numbers in sampling processes that can affect the environment they are acting upon and interact with it. Specifically, we consider the sequential sampling model proposed by Ben-Eliezer and Yogev (2020…
▽ More
Laws of large numbers guarantee that given a large enough sample from some population, the measure of any fixed sub-population is well-estimated by its frequency in the sample. We study laws of large numbers in sampling processes that can affect the environment they are acting upon and interact with it. Specifically, we consider the sequential sampling model proposed by Ben-Eliezer and Yogev (2020), and characterize the classes which admit a uniform law of large numbers in this model: these are exactly the classes that are \emph{online learnable}. Our characterization may be interpreted as an online analogue to the equivalence between learnability and uniform convergence in statistical (PAC) learning.
The sample-complexity bounds we obtain are tight for many parameter regimes, and as an application, we determine the optimal regret bounds in online learning, stated in terms of \emph{Littlestone's dimension}, thus resolving the main open question from Ben-David, Pál, and Shalev-Shwartz (2009), which was also posed by Rakhlin, Sridharan, and Tewari (2015).
△ Less
Submitted 22 January, 2021;
originally announced January 2021.
-
Divisible subdivisions
Authors:
Noga Alon,
Michael Krivelevich
Abstract:
We prove that for every graph $H$ of maximum degree at most $3$ and for every positive integer $q$ there is a finite $f=f(H,q)$ such that every $K_f$-minor contains a subdivision of $H$ in which every edge is replaced by a path whose length is divisible by $q$. For the case of cycles we show that for $f=O(q \log q)$ every $K_f$-minor contains a cycle of length divisible by $q$, and observe that th…
▽ More
We prove that for every graph $H$ of maximum degree at most $3$ and for every positive integer $q$ there is a finite $f=f(H,q)$ such that every $K_f$-minor contains a subdivision of $H$ in which every edge is replaced by a path whose length is divisible by $q$. For the case of cycles we show that for $f=O(q \log q)$ every $K_f$-minor contains a cycle of length divisible by $q$, and observe that this settles a recent problem of Friedman and the second author about cycles in (weakly) expanding graphs.
△ Less
Submitted 29 June, 2021; v1 submitted 9 December, 2020;
originally announced December 2020.