(Translated by https://www.hiragana.jp/)
Fair Secretaries with Unfair Predictions
\addauthor

Willblue \addauthorAndreasolive

Fair Secretaries with Unfair Predictions

Eric Balkanski
Columbia University
eb3224@columbia.edu
&Will Ma
Columbia University
wm2428@gsb.columbia.edu
Andreas Maggiori
Columbia University
am6292@columbia.edu
Abstract

Algorithms with predictions is a recent framework for decision-making under uncertainty that leverages the power of machine-learned predictions without making any assumption about their quality. The goal in this framework is for algorithms to achieve an improved performance when the predictions are accurate while maintaining acceptable guarantees when the predictions are erroneous. A serious concern with algorithms that use predictions is that these predictions can be biased and, as a result, cause the algorithm to make decisions that are deemed unfair. We show that this concern manifests itself in the classical secretary problem in the learning-augmented setting—the state-of-the-art algorithm can have zero probability of accepting the best candidate, which we deem unfair, despite promising to accept a candidate whose expected value is at least max{Ω(1),1O(ε)}Ω11𝑂𝜀\max\{\Omega(1),1-O(\operatorname{\varepsilon})\}roman_max { roman_Ω ( 1 ) , 1 - italic_O ( italic_ε ) } times the optimal value, where ε𝜀\operatorname{\varepsilon}italic_ε is the prediction error. We show how to preserve this promise while also guaranteeing to accept the best candidate with probability Ω(1)Ω1\Omega(1)roman_Ω ( 1 ). Our algorithm and analysis are based on a new “pegging” idea that diverges from existing works and simplifies/unifies some of their results. Finally, we extend to the k𝑘kitalic_k-secretary problem and complement our theoretical analysis with experiments.

1 Introduction

As machine learning algorithms are increasingly used in socially impactful decision-making applications, the fairness of those algorithms has become a primary concern. Many algorithms deployed in recent years have been shown to be explicitly unfair or reflect bias that is present in training data. Applications where automated decision-making algorithms have been used and fairness is of central importance include loan/credit-risk evaluation (Mukerjee et al., 2002; Khandani et al., 2010; Malhotra and Malhotra, 2003), hiring (Bogen and Rieke, 2018; Cohen et al., 2019), recidivism evaluation Mustard (2003); Angwin et al. (2016); Dressel and Farid (2018); Chouldechova (2017); COMPAS () (software), childhood welfare systems Chouldechova et al. (2018), job recommendations Lambrecht and Tucker (2019), price discrimination Cohen et al. (2022), resource allocation Manshadi et al. (2023), and others Hern (2016); Grossman (2010); Howard and Borenstein (2018); Balseiro et al. (2021); Ma et al. (2022). A lot of work in recent years has been devoted to formally defining different notions of fairness Luong et al. (2011); Kamishima et al. (2012); Feldman et al. (2015); Dwork et al. (2012); Corbett-Davies et al. (2017); Kleinberg et al. (2017); Kleinberg and Raghavan (2017), designing algorithms that satisfy these different definitions Kamishima et al. (2011); Joseph et al. (2016); Celis et al. (2018); Chierichetti et al. (2019); Yang and Stoyanovich (2017), and investigating trade-offs between fairness and other optimization objectives Bertsimas et al. (2011, 2012).

While most fairness work concentrates on classification problems where the instance is known offline, we explore the problem of making fair decisions when the input is revealed in an online manner. Although fairness in online algorithms is an interesting line of research per se, fairness considerations have become increasingly important due to the recent interest in incorporating (possibly biased) machine learning predictions into the design of classical online algorithms. This framework, usually referred to as learning-augmented algorithms or algorithms with predictions, was first formalized in Lykouris and Vassilvitskii (2018). In contrast to classical online algorithms problems where it is assumed that no information is known about the future, learning-augmented online algorithms are given as input, possibly erroneous, predictions about the future. The main challenge is to simultaneously achieve an improved performance when the predictions are accurate and a robust performance when the predictions are arbitrarily inaccurate. A long list of online problems have been considered in this setting and we point to Lindermayr and Megow for an up-to-date list of papers. We enrich this active area of research by investigating how potentially biased predictions affect the fairness of decisions made by learning-augmented algorithms, and ask the following question: {mdframed}[hidealllines=true, backgroundcolor=gray!15]

Can we design fair algorithms that take advantage of unfair predictions?

In this paper, we study this question on a parsimonious formulation of the secretary problem with predictions, motivated by fairness in hiring candidates.

The problem. In the classical secretary problem, there are n𝑛nitalic_n candidates who each have a value and arrive in a random order. Upon arrival of a candidate, the algorithm observes the value of that candidate and must irrevocably decide whether to accept or reject that candidate. It can only accept one candidate and the goal is to maximize the probability of accepting the candidate with maximum value. In the classical formulation, only the ordinal ranks of candidates matter, and the algorithm of Dynkin (1963) accepts the best candidate with a constant probability, that equals the best-possible 1/e1𝑒1/e1 / italic_e.

In the learning-augmented formulation of the problem proposed by Fujii and Yoshida (2023), the algorithm is initially given a predicted value about each candidate and the authors focus on comparing the expected cardinal value accepted by the algorithm to the maximum cardinal value. The authors derive an algorithm that obtains expected value at least max{Ω(1),1O(ε)}Ω11𝑂𝜀\max\{\Omega(1),1-O(\operatorname{\varepsilon})\}roman_max { roman_Ω ( 1 ) , 1 - italic_O ( italic_ε ) } times the maximum value, where ε0𝜀0\operatorname{\varepsilon}\geq 0italic_ε ≥ 0 is the prediction error. The strength of this guarantee is that it approaches 1111 as the prediction error decreases and it is a positive constant even when the error is arbitrarily large.

However, because the algorithm is now using predictions that could be biased, the best candidate may no longer have any probability of being accepted. We view this as a form of unfairness, and aim to derive algorithms that are fair to the best candidate by guaranteeing them a constant probability of being accepted (we contrast with other notions of fairness in stopping problems in Section 1.1). Of course, a simple way to be fair by this metric is to ignore the predictions altogether and run the classical algorithm of Dynkin. However, this approach would ignore potentially valuable information and lose the improved guarantee of Fujii and Yoshida (2023) that approaches 1 when the prediction error is low.

Outline of results. We first formally show that the algorithm of Fujii and Yoshida (2023) may in fact accept the best candidate with 0 probability. Our main result is then a new algorithm for secretary with predictions that: obtains expected value at least max{Ω(1),1O(ε)}Ω11𝑂𝜀\max\{\Omega(1),1-O(\operatorname{\varepsilon})\}roman_max { roman_Ω ( 1 ) , 1 - italic_O ( italic_ε ) } times the maximum value, like Fujii and Yoshida (2023); and ensures that, under any predictions, the best candidate is hired with Ω(1)Ω1\Omega(1)roman_Ω ( 1 ) probability. This result takes advantage of potentially biased predictions to achieve a guarantee on expected value that approaches 1111 when the prediction error is small, while also providing a fairness guarantee for the true best candidate irrespective of the predictions. We note that Antoniadis et al. (2020b) also derive an algorithm for secretary with predictions, where the prediction is of the maximum value (a less informative form of prediction). This algorithm accepts the best candidate with constant probability but it does not provide a guarantee on the expected value accepted that approaches 1111 as the prediction error approaches 00. Similarly, Dynkin’s algorithm for the classical secretary problem accepts the best candidate with constant probability but does not make use of predictions at all.

Our algorithm is fundamentally different from existing algorithms for secretary with predictions, as our “pegging” idea, i.e., the idea not to accept a possibly suboptimal candidate if there is a future candidate with high enough predicted value, is important to achieve our fairness desideratum. We also note that the definitions of the prediction error ε𝜀\operatorname{\varepsilon}italic_ε differ in (Fujii and Yoshida, 2023) and (Antoniadis et al., 2020b); the former error definition uses the maximum ratio over all candidates between their predicted and true value while the latter uses the absolute difference. Our techniques present an arguably simpler analysis and extend to a general family of prediction error measures that includes both of these error definitions.

We then extend our approach to the multiple choice or k𝑘kitalic_k-secretary problem where the goal is to accept at most k𝑘kitalic_k candidates and maximize the total of their values, which is the most technical part of the paper. We design an algorithm that obtains expected total value at least max{Ω(1),1O(ε)}Ω11𝑂𝜀\max\{\Omega(1),1-O(\operatorname{\varepsilon})\}roman_max { roman_Ω ( 1 ) , 1 - italic_O ( italic_ε ) } times the optimum (which is the sum of the k𝑘kitalic_k highest values), while simultaneously guaranteeing the k𝑘kitalic_k highest-valued candidates a constant probability of being accepted. We also have a refined guarantee that provides a higher acceptance probability for the (1δ)k1𝛿𝑘(1-\delta)k( 1 - italic_δ ) italic_k highest-valued candidates, for any δ(0,1)𝛿01\delta\in(0,1)italic_δ ∈ ( 0 , 1 ).

Finally, we simulate our algorithms in the exact experimental setup of Fujii and Yoshida (2023). We find that they perform well both in terms of expected value accepted and fairness, whereas benchmark algorithms compromise on one of these desiderata.

1.1 Related work

The secretary problem. After Gardner (1960) introduced the secretary problem, Dynkin (1963) developed a simple and optimal stopping rule algorithm that, with probability at least 1/e1𝑒1/e1 / italic_e, accepts the candidate with maximum value. Due to its general and simple formulation, the problem has received a lot of attention (see, e.g., Lindley (1961); Gilbert and Mosteller (1966) and references therein) and it was later extended to more general versions such as k𝑘kitalic_k-secretary (Kleinberg, 2005), matroid-secretary (Babaioff et al., 2007b) and knapsack-secretary (Babaioff et al., 2007a).

Secretaries with predictions. The two works which are closest to our paper are those of Antoniadis et al. (2020b) and Fujii and Yoshida (2023). Both works design algorithms that use predictions regarding the values of the candidates to improve the performance guarantee of Dynkin’s algorithm when the predictions are accurate while also maintaining robustness guarantees when the predictions are arbitrarily wrong. Antoniadis et al. (2020b) uses as prediction only the maximum value and defines the prediction error as the additive difference between the predicted and true maximum value while Fujii and Yoshida (2023) receives a prediction for each candidate and defines the error as the maximum multiplicative difference between true and predicted value among all candidates. Very recently, Choo and Ling (2024) showed that any secretary algorithm that is 1111-consistent cannot achieve robustness better than 1/3+o(1)13𝑜11/3+o(1)1 / 3 + italic_o ( 1 ), even with predictions for each candidate. This result implies that, if we wish to maintain the 1O(ε)1𝑂𝜀1-O(\operatorname{\varepsilon})1 - italic_O ( italic_ε ) competitive ratio guarantee from (Fujii and Yoshida, 2023), then the probability of accepting the best candidate cannot be better than 1/3+o(1)13𝑜11/3+o(1)1 / 3 + italic_o ( 1 ).

Secretaries with distributional advice. Another active line of work is to explore how distributional advice can be used to surpass the 1/e1𝑒1/e1 / italic_e barrier of the classical secretary problem. Examples of this line of work include the prophet secretary problems where each candidate draws its valuation from a known distribution Esfandiari et al. (2017); Correa et al. (2021b, c); Azar et al. (2018) and the sample secretary problem where the algorithm designer has only sample access to this distribution Kaplan et al. (2020); Correa et al. (2021a). We note that in the former models, predictions are either samples from distributions or distributions themselves which are assumed to be perfectly correct, while in the learning-augmented setting, we receive point predictions that could be completely incorrect. Dütting et al. (2021) investigate a general model for advice where both values and advice are revealed upon a candidate’s arrival and are drawn from a joint distribution \mathcal{F}caligraphic_F. For example, their advice can be a noisy binary prediction about whether the current candidate is the best overall. Their main result uses linear programming to design optimal algorithms for a broad family of advice that satisfies two conditions. However, these two conditions are not satisfied by the predictions we consider. Additionally, we do not assume any prior knowledge of the prediction quality, whereas their noisy binary prediction setting assumes that the error probability of the binary advice is known.

Fairness in stopping algorithms. We say that a learning-augmented algorithm for the secretary problem is F𝐹Fitalic_F-fair if it accepts the candidate with the maximum true value with probability at least F𝐹Fitalic_F. In that definition, we do not quantify unfairness as a prediction property but as an algorithmic one, since the algorithm has to accept the best candidate with probability at least F𝐹Fitalic_F no matter how biased predictions are our fairness notion is a challenging one. That notion can be characterized as an individual fairness notion similar to the identity-independent fairness (IIF) and time-independent fairness (TIF) introduced in Arsenis and Kleinberg (2022). In the context of the secretary problem, IIF and TIF try to mitigate discrimination due to a person’s identity and arrival time respectively. While these are very appealing fairness notions, the fair algorithms designed in Arsenis and Kleinberg (2022) fall in the classical online algorithms setting as they do not make any assumptions about the future. Consequently, their performance is upper bound by the performance of the best algorithm in the classical worst-case analysis setting. It is also interesting to note the similarities with the poset secretary problem in Salem and Gupta (2023). In the latter work the set of candidates is split into several groups and candidates belonging to different groups cannot be compared due to different biases in the evaluation. In some sense, we try to do the same; different groups of candidates may have predictions that are affected by different biases making the comparison difficult before the true value of each candidate is revealed. Again, in Salem and Gupta (2023) no information about the values of future candidates is available and the performance of their algorithms is upper bounded by the best possible performance in the worst-case analysis setting.

2 Preliminaries

Secretary problem with predictions. Candidates i=1,,n𝑖1𝑛i=1,\ldots,nitalic_i = 1 , … , italic_n have true values uisubscript𝑢𝑖u_{i}italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT and predicted values u^isubscript^𝑢𝑖\hat{u}_{i}over^ start_ARG italic_u end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT. The number of candidates n𝑛nitalic_n and their predicted values are known in advance. The candidates arrive in a uniformly random order. Every time a new candidate arrives their true value is revealed and the algorithm must immediately decide whether to accept the current candidate or reject them irrevocably and wait for the next arrival. We let i=argmaxiuisuperscript𝑖subscriptargmax𝑖subscript𝑢𝑖{i^{*}}=\operatorname*{argmax}_{i}u_{i}italic_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT = roman_argmax start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT and ı^=argmaxiu^i^italic-ısubscriptargmax𝑖subscript^𝑢𝑖{\hat{\imath}}=\operatorname*{argmax}_{i}\hat{u}_{i}over^ start_ARG italic_ı end_ARG = roman_argmax start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT over^ start_ARG italic_u end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT denote the indices of the candidates with the maximum true and predicted value respectively. An instance \mathcal{I}caligraphic_I consists of the 2n2𝑛2n2 italic_n values u1,,un,u^1,,u^nsubscript𝑢1subscript𝑢𝑛subscript^𝑢1subscript^𝑢𝑛u_{1},\ldots,u_{n},\hat{u}_{1},\ldots,\hat{u}_{n}italic_u start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_u start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , over^ start_ARG italic_u end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , over^ start_ARG italic_u end_ARG start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT which, for convenience, are assumed to be positive111Our results for additive error allow negative values, but our extension to multiplicative error in Section A.2 requires positive values. and mutually distinct222This is without loss as adding an arbitrarily small perturbation to each true and predicted value does not change the performance of our algorithms. This allows for a unique argmaxargmax\operatorname*{argmax}roman_argmax in the definitions of isuperscript𝑖{i^{*}}italic_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT and ı^^italic-ı{\hat{\imath}}over^ start_ARG italic_ı end_ARG.. We let ε()𝜀\operatorname{\varepsilon}(\mathcal{I})italic_ε ( caligraphic_I ) denote its prediction error. For simplicity, we focus on the additive prediction error ε()=maxi|u^iui|𝜀subscript𝑖subscript^𝑢𝑖subscript𝑢𝑖\operatorname{\varepsilon}(\mathcal{I})=\max_{i}|\hat{u}_{i}-u_{i}|italic_ε ( caligraphic_I ) = roman_max start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT | over^ start_ARG italic_u end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT |, but we consider an abstract generalization that includes the multiplicative prediction error of Fujii and Yoshida (2023) in Appendix A.2.

Objectives. We let 𝒜𝒜\mathcal{A}caligraphic_A be a random variable denoting the candidate accepted by a given algorithm on a fixed instance, which depends on both the arrival order and any internal randomness in the algorithm. We consider the following desiderata for a given algorithm:

𝐄[u𝒜]𝐄delimited-[]subscriptu𝒜\displaystyle{{\bf E}{\left[u_{\mathcal{A}}\right]}}bold_E [ roman_u start_POSTSUBSCRIPT caligraphic_A end_POSTSUBSCRIPT ] uiCε(),absentsubscript𝑢superscript𝑖𝐶𝜀for-all\displaystyle\geq u_{{i^{*}}}-C\cdot\operatorname{\varepsilon}(\mathcal{I}),\;% \forall\mathcal{I}≥ italic_u start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT - italic_C ⋅ italic_ε ( caligraphic_I ) , ∀ caligraphic_I (smoothness)
P[𝒜=i]𝑃delimited-[]𝒜superscripti\displaystyle{P{\left[\mathcal{A}={i^{*}}\right]}}italic_P [ caligraphic_A = roman_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ] F,.absent𝐹for-all\displaystyle\geq F,\;\forall\mathcal{I}.≥ italic_F , ∀ caligraphic_I . (fairness)

Since the prediction error ε()𝜀\operatorname{\varepsilon}(\mathcal{I})italic_ε ( caligraphic_I ) is an additive prediction error, we define smoothness to provide an additive approximation guarantee that depends on ε()𝜀\operatorname{\varepsilon}(\mathcal{I})italic_ε ( caligraphic_I ). When considering the multiplicative prediction error of Fujii and Yoshida (2023), smoothness is defined to provide an approximation guarantee that is multiplicative instead of additive (see Theorem 4).

We aim to derive algorithms that can satisfy smoothness and fairness with constants C,F>0𝐶𝐹0C,F>0italic_C , italic_F > 0 that do not depend on the instance \mathcal{I}caligraphic_I or the number of candidates n𝑛nitalic_n. Existing algorithms for secretary with predictions do not simultaneously satisfy these desiderata, as shown by our examples in Appendix A.1.

Comparison to other objectives. Existing algorithms for secretary with predictions do satisfy a weaker notion called R𝑅Ritalic_R-robustness, where 𝐄[u𝒜]Rui𝐄delimited-[]subscriptu𝒜𝑅subscript𝑢superscript𝑖{{\bf E}{\left[u_{\mathcal{A}}\right]}}\geq R\cdot u_{{i^{*}}}bold_E [ roman_u start_POSTSUBSCRIPT caligraphic_A end_POSTSUBSCRIPT ] ≥ italic_R ⋅ italic_u start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT for some constant R>0𝑅0R>0italic_R > 0. Our desideratum of fairness implies F𝐹Fitalic_F-robustness and aligns with the classical secretary formulation where one is only rewarded for accepting the best candidate. Another notion of interest in existing literature is consistency, which is how 𝐄[u𝒜]𝐄delimited-[]subscriptu𝒜{{\bf E}{\left[u_{\mathcal{A}}\right]}}bold_E [ roman_u start_POSTSUBSCRIPT caligraphic_A end_POSTSUBSCRIPT ] compares to uisubscript𝑢superscript𝑖u_{{i^{*}}}italic_u start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT when ε()=0𝜀0\operatorname{\varepsilon}(\mathcal{I})=0italic_ε ( caligraphic_I ) = 0. Our smoothness desideratum implies 1111-consistency, the best possible consistency result, and guarantees a smooth degradation as ε()𝜀\varepsilon(\mathcal{I})italic_ε ( caligraphic_I ) increases beyond 00.

3 Algorithm and Analysis

We first present and analyze Additive-Pegging in Algorithm 1 which achieves the desiderata from Section 2. Then, we mention how using a more abstract prediction error and an almost identical analysis, permits us to generalize Additive-Pegging to Pegging which achieves comparable guarantees for a more general class of error functions that includes the multiplicative error.

Our algorithms assume that each candidate i𝑖iitalic_i arrives at an independently random arrival time tisubscript𝑡𝑖t_{i}italic_t start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT drawn uniformly from [0,1]01[0,1][ 0 , 1 ]. The latter continuous-time arrival model is equivalent to candidates arriving in a uniformly random order and simplifies the algorithm description and analysis. We also write ϵisubscriptitalic-ϵ𝑖\epsilon_{i}italic_ϵ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT as shorthand for |uiu^i|subscript𝑢𝑖subscript^𝑢𝑖\lvert u_{i}-\hat{u}_{i}\rvert| italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - over^ start_ARG italic_u end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT |, ε𝜀\operatorname{\varepsilon}italic_ε as shorthand for ε()𝜀\operatorname{\varepsilon}(\mathcal{I})italic_ε ( caligraphic_I ) (so that ε=maxiϵi𝜀subscript𝑖subscriptitalic-ϵ𝑖\operatorname{\varepsilon}=\max_{i}\epsilon_{i}italic_ε = roman_max start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_ϵ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT) and ijprecedes𝑖𝑗i\prec jitalic_i ≺ italic_j if ti<tjsubscript𝑡𝑖subscript𝑡𝑗t_{i}<t_{j}italic_t start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT < italic_t start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT.

Description of Additive-Pegging. Additive-Pegging ensures smoothness by always accepting a candidate whose value is close to uı^subscript𝑢^italic-ıu_{\hat{\imath}}italic_u start_POSTSUBSCRIPT over^ start_ARG italic_ı end_ARG end_POSTSUBSCRIPT which, as we argue, is at least ui2εsubscript𝑢superscript𝑖2𝜀u_{i^{*}}-2\operatorname{\varepsilon}italic_u start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT - 2 italic_ε. To see this, note that uı^u^ı^ϵı^u^iϵı^uiϵiϵı^ui2εsubscript𝑢^italic-ısubscript^𝑢^italic-ısubscriptitalic-ϵ^italic-ısubscript^𝑢superscript𝑖subscriptitalic-ϵ^italic-ısubscript𝑢superscript𝑖subscriptitalic-ϵsuperscript𝑖subscriptitalic-ϵ^italic-ısubscript𝑢superscript𝑖2𝜀u_{\hat{\imath}}\geq\hat{u}_{\hat{\imath}}-\epsilon_{\hat{\imath}}\geq\hat{u}_% {i^{*}}-\epsilon_{\hat{\imath}}\geq u_{i^{*}}-\epsilon_{i^{*}}-\epsilon_{\hat{% \imath}}\geq u_{i^{*}}-2\operatorname{\varepsilon}italic_u start_POSTSUBSCRIPT over^ start_ARG italic_ı end_ARG end_POSTSUBSCRIPT ≥ over^ start_ARG italic_u end_ARG start_POSTSUBSCRIPT over^ start_ARG italic_ı end_ARG end_POSTSUBSCRIPT - italic_ϵ start_POSTSUBSCRIPT over^ start_ARG italic_ı end_ARG end_POSTSUBSCRIPT ≥ over^ start_ARG italic_u end_ARG start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT - italic_ϵ start_POSTSUBSCRIPT over^ start_ARG italic_ı end_ARG end_POSTSUBSCRIPT ≥ italic_u start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT - italic_ϵ start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT - italic_ϵ start_POSTSUBSCRIPT over^ start_ARG italic_ı end_ARG end_POSTSUBSCRIPT ≥ italic_u start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT - 2 italic_ε, where we used that u^ı^u^isubscript^𝑢^italic-ısubscript^𝑢superscript𝑖\hat{u}_{\hat{\imath}}\geq\hat{u}_{i^{*}}over^ start_ARG italic_u end_ARG start_POSTSUBSCRIPT over^ start_ARG italic_ı end_ARG end_POSTSUBSCRIPT ≥ over^ start_ARG italic_u end_ARG start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT (by definition of ı^^italic-ı{\hat{\imath}}over^ start_ARG italic_ı end_ARG) and εmax{ϵi,ϵı^}𝜀subscriptitalic-ϵsuperscript𝑖subscriptitalic-ϵ^italic-ı\operatorname{\varepsilon}\geq\max\{\epsilon_{i^{*}},\epsilon_{\hat{\imath}}\}italic_ε ≥ roman_max { italic_ϵ start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT , italic_ϵ start_POSTSUBSCRIPT over^ start_ARG italic_ı end_ARG end_POSTSUBSCRIPT } (by definition of ε𝜀\operatorname{\varepsilon}italic_ε). Consequently, for smoothness, our algorithm defines the literal 𝒞=(i=ı^)𝒞𝑖^italic-ı\mathcal{C}=(i={\hat{\imath}})caligraphic_C = ( italic_i = over^ start_ARG italic_ı end_ARG ) at each new arrival, which is true if and only if i𝑖iitalic_i has the highest predicted value. Accepting while 𝒞𝒞\mathcal{C}caligraphic_C holds would maintain smoothness.

For the fairness desideratum, we note that Dynkin’s algorithm (Dynkin, 1963) for the classical secretary problem relies on the observation that if a constant fraction of the candidates have arrived and the candidate who just arrived has the maximum true value so far, then this candidate has a constant probability of being the best overall. The same high-level intuition is used in our algorithm. Every time a new candidate i𝑖iitalic_i arrives, we check if i𝑖iitalic_i is the maximum so far and if ti>1/2subscript𝑡𝑖12t_{i}>1/2italic_t start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT > 1 / 2; namely, we compute the literal \mathcal{F}caligraphic_F. Accepting when \mathcal{F}caligraphic_F is true, which is what Dynkin’s algorithm does, would ensure fairness.

However, there are two crucial situations where Additive-Pegging differs from Dynkin’s algorithm. The first such situation is when the candidate ı^^italic-ı{\hat{\imath}}over^ start_ARG italic_ı end_ARG with maximum predicted value arrives and we have that ı^^italic-ı{\hat{\imath}}over^ start_ARG italic_ı end_ARG is not the maximum so far or tı^1/2subscript𝑡^italic-ı12t_{{\hat{\imath}}}\leq 1/2italic_t start_POSTSUBSCRIPT over^ start_ARG italic_ı end_ARG end_POSTSUBSCRIPT ≤ 1 / 2, i.e., 𝒞¯𝒞¯\mathcal{C}\land\overline{\mathcal{F}}caligraphic_C ∧ over¯ start_ARG caligraphic_F end_ARG is true. In this case, we cannot always reject ı^^italic-ı{\hat{\imath}}over^ start_ARG italic_ı end_ARG, as Dynkin’s algorithm would, because that would not guarantee smoothness. Instead, we reject ı^^italic-ı{\hat{\imath}}over^ start_ARG italic_ı end_ARG only if there is a future candidate whose prediction is sufficiently high compared to uı^subscript𝑢^italic-ıu_{{\hat{\imath}}}italic_u start_POSTSUBSCRIPT over^ start_ARG italic_ı end_ARG end_POSTSUBSCRIPT. We call I𝗉𝖾𝗀𝗀𝖾𝖽superscript𝐼𝗉𝖾𝗀𝗀𝖾𝖽I^{\mathsf{pegged}}italic_I start_POSTSUPERSCRIPT sansserif_pegged end_POSTSUPERSCRIPT the set of those candidates. The main idea behind the pegged set I𝗉𝖾𝗀𝗀𝖾𝖽superscript𝐼𝗉𝖾𝗀𝗀𝖾𝖽I^{\mathsf{pegged}}italic_I start_POSTSUPERSCRIPT sansserif_pegged end_POSTSUPERSCRIPT is that it contains the last candidate to arrive who can guarantee the smoothness property, which is why we accept that candidate when they arrive. The second situation where our algorithm departs from Dynkin’s algorithm is when a candidate i𝑖iitalic_i arrives with iı^,ii𝗉𝖾𝗀𝗀𝖾𝖽formulae-sequence𝑖^italic-ı𝑖superscript𝑖𝗉𝖾𝗀𝗀𝖾𝖽i\neq{\hat{\imath}},i\neq{i^{\mathsf{pegged}}}italic_i ≠ over^ start_ARG italic_ı end_ARG , italic_i ≠ italic_i start_POSTSUPERSCRIPT sansserif_pegged end_POSTSUPERSCRIPT and we have that \mathcal{F}caligraphic_F is true, in which case Algorithm 1 executes the if statement under the case 𝒞¯¯𝒞\overline{\mathcal{C}}\land\mathcal{F}over¯ start_ARG caligraphic_C end_ARG ∧ caligraphic_F. In this situation, we cannot always accept i𝑖iitalic_i as Dynkin’s algorithm would, because that would again violate smoothness. Instead, we accept i𝑖iitalic_i only if uisubscript𝑢𝑖u_{i}italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT can be lower bounded by u^ı^εtisubscript^𝑢^italic-ısubscript𝜀subscript𝑡𝑖\hat{u}_{\hat{\imath}}-\varepsilon_{t_{i}}over^ start_ARG italic_u end_ARG start_POSTSUBSCRIPT over^ start_ARG italic_ı end_ARG end_POSTSUBSCRIPT - italic_ε start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT, noting that if conversely ui+εtisubscript𝑢𝑖subscript𝜀subscript𝑡𝑖u_{i}+\varepsilon_{t_{i}}italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT + italic_ε start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT is smaller than u^ı^subscript^𝑢^italic-ı\hat{u}_{\hat{\imath}}over^ start_ARG italic_u end_ARG start_POSTSUBSCRIPT over^ start_ARG italic_ı end_ARG end_POSTSUBSCRIPT, then accepting i𝑖iitalic_i might be detrimental to our quest of ensuring smoothness.

Algorithm 1 Additive-Pegging
//* The algorithm stops when it accepts a candidate by executing 𝒜i𝒜𝑖\mathcal{A}\leftarrow icaligraphic_A ← italic_i. *//
Initialization: I𝗉𝖾𝗀𝗀𝖾𝖽superscript𝐼𝗉𝖾𝗀𝗀𝖾𝖽I^{\mathsf{pegged}}\leftarrow\emptysetitalic_I start_POSTSUPERSCRIPT sansserif_pegged end_POSTSUPERSCRIPT ← ∅
while agent i𝑖iitalic_i arrives at time tisubscript𝑡𝑖t_{i}italic_t start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT do
     if iI𝗉𝖾𝗀𝗀𝖾𝖽𝑖superscript𝐼𝗉𝖾𝗀𝗀𝖾𝖽i\in I^{\mathsf{pegged}}italic_i ∈ italic_I start_POSTSUPERSCRIPT sansserif_pegged end_POSTSUPERSCRIPT then
         if |I𝗉𝖾𝗀𝗀𝖾𝖽|=1superscript𝐼𝗉𝖾𝗀𝗀𝖾𝖽1|I^{\mathsf{pegged}}|=1| italic_I start_POSTSUPERSCRIPT sansserif_pegged end_POSTSUPERSCRIPT | = 1 then
              𝒜i𝒜𝑖\mathcal{A}\leftarrow icaligraphic_A ← italic_i
         else
              I𝗉𝖾𝗀𝗀𝖾𝖽I𝗉𝖾𝗀𝗀𝖾𝖽{i}superscript𝐼𝗉𝖾𝗀𝗀𝖾𝖽superscript𝐼𝗉𝖾𝗀𝗀𝖾𝖽𝑖I^{\mathsf{pegged}}\leftarrow I^{\mathsf{pegged}}\setminus\{i\}italic_I start_POSTSUPERSCRIPT sansserif_pegged end_POSTSUPERSCRIPT ← italic_I start_POSTSUPERSCRIPT sansserif_pegged end_POSTSUPERSCRIPT ∖ { italic_i }               
     (ui>maxjiuj)(ti>1/2),𝒞(i=ı^),εtimaxj:tjti|u^juj|formulae-sequencesubscript𝑢𝑖subscriptprecedes𝑗𝑖subscript𝑢𝑗subscript𝑡𝑖12formulae-sequence𝒞𝑖^italic-ısubscript𝜀subscript𝑡𝑖subscript:𝑗subscript𝑡𝑗subscript𝑡𝑖subscript^𝑢𝑗subscript𝑢𝑗\mathcal{F}\leftarrow\left(u_{i}>\max_{j\prec i}u_{j}\right)\land\left(t_{i}>1% /2\right),\mathcal{C}\leftarrow\left(i={\hat{\imath}}\right),\varepsilon_{t_{i% }}\leftarrow\max_{j:t_{j}\leq t_{i}}\lvert\hat{u}_{j}-u_{j}\rvertcaligraphic_F ← ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT > roman_max start_POSTSUBSCRIPT italic_j ≺ italic_i end_POSTSUBSCRIPT italic_u start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) ∧ ( italic_t start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT > 1 / 2 ) , caligraphic_C ← ( italic_i = over^ start_ARG italic_ı end_ARG ) , italic_ε start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT ← roman_max start_POSTSUBSCRIPT italic_j : italic_t start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ≤ italic_t start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT | over^ start_ARG italic_u end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT - italic_u start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT |
     if 𝒞𝒞\mathcal{C}\land\mathcal{F}caligraphic_C ∧ caligraphic_F then
         𝒜i𝒜𝑖\mathcal{A}\leftarrow icaligraphic_A ← italic_i
     else if 𝒞¯𝒞¯\mathcal{C}\land\overline{\mathcal{F}}caligraphic_C ∧ over¯ start_ARG caligraphic_F end_ARG then
         I𝗉𝖾𝗀𝗀𝖾𝖽{jı^:uı^<u^j+εtı^}superscript𝐼𝗉𝖾𝗀𝗀𝖾𝖽conditional-setsucceeds𝑗^italic-ısubscript𝑢^italic-ısubscript^𝑢𝑗subscript𝜀subscript𝑡^italic-ıI^{\mathsf{pegged}}\leftarrow\{j\succ\hat{\imath}:u_{\hat{\imath}}<\hat{u}_{j}% +\varepsilon_{t_{{\hat{\imath}}}}\}italic_I start_POSTSUPERSCRIPT sansserif_pegged end_POSTSUPERSCRIPT ← { italic_j ≻ over^ start_ARG italic_ı end_ARG : italic_u start_POSTSUBSCRIPT over^ start_ARG italic_ı end_ARG end_POSTSUBSCRIPT < over^ start_ARG italic_u end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT + italic_ε start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT over^ start_ARG italic_ı end_ARG end_POSTSUBSCRIPT end_POSTSUBSCRIPT } (note that ı^=i^italic-ı𝑖{\hat{\imath}}=iover^ start_ARG italic_ı end_ARG = italic_i)
         if I𝗉𝖾𝗀𝗀𝖾𝖽=superscript𝐼𝗉𝖾𝗀𝗀𝖾𝖽I^{\mathsf{pegged}}=\emptysetitalic_I start_POSTSUPERSCRIPT sansserif_pegged end_POSTSUPERSCRIPT = ∅ then
              𝒜i𝒜𝑖\mathcal{A}\leftarrow icaligraphic_A ← italic_i          
     else if 𝒞¯¯𝒞\overline{\mathcal{C}}\land\mathcal{F}over¯ start_ARG caligraphic_C end_ARG ∧ caligraphic_F then
         if ui>u^ı^εtisubscript𝑢𝑖subscript^𝑢^italic-ısubscript𝜀subscript𝑡𝑖u_{i}>\hat{u}_{\hat{\imath}}-\varepsilon_{t_{i}}italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT > over^ start_ARG italic_u end_ARG start_POSTSUBSCRIPT over^ start_ARG italic_ı end_ARG end_POSTSUBSCRIPT - italic_ε start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT then
              𝒜i𝒜𝑖\mathcal{A}\leftarrow icaligraphic_A ← italic_i               

Analysis of the Additive-Pegging algorithm.

Lemma 1.

Additive-Pegging satisfies u𝒜ui4ε(),subscript𝑢𝒜subscript𝑢superscript𝑖4𝜀for-allu_{\mathcal{A}}\ \geq u_{i^{*}}-4\operatorname{\varepsilon}(\mathcal{I}),\;% \forall\mathcal{I}italic_u start_POSTSUBSCRIPT caligraphic_A end_POSTSUBSCRIPT ≥ italic_u start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT - 4 italic_ε ( caligraphic_I ) , ∀ caligraphic_I with probability 1.

Proof.

Let i𝗉𝖾𝗀𝗀𝖾𝖽superscript𝑖𝗉𝖾𝗀𝗀𝖾𝖽{i^{\mathsf{pegged}}}italic_i start_POSTSUPERSCRIPT sansserif_pegged end_POSTSUPERSCRIPT denote the last arriving candidate in I𝗉𝖾𝗀𝗀𝖾𝖽superscript𝐼𝗉𝖾𝗀𝗀𝖾𝖽I^{\mathsf{pegged}}italic_I start_POSTSUPERSCRIPT sansserif_pegged end_POSTSUPERSCRIPT.

We first argue that Pegging always accepts a candidate irrespective of the random arrival times of the candidates. We focus on any instance where Additive-Pegging does not accept a candidate until time tı^subscript𝑡^italic-ıt_{\hat{\imath}}italic_t start_POSTSUBSCRIPT over^ start_ARG italic_ı end_ARG end_POSTSUBSCRIPT. At time tı^subscript𝑡^italic-ıt_{\hat{\imath}}italic_t start_POSTSUBSCRIPT over^ start_ARG italic_ı end_ARG end_POSTSUBSCRIPT either 𝒞𝒞\mathcal{C}\land\mathcal{F}caligraphic_C ∧ caligraphic_F or 𝒞¯𝒞¯\mathcal{C}\land\overline{\mathcal{F}}caligraphic_C ∧ over¯ start_ARG caligraphic_F end_ARG are true. Since in the former case candidate ı^^italic-ı{\hat{\imath}}over^ start_ARG italic_ı end_ARG is accepted, we focus on the latter case and in particular whenever the set I𝗉𝖾𝗀𝗀𝖾𝖽superscript𝐼𝗉𝖾𝗀𝗀𝖾𝖽I^{\mathsf{pegged}}italic_I start_POSTSUPERSCRIPT sansserif_pegged end_POSTSUPERSCRIPT which is computed is non-empty (otherwise, candidate ı^^italic-ı{\hat{\imath}}over^ start_ARG italic_ı end_ARG is accepted). In that case, it is guaranteed that by time ti𝗉𝖾𝗀𝗀𝖾𝖽subscript𝑡superscript𝑖𝗉𝖾𝗀𝗀𝖾𝖽t_{{i^{\mathsf{pegged}}}}italic_t start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT sansserif_pegged end_POSTSUPERSCRIPT end_POSTSUBSCRIPT Additive-Pegging will accept a candidate.

We now argue that in all cases Additive-Pegging maintains smoothness. Using ε𝜀\operatorname{\varepsilon}italic_ε, ϵisubscriptitalic-ϵ𝑖\epsilon_{i}italic_ϵ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT definitions and the fact that ı^^italic-ı{\hat{\imath}}over^ start_ARG italic_ı end_ARG is the candidate with the maximum predicted value we have: u^ı^u^iuiϵiuiεsubscript^𝑢^italic-ısubscript^𝑢superscript𝑖subscript𝑢superscript𝑖subscriptitalic-ϵsuperscript𝑖subscript𝑢superscript𝑖𝜀\hat{u}_{\hat{\imath}}\geq\hat{u}_{i^{*}}\geq u_{i^{*}}-\epsilon_{i^{*}}\geq u% _{i^{*}}-\operatorname{\varepsilon}over^ start_ARG italic_u end_ARG start_POSTSUBSCRIPT over^ start_ARG italic_ı end_ARG end_POSTSUBSCRIPT ≥ over^ start_ARG italic_u end_ARG start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ≥ italic_u start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT - italic_ϵ start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ≥ italic_u start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT - italic_ε. If candidate ı^^italic-ı{\hat{\imath}}over^ start_ARG italic_ı end_ARG is accepted then using the latter lower bound we get uı^u^ı^ϵı^uiεϵı^ui2εsubscript𝑢^italic-ısubscript^𝑢^italic-ısubscriptitalic-ϵ^italic-ısubscript𝑢superscript𝑖𝜀subscriptitalic-ϵ^italic-ısubscript𝑢superscript𝑖2𝜀u_{\hat{\imath}}\geq\hat{u}_{\hat{\imath}}-\epsilon_{\hat{\imath}}\geq u_{i^{*% }}-\operatorname{\varepsilon}-\epsilon_{\hat{\imath}}\geq u_{i^{*}}-2% \operatorname{\varepsilon}italic_u start_POSTSUBSCRIPT over^ start_ARG italic_ı end_ARG end_POSTSUBSCRIPT ≥ over^ start_ARG italic_u end_ARG start_POSTSUBSCRIPT over^ start_ARG italic_ı end_ARG end_POSTSUBSCRIPT - italic_ϵ start_POSTSUBSCRIPT over^ start_ARG italic_ı end_ARG end_POSTSUBSCRIPT ≥ italic_u start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT - italic_ε - italic_ϵ start_POSTSUBSCRIPT over^ start_ARG italic_ı end_ARG end_POSTSUBSCRIPT ≥ italic_u start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT - 2 italic_ε. If we accept iı^𝑖^italic-ıi\neq{\hat{\imath}}italic_i ≠ over^ start_ARG italic_ı end_ARG and the if statement of 𝒞¯¯𝒞\overline{\mathcal{C}}\land{\mathcal{F}}over¯ start_ARG caligraphic_C end_ARG ∧ caligraphic_F is executed at time tisubscript𝑡𝑖t_{i}italic_t start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT then we have ui>u^ı^εtiuiεεtiui2εsubscript𝑢𝑖subscript^𝑢^italic-ısubscript𝜀subscript𝑡𝑖subscript𝑢superscript𝑖𝜀subscript𝜀subscript𝑡𝑖subscript𝑢superscript𝑖2𝜀u_{i}>\hat{u}_{\hat{\imath}}-\varepsilon_{t_{i}}\geq u_{i^{*}}-\operatorname{% \varepsilon}-\varepsilon_{t_{i}}\geq u_{i^{*}}-2\operatorname{\varepsilon}italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT > over^ start_ARG italic_u end_ARG start_POSTSUBSCRIPT over^ start_ARG italic_ı end_ARG end_POSTSUBSCRIPT - italic_ε start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT ≥ italic_u start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT - italic_ε - italic_ε start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT ≥ italic_u start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT - 2 italic_ε. Finally, we need to lower bound the value ui𝗉𝖾𝗀𝗀𝖾𝖽subscript𝑢superscript𝑖𝗉𝖾𝗀𝗀𝖾𝖽u_{{i^{\mathsf{pegged}}}}italic_u start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT sansserif_pegged end_POSTSUPERSCRIPT end_POSTSUBSCRIPT in case our algorithm terminates accepting i𝗉𝖾𝗀𝗀𝖾𝖽superscript𝑖𝗉𝖾𝗀𝗀𝖾𝖽{i^{\mathsf{pegged}}}italic_i start_POSTSUPERSCRIPT sansserif_pegged end_POSTSUPERSCRIPT. Note that from the way the pegged set I𝗉𝖾𝗀𝗀𝖾𝖽superscript𝐼𝗉𝖾𝗀𝗀𝖾𝖽I^{\mathsf{pegged}}italic_I start_POSTSUPERSCRIPT sansserif_pegged end_POSTSUPERSCRIPT is updated when 𝒞¯𝒞¯\mathcal{C}\land\overline{\mathcal{F}}caligraphic_C ∧ over¯ start_ARG caligraphic_F end_ARG is true we always have uı^<u^i𝗉𝖾𝗀𝗀𝖾𝖽+εtı^subscript𝑢^italic-ısubscript^𝑢superscript𝑖𝗉𝖾𝗀𝗀𝖾𝖽subscript𝜀subscript𝑡^italic-ıu_{\hat{\imath}}<\hat{u}_{i^{\mathsf{pegged}}}+\varepsilon_{t_{{\hat{\imath}}}}italic_u start_POSTSUBSCRIPT over^ start_ARG italic_ı end_ARG end_POSTSUBSCRIPT < over^ start_ARG italic_u end_ARG start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT sansserif_pegged end_POSTSUPERSCRIPT end_POSTSUBSCRIPT + italic_ε start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT over^ start_ARG italic_ı end_ARG end_POSTSUBSCRIPT end_POSTSUBSCRIPT. Since ui𝗉𝖾𝗀𝗀𝖾𝖽u^i𝗉𝖾𝗀𝗀𝖾𝖽ϵi𝗉𝖾𝗀𝗀𝖾𝖽subscript𝑢superscript𝑖𝗉𝖾𝗀𝗀𝖾𝖽subscript^𝑢superscript𝑖𝗉𝖾𝗀𝗀𝖾𝖽subscriptitalic-ϵsuperscript𝑖𝗉𝖾𝗀𝗀𝖾𝖽u_{i^{\mathsf{pegged}}}\geq\hat{u}_{i^{\mathsf{pegged}}}-\epsilon_{i^{\mathsf{% pegged}}}italic_u start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT sansserif_pegged end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ≥ over^ start_ARG italic_u end_ARG start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT sansserif_pegged end_POSTSUPERSCRIPT end_POSTSUBSCRIPT - italic_ϵ start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT sansserif_pegged end_POSTSUPERSCRIPT end_POSTSUBSCRIPT we can conclude that ui𝗉𝖾𝗀𝗀𝖾𝖽>uı^εtı^ϵi𝗉𝖾𝗀𝗀𝖾𝖽ui4εsubscript𝑢superscript𝑖𝗉𝖾𝗀𝗀𝖾𝖽subscript𝑢^italic-ısubscript𝜀subscript𝑡^italic-ısubscriptitalic-ϵsuperscript𝑖𝗉𝖾𝗀𝗀𝖾𝖽subscript𝑢superscript𝑖4𝜀u_{i^{\mathsf{pegged}}}>u_{\hat{\imath}}-\varepsilon_{t_{{\hat{\imath}}}}-% \epsilon_{i^{\mathsf{pegged}}}\geq u_{i^{*}}-4\operatorname{\varepsilon}italic_u start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT sansserif_pegged end_POSTSUPERSCRIPT end_POSTSUBSCRIPT > italic_u start_POSTSUBSCRIPT over^ start_ARG italic_ı end_ARG end_POSTSUBSCRIPT - italic_ε start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT over^ start_ARG italic_ı end_ARG end_POSTSUBSCRIPT end_POSTSUBSCRIPT - italic_ϵ start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT sansserif_pegged end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ≥ italic_u start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT - 4 italic_ε. ∎

Lemma 2.

Additive-Pegging satisfies P[𝒜=i]1/16,𝑃delimited-[]𝒜superscripti116for-all{P{\left[\mathcal{A}={i^{*}}\right]}}\geq 1/16,\;\forall\mathcal{I}italic_P [ caligraphic_A = roman_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ] ≥ 1 / 16 , ∀ caligraphic_I.

Proof.

In the following, we assume that the number of candidates is larger or equal to 3333. The proof for the case where n=2𝑛2n=2italic_n = 2 is almost identical while the fairness guarantee in that case is 1/4141/41 / 4. We denote by ı~~italic-ı{\tilde{\imath}}over~ start_ARG italic_ı end_ARG the index of the candidate with the highest true value except isuperscript𝑖{i^{*}}italic_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT and ı^^italic-ı{\hat{\imath}}over^ start_ARG italic_ı end_ARG, i.e., ı~=argmaxii,ı^ui~italic-ısubscriptargmax𝑖superscript𝑖^italic-ısubscript𝑢𝑖{\tilde{\imath}}=\operatorname*{argmax}_{i\neq{i^{*}},{\hat{\imath}}}u_{i}over~ start_ARG italic_ı end_ARG = roman_argmax start_POSTSUBSCRIPT italic_i ≠ italic_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , over^ start_ARG italic_ı end_ARG end_POSTSUBSCRIPT italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT. Note that depending on the value of uı^subscript𝑢^italic-ıu_{\hat{\imath}}italic_u start_POSTSUBSCRIPT over^ start_ARG italic_ı end_ARG end_POSTSUBSCRIPT, ı~~italic-ı{\tilde{\imath}}over~ start_ARG italic_ı end_ARG might denote the index of the candidate with the second or third highest true value. To prove fairness we distinguish between two cases: either ı^=i^italic-ısuperscript𝑖{\hat{\imath}}={i^{*}}over^ start_ARG italic_ı end_ARG = italic_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT or ı^i^italic-ısuperscript𝑖{\hat{\imath}}\neq{i^{*}}over^ start_ARG italic_ı end_ARG ≠ italic_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT. For each of those cases, we define an event and argue that: (1) the event happens with constant probability, and (2) if that event happens then Additive-Pegging accepts isuperscript𝑖{i^{*}}italic_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT.

If i=ı^superscript𝑖^italic-ı{i^{*}}={\hat{\imath}}italic_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT = over^ start_ARG italic_ı end_ARG we define event E={tı~<1/2<ti}𝐸subscript𝑡~italic-ı12subscript𝑡superscript𝑖E=\{t_{\tilde{\imath}}<1/2<t_{i^{*}}\}italic_E = { italic_t start_POSTSUBSCRIPT over~ start_ARG italic_ı end_ARG end_POSTSUBSCRIPT < 1 / 2 < italic_t start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT } for which P[E]=1/4𝑃delimited-[]E14{P{\left[E\right]}}=1/4italic_P [ roman_E ] = 1 / 4. E𝐸Eitalic_E implies that our algorithm does not accept any candidate until time tisubscript𝑡superscript𝑖t_{i^{*}}italic_t start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT. Indeed, note that at any point in time before tisubscript𝑡superscript𝑖t_{i^{*}}italic_t start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT, both literals \mathcal{F}caligraphic_F and 𝒞𝒞\mathcal{C}caligraphic_C are simultaneously false. On the contrary, at time tisubscript𝑡superscript𝑖t_{i^{*}}italic_t start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT, both 𝒞𝒞\mathcal{C}caligraphic_C and \mathcal{F}caligraphic_F are true and our algorithm accepts isuperscript𝑖{i^{*}}italic_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT.

On the other hand, if iı^superscript𝑖^italic-ı{i^{*}}\neq{\hat{\imath}}italic_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ≠ over^ start_ARG italic_ı end_ARG we distinguish between two sub-cases. First, we show that either uı^<u^i+ϵı^subscript𝑢^italic-ısubscript^𝑢superscript𝑖subscriptitalic-ϵ^italic-ıu_{\hat{\imath}}<\hat{u}_{i^{*}}+\epsilon_{\hat{\imath}}italic_u start_POSTSUBSCRIPT over^ start_ARG italic_ı end_ARG end_POSTSUBSCRIPT < over^ start_ARG italic_u end_ARG start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT + italic_ϵ start_POSTSUBSCRIPT over^ start_ARG italic_ı end_ARG end_POSTSUBSCRIPT or ui>u^ı^ϵisubscript𝑢superscript𝑖subscript^𝑢^italic-ısubscriptitalic-ϵsuperscript𝑖u_{i^{*}}>\hat{u}_{\hat{\imath}}-\epsilon_{i^{*}}italic_u start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT > over^ start_ARG italic_u end_ARG start_POSTSUBSCRIPT over^ start_ARG italic_ı end_ARG end_POSTSUBSCRIPT - italic_ϵ start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT is true. By contradiction, assume that both inequalities do not hold, then

uı^u^i+ϵı^ui>uı^ui>u^i+ϵı^uiu^i>ϵı^ϵiuiu^iϵi>ϵı^subscript𝑢^italic-ısubscript^𝑢superscript𝑖subscriptitalic-ϵ^italic-ısubscript𝑢superscript𝑖subscript𝑢^italic-ısubscript𝑢superscript𝑖subscript^𝑢superscript𝑖subscriptitalic-ϵ^italic-ıabsentsubscript𝑢superscript𝑖subscript^𝑢superscript𝑖subscriptitalic-ϵ^italic-ısubscriptitalic-ϵsuperscript𝑖subscript𝑢superscript𝑖subscript^𝑢superscript𝑖subscriptitalic-ϵsuperscript𝑖subscriptitalic-ϵ^italic-ı\displaystyle\vspace{-1mm}u_{\hat{\imath}}\geq\hat{u}_{i^{*}}+\epsilon_{\hat{% \imath}}\xRightarrow{u_{i^{*}}>u_{\hat{\imath}}}u_{i^{*}}>\hat{u}_{i^{*}}+% \epsilon_{\hat{\imath}}\xRightarrow{}u_{i^{*}}-\hat{u}_{i^{*}}>\epsilon_{\hat{% \imath}}\xRightarrow{\epsilon_{i^{*}}\geq u_{i^{*}}-\hat{u}_{i^{*}}}\epsilon_{% i^{*}}>\epsilon_{\hat{\imath}}italic_u start_POSTSUBSCRIPT over^ start_ARG italic_ı end_ARG end_POSTSUBSCRIPT ≥ over^ start_ARG italic_u end_ARG start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT + italic_ϵ start_POSTSUBSCRIPT over^ start_ARG italic_ı end_ARG end_POSTSUBSCRIPT start_ARROW start_OVERACCENT italic_u start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT > italic_u start_POSTSUBSCRIPT over^ start_ARG italic_ı end_ARG end_POSTSUBSCRIPT end_OVERACCENT ⇒ end_ARROW italic_u start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT > over^ start_ARG italic_u end_ARG start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT + italic_ϵ start_POSTSUBSCRIPT over^ start_ARG italic_ı end_ARG end_POSTSUBSCRIPT start_ARROW start_OVERACCENT end_OVERACCENT ⇒ end_ARROW italic_u start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT - over^ start_ARG italic_u end_ARG start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT > italic_ϵ start_POSTSUBSCRIPT over^ start_ARG italic_ı end_ARG end_POSTSUBSCRIPT start_ARROW start_OVERACCENT italic_ϵ start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ≥ italic_u start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT - over^ start_ARG italic_u end_ARG start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT end_OVERACCENT ⇒ end_ARROW italic_ϵ start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT > italic_ϵ start_POSTSUBSCRIPT over^ start_ARG italic_ı end_ARG end_POSTSUBSCRIPT
uiu^ı^ϵiui>uı^uı^<u^ı^ϵiϵi<u^ı^uı^ϵı^uı^u^ı^ϵi<ϵı^subscript𝑢superscript𝑖subscript^𝑢^italic-ısubscriptitalic-ϵsuperscript𝑖subscript𝑢superscript𝑖subscript𝑢^italic-ısubscript𝑢^italic-ısubscript^𝑢^italic-ısubscriptitalic-ϵsuperscript𝑖absentsubscriptitalic-ϵsuperscript𝑖subscript^𝑢^italic-ısubscript𝑢^italic-ısubscriptitalic-ϵ^italic-ısubscript𝑢^italic-ısubscript^𝑢^italic-ısubscriptitalic-ϵsuperscript𝑖subscriptitalic-ϵ^italic-ı\displaystyle u_{i^{*}}\leq\hat{u}_{\hat{\imath}}-\epsilon_{i^{*}}\xRightarrow% {u_{i^{*}}>u_{\hat{\imath}}}u_{\hat{\imath}}<\hat{u}_{\hat{\imath}}-\epsilon_{% i^{*}}\xRightarrow{}\epsilon_{i^{*}}<\hat{u}_{\hat{\imath}}-u_{\hat{\imath}}% \xRightarrow{\epsilon_{\hat{\imath}}\geq u_{\hat{\imath}}-\hat{u}_{\hat{\imath% }}}\epsilon_{i^{*}}<\epsilon_{\hat{\imath}}\vspace{-1mm}italic_u start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ≤ over^ start_ARG italic_u end_ARG start_POSTSUBSCRIPT over^ start_ARG italic_ı end_ARG end_POSTSUBSCRIPT - italic_ϵ start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT start_ARROW start_OVERACCENT italic_u start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT > italic_u start_POSTSUBSCRIPT over^ start_ARG italic_ı end_ARG end_POSTSUBSCRIPT end_OVERACCENT ⇒ end_ARROW italic_u start_POSTSUBSCRIPT over^ start_ARG italic_ı end_ARG end_POSTSUBSCRIPT < over^ start_ARG italic_u end_ARG start_POSTSUBSCRIPT over^ start_ARG italic_ı end_ARG end_POSTSUBSCRIPT - italic_ϵ start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT start_ARROW start_OVERACCENT end_OVERACCENT ⇒ end_ARROW italic_ϵ start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT < over^ start_ARG italic_u end_ARG start_POSTSUBSCRIPT over^ start_ARG italic_ı end_ARG end_POSTSUBSCRIPT - italic_u start_POSTSUBSCRIPT over^ start_ARG italic_ı end_ARG end_POSTSUBSCRIPT start_ARROW start_OVERACCENT italic_ϵ start_POSTSUBSCRIPT over^ start_ARG italic_ı end_ARG end_POSTSUBSCRIPT ≥ italic_u start_POSTSUBSCRIPT over^ start_ARG italic_ı end_ARG end_POSTSUBSCRIPT - over^ start_ARG italic_u end_ARG start_POSTSUBSCRIPT over^ start_ARG italic_ı end_ARG end_POSTSUBSCRIPT end_OVERACCENT ⇒ end_ARROW italic_ϵ start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT < italic_ϵ start_POSTSUBSCRIPT over^ start_ARG italic_ı end_ARG end_POSTSUBSCRIPT

which is a contradiction. We now define two events E1subscript𝐸1E_{1}italic_E start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and E2subscript𝐸2E_{2}italic_E start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT which imply that isuperscript𝑖{i^{*}}italic_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT is always accepted whenever {uı^<u^i+ϵı^}subscript𝑢^italic-ısubscript^𝑢superscript𝑖subscriptitalic-ϵ^italic-ı\{u_{\hat{\imath}}<\hat{u}_{i^{*}}+\epsilon_{\hat{\imath}}\}{ italic_u start_POSTSUBSCRIPT over^ start_ARG italic_ı end_ARG end_POSTSUBSCRIPT < over^ start_ARG italic_u end_ARG start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT + italic_ϵ start_POSTSUBSCRIPT over^ start_ARG italic_ı end_ARG end_POSTSUBSCRIPT } and {ui>u^ı^ϵi}subscript𝑢superscript𝑖subscript^𝑢^italic-ısubscriptitalic-ϵsuperscript𝑖\{u_{i^{*}}>\hat{u}_{\hat{\imath}}-\epsilon_{i^{*}}\}{ italic_u start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT > over^ start_ARG italic_u end_ARG start_POSTSUBSCRIPT over^ start_ARG italic_ı end_ARG end_POSTSUBSCRIPT - italic_ϵ start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT } are true respectively.

If uı^<u^i+ϵı^subscript𝑢^italic-ısubscript^𝑢superscript𝑖subscriptitalic-ϵ^italic-ıu_{\hat{\imath}}<\hat{u}_{i^{*}}+\epsilon_{\hat{\imath}}italic_u start_POSTSUBSCRIPT over^ start_ARG italic_ı end_ARG end_POSTSUBSCRIPT < over^ start_ARG italic_u end_ARG start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT + italic_ϵ start_POSTSUBSCRIPT over^ start_ARG italic_ı end_ARG end_POSTSUBSCRIPT, then we define event E1={tı~<1/2}{tı^<1/2}{1/2<ti}subscript𝐸1subscript𝑡~italic-ı12subscript𝑡^italic-ı1212subscript𝑡superscript𝑖E_{1}=\{t_{\tilde{\imath}}<1/2\}\land\{t_{\hat{\imath}}<1/2\}\land\{1/2<t_{i^{% *}}\}italic_E start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = { italic_t start_POSTSUBSCRIPT over~ start_ARG italic_ı end_ARG end_POSTSUBSCRIPT < 1 / 2 } ∧ { italic_t start_POSTSUBSCRIPT over^ start_ARG italic_ı end_ARG end_POSTSUBSCRIPT < 1 / 2 } ∧ { 1 / 2 < italic_t start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT } which is composed by 3333 independent events and it happens with probability P[E1]=1/23=1/8𝑃delimited-[]subscriptE11superscript2318{P{\left[E_{1}\right]}}=1/2^{3}=1/8italic_P [ roman_E start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ] = 1 / 2 start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT = 1 / 8. E1subscript𝐸1E_{1}italic_E start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT implies that tı^<tiεtiϵı^subscript𝑡^italic-ısubscript𝑡superscript𝑖subscript𝜀subscript𝑡superscript𝑖subscriptitalic-ϵ^italic-ıt_{\hat{\imath}}<t_{i^{*}}\Rightarrow\varepsilon_{t_{{i^{*}}}}\geq\epsilon_{% \hat{\imath}}italic_t start_POSTSUBSCRIPT over^ start_ARG italic_ı end_ARG end_POSTSUBSCRIPT < italic_t start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ⇒ italic_ε start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT end_POSTSUBSCRIPT ≥ italic_ϵ start_POSTSUBSCRIPT over^ start_ARG italic_ı end_ARG end_POSTSUBSCRIPT, thus we can deduce that ui>uı^u^ı^ϵı^u^ı^εtisubscript𝑢superscript𝑖subscript𝑢^italic-ısubscript^𝑢^italic-ısubscriptitalic-ϵ^italic-ısubscript^𝑢^italic-ısubscript𝜀subscript𝑡superscript𝑖u_{i^{*}}>u_{\hat{\imath}}\geq\hat{u}_{\hat{\imath}}-\epsilon_{\hat{\imath}}% \geq\hat{u}_{\hat{\imath}}-\varepsilon_{t_{{i^{*}}}}italic_u start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT > italic_u start_POSTSUBSCRIPT over^ start_ARG italic_ı end_ARG end_POSTSUBSCRIPT ≥ over^ start_ARG italic_u end_ARG start_POSTSUBSCRIPT over^ start_ARG italic_ı end_ARG end_POSTSUBSCRIPT - italic_ϵ start_POSTSUBSCRIPT over^ start_ARG italic_ı end_ARG end_POSTSUBSCRIPT ≥ over^ start_ARG italic_u end_ARG start_POSTSUBSCRIPT over^ start_ARG italic_ı end_ARG end_POSTSUBSCRIPT - italic_ε start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT end_POSTSUBSCRIPT. Consequently, if until time tisubscript𝑡superscript𝑖t_{i^{*}}italic_t start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT all candidates are rejected, E1subscript𝐸1E_{1}italic_E start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT implies that 𝒞¯{ui>u^ı^ϵi}¯𝒞subscript𝑢superscript𝑖subscript^𝑢^italic-ısubscriptitalic-ϵsuperscript𝑖\overline{\mathcal{C}}\land\mathcal{F}\land\left\{u_{i^{*}}>\hat{u}_{\hat{% \imath}}-\epsilon_{{i^{*}}}\right\}over¯ start_ARG caligraphic_C end_ARG ∧ caligraphic_F ∧ { italic_u start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT > over^ start_ARG italic_u end_ARG start_POSTSUBSCRIPT over^ start_ARG italic_ı end_ARG end_POSTSUBSCRIPT - italic_ϵ start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT } is true at time tisubscript𝑡superscript𝑖t_{i^{*}}italic_t start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT and candidate isuperscript𝑖{i^{*}}italic_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT is hired. To argue that no candidate is accepted before time tisubscript𝑡superscript𝑖t_{i^{*}}italic_t start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT, note that \mathcal{F}caligraphic_F is false at all times before tisubscript𝑡superscript𝑖t_{i^{*}}italic_t start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT and at time tı^subscript𝑡^italic-ıt_{\hat{\imath}}italic_t start_POSTSUBSCRIPT over^ start_ARG italic_ı end_ARG end_POSTSUBSCRIPT (when literal 𝒞𝒞\mathcal{C}caligraphic_C is true) the set {jı^:uı^<u^j+εtı^}{jı^:uı^<u^j+ϵı^}conditional-setsucceeds𝑗^italic-ısubscript𝑢^italic-ısubscript^𝑢𝑗subscriptitalic-ϵ^italic-ıconditional-setsucceeds𝑗^italic-ısubscript𝑢^italic-ısubscript^𝑢𝑗subscript𝜀subscript𝑡^italic-ı\{j\succ{\hat{\imath}}:u_{\hat{\imath}}<\hat{u}_{j}+\varepsilon_{t_{{\hat{% \imath}}}}\}\supseteq\{j\succ{\hat{\imath}}:u_{\hat{\imath}}<\hat{u}_{j}+% \epsilon_{\hat{\imath}}\}{ italic_j ≻ over^ start_ARG italic_ı end_ARG : italic_u start_POSTSUBSCRIPT over^ start_ARG italic_ı end_ARG end_POSTSUBSCRIPT < over^ start_ARG italic_u end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT + italic_ε start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT over^ start_ARG italic_ı end_ARG end_POSTSUBSCRIPT end_POSTSUBSCRIPT } ⊇ { italic_j ≻ over^ start_ARG italic_ı end_ARG : italic_u start_POSTSUBSCRIPT over^ start_ARG italic_ı end_ARG end_POSTSUBSCRIPT < over^ start_ARG italic_u end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT + italic_ϵ start_POSTSUBSCRIPT over^ start_ARG italic_ı end_ARG end_POSTSUBSCRIPT } contains isuperscript𝑖{i^{*}}italic_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT.

If ui>u^ı^ϵisubscript𝑢superscript𝑖subscript^𝑢^italic-ısubscriptitalic-ϵsuperscript𝑖u_{i^{*}}>\hat{u}_{\hat{\imath}}-\epsilon_{i^{*}}italic_u start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT > over^ start_ARG italic_u end_ARG start_POSTSUBSCRIPT over^ start_ARG italic_ı end_ARG end_POSTSUBSCRIPT - italic_ϵ start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT, then we define E2={tı~<1/2<ti<tı^}subscript𝐸2subscript𝑡~italic-ı12subscript𝑡superscript𝑖subscript𝑡^italic-ıE_{2}=\{t_{\tilde{\imath}}<1/2<t_{i^{*}}<t_{\hat{\imath}}\}italic_E start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = { italic_t start_POSTSUBSCRIPT over~ start_ARG italic_ı end_ARG end_POSTSUBSCRIPT < 1 / 2 < italic_t start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT < italic_t start_POSTSUBSCRIPT over^ start_ARG italic_ı end_ARG end_POSTSUBSCRIPT } which happens with probability

P[E2]𝑃delimited-[]subscriptE2\displaystyle{P{\left[E_{2}\right]}}italic_P [ roman_E start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ] =P[tı~<1/2]P[1/2<ti<tı^]absent𝑃delimited-[]subscriptt~ı12𝑃delimited-[]12subscripttsuperscriptisubscriptt^ı\displaystyle={P{\left[t_{\tilde{\imath}}<1/2\right]}}\cdot{P{\left[1/2<t_{i^{% *}}<t_{\hat{\imath}}\right]}}= italic_P [ roman_t start_POSTSUBSCRIPT over~ start_ARG roman_ı end_ARG end_POSTSUBSCRIPT < 1 / 2 ] ⋅ italic_P [ 1 / 2 < roman_t start_POSTSUBSCRIPT roman_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT < roman_t start_POSTSUBSCRIPT over^ start_ARG roman_ı end_ARG end_POSTSUBSCRIPT ]
=P[tı~<1/2]P[1/2<min{ti,tı^}min{ti,tı^}=ti]absent𝑃delimited-[]subscriptt~ı12𝑃delimited-[]12subscripttsuperscriptisubscriptt^ısubscripttsuperscriptisubscriptt^ısubscripttsuperscripti\displaystyle={P{\left[t_{\tilde{\imath}}<1/2\right]}}\cdot{P{\left[1/2<\min\{% t_{i^{*}},t_{\hat{\imath}}\}\land\min\{t_{i^{*}},t_{\hat{\imath}}\}=t_{i^{*}}% \right]}}= italic_P [ roman_t start_POSTSUBSCRIPT over~ start_ARG roman_ı end_ARG end_POSTSUBSCRIPT < 1 / 2 ] ⋅ italic_P [ 1 / 2 < roman_min { roman_t start_POSTSUBSCRIPT roman_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT , roman_t start_POSTSUBSCRIPT over^ start_ARG roman_ı end_ARG end_POSTSUBSCRIPT } ∧ roman_min { roman_t start_POSTSUBSCRIPT roman_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT , roman_t start_POSTSUBSCRIPT over^ start_ARG roman_ı end_ARG end_POSTSUBSCRIPT } = roman_t start_POSTSUBSCRIPT roman_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ]
=P[tı~<1/2]P[1/2<min{ti,tı^}]P[min{ti,tı^}=ti]absent𝑃delimited-[]subscriptt~ı12𝑃delimited-[]12subscripttsuperscriptisubscriptt^ı𝑃delimited-[]subscripttsuperscriptisubscriptt^ısubscripttsuperscripti\displaystyle={P{\left[t_{\tilde{\imath}}<1/2\right]}}\cdot{P{\left[1/2<\min\{% t_{i^{*}},t_{\hat{\imath}}\}\right]}}\cdot{P{\left[\min\{t_{i^{*}},t_{\hat{% \imath}}\}=t_{i^{*}}\right]}}= italic_P [ roman_t start_POSTSUBSCRIPT over~ start_ARG roman_ı end_ARG end_POSTSUBSCRIPT < 1 / 2 ] ⋅ italic_P [ 1 / 2 < roman_min { roman_t start_POSTSUBSCRIPT roman_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT , roman_t start_POSTSUBSCRIPT over^ start_ARG roman_ı end_ARG end_POSTSUBSCRIPT } ] ⋅ italic_P [ roman_min { roman_t start_POSTSUBSCRIPT roman_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT , roman_t start_POSTSUBSCRIPT over^ start_ARG roman_ı end_ARG end_POSTSUBSCRIPT } = roman_t start_POSTSUBSCRIPT roman_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ]
=(1/2)(1/4)(1/2)=1/16absent121412116\displaystyle=(1/2)\cdot(1/4)\cdot(1/2)=1/16= ( 1 / 2 ) ⋅ ( 1 / 4 ) ⋅ ( 1 / 2 ) = 1 / 16

Note that until time tisubscript𝑡superscript𝑖t_{i^{*}}italic_t start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT no candidate is accepted since 𝒞𝒞\mathcal{C}caligraphic_C and \mathcal{F}caligraphic_F are both false at all times. Indeed, between times 00 and 1/2121/21 / 2 only ı^^italic-ı{\hat{\imath}}over^ start_ARG italic_ı end_ARG could have been accepted but its arrival time is after tisubscript𝑡superscript𝑖t_{i^{*}}italic_t start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT, and between times 1/2121/21 / 2 and tisubscript𝑡superscript𝑖t_{i^{*}}italic_t start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT no candidate has a true value larger than uı~subscript𝑢~italic-ıu_{\tilde{\imath}}italic_u start_POSTSUBSCRIPT over~ start_ARG italic_ı end_ARG end_POSTSUBSCRIPT. Finally, note that at time tisubscript𝑡superscript𝑖t_{i^{*}}italic_t start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT we have εtiϵisubscript𝜀subscript𝑡superscript𝑖subscriptitalic-ϵsuperscript𝑖\varepsilon_{t_{{i^{*}}}}\geq\epsilon_{i^{*}}italic_ε start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT end_POSTSUBSCRIPT ≥ italic_ϵ start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT and consequently 𝒞¯{ui>u^ı^εti}¯𝒞subscript𝑢superscript𝑖subscript^𝑢^italic-ısubscript𝜀subscript𝑡superscript𝑖\overline{\mathcal{C}}\land\mathcal{F}\land\{u_{i^{*}}>\hat{u}_{\hat{\imath}}-% \varepsilon_{t_{{i^{*}}}}\}over¯ start_ARG caligraphic_C end_ARG ∧ caligraphic_F ∧ { italic_u start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT > over^ start_ARG italic_u end_ARG start_POSTSUBSCRIPT over^ start_ARG italic_ı end_ARG end_POSTSUBSCRIPT - italic_ε start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT end_POSTSUBSCRIPT } is true and isuperscript𝑖{i^{*}}italic_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT gets accepted. ∎

Theorem 3.

Additive-Pegging satisfies smoothness and fairness with C=4𝐶4C=4italic_C = 4 and F=1/16𝐹116F=1/16italic_F = 1 / 16.

Theorem 3 follows directly from Lemmas 1 and 2. We note that Lemma 1 actually implies a stronger notion of smoothness that holds with probability 1.

The general Pegging algorithm. In Section A.2 we generalize the Additive-Pegging algorithm to the Pegging algorithm to provide fair and smooth algorithms for different prediction error definitions. Additive-Pegging is an instantiation of Pegging when the prediction error is defined as the maximum absolute difference between true and predicted values among candidates. To further demonstrate the generality of Pegging, we also instantiate it over the same prediction error definition ε()=maxi|1u^i/ui|𝜀subscript𝑖1subscript^𝑢𝑖subscript𝑢𝑖\operatorname{\varepsilon}(\mathcal{I})=\max_{i}\left|1-\hat{u}_{i}/u_{i}\right|italic_ε ( caligraphic_I ) = roman_max start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT | 1 - over^ start_ARG italic_u end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT / italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT | as in Fujii and Yoshida (2023) and recover similar smoothness bounds while also ensuring fairness. We name the latter instantiation Multiplicative-Pegging and present its guarantees in Theorem 4.

Theorem 4.

Let ε()=maxi|1u^i/ui|𝜀subscript𝑖1subscript^𝑢𝑖subscript𝑢𝑖\operatorname{\varepsilon}(\mathcal{I})=\max_{i}\left|1-\hat{u}_{i}/u_{i}\right|italic_ε ( caligraphic_I ) = roman_max start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT | 1 - over^ start_ARG italic_u end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT / italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT | and assume ui,u^i>0i[n]subscript𝑢𝑖subscript^𝑢𝑖0for-all𝑖delimited-[]𝑛u_{i},\hat{u}_{i}>0\ \forall i\in[n]italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , over^ start_ARG italic_u end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT > 0 ∀ italic_i ∈ [ italic_n ]. Then Multiplicative-Pegging satisfies fairness with F=1/16𝐹116F=1/16italic_F = 1 / 16 and selects a candidate 𝒜𝒜\mathcal{A}caligraphic_A such that u𝒜ui(14ε())subscript𝑢𝒜subscript𝑢superscript𝑖14𝜀u_{\mathcal{A}}\geq u_{{i^{*}}}\cdot\left(1-4\cdot\operatorname{\varepsilon}(% \mathcal{I})\right)italic_u start_POSTSUBSCRIPT caligraphic_A end_POSTSUBSCRIPT ≥ italic_u start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ⋅ ( 1 - 4 ⋅ italic_ε ( caligraphic_I ) ) with probability 1.

Fujii and Yoshida (2023) define the prediction error as in Theorem 4 and design an algorithm that accepts a candidate with expected value at least uimax{(1ε)/(1+ε),0.215}subscript𝑢superscript𝑖1𝜀1𝜀0.215u_{i^{*}}\cdot\max\left\{(1-\operatorname{\varepsilon})/(1+\operatorname{% \varepsilon}),0.215\right\}italic_u start_POSTSUBSCRIPT italic_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ⋅ roman_max { ( 1 - italic_ε ) / ( 1 + italic_ε ) , 0.215 }. Since (1ε)/(1+ε)12ε1𝜀1𝜀12𝜀(1-\operatorname{\varepsilon})/(1+\operatorname{\varepsilon})\geq 1-2% \operatorname{\varepsilon}( 1 - italic_ε ) / ( 1 + italic_ε ) ≥ 1 - 2 italic_ε their algorithm satisfies a smoothness desideratum similar to the one in Theorem 4, but as we prove in Section A.1, it violates the fairness desideratum.

4 Extension: k𝑘kitalic_k-Secretary problem with predictions

We consider the generalization to the k𝑘kitalic_k-secretary problem, where k1𝑘1k\geq 1italic_k ≥ 1 candidates can be accepted. To simplify notation we label the candidates in decreasing order of predicted value, so that u^1>>u^nsubscript^𝑢1subscript^𝑢𝑛\hat{u}_{1}>\cdots>\hat{u}_{n}over^ start_ARG italic_u end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT > ⋯ > over^ start_ARG italic_u end_ARG start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT and denote rsubscript𝑟r_{\ell}italic_r start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT to be the index of the candidate with the \ellroman_ℓ’th highest true value so that ur1>>urnsubscript𝑢subscript𝑟1subscript𝑢subscript𝑟𝑛u_{r_{1}}>\cdots>u_{r_{n}}italic_u start_POSTSUBSCRIPT italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT > ⋯ > italic_u start_POSTSUBSCRIPT italic_r start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT. The prediction error is again defined as ε():=maxi|uiu^i|assign𝜀subscript𝑖subscript𝑢𝑖subscript^𝑢𝑖\operatorname{\varepsilon}(\mathcal{I}):=\max_{i}|u_{i}-\hat{u}_{i}|italic_ε ( caligraphic_I ) := roman_max start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT | italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - over^ start_ARG italic_u end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT | and we let S𝑆Sitalic_S denote the random set of candidates accepted by a given algorithm on a fixed instance. The extension of our two objectives to this setting is

𝐄[iSui]𝐄delimited-[]subscriptiSsubscriptui\displaystyle{{\bf E}{\left[\sum_{i\in S}u_{i}\right]}}bold_E [ ∑ start_POSTSUBSCRIPT roman_i ∈ roman_S end_POSTSUBSCRIPT roman_u start_POSTSUBSCRIPT roman_i end_POSTSUBSCRIPT ] =1kurCε(),absentsuperscriptsubscript1𝑘subscript𝑢subscript𝑟𝐶𝜀for-all\displaystyle\geq\sum_{\ell=1}^{k}u_{r_{\ell}}-C\cdot\operatorname{\varepsilon% }(\mathcal{I}),\;\forall\mathcal{I}≥ ∑ start_POSTSUBSCRIPT roman_ℓ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT italic_u start_POSTSUBSCRIPT italic_r start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT end_POSTSUBSCRIPT - italic_C ⋅ italic_ε ( caligraphic_I ) , ∀ caligraphic_I (smoothness for k𝑘kitalic_k-secretary)
P[rS]𝑃delimited-[]subscriptrS\displaystyle{P{\left[r_{\ell}\in S\right]}}italic_P [ roman_r start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ∈ roman_S ] F,=1,,k,.formulae-sequenceabsentsubscript𝐹for-all1𝑘for-all\displaystyle\geq F_{\ell},\;\forall\ell=1,\ldots,k,\;\;\forall\mathcal{I}.≥ italic_F start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT , ∀ roman_ℓ = 1 , … , italic_k , ∀ caligraphic_I . (fairness for k𝑘kitalic_k-secretary)

The smoothness desideratum compares the expected sum of true values accepted by the algorithm to the sum of the k𝑘kitalic_k highest true values that could have been accepted. The fairness desideratum guarantees each of the candidates ranked =1,,k1𝑘\ell=1,\ldots,kroman_ℓ = 1 , … , italic_k to be accepted with probability Fsubscript𝐹F_{\ell}italic_F start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT. The k𝑘kitalic_k-secretary problem with predictions has been studied by Fujii and Yoshida (2023), who derive an algorithm satisfying 𝐄[iSui]max{1O(logk/k),1O(maxi|1u^i/ui|)}=1kur𝐄delimited-[]subscriptiSsubscriptui1𝑂𝑘𝑘1𝑂subscript𝑖1subscript^𝑢𝑖subscript𝑢𝑖superscriptsubscript1𝑘subscript𝑢subscript𝑟{{\bf E}{\left[\sum_{i\in S}u_{i}\right]}}\geq\max\{1-O(\log k/\sqrt{k}),1-O(% \max_{i}\left|1-\hat{u}_{i}/u_{i}\right|)\}\sum_{\ell=1}^{k}u_{r_{\ell}}bold_E [ ∑ start_POSTSUBSCRIPT roman_i ∈ roman_S end_POSTSUBSCRIPT roman_u start_POSTSUBSCRIPT roman_i end_POSTSUBSCRIPT ] ≥ roman_max { 1 - italic_O ( roman_log italic_k / square-root start_ARG italic_k end_ARG ) , 1 - italic_O ( roman_max start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT | 1 - over^ start_ARG italic_u end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT / italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT | ) } ∑ start_POSTSUBSCRIPT roman_ℓ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT italic_u start_POSTSUBSCRIPT italic_r start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT end_POSTSUBSCRIPT but without any fairness guarantees. We derive an algorithm k𝑘kitalic_k-Pegging that satisfies the following.

Theorem 5.

k𝑘kitalic_k-Pegging satisfies smoothness and fairness for k𝑘kitalic_k-secretary with C=4k𝐶4𝑘C=4kitalic_C = 4 italic_k and F=max{(1/3)k+5,1(+13)/k256}subscript𝐹superscript13𝑘5113𝑘256F_{\ell}=\max\left\{(1/3)^{k+5},\frac{1-(\ell+13)/k}{256}\right\}italic_F start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT = roman_max { ( 1 / 3 ) start_POSTSUPERSCRIPT italic_k + 5 end_POSTSUPERSCRIPT , divide start_ARG 1 - ( roman_ℓ + 13 ) / italic_k end_ARG start_ARG 256 end_ARG } for all =1,,k1𝑘\ell=1,\ldots,kroman_ℓ = 1 , … , italic_k.

We note that the algorithm of Kleinberg (2005) for k𝑘kitalic_k-Secretary (without predictions) obtains in expectation at least

(15k)=1kur=1kur5kmaxiui,15𝑘superscriptsubscript1𝑘subscript𝑢subscript𝑟superscriptsubscript1𝑘subscript𝑢subscript𝑟5𝑘subscript𝑖subscript𝑢𝑖\left(1-\frac{5}{\sqrt{k}}\right)\sum_{\ell=1}^{k}u_{r_{\ell}}\geq\sum_{\ell=1% }^{k}u_{r_{\ell}}-5\sqrt{k}\max_{i}u_{i},( 1 - divide start_ARG 5 end_ARG start_ARG square-root start_ARG italic_k end_ARG end_ARG ) ∑ start_POSTSUBSCRIPT roman_ℓ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT italic_u start_POSTSUBSCRIPT italic_r start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT end_POSTSUBSCRIPT ≥ ∑ start_POSTSUBSCRIPT roman_ℓ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT italic_u start_POSTSUBSCRIPT italic_r start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT end_POSTSUBSCRIPT - 5 square-root start_ARG italic_k end_ARG roman_max start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ,

which has a better asymptotic dependence on k𝑘kitalic_k than our smoothness constant C=4k𝐶4𝑘C=4kitalic_C = 4 italic_k if the prediction error is relatively large, i.e., ε()=ω(maxiui/k)𝜀𝜔subscript𝑖subscript𝑢𝑖𝑘\varepsilon(\mathcal{I})=\omega\left(\max_{i}u_{i}/\sqrt{k}\right)italic_ε ( caligraphic_I ) = italic_ω ( roman_max start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT / square-root start_ARG italic_k end_ARG ). On the other hand, regarding our fairness guarantee, if one only considers our fairness desideratum for k𝑘kitalic_k-secretary, then a simple algorithm suffices to achieve F=1/4subscript𝐹14F_{\ell}=1/4italic_F start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT = 1 / 4 for all =1,,k1𝑘\ell=1,\ldots,kroman_ℓ = 1 , … , italic_k, namely: reject all candidates i𝑖iitalic_i with ti<1/2subscript𝑡𝑖12t_{i}<1/2italic_t start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT < 1 / 2; accept any candidate i𝑖iitalic_i with ti>1/2subscript𝑡𝑖12t_{i}>1/2italic_t start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT > 1 / 2 whose true value uisubscript𝑢𝑖u_{i}italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT is among the k𝑘kitalic_k highest true values observed so far, space permitting.

For any of the top k𝑘kitalic_k candidates, i.e., candidate rsubscript𝑟r_{\ell}italic_r start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT with [k]delimited-[]𝑘\ell\in[k]roman_ℓ ∈ [ italic_k ], their value ursubscript𝑢subscript𝑟u_{r_{\ell}}italic_u start_POSTSUBSCRIPT italic_r start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT end_POSTSUBSCRIPT is always greater than the threshold τ𝜏\tauitalic_τ, which our algorithm recomputes upon the arrival of each candidate. Consequently, candidate rsubscript𝑟r_{\ell}italic_r start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT is added to our solution if the following conditions are satisfied: (1) tr>1/2subscript𝑡subscript𝑟12t_{r_{\ell}}>1/2italic_t start_POSTSUBSCRIPT italic_r start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT end_POSTSUBSCRIPT > 1 / 2; and (2) there is space available in the solution when rsubscript𝑟r_{\ell}italic_r start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT arrives. For condition (2) to hold, it suffices that at least k𝑘kitalic_k of the 2k12𝑘12k-12 italic_k - 1 candidates with the highest values other than rsubscript𝑟r_{\ell}italic_r start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT arrive before time 1/2121/21 / 2. This ensures that at most k1𝑘1k-1italic_k - 1 candidates other than rsubscript𝑟r_{\ell}italic_r start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT can be accepted after time 1/2121/21 / 2. The probability of both conditions (1) and (2) being satisfied is at least P[tr>1/2Binom(2k1,12)]=P[tr>1/2]P[Binom(2k1,12)k]=1212=14𝑃delimited-[]subscripttsubscriptr12Binom2k112𝑃delimited-[]subscripttsubscriptr12𝑃delimited-[]Binom2k112k121214{P{\left[t_{r_{\ell}}>1/2\land\mathrm{Binom}(2k-1,\frac{1}{2})\right]}}={P{% \left[t_{r_{\ell}}>1/2\right]}}\cdot{P{\left[\mathrm{Binom}(2k-1,\frac{1}{2})% \geq k\right]}}=\frac{1}{2}\cdot\frac{1}{2}=\frac{1}{4}italic_P [ roman_t start_POSTSUBSCRIPT roman_r start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT end_POSTSUBSCRIPT > 1 / 2 ∧ roman_Binom ( 2 roman_k - 1 , divide start_ARG 1 end_ARG start_ARG 2 end_ARG ) ] = italic_P [ roman_t start_POSTSUBSCRIPT roman_r start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT end_POSTSUBSCRIPT > 1 / 2 ] ⋅ italic_P [ roman_Binom ( 2 roman_k - 1 , divide start_ARG 1 end_ARG start_ARG 2 end_ARG ) ≥ roman_k ] = divide start_ARG 1 end_ARG start_ARG 2 end_ARG ⋅ divide start_ARG 1 end_ARG start_ARG 2 end_ARG = divide start_ARG 1 end_ARG start_ARG 4 end_ARG, establishing that F=14subscript𝐹14F_{\ell}=\frac{1}{4}italic_F start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT = divide start_ARG 1 end_ARG start_ARG 4 end_ARG for all =1,,k1𝑘\ell=1,\ldots,kroman_ℓ = 1 , … , italic_k.

Assuming k𝑘kitalic_k is a constant, C𝐶Citalic_C and F1,,Fksubscript𝐹1subscript𝐹𝑘F_{1},\ldots,F_{k}italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_F start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT in Theorem 5 are constants that do not depend on n𝑛nitalic_n or the instance \mathcal{I}caligraphic_I. For large values of k𝑘kitalic_k the first term in Fsubscript𝐹F_{\ell}italic_F start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT is exponentially decaying, but the second term still guarantees candidate rsubscript𝑟r_{\ell}italic_r start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT a probability of acceptance that is independent of k𝑘kitalic_k as long as (+13)/k13𝑘(\ell+13)/k( roman_ℓ + 13 ) / italic_k is bounded away from 1. More precisely, for k52𝑘52k\geq 52italic_k ≥ 52 and lk/2𝑙𝑘2l\leq k/2italic_l ≤ italic_k / 2 we have that candidate rsubscript𝑟r_{\ell}italic_r start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT is accepted with probability at least 1102411024\frac{1}{1024}divide start_ARG 1 end_ARG start_ARG 1024 end_ARG, i.e., every candidate among the top k/2𝑘2k/2italic_k / 2 is accepted with probability at least 1102411024\frac{1}{1024}divide start_ARG 1 end_ARG start_ARG 1024 end_ARG, thus 𝐄[iSui]11024=1k/2ur12048=1kur𝐄delimited-[]subscriptiSsubscriptui11024superscriptsubscript1𝑘2subscript𝑢subscript𝑟12048superscriptsubscript1𝑘subscript𝑢subscript𝑟{{\bf E}{\left[\sum_{i\in S}u_{i}\right]}}\geq\frac{1}{1024}\sum_{\ell=1}^{k/2% }u_{r_{\ell}}\geq\frac{1}{2048}\sum_{\ell=1}^{k}u_{r_{\ell}}bold_E [ ∑ start_POSTSUBSCRIPT roman_i ∈ roman_S end_POSTSUBSCRIPT roman_u start_POSTSUBSCRIPT roman_i end_POSTSUBSCRIPT ] ≥ divide start_ARG 1 end_ARG start_ARG 1024 end_ARG ∑ start_POSTSUBSCRIPT roman_ℓ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k / 2 end_POSTSUPERSCRIPT italic_u start_POSTSUBSCRIPT italic_r start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT end_POSTSUBSCRIPT ≥ divide start_ARG 1 end_ARG start_ARG 2048 end_ARG ∑ start_POSTSUBSCRIPT roman_ℓ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT italic_u start_POSTSUBSCRIPT italic_r start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT end_POSTSUBSCRIPT. This implies a multiplicative guarantee on total value that does not depend on k𝑘kitalic_k when k𝑘kitalic_k is large.

The algorithm. While we defer the proof of Theorem 5 to Appendix B, we present the intuition and the main technical difficulties in the design of k𝑘kitalic_k-Pegging. The algorithm maintains in an online manner the following sets: (1) the solution set S𝑆Sitalic_S which contains all the candidates that have already been accepted; (2) a set H𝐻Hitalic_H that we call the “Hopefuls” and contains the k|S|𝑘𝑆k-|S|italic_k - | italic_S | future candidates with highest predicted values; (3) a set B𝐵Bitalic_B that we call the blaming set, which contains a subset of already arrived candidates that pegged a future candidate; and (4) the set P𝑃Pitalic_P of pegged elements which contains all candidates that have been pegged by a candidate in B𝐵Bitalic_B. In addition, we use function pegpeg\mathrm{peg}roman_peg to store the “pegging responsibility”, i.e., if peg(i)=jpeg𝑖𝑗\mathrm{peg}\left(i\right)=jroman_peg ( italic_i ) = italic_j, for some candidates i,j𝑖𝑗i,jitalic_i , italic_j where i𝑖iitalic_i had one of the k𝑘kitalic_k highest predicted values, then i𝑖iitalic_i was not accepted at the time of its arrival and pegged j𝑗jitalic_j. We use peg1(j)=isuperscriptpeg1𝑗𝑖\mathrm{peg^{-1}}\left(j\right)=iroman_peg start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_j ) = italic_i to denote that j𝑗jitalic_j was pegged by i𝑖iitalic_i.

Algorithm 2 k𝑘kitalic_k-Pegging
//* The algorithm stops when it accepts k𝑘kitalic_k candidates, i.e., when |S|=k𝑆𝑘\lvert S\rvert=k| italic_S | = italic_k. *//
Initialization: H[k],S,P,Bformulae-sequence𝐻delimited-[]𝑘formulae-sequence𝑆formulae-sequence𝑃𝐵H\leftarrow[k],S\leftarrow\emptyset,P\leftarrow\emptyset,B\leftarrow\emptysetitalic_H ← [ italic_k ] , italic_S ← ∅ , italic_P ← ∅ , italic_B ← ∅
while agent i𝑖iitalic_i arrives at time tisubscript𝑡𝑖t_{i}italic_t start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT do
     if iP𝑖𝑃i\in Pitalic_i ∈ italic_P then \triangleright Case 1
         Add i𝑖iitalic_i to S𝑆Sitalic_S, remove i𝑖iitalic_i from P𝑃Pitalic_P, and remove peg1(i)superscriptpeg1𝑖\mathrm{peg^{-1}}\left(i\right)roman_peg start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_i ) from B𝐵Bitalic_B      
     τkth highest true value strictly before ti𝜏superscript𝑘𝑡 highest true value strictly before subscript𝑡𝑖\tau\leftarrow k^{th}\text{ highest true value strictly before }t_{i}italic_τ ← italic_k start_POSTSUPERSCRIPT italic_t italic_h end_POSTSUPERSCRIPT highest true value strictly before italic_t start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT
     (ui>τ)(ti>1/2),𝒞(iH)formulae-sequencesubscript𝑢𝑖𝜏subscript𝑡𝑖12𝒞𝑖𝐻\mathcal{F}\leftarrow\left(u_{i}>\tau\right)\land\left(t_{i}>1/2\right),% \mathcal{C}\leftarrow\left(i\in H\right)caligraphic_F ← ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT > italic_τ ) ∧ ( italic_t start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT > 1 / 2 ) , caligraphic_C ← ( italic_i ∈ italic_H ), εtimaxj:tjti|u^juj|subscript𝜀subscript𝑡𝑖subscript:𝑗subscript𝑡𝑗subscript𝑡𝑖subscript^𝑢𝑗subscript𝑢𝑗\varepsilon_{t_{i}}\leftarrow\max_{j:t_{j}\leq t_{i}}\lvert\hat{u}_{j}-u_{j}\rvertitalic_ε start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT ← roman_max start_POSTSUBSCRIPT italic_j : italic_t start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ≤ italic_t start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT | over^ start_ARG italic_u end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT - italic_u start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT |
     if 𝒞𝒞\mathcal{C}\land\mathcal{F}caligraphic_C ∧ caligraphic_F then \triangleright Case 2
         Add i𝑖iitalic_i to S𝑆Sitalic_S and remove i𝑖iitalic_i from H𝐻Hitalic_H
     else if 𝒞¯𝒞¯\mathcal{C}\land\overline{\mathcal{F}}caligraphic_C ∧ over¯ start_ARG caligraphic_F end_ARG then \triangleright Case 3
         if {ji:ui<u^j+εti}(P[k])=conditional-setsucceeds𝑗𝑖subscript𝑢𝑖subscript^𝑢𝑗subscript𝜀subscript𝑡𝑖𝑃delimited-[]𝑘\{j\succ i:u_{i}<\hat{u}_{j}+\varepsilon_{t_{i}}\}\setminus(P\cup[k])=\emptyset{ italic_j ≻ italic_i : italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT < over^ start_ARG italic_u end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT + italic_ε start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT } ∖ ( italic_P ∪ [ italic_k ] ) = ∅ then \triangleright subcase a
              Add i𝑖iitalic_i to S𝑆Sitalic_S and remove i𝑖iitalic_i from H𝐻Hitalic_H
         else\triangleright subcase b
              jAn arbitrary candidate from {ji:ui<u^j+εti}(P[k])superscript𝑗An arbitrary candidate from conditional-setsucceeds𝑗𝑖subscript𝑢𝑖subscript^𝑢𝑗subscript𝜀subscript𝑡𝑖𝑃delimited-[]𝑘j^{\prime}\leftarrow\text{An arbitrary candidate from }\{j\succ i:u_{i}<\hat{u% }_{j}+\varepsilon_{t_{i}}\}\setminus(P\cup[k])italic_j start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ← An arbitrary candidate from { italic_j ≻ italic_i : italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT < over^ start_ARG italic_u end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT + italic_ε start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT } ∖ ( italic_P ∪ [ italic_k ] )
              Add jsuperscript𝑗j^{\prime}italic_j start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT to P𝑃Pitalic_P, add i𝑖iitalic_i to B𝐵Bitalic_B, remove i𝑖iitalic_i from H𝐻Hitalic_H, and set peg(i)=jpeg𝑖superscript𝑗\mathrm{peg}\left(i\right)=j^{\prime}roman_peg ( italic_i ) = italic_j start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT          
     else if 𝒞¯¯𝒞\overline{\mathcal{C}}\land\mathcal{F}over¯ start_ARG caligraphic_C end_ARG ∧ caligraphic_F then \triangleright Case 4
         if {jB:ui>uj}conditional-set𝑗𝐵subscript𝑢𝑖subscript𝑢𝑗\{j\in B:u_{i}>u_{j}\}\neq\emptyset{ italic_j ∈ italic_B : italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT > italic_u start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT } ≠ ∅ then \triangleright subcase a
              jAn arbitrary candidate from {jB:ui>uj}superscript𝑗An arbitrary candidate from conditional-set𝑗𝐵subscript𝑢𝑖subscript𝑢𝑗j^{\prime}\leftarrow\text{An arbitrary candidate from }\{j\in B:u_{i}>u_{j}\}italic_j start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ← An arbitrary candidate from { italic_j ∈ italic_B : italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT > italic_u start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT }
              Add i𝑖iitalic_i to S𝑆Sitalic_S, remove jsuperscript𝑗j^{\prime}italic_j start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT from B𝐵Bitalic_B, and remove peg(j)pegsuperscript𝑗\mathrm{peg}\left(j^{\prime}\right)roman_peg ( italic_j start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) from P𝑃Pitalic_P
         else if {jH:ui>u^jεti}conditional-set𝑗𝐻subscript𝑢𝑖subscript^𝑢𝑗subscript𝜀subscript𝑡𝑖\{j\in H:u_{i}>\hat{u}_{j}-\varepsilon_{t_{i}}\}\neq\emptyset{ italic_j ∈ italic_H : italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT > over^ start_ARG italic_u end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT - italic_ε start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT } ≠ ∅ then \triangleright subcase b
              jAn arbitrary candidate from {jH:ui>u^jεti}superscript𝑗An arbitrary candidate from conditional-set𝑗𝐻subscript𝑢𝑖subscript^𝑢𝑗subscript𝜀subscript𝑡𝑖j^{\prime}\leftarrow\text{An arbitrary candidate from }\{j\in H:u_{i}>\hat{u}_% {j}-\varepsilon_{t_{i}}\}italic_j start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ← An arbitrary candidate from { italic_j ∈ italic_H : italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT > over^ start_ARG italic_u end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT - italic_ε start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT }
              Add i𝑖iitalic_i to S𝑆Sitalic_S and remove jsuperscript𝑗j^{\prime}italic_j start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT from H𝐻Hitalic_H               

To satisfy the fairness property, we check if the current candidate i𝑖iitalic_i has arrived at time ti>1/2subscript𝑡𝑖12t_{i}>1/2italic_t start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT > 1 / 2 and if uisubscript𝑢𝑖u_{i}italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT is larger than the kthsuperscript𝑘𝑡k^{th}italic_k start_POSTSUPERSCRIPT italic_t italic_h end_POSTSUPERSCRIPT highest value seen so far. We refer to these two conditions as the fairness conditions. If iP𝑖𝑃i\in Pitalic_i ∈ italic_P (case 1) or iH𝑖𝐻i\in Hitalic_i ∈ italic_H and the fairness conditions hold (case 2), then we accept i𝑖iitalic_i. If the fairness conditions hold but iH𝑖𝐻i\not\in Hitalic_i ∉ italic_H then we accept if there is a past candidate in B𝐵Bitalic_B with lower true value than uisubscript𝑢𝑖u_{i}italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT (subcase 4a), or a future candidate in H𝐻Hitalic_H with low predicted value compared to uisubscript𝑢𝑖u_{i}italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT (subcase 4b). The main technical challenge in generalizing the pegging idea to k>1𝑘1k>1italic_k > 1 arises when a candidate iH𝑖𝐻i\in Hitalic_i ∈ italic_H arrives, but the fairness conditions do not hold (case 3). In this situation, it is unclear whether to reject i𝑖iitalic_i and peg a future candidate, or accept i𝑖iitalic_i. For instance, consider a scenario where the prediction error is consistently large (i.e., εtisubscript𝜀subscript𝑡𝑖\varepsilon_{t_{i}}italic_ε start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT is always large), such that when i𝑖iitalic_i arrives, the set {ji:ui<u^j+εti}(P[k])conditional-setsucceeds𝑗𝑖subscript𝑢𝑖subscript^𝑢𝑗subscript𝜀subscript𝑡𝑖𝑃delimited-[]𝑘\{j\succ i:u_{i}<\hat{u}_{j}+\varepsilon_{t_{i}}\}\setminus(P\cup[k]){ italic_j ≻ italic_i : italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT < over^ start_ARG italic_u end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT + italic_ε start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT } ∖ ( italic_P ∪ [ italic_k ] ) is always non-empty. If ti<1/2subscript𝑡𝑖12t_{i}<1/2italic_t start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT < 1 / 2 and we accept i𝑖iitalic_i, we risk depleting our budget too quickly before time 1/2121/21 / 2, leaving insufficient capacity to accept candidates not in [k]delimited-[]𝑘[k][ italic_k ] who arrive later. Conversely, if we reject i𝑖iitalic_i, we deny it the possibility of acceptance in the first half of the time horizon, potentially reducing its overall acceptance probability. k𝑘kitalic_k-Pegging balances this tradeoff while achieving smoothness. To establish smoothness, we demonstrate that the k𝑘kitalic_k candidates with the highest predicted values can be mapped to the solution set S𝑆Sitalic_S, ensuring that the true values within our solution set are pairwise “close” to the values of candidates in {1,2,,k}12𝑘\{1,2,\dots,k\}{ 1 , 2 , … , italic_k }. This is proven in Lemma 8 by constructing an injective function m()m\mathrm{m(\cdot)}roman_m ( ⋅ ) from set S𝑆Sitalic_S to {1,2,,k}12𝑘\{1,2,\dots,k\}{ 1 , 2 , … , italic_k } such that for each jS𝑗𝑆j\in Sitalic_j ∈ italic_S, ujum(j)subscript𝑢𝑗subscript𝑢m𝑗u_{j}\approx u_{\mathrm{m}(j)}italic_u start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ≈ italic_u start_POSTSUBSCRIPT roman_m ( italic_j ) end_POSTSUBSCRIPT.

5 Experiments

We simulate our Additive-Pegging and Multiplicative-Pegging algorithms in the exact experimental setup of Fujii and Yoshida (2023), to test its average-case performance.

Experimental Setup. Fujii and Yoshida (2023) generate various types of instances. We follow their Almost-constant, Uniform, and Adversarial types of instances, and also create the Unfair type of instance to further highlight how slightly biased predictions can lead to very unfair outcomes. Both true and predicted values of candidates in all these instance types are parameterized by a scalar ε[0,1)𝜀01\varepsilon\in[0,1)italic_ε ∈ [ 0 , 1 ) which controls the prediction error. Setting ε=0𝜀0\varepsilon=0italic_ε = 0 creates instances with perfect predictions and setting a higher value of ε𝜀\varepsilonitalic_ε creates instances with more erroneous predictions. Almost-constant models a situation where one candidate has a true value of 1/(1ε)11𝜀1/(1-\varepsilon)1 / ( 1 - italic_ε ) and the rest of the candidates have a value of 1111. All predictions are set to 1111. In Uniform, we sample each uisubscript𝑢𝑖u_{i}italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT independently from the exponential distribution with parameter 1111. The exponential distribution generates a large value with a small probability and consequently models a situation where one candidate is significantly better than the rest. All predicted values are generated by perturbing the actual value with the uniform distribution, i.e., u^i=δiuisubscript^𝑢𝑖subscript𝛿𝑖subscript𝑢𝑖\hat{u}_{i}=\delta_{i}\cdot u_{i}over^ start_ARG italic_u end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = italic_δ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ⋅ italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, where δisubscript𝛿𝑖\delta_{i}italic_δ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT is sampled uniformly and independently from [1ε,1+ε]1𝜀1𝜀[1-\varepsilon,1+\varepsilon][ 1 - italic_ε , 1 + italic_ε ]. In Adversarial, the true values are again independent samples from the exponential distribution with parameter 1111. The predictions are “adversarially” perturbed while maintaining the error to be at most ε𝜀\varepsilonitalic_ε in the following manner: if i𝑖iitalic_i belongs to the top half of candidates in terms of true value, then u^i=(1ε)uisubscript^𝑢𝑖1𝜀subscript𝑢𝑖\hat{u}_{i}=(1-\varepsilon)\cdot u_{i}over^ start_ARG italic_u end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = ( 1 - italic_ε ) ⋅ italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT; if i𝑖iitalic_i belongs to the bottom half, then u^i=(1+ε)uisubscript^𝑢𝑖1𝜀subscript𝑢𝑖\hat{u}_{i}=(1+\varepsilon)\cdot u_{i}over^ start_ARG italic_u end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = ( 1 + italic_ε ) ⋅ italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT. Finally, in Unfair all candidates have values that are at most a (1+ε)1𝜀(1+\varepsilon)( 1 + italic_ε ) multiplicative factor apart. Formally, uisubscript𝑢𝑖u_{i}italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT is a uniform value in [1ε/4,1+ε/4]1𝜀41𝜀4[1-\varepsilon/4,1+\varepsilon/4][ 1 - italic_ε / 4 , 1 + italic_ε / 4 ], and since (1+ε/4)/(1ε/4)(1+ε)1𝜀41𝜀41𝜀(1+\varepsilon/4)/(1-\varepsilon/4)\leq(1+\varepsilon)( 1 + italic_ε / 4 ) / ( 1 - italic_ε / 4 ) ≤ ( 1 + italic_ε ) we have that the smallest and largest value are indeed very close. We set u^i=unr(i)+1subscript^𝑢𝑖subscript𝑢𝑛𝑟𝑖1\hat{u}_{i}=u_{n-r(i)+1}over^ start_ARG italic_u end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = italic_u start_POSTSUBSCRIPT italic_n - italic_r ( italic_i ) + 1 end_POSTSUBSCRIPT where r(i)𝑟𝑖r(i)italic_r ( italic_i ) is the rank of uisubscript𝑢𝑖u_{i}italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, i.e., predictions create a completely inverted order.

We compare Additive-Pegging and Multiplicative-Pegging against Learned-Dynkin Fujii and Yoshida (2023), Highest-prediction which always accepts the candidate with the highest prediction, and the classical Dynkin algorithm which does not use the predictions. Following Fujii and Yoshida (2023), we set the number of candidates to be n=100𝑛100n=100italic_n = 100. We experiment with all values of ε𝜀\varepsilonitalic_ε in {0,1/20,2/20,,19/20}01202201920\{0,1/20,2/20,\dots,19/20\}{ 0 , 1 / 20 , 2 / 20 , … , 19 / 20 }. For each type of instance and value of ε𝜀\varepsilonitalic_ε in this set, we randomly generate 10000 instances, and then run each algorithm on each instance. For each algorithm, we consider instance-wise the ratio of the true value it accepted to the maximum true value, calling the average of this ratio across the 10000 instances its competitive ratio. For each algorithm, we consider the fraction of the 10000 instances on which it successfully accepted the candidate with the highest true value, calling this fraction its fairness. We report the competitive ratio and fairness of each algorithm, for each type of instance and each value of ε𝜀\varepsilonitalic_ε, in Figure 1. Our code is written in Python 3.11.5 and we conduct experiments on an M3 Pro CPU with 18 GB of RAM. The total runtime is less than 5555 minutes.

Results. The results are summarized in figure 1. Since Additive-Pegging and Multiplicative-Pegging achieve almost the same competitive ratio and fairness for all instance types and values of ε𝜀\varepsilonitalic_ε we only present Additive-Pegging in figure 1 but include the code of both in the supplementary material. Our algorithms are consistently either the best or close to the best in terms of both competitive ratio and fairness for all different pairs of instance types and ε𝜀\varepsilonitalic_ε values. Before discussing the results of each instance type individually it is instructive to mention some characteristics of our benchmarks. While Dynkin does not use predictions and is therefore bound to suboptimal competitive ratios when predictions are accurate, we note that it accepts the maximum value candidate with probability at least 1/e1𝑒1/e1 / italic_e, i.e., it is 1/e1𝑒1/e1 / italic_e-fair. When predictions are non-informative this is an upper bound on the attainable fairness for any algorithm whether it uses predictions or not. Highest-prediction is expected to perform well when the highest prediction matches the true highest value candidate and poorly when the latter is not true. In Almost-constant for small values of ε𝜀\varepsilonitalic_ε all candidates have very close true values and all algorithms except Dynkin have a competitive ratio close to 1111. Dynkin may not accept any candidate and this is why its performance is poorer than the rest of the algorithms. Note that as ε𝜀\varepsilonitalic_ε increases both our algorithms perform significantly better than all other benchmarks.

In terms of fairness, predictions do not offer any information regarding the ordinal comparison between candidates’ true values and this is why for small values of ε𝜀\varepsilonitalic_ε the probability of Highest-prediction and Learned-Dynkin of accepting the best candidate is close to 1/100=1/n11001𝑛1/100=1/n1 / 100 = 1 / italic_n, i.e., random. Here, the fairness of our algorithms and Dynkin is similar and close to 1/e1𝑒1/e1 / italic_e. In both Uniform and Adversarial we observe that for small values of ε𝜀\varepsilonitalic_ε the highest predicted candidate is the true highest and Additive-Pegging, Learned-Dynkin and Highest-prediction all accept that candidate having a very close performance both in terms of fairness and competitive ratio. For higher values of ε𝜀\varepsilonitalic_ε the fairness of those algorithms deteriorates similarly and it approaches again 0.371/esimilar-to-or-equals0.371𝑒0.37\simeq 1/e0.37 ≃ 1 / italic_e. In Unfair our algorithms outperform all other benchmarks in terms of competitive ratio for all values of ε𝜀\varepsilonitalic_ε and achieve a close to optimal fairness. This is expected as our algorithms are particularly suited for cases where predictions may be accurate but unfair.

Refer to caption
Figure 1: Competitive ratio and fairness of different algorithms, for each instance type and level of ε𝜀\operatorname{\varepsilon}italic_ε.

Overall, our algorithms are the best-performing and most robust. The Highest-prediction algorithm does perform slightly better on Uniform instances and Adversarial instances under most values of ε𝜀\varepsilonitalic_ε, but performs consistently worse on Almost-constant and Unfair instances, especially in terms of fairness. Our algorithms perform better than Learned-Dynkin in almost all situations.

6 Limitations and Future Work

We study a notion of fairness that is tailored to the secretary problem with predictions and build our algorithms based on this notion. However, there are alternative notions of fairness one could consider in applications such as hiring, as well as variations of the secretary problem that capture other features in these applications. While our model allows for arbitrary bias in the predictions we assume that the true value of a candidate is fully discovered upon arrival, and define fairness based on hiring the best candidate (who has the highest true value) with a reasonable probability. Thus, we ignore considerations such as bias in how we get the true value of a candidate (e.g., via an interview process). In addition, as noted in Section 1, we use an individual fairness notion which does not model other natural desiderata like hiring from underprivileged populations or balance the hiring probabilities across different populations. These are considerations with potentially high societal impact which our algorithms do not consider and are interesting directions for future work on fair selection with predictions.

Regarding trade-offs in our guarantees: for the single-secretary problem, we can improve the fairness guarantee from 1/161161/161 / 16 to 0.0740.0740.0740.074 by optimizing the constants in our algorithm. However, we choose not to do so, as the performance increase is marginal, and we aim to keep the proof as simple as possible. Additionally, as we noted in Section 1.1 any constant C𝐶Citalic_C for smoothness implies an upper bound of F=1/3+o(1)𝐹13𝑜1F=1/3+o(1)italic_F = 1 / 3 + italic_o ( 1 ) for fairness. Finding the Pareto-optimal curve in terms of smoothness and fairness is an interesting direction. The main challenge in achieving a smooth trade-off between fairness and smoothness is as follows: any bound on C𝐶Citalic_C for smoothness implies a competitive ratio of 1Cϵ1𝐶italic-ϵ1-C\epsilon1 - italic_C italic_ϵ, which reaches a ratio of 1 when the predictions are exactly correct. Thus, regardless of the smoothness guarantee, we must achieve a competitive ratio of 1 when predictions are fully accurate. This constraint makes it challenging to improve the fairness guarantee F𝐹Fitalic_F, even at the cost of a less favorable smoothness constant C𝐶Citalic_C.

References

  • Angwin et al. [2016] Julia Angwin, Jeff Larson, Surya Mattu, and Lauren Kirchner. Machine bias: There’s software used across the country to predict future criminals. and it’s biased against blacks. ProPublica, May 23 2016. URL https://www.propublica.org/article/machine-bias-risk-assessments-in-criminal-sentencing.
  • Antoniadis et al. [2020a] Antonios Antoniadis, Christian Coester, Marek Elias, Adam Polak, and Bertrand Simon. Online metric algorithms with untrusted predictions. In Proceedings of the 37th International Conference on Machine Learning (ICML), pages 345–355, 2020a.
  • Antoniadis et al. [2020b] Antonios Antoniadis, Themis Gouleakis, Pieter Kleer, and Pavel Kolev. Secretary and online matching problems with machine learned advice. In Proceedings of the 33rd Annual Conference on Neural Information Processing Systems (NeurIPS), pages 7933–7944, 2020b.
  • Arsenis and Kleinberg [2022] Makis Arsenis and Robert Kleinberg. Individual fairness in prophet inequalities. In Proceedings of the 23rd ACM Conference on Economics and Computation (EC), page 245, 2022.
  • Azar et al. [2018] Yossi Azar, Ashish Chiplunkar, and Haim Kaplan. Prophet secretary: Surpassing the 11/e11𝑒1-1/e1 - 1 / italic_e barrier. In Proceedings of the 19th ACM Conference on Economics and Computation (EC), pages 303–318, 2018.
  • Babaioff et al. [2007a] Moshe Babaioff, Nicole Immorlica, David Kempe, and Robert Kleinberg. A knapsack secretary problem with applications. In Proceedings of the 10th International Workshop on Approximation Algorithms for Combinatorial Optimization Problems (APPROX), pages 16–28, 2007a.
  • Babaioff et al. [2007b] Moshe Babaioff, Nicole Immorlica, and Robert Kleinberg. Matroids, secretary problems, and online mechanisms. In Proceedings of the 18th Annual ACM-SIAM Symposium on Discrete Algorithms (SODA), page 434–443, 2007b.
  • Balseiro et al. [2021] Santiago Balseiro, Haihao Lu, and Vahab Mirrokni. Regularized online allocation problems: Fairness and beyond. In Proceedings of the 38th International Conference on Machine Learning (ICML), pages 630–639, 2021.
  • Bertsimas et al. [2011] Dimitris Bertsimas, Vivek F. Farias, and Nikolaos Trichakis. The price of fairness. Operations Research, 59(1):17–31, 2011.
  • Bertsimas et al. [2012] Dimitris Bertsimas, Vivek F. Farias, and Nikolaos Trichakis. On the efficiency-fairness trade-off. Management Science, 12:2234–2250, 2012.
  • Bogen and Rieke [2018] Miranda Bogen and Aaron Rieke. Help wanted: an examination of hiring algorithms. Equity, and Bias, Upturn (December 2018), 2018.
  • Celis et al. [2018] L. Elisa Celis, Damian Straszak, and Nisheeth K. Vishnoi. Ranking with fairness constraints. In Proceedings of the 45th International Colloquium on Automata, Languages and Programming (ICALP), 2018.
  • Chierichetti et al. [2019] Flavio Chierichetti, Ravi Kumar, Silvio Lattanzi, and Sergei Vassilvtiskii. Matroids, matchings, and fairness. In Proceedings of the 22ndInternational Conference on Artificial Intelligence and Statistics (AISTATS), pages 2212–2220, 2019.
  • Choo and Ling [2024] Davin Choo and Chun Kai Ling. A short note about the learning-augmented secretary problem. arXiv preprint arXiv:2410.06583, 2024.
  • Chouldechova [2017] Alexandra Chouldechova. Fair prediction with disparate impact: A study of bias in recidivism prediction instruments. Big Data, 5(2):153–163, June 2017.
  • Chouldechova et al. [2018] Alexandra Chouldechova, Diana Benavides-Prado, Oleksandr Fialko, and Rhema Vaithianathan. A case study of algorithm-assisted decision making in child maltreatment hotline screening decisions. In Proceedings of the 1st Conference on Fairness, Accountability and Transparency (FAccT), pages 134–148, 2018.
  • Cohen et al. [2019] Lee Cohen, Zachary C. Lipton, and Yishay Mansour. Efficient candidate screening under multiple tests and implications for fairness. In Proceedings of the 1st Symposium on the foundations of responsible computing (FORC), page 1–20, 2019.
  • Cohen et al. [2022] Maxime C. Cohen, Adam N. Elmachtoub, and Xiao Lei. Price discrimination with fairness constraints. Management Science, 68(12):8536–8552, 2022.
  • COMPAS () [software] COMPAS (software). https://en.wikipedia.org/wiki/COMPAS_(software).
  • Corbett-Davies et al. [2017] Sam Corbett-Davies, Emma Pierson, Avi Feller, Sharad Goel, and Aziz Huq. Algorithmic decision making and the cost of fairness. In Proceedings of the 23rd International Conference on Knowledge Discovery and Data Mining (KDD), page 797–806, 2017.
  • Correa et al. [2021a] José R. Correa, Andrés Cristi, Laurent Feuilloley, Tim Oosterwijk, and Alexandros Tsigonias-Dimitriadis. The secretary problem with independent sampling. In Proceedings of the 32nd Annual ACM-SIAM Symposium on Discrete Algorithms (SODA), pages 2047–2058, 2021a.
  • Correa et al. [2021b] José R. Correa, Patricio Foncea, Ruben Hoeksma, Tim Oosterwijk, and Tjark Vredeveld. Posted price mechanisms and optimal threshold strategies for random arrivals. Mathematics of Operations Research, 46(4):1452–1478, 2021b.
  • Correa et al. [2021c] José R. Correa, Raimundo Saona, and Bruno Ziliotto. Prophet secretary through blind strategies. Mathematical Programming, 190(1):483–521, 2021c.
  • Dressel and Farid [2018] Julia Dressel and Hany Farid. The accuracy, fairness, and limits of predicting recidivism. Science Advances, 4(1), 2018.
  • Dütting et al. [2021] Paul Dütting, Silvio Lattanzi, Renato Paes Leme, and Sergei Vassilvitskii. Secretaries with advice. In Proceedings of the 22nd ACM Conference on Economics and Computation (EC), page 409–429, 2021.
  • Dwork et al. [2012] Cynthia Dwork, Moritz Hardt, Toniann Pitassi, Omer Reingold, and Richard Zemel. Fairness through awareness. In Proceedings of the 3rd Innovations in Theoretical Computer Science Conference (ITCS), page 214–226, 2012.
  • Dynkin [1963] E. B. Dynkin. The optimum choice of the instant for stopping a markov process. Soviet Mathematics Doklady, 4:627–629, 1963.
  • Esfandiari et al. [2017] Hossein Esfandiari, MohammadTaghi Hajiaghayi, Vahid Liaghat, and Morteza Monemizadeh. Prophet secretary. SIAM Journal on Discrete Mathematics (SIDMA), 31(3):1685–1701, 2017.
  • Feldman et al. [2015] Michael Feldman, Sorelle A. Friedler, John Moeller, Carlos Scheidegger, and Suresh Venkatasubramanian. Certifying and removing disparate impact. In Proceedings of the 21th International Conference on Knowledge Discovery and Data Mining (KDD), page 259–268, 2015.
  • Fujii and Yoshida [2023] Kaito Fujii and Yuichi Yoshida. The secretary problem with predictions. In Mathematics of Operations Research, 2023.
  • Gardner [1960] Martin Gardner. Mathematical games. Scientific American, 202(3):172–186, 1960.
  • Gilbert and Mosteller [1966] John P. Gilbert and Frederick Mosteller. Recognizing the maximum of a sequence. Journal of the American Statistical Association, 61:35–73, 1966.
  • Grossman [2010] Lev Grossman. Are face-detection cameras racist? Time, 2010.
  • Hern [2016] Alex Hern. Artificial intelligence beauty contest doesn’t like black people. The Guardian, September 8 2016. URL https://www.theguardian.com/technology/2016/sep/08/artificial-intelligence-beauty-contest-doesnt-like-black-people.
  • Howard and Borenstein [2018] A Howard and J Borenstein. The ugly truth about ourselves and our robot creations: The problem of bias and social inequity. Science and Engineering Ethics, 24(5):1521–1536, October 2018.
  • Joseph et al. [2016] Matthew Joseph, Michael Kearns, Jamie H Morgenstern, and Aaron Roth. Fairness in learning: Classic and contextual bandits. In Proceedings of the 29th Annual Conference on Neural Information Processing Systems (NeurIPS), 2016.
  • Kamishima et al. [2011] Toshihiro Kamishima, Shotaro Akaho, and Jun Sakuma. Fairness-aware learning through regularization approach. In Proceedings of the 11thInternational Conference on Data Mining Workshops (ICDMW), pages 643–650, 2011.
  • Kamishima et al. [2012] Toshihiro Kamishima, Shotaro Akaho, Hideki Asoh, and Jun Sakuma. Fairness-aware classifier with prejudice remover regularizer. In Peter A. Flach, Tijl De Bie, and Nello Cristianini, editors, Proceedings of the European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML-PKDD), volume 7524, 2012.
  • Kaplan et al. [2020] Haim Kaplan, David Naori, and Danny Raz. Competitive analysis with a sample and the secretary problem. In Proceedings of the 31st Annual ACM-SIAM Symposium on Discrete Algorithms (SODA), pages 2082–2095, 2020.
  • Khandani et al. [2010] Amir E. Khandani, Adlar J. Kim, and Andrew W. Lo. Consumer credit-risk models via machine-learning algorithms. Journal of Banking & Finance, 34(11):2767–2787, 2010. URL https://www.sciencedirect.com/science/article/pii/S0378426610002372.
  • Kleinberg and Raghavan [2017] Jon Kleinberg and Manish Raghavan. Inherent trade-offs in the fair determination of risk scores. In Proceedings of the 8th Innovations in Theoretical Computer Science Conference (ITCS), 2017.
  • Kleinberg et al. [2017] Jon Kleinberg, Himabindu Lakkaraju, Jure Leskovec, Jens Ludwig, and Sendhil Mullainathan. Human Decisions and Machine Predictions. The Quarterly Journal of Economics, 133(1):237–293, 2017.
  • Kleinberg [2005] Robert D. Kleinberg. A multiple-choice secretary algorithm with applications to online auctions. In Proceedings of the 16th Annual ACM-SIAM Symposium on Discrete Algorithms (SODA), pages 630–631, 2005.
  • Lambrecht and Tucker [2019] Anja Lambrecht and Catherine Tucker. Algorithmic bias? an empirical study of apparent gender-based discrimination in the display of stem career ads. Management Science, 65(7):2966–2981, 2019.
  • [45] Alexander Lindermayr and Nicole Megow. Algorithms with predictions. https://algorithms-with-predictions.github.io/.
  • Lindley [1961] D. V. Lindley. Dynamic programming and decision theory. Journal of the Royal Statistical Society: Series C (Applied Statistics), 10(1):39–51, 1961.
  • Luong et al. [2011] Binh Thanh Luong, Salvatore Ruggieri, and Franco Turini. k-nn as an implementation of situation testing for discrimination discovery and prevention. In Proceedings of the 17th International Conference on Knowledge Discovery and Data Mining (KDD), page 502–510, 2011.
  • Lykouris and Vassilvitskii [2018] Thodoris Lykouris and Sergei Vassilvitskii. Competitive caching with machine learned advice. In Proceedings of the 35th International Conference on Machine Learning (ICML), pages 3302–3311, 2018.
  • Ma et al. [2022] Will Ma, Pan Xu, and Yifan Xu. Group-level fairness maximization in online bipartite matching. In Proceedings of the 21st International Conference on Autonomous Agents and Multiagent Systems, page 1687–1689, 2022.
  • Malhotra and Malhotra [2003] Rashmi Malhotra and D. K. Malhotra. Evaluating con