(Translated by https://www.hiragana.jp/)
1 INTRODUCTION

 

To Give or Not to Give? The Impacts of Strategically Withheld Recourse


 


Yatong Chen                        Andrew Estornell MPI for Intelligent Systems, Tübingen AI Center, Tübingen, Germany                        Bytedance Research

Yevgeniy Vorobeychik                        Yang Liu

Washington University in Saint Louis                        University of California, Santa Cruz

Abstract

Individuals often aim to reverse undesired outcomes in interactions with automated systems, like loan denials, by either implementing system-recommended actions (recourse), or manipulating their features. While providing recourse benefits users and enhances system utility, it also provides information about the decision process that can be used for more effective strategic manipulation, especially when the individuals collectively share such information with each other. We show that this tension leads rational utility-maximizing systems to frequently withhold recourse, resulting in decreased population utility, particularly impacting sensitive groups. To mitigate these effects, we explore the role of recourse subsidies, finding them effective in increasing the provision of recourse actions by rational systems, as well as lowering the potential social cost and mitigating unfairness caused by recourse withholding.

1 INTRODUCTION

When individuals interacting with automated systems are denied a desired outcome (e.g., loan approval), they may seek a means of reversing this decision to obtain the desired outcome. This procedure is commonly referred to as recourse (Ustun et al., 2019). In cases where the system’s decision rule is opaque (e.g., lending), the system itself is responsible for supplying individuals with recourse, i.e., a recommended feature modification that is feasible and will result in that individual being approved.

When the feature modification changes an agent’s true qualification rate (e.g., paying off debt increases one’s creditworthiness), providing recourse can benefit the system. However, offering recourse actions also exposes information about the system’s decision rule, as each action leads to a positively classified feature vector close to the decision boundary. This added transparency creates opportunities for strategic individuals to exploit the system’s decision rule by manipulating their features, especially when they share their knowledge about the decision rule with one another. For example, platforms like GradCafe for graduate school admissions and LendingClub for loan applications allow agents to see other applicants’ features. This enables them to potentially misreport their features to mimic those of others, thereby leveraging publicly available information to their advantage (Bechavod et al., 2022; Chen et al., 2020; Estornell et al., 2023b; Hardt et al., 2016; Vorobeychik, 2023). Such feature manipulation can often reduce both system and social utility since it will increase the false positive rate. This creates a tension in providing recourse, where the utility gained from increased qualifications must be balanced against the utility lost due to manipulation that exploits the counterfactual information in recourse recommendations The consequence of this tension is that in many settings, providing recourse to all, or even most, of the agents may be suboptimal from a system’s perspective. This sharply contrasts with the common assumption in the algorithmic recourse literature, which typically considers agents taking recourse actions without the possibility of manipulation.

On the other hand, we can consider subsidies as a means to incentivize systems to offer recourse actions. Subsidy (Hu et al., 2019), or government incentive, is a type of government expenditure to financially help individuals, households, and businesses in various settings. Consider the small business administration (SBA) microloan program 111https://www.hud.gov/program-offices/housing/fhahistory in the United States as a motivating example. This program provides small loans to startups and small businesses and offers technical assistance and financial training to help borrowers succeed. In this work, we model subsidies that lower each individual’s recourse costs, requiring the agent to pay only a fraction of the original amount. Compared with using penalty to disincentivize manipulation (Blocki et al., 2013) or using auditing to incentivize recourse taking (Estornell et al., 2023a), the main benefit of subsidies is that it requires no verification power from the system, reducing the potential harm caused by unintentionally impose large fines on truthful agents. We add a more detailed discussion in Section 2.

High-Level Overview of Our Model

There are two parties in our setting: a utility-maximizing recourse system and a set of agents. Each agent is represented by a feature vector 𝐱X𝐱X\mathbf{x}\in\pazocal{X}bold_x ∈ roman_X. The system trained a fixed, potentially opaque function f:X[0,1]:𝑓X01f:\pazocal{X}\rightarrow[0,1]italic_f : roman_X → [ 0 , 1 ] to decide who to provide a resource (e.g., loan) based on 𝐱𝐱\mathbf{x}bold_x. For negatively classified agents, the system decides whether to provide a recourse action or not. The central tension comes from the fact that agents can both (1) lie about their features and manipulate them to some publicly known positively classified features and (2) take the recommended recourse actions that change their true features. Only the latter leads to an increase in the system’s utility. The publicly known features come from either agents who are already classified positively, or agents who successfully obtain a recourse action from the system. The latter is more within the system’s control and could be an easier target for manipulation, as they are more likely to be closer to the decision boundary. Thus, the system’s main tool is to strategically withhold recourse actions from some agents to maximize their utility. Based on the relative cost of recourse and manipulation, agents choose to take the recommended action or manipulate known positively classified features. See Figure 1 for a demonstration of our modeling framework.

Main Results

We show that in many cases, the system is incentivized to strategically withhold recourse from most if not all, agents to prevent manipulation. To our knowledge, this is the first work to challenge the assumption that a utility-maximizing recourse system will naturally provide recourse without third-party intervention (e.g., government regulation). As fewer agents receive recourse, the social cost—the average cost to achieve positive classification—rises. Withholding recourse also limits legitimate paths to positive classification, pushing more individuals toward manipulation. This burden often falls disproportionately on disadvantaged groups, worsening existing inequalities. To address this, we explore recourse subsidies, a third-party payment that reduces recourse costs, and find them effective in increasing recourse providing, reducing social costs, and mitigating unfairness.

The details for reproducing our experimental results can be found at https://github.com/UCSC-REAL/Strategic-withheld-recourse.

2 RELATED WORKS

Our work is closely related to the literature on algorithmic recourse, strategic classification, and fairness in general. Due to the page limit, additional related work on fairness and social cost in strategic classification and recourse (Gupta et al., 2019; von Kügelgen et al., 2022; Ehyaei et al., 2023; Estornell et al., 2023b), transparency (Barsotti et al., 2022; Akyol et al., 2016) and others can be found in Appendix B.

Recourse

Much of the line of algorithmic recourse (Ustun et al., 2019; Venkatasubramanian & Alfano, 2020; Karimi et al., 2020a; Gupta et al., 2019; Karimi et al., 2020b; von Kügelgen et al., 2020; Chen et al., 2020; Harris et al., 2022) focuses on the setting where the requested recourse is guaranteed to be provided out of ethical consideration (Venkatasubramanian & Alfano, 2020). Our work is the first to challenge this fundamental assumption and argue that without a third-party’s intervention, a utility-maximizing algorithmic recourse system may be incentivized to withhold recourse from some agents to prevent manipulations strategically. We point the reader to Karimi et al. (2020a) for a more detailed discussion of the concepts and recent development of algorithmic recourse.

Strategic Classification

Strategic classification focuses on the problem of how to effectively make predictions in the presence of agents who behave strategically to obtain desirable outcomes (Hardt et al., 2016; Chen et al., 2018; Tsirtsis et al., 2019; Levanon & Rosenfeld, 2021; Dong et al., 2018; Chen et al., 2018; Zrnic et al., 2021). In this work, we use the standard game-theoretic Stackelberg model proposed in Hardt et al. (2016) to simulate the agent’s best response actions when choosing between recourse and manipulation. Our work considers the imitation-based manipulations: agents do not know the classifier f𝑓fitalic_f but are aware of a set of positively classified features and can misreport their feature by imitating another agent’s feature that is positively classified. Such copycat behavior has been well-known in the literature of game theory, the behavioral economy, and strategic classification, e.g., (Bechavod et al., 2022; Barsotti et al., 2022). While most of this line of work focuses on agents being strategic and could potentially modify their features to get a favorable prediction outcome, our work focuses on when the system is being strategic and potentially withholds recourse to the agents.

Subsidy, Penalty, and Auditing

Our work relates to interventions aimed at (dis)incentivizing strategic behaviors. Most relevant is Hu et al. (2019), who also studies strategic behavior using subsidies. Penalties for misreporting (Hardt et al., 2016; Blocki et al., 2013) offer another way to discourage manipulation, encouraging agents to pursue recourse instead. Both subsidies and penalties can be viewed as tools to shift the balance between the cost of recourse and manipulation — penalties raise the cost of manipulation, while subsidies lower the cost of recourse. Estornell et al. (2023a) explores auditing as an intervention to promote recourse, assuming universal recourse availability. The implementation of penalties requires verification power, such as in tax systems where cross-checking reported income deters misreporting. Subsidies, however, could be financed by third-party entities like governments or financial institutions. Incentivizing recourse through penalties is not ideal, as verification can lead to false positives, unfairly penalizing truthful agents. Audit-based systems (Estornell et al., 2023a) typically impose large fines, which can harm innocent agents if they are wrongly identified as manipulators. Subsidies avoid this issue. If the system controls audits and subsidies alone, it will prioritize its utility, which may not always align with the population’s best interests.

3 PRELIMINARIES

Let XdXsuperscriptd\pazocal{X}\subset\mathbb{R}^{d}roman_X ⊂ blackboard_R start_POSTSUPERSCRIPT roman_d end_POSTSUPERSCRIPT and Y{0,1}Y01\pazocal{Y}\equiv\{0,1\}roman_Y ≡ { 0 , 1 } be a domain of features and labels respectively. Let f:XY:𝑓XYf:\pazocal{X}\rightarrow\pazocal{Y}italic_f : roman_X → roman_Y be a fixed binary classifier. A population of agents with features X={x:xX}Xconditional-set𝑥𝑥X\textbf{X}=\{x:x\in\pazocal{X}\}X = { italic_x : italic_x ∈ roman_X } and labels Y={y:yY}𝑌conditional-set𝑦𝑦YY=\{y:y\in\pazocal{Y}\}italic_Y = { italic_y : italic_y ∈ roman_Y } are classified by f𝑓fitalic_f, which is unknown to the agents; all agents desired to be positively classified (e.g., all loan applicants desire approval). Denote the domain of negatively classified features as XdsubscriptXsuperscriptd\pazocal{X}_{-}\subseteq\mathbb{R}^{d}roman_X start_POSTSUBSCRIPT - end_POSTSUBSCRIPT ⊆ blackboard_R start_POSTSUPERSCRIPT roman_d end_POSTSUPERSCRIPT and the domain of positively classified features as X+dsubscriptXsuperscriptd\pazocal{X}_{+}\in\mathbb{R}^{d}roman_X start_POSTSUBSCRIPT + end_POSTSUBSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT roman_d end_POSTSUPERSCRIPT, i.e. f(𝐱)=0𝑓𝐱0f(\mathbf{x})=0italic_f ( bold_x ) = 0 for all 𝐱X𝐱subscriptX\mathbf{x}\in\pazocal{X}_{-}bold_x ∈ roman_X start_POSTSUBSCRIPT - end_POSTSUBSCRIPT and f(𝐱)=1𝑓𝐱1f(\mathbf{x})=1italic_f ( bold_x ) = 1 for all 𝐱X+.𝐱subscriptX\mathbf{x}\in\pazocal{X}_{+}.bold_x ∈ roman_X start_POSTSUBSCRIPT + end_POSTSUBSCRIPT . All agents prefer positive classification over negative classification. Agents who have features 𝐱X𝐱subscriptX\mathbf{x}\in\pazocal{X}_{-}bold_x ∈ roman_X start_POSTSUBSCRIPT - end_POSTSUBSCRIPT have two means of obtaining positive classification in the next step, recourse and manipulation, which are defined next.

Recourse

Recourse provides agents who received undesirable outcomes with recommended actions to genuinely improve their outcome by modifying their attributes (Ustun et al., 2019). Let cR:X×X+:subscript𝑐𝑅XXsubscriptc_{R}:\pazocal{X}\times\pazocal{X}\rightarrow\mathbb{R}_{+}italic_c start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT : roman_X × roman_X → blackboard_R start_POSTSUBSCRIPT + end_POSTSUBSCRIPT be the cost of recourse, i.e. an agent with true features 𝐱𝐱\mathbf{x}bold_x pays cost cR(𝐱,𝐱)subscript𝑐𝑅𝐱superscript𝐱c_{R}(\mathbf{x},\mathbf{x}^{\prime})italic_c start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ( bold_x , bold_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) when modifying their features to be 𝐱superscript𝐱\mathbf{x}^{\prime}bold_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT. An agent with true feature 𝐱X𝐱subscriptX\mathbf{x}\in\pazocal{X}_{-}bold_x ∈ roman_X start_POSTSUBSCRIPT - end_POSTSUBSCRIPT has an optimal recourse action 222Throughout the paper, we will interchangeably use the terms ’recourse action’ and ’recourse feature.’ They both refer to the feature vector that will be classified positively after the agent’s taking a particular recourse action. In other words, we assume that whenever an agent reveals their recourse action, it also reveals their original feature vector, which is equivalent to revealing the feature vector that corresponds to the vectorafter the agent performs recourse.,

𝐱R(𝐱)=argmin𝐱X+cR(𝐱,𝐱)subscript𝐱𝑅𝐱subscriptargminsuperscript𝐱subscriptXsubscript𝑐𝑅𝐱superscript𝐱\displaystyle\mathbf{x}_{R}(\mathbf{x})=\operatorname{argmin}_{\mathbf{x}^{% \prime}\in\pazocal{X}_{+}}c_{R}(\mathbf{x},\mathbf{x}^{\prime})bold_x start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ( bold_x ) = roman_argmin start_POSTSUBSCRIPT bold_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ roman_X start_POSTSUBSCRIPT + end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_c start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ( bold_x , bold_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) (1)
s.t.f(𝐱)=1,𝐱A(𝐱)formulae-sequences.t.𝑓superscript𝐱1superscript𝐱𝐴𝐱\displaystyle\text{s.t.}~{}f(\mathbf{x}^{\prime})=1,~{}~{}\mathbf{x}^{\prime}% \in A(\mathbf{x})s.t. italic_f ( bold_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) = 1 , bold_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ italic_A ( bold_x )

where A(𝐱)𝐴𝐱A(\mathbf{x})italic_A ( bold_x ) represents the set of features an agent with true features 𝐱𝐱\mathbf{x}bold_x can feasibly obtain, i.e., the actionable recourse actions provided by the system. When agents perform recourse, both their true features and true qualification rate change, i.e., their true features become 𝐱R(𝐱)subscript𝐱𝑅𝐱\mathbf{x}_{R}(\mathbf{x})bold_x start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ( bold_x ), and their true qualification rate changes from Pr(y=1|𝐱)Pr𝑦conditional1𝐱\Pr(y=1|\mathbf{x})roman_Pr ( italic_y = 1 | bold_x ) to Pr(y=1|𝐱R(𝐱))Pr𝑦conditional1subscript𝐱𝑅𝐱\Pr(y=1|\mathbf{x}_{R}(\mathbf{x}))roman_Pr ( italic_y = 1 | bold_x start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ( bold_x ) ).

Manipulation

In addition to recourse, agents can also perform manipulations. Following Barsotti et al. (2022), we focus on imitation-based manipulations: agents do not know the classifier f𝑓fitalic_f, but are aware of a set of publically revealed positively classified features ZX+ZsubscriptX\textbf{Z}\subseteq\pazocal{X}_{+}Z ⊆ roman_X start_POSTSUBSCRIPT + end_POSTSUBSCRIPT (defined below) and can misreport their feature by imitating another agent’s feature that is positively classified and is publically revealed. For a manipulation cost function cM:X×X+:subscript𝑐𝑀XXsubscriptc_{M}:\pazocal{X}\times\pazocal{X}\rightarrow\mathbb{R}_{+}italic_c start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT : roman_X × roman_X → blackboard_R start_POSTSUBSCRIPT + end_POSTSUBSCRIPT the optimal imitation-based manipulation for an agent with true feature 𝐱𝐱\mathbf{x}bold_x is

𝐱M(𝐱)=argmin𝐱ZcM(𝐱,𝐱)subscript𝐱𝑀𝐱subscriptsuperscript𝐱Zsubscript𝑐𝑀𝐱superscript𝐱\displaystyle\mathbf{x}_{M}(\mathbf{x})=\arg\min_{\mathbf{x}^{\prime}\in% \textbf{Z}}c_{M}(\mathbf{x},\mathbf{x}^{\prime})bold_x start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT ( bold_x ) = roman_arg roman_min start_POSTSUBSCRIPT bold_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ Z end_POSTSUBSCRIPT italic_c start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT ( bold_x , bold_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) (2)

Different from recourse, manipulation is simply a misreport rather than a change of one’s features, thus it does not change Pr[y=1|𝐱]Pr𝑦conditional1𝐱\Pr[y=1|\mathbf{x}]roman_Pr [ italic_y = 1 | bold_x ]. However, since the system only observes the reported features before classification, it does not know whether a report is truthful.

Feature Disclosure and Publicaly Revealed Set Z

We model the set of publicly revealed features ZX+ZsubscriptX\textbf{Z}\subseteq\pazocal{X}_{+}Z ⊆ roman_X start_POSTSUBSCRIPT + end_POSTSUBSCRIPT resulting from agents sharing information with each other. In particular, Z consists of features that may come from two sets: 1) the revealed recourse actions recommended by the system (i.e., 𝐳XR𝐳subscriptX𝑅\mathbf{z}\in\textbf{X}_{R}bold_z ∈ X start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT where XR={𝐱R(𝐱),𝐱X}subscriptX𝑅subscript𝐱𝑅𝐱𝐱subscriptX\textbf{X}_{R}=\{\mathbf{x}_{R}(\mathbf{x}),\mathbf{x}\in\textbf{X}_{-}\}X start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT = { bold_x start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ( bold_x ) , bold_x ∈ X start_POSTSUBSCRIPT - end_POSTSUBSCRIPT }), and 2) the set of initial positively classified features (i.e., 𝐳X+𝐳subscriptX\mathbf{z}\in\textbf{X}_{+}bold_z ∈ X start_POSTSUBSCRIPT + end_POSTSUBSCRIPT). Each element is made public with a fixed probability p[0,1]𝑝01p\in[0,1]italic_p ∈ [ 0 , 1 ], and all publicly revealed elements make the reveal set Z. We represent the set of recourse actions that are actually revealed as ZR={𝐳XR:Reveal(𝐳)=1}subscriptZ𝑅conditional-set𝐳subscriptX𝑅Reveal𝐳1\textbf{Z}_{R}=\{\mathbf{z}\in\textbf{X}_{R}:\text{Reveal}(\mathbf{z})=1\}Z start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT = { bold_z ∈ X start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT : Reveal ( bold_z ) = 1 }. Here, Reveal(𝐳)Reveal𝐳\text{Reveal}(\mathbf{z})Reveal ( bold_z ) is a random indicator function that equals 1 with probability p𝑝pitalic_p (indicating that feature 𝐳𝐳\mathbf{z}bold_z is revealed) and 0 otherwise. Similarly, let Z+subscriptZ\textbf{Z}_{+}Z start_POSTSUBSCRIPT + end_POSTSUBSCRIPT represent the positively classified features that are actually revealed: Z+={𝐳X+:Reveal(𝐳)=1}subscriptZconditional-set𝐳subscriptXReveal𝐳1\textbf{Z}_{+}=\{\mathbf{z}\in\textbf{X}_{+}:\text{Reveal}(\mathbf{z})=1\}Z start_POSTSUBSCRIPT + end_POSTSUBSCRIPT = { bold_z ∈ X start_POSTSUBSCRIPT + end_POSTSUBSCRIPT : Reveal ( bold_z ) = 1 }. As a result, Z=ZRZ+ZsubscriptZ𝑅subscriptZ\textbf{Z}=\textbf{Z}_{R}\cup\textbf{Z}_{+}Z = Z start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ∪ Z start_POSTSUBSCRIPT + end_POSTSUBSCRIPT.

This captures real-life scenarios where negatively classified agents collectively gather information about classifier f𝑓fitalic_f by observing positively classified peers or those who obtained recourse. Revealed recourse features are particularly crucial as they lie near the decision boundary, making them more likely targets for manipulation than general positive features.

4 INTERACTION BETWEEN AGENTS AND THE SYSTEM

Refer to caption
Figure 1: Demonstration of our modeling framework. Agents arrive simultaneously, and the system trains a classifier f:XY:𝑓XYf:\pazocal{X}\rightarrow\pazocal{Y}italic_f : roman_X → roman_Y for maximum prediction accuracy. Negatively classified agents request recourse, and the system selects agents for recourse provision to maximize utility (Equation 3). Positively classified agents and those provided recourse have a probability p[0,1]𝑝01p\in[0,1]italic_p ∈ [ 0 , 1 ] to reveal features, contributing to the publicly revealed set ZX+ZsubscriptX\textbf{Z}\subseteq\pazocal{X}_{+}Z ⊆ roman_X start_POSTSUBSCRIPT + end_POSTSUBSCRIPT. Upon observing Z, agents execute final actions based on Equation 5.

Unlike the traditional recourse setting, where the system is expected to provide recourse to any individual upon request, without external regulation (e.g., government mandates requiring banks to offer recourse), a utility-maximizing system may have incentives to withhold recourse to prevent strategic manipulation by agents. In this section, we introduce our modeling framework to capture these dynamics.

A Motivating Example

A bank publishes a classifier to determine who qualifies for a credit card. Each applicant (with feature vector 𝐱𝐱\mathbf{x}bold_x) is approved if the bank’s model f𝑓fitalic_f predicts they can repay their loan. For applicants denied a card, the bank may offer recourse, i.e., a plan to improve their creditworthiness, such as paying off debt or increasing their income. These recourse actions are provided through specific programs, such as financial classes. Agents also have access to an online forum where some applicants share their approved loan or recourse features. With knowledge of both recourse actions and the forum, some agents may misreport their features to match positively classified ones in an attempt to gain approval without actually taking the recommended recourse actions. As a result, the bank may have an incentive to limit recourse to individuals whose features are harder to manipulate (e.g., features that are easier for the bank to verify).

We now formalize the dynamics between the recourse system and the agents.

System: The system trains a classifier f:XY:𝑓XYf:\pazocal{X}\rightarrow\pazocal{Y}italic_f : roman_X → roman_Y to maximize the prediction accuracy:

f=argmaxfxX𝟙[f(𝐱)=y]𝑓subscriptargmax𝑓subscript𝑥X1delimited-[]𝑓𝐱𝑦f=\operatorname*{arg\,max}_{f\in\cal{F}}\sum_{x\in\textbf{X}}\mathbbm{1}[f(% \mathbf{x})=y]italic_f = start_OPERATOR roman_arg roman_max end_OPERATOR start_POSTSUBSCRIPT italic_f ∈ caligraphic_F end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_x ∈ X end_POSTSUBSCRIPT blackboard_1 [ italic_f ( bold_x ) = italic_y ]

A collection of negatively classified agents with features XXsubscriptXsubscriptX\textbf{X}_{-}\subseteq\pazocal{X}_{-}X start_POSTSUBSCRIPT - end_POSTSUBSCRIPT ⊆ roman_X start_POSTSUBSCRIPT - end_POSTSUBSCRIPT will request recourse actions from the system after receiving their prediction outcome. The system first computes optimal recourse actions for all negatively classified agents but only chooses to release a subset of those recourse actions ZRXRsubscriptZ𝑅subscriptX𝑅\textbf{Z}_{R}\subset\textbf{X}_{R}Z start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ⊂ X start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT to the public to maximize its utility, i.e., TPFPTPFP\text{TP}-\text{FP}TP - FP:

maxZRXRsubscriptsubscriptZ𝑅subscriptX𝑅\displaystyle\max_{\textbf{Z}_{R}\subset\textbf{X}_{R}}roman_max start_POSTSUBSCRIPT Z start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ⊂ X start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT end_POSTSUBSCRIPT TP(SS)FP(SS)system’s utilitysubscriptTPSSFPSSsystem’s utility\displaystyle~{}~{}~{}\underbrace{\textsf{TP}({\SS})-\textsf{FP}(\SS)}_{\text{% system's utility}}under⏟ start_ARG TP ( roman_SS ) - FP ( roman_SS ) end_ARG start_POSTSUBSCRIPT system’s utility end_POSTSUBSCRIPT (3)
s.t. SS={𝐳(𝐱,Z):𝐱X}agent’s reported features (Eq 5)subscriptSSconditional-set𝐳𝐱Z𝐱Xagent’s reported features (Eq 5)\displaystyle~{}~{}\underbrace{\SS=\{\mathbf{z}(\mathbf{x},\textbf{Z}):\mathbf% {x}\in\textbf{X}\}}_{\begin{subarray}{c}\text{agent's reported features (Eq % \ref{eqn:final-action})}\end{subarray}}under⏟ start_ARG roman_SS = { bold_z ( bold_x , Z ) : bold_x ∈ X } end_ARG start_POSTSUBSCRIPT start_ARG start_ROW start_CELL agent’s reported features (Eq ) end_CELL end_ROW end_ARG end_POSTSUBSCRIPT (4)
Z=ZRZ+all publicly revealed featuressubscriptZsubscriptZ𝑅subscriptZall publicly revealed features\displaystyle\underbrace{\textbf{Z}=\textbf{Z}_{R}\cup\textbf{Z}_{+}}_{\begin{% subarray}{c}\text{all publicly revealed features}\end{subarray}}under⏟ start_ARG Z = Z start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ∪ Z start_POSTSUBSCRIPT + end_POSTSUBSCRIPT end_ARG start_POSTSUBSCRIPT start_ARG start_ROW start_CELL all publicly revealed features end_CELL end_ROW end_ARG end_POSTSUBSCRIPT

Here, TP(SS)TPSS\textsf{TP}(\SS)TP ( roman_SS ) and FP(SS)FPSS\textsf{FP}(\SS)FP ( roman_SS ) are the true positive and false positive rates on the set of features after the agent’s final actions. We assume that the system either knows cRsubscript𝑐𝑅c_{R}italic_c start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT and cMsubscript𝑐𝑀c_{M}italic_c start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT, or can reasonably approximate these cost functions when optimizing their objective. Intuitively, this definition of system utility reflects a bank gaining a utility of 1111 for each repaid loan and 11-1- 1 for each defaulted loan.

Agents: Agents who are negatively classified will request a recourse action from the system. Upon seeing the publically revealed features Z defined in Section 3, agents who are provided with a recourse action adapt their features from 𝐱𝐱\mathbf{x}bold_x to 𝐳=𝐱M(𝐱)𝐳subscript𝐱𝑀𝐱\mathbf{z}=\mathbf{x}_{M}(\mathbf{x})bold_z = bold_x start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT ( bold_x ) or 𝐳=𝐱R(𝐱)𝐳subscript𝐱𝑅𝐱\mathbf{z}=\mathbf{x}_{R}(\mathbf{x})bold_z = bold_x start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ( bold_x ) such that f(𝐳)=1𝑓𝐳1f(\mathbf{z})=1italic_f ( bold_z ) = 1, while minimizing the cost of the corresponding action. When both the recourse and manipulation actions are greater than 1111333The strategic agent’s utility for adapting their feature from x𝑥xitalic_x to xsuperscript𝑥x^{\prime}italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT is determined by the standard utility function in the literature of strategic classification (see, e.g., Hardt et al. (2016)), which is U(x,x)=f(x)c(x,x)𝑈𝑥superscript𝑥𝑓superscript𝑥𝑐𝑥superscript𝑥U(x,x^{\prime})=f(x^{\prime})-c(x,x^{\prime})italic_U ( italic_x , italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) = italic_f ( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) - italic_c ( italic_x , italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ). Thus, when the cost of adaptation c(x,x)1𝑐𝑥superscript𝑥1c(x,x^{\prime})\geq 1italic_c ( italic_x , italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) ≥ 1, the utility will be less than 0, in which case, the agent does nothing., the agents will choose to stay with their original features 𝐱𝐱\mathbf{x}bold_x, which corresponds to the do-nothing action. Agents who are not provided with a recourse action will choose to manipulate or do nothing. The final action for already positively classified agents is always the do-nothing action.

Agent’s best response: Denote ζ𝐱{0,1}subscript𝜁𝐱01\zeta_{\mathbf{x}}\in\{0,1\}italic_ζ start_POSTSUBSCRIPT bold_x end_POSTSUBSCRIPT ∈ { 0 , 1 } as an indicator for whether agent 𝐱𝐱\mathbf{x}bold_x is provided with a recourse or not (i.e., ζ(𝐱)=1𝜁𝐱1\zeta(\mathbf{x})=1italic_ζ ( bold_x ) = 1 when provided with a recourse action). Then for all agents with f(𝐱)=0𝑓𝐱0f(\mathbf{x})=0italic_f ( bold_x ) = 0, their final action is:

𝐳(𝐱,Z)={𝐱R(𝐱)ζ𝐱=1 and cR(𝐱,𝐱R(𝐱))<min(1,cM(𝐱,𝐱)),𝐱Z𝐱M(𝐱)ζ𝐱=1 and cM(𝐱,𝐱M(𝐱))<min(1,cR(𝐱,𝐱)),𝐱Z, or ζ𝐱=0 and cM(𝐱,𝐱M(𝐱))<1𝐱ζ𝐱=1 and cR(𝐱,𝐱R(𝐱)),cM(𝐱,𝐱R(𝐱))1,𝐱Z, or ζ𝐱=0 and cM(𝐱,𝐱M(𝐱))1𝐳𝐱Zcasessubscript𝐱𝑅𝐱subscript𝜁𝐱1 and subscript𝑐𝑅𝐱subscript𝐱𝑅𝐱1subscript𝑐𝑀𝐱superscript𝐱otherwisefor-allsuperscript𝐱Zsubscript𝐱𝑀𝐱subscript𝜁𝐱1 and subscript𝑐𝑀𝐱subscript𝐱𝑀𝐱1subscript𝑐𝑅𝐱superscript𝐱otherwiseformulae-sequencefor-allsuperscript𝐱Z or subscript𝜁𝐱0 and subscript𝑐𝑀𝐱subscript𝐱𝑀𝐱1𝐱formulae-sequencesubscript𝜁𝐱1 and subscript𝑐𝑅𝐱subscript𝐱𝑅𝐱subscript𝑐𝑀𝐱subscript𝐱𝑅𝐱1otherwiseformulae-sequencefor-allsuperscript𝐱Z or subscript𝜁𝐱0 and subscript𝑐𝑀𝐱subscript𝐱𝑀𝐱1\displaystyle\mathbf{z}(\mathbf{x},\textbf{Z})=\begin{cases}\mathbf{x}_{R}(% \mathbf{x})&\zeta_{\mathbf{x}}=1\text{ and }c_{R}(\mathbf{x},\mathbf{x}_{R}(% \mathbf{x}))<\min(1,c_{M}(\mathbf{x},\mathbf{x}^{\prime})),\\ &\forall\mathbf{x}^{\prime}\in\textbf{Z}\\ \mathbf{x}_{M}(\mathbf{x})&\zeta_{\mathbf{x}}=1\text{ and }c_{M}(\mathbf{x},% \mathbf{x}_{M}(\mathbf{x}))<\min(1,c_{R}(\mathbf{x},\mathbf{x}^{\prime})),\\ &\forall\mathbf{x}^{\prime}\in\textbf{Z},\text{ or }\zeta_{\mathbf{x}}=0\text{% and }~{}c_{M}(\mathbf{x},\mathbf{x}_{M}(\mathbf{x}))<1\\ \mathbf{x}&\zeta_{\mathbf{x}}=1\text{ and }c_{R}(\mathbf{x},\mathbf{x}_{R}(% \mathbf{x})),c_{M}(\mathbf{x},\mathbf{x}_{R}(\mathbf{x}))\geq 1,\\ &\forall\mathbf{x}^{\prime}\in\textbf{Z},\text{ or }\zeta_{\mathbf{x}}=0\text{% and }c_{M}(\mathbf{x},\mathbf{x}_{M}(\mathbf{x}))\geq 1\end{cases}bold_z ( bold_x , Z ) = { start_ROW start_CELL bold_x start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ( bold_x ) end_CELL start_CELL italic_ζ start_POSTSUBSCRIPT bold_x end_POSTSUBSCRIPT = 1 and italic_c start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ( bold_x , bold_x start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ( bold_x ) ) < roman_min ( 1 , italic_c start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT ( bold_x , bold_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) ) , end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL ∀ bold_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ Z end_CELL end_ROW start_ROW start_CELL bold_x start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT ( bold_x ) end_CELL start_CELL italic_ζ start_POSTSUBSCRIPT bold_x end_POSTSUBSCRIPT = 1 and italic_c start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT ( bold_x , bold_x start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT ( bold_x ) ) < roman_min ( 1 , italic_c start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ( bold_x , bold_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) ) , end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL ∀ bold_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ Z , or italic_ζ start_POSTSUBSCRIPT bold_x end_POSTSUBSCRIPT = 0 and italic_c start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT ( bold_x , bold_x start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT ( bold_x ) ) < 1 end_CELL end_ROW start_ROW start_CELL bold_x end_CELL start_CELL italic_ζ start_POSTSUBSCRIPT bold_x end_POSTSUBSCRIPT = 1 and italic_c start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ( bold_x , bold_x start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ( bold_x ) ) , italic_c start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT ( bold_x , bold_x start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ( bold_x ) ) ≥ 1 , end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL ∀ bold_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ Z , or italic_ζ start_POSTSUBSCRIPT bold_x end_POSTSUBSCRIPT = 0 and italic_c start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT ( bold_x , bold_x start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT ( bold_x ) ) ≥ 1 end_CELL end_ROW (5)

Summary of System-Agent Interaction:

  1. 1.

    Agents arrive simultaneously, and the system trains a classifier f:XY:𝑓XYf:\pazocal{X}\rightarrow\pazocal{Y}italic_f : roman_X → roman_Y for maximum prediction accuracy.

  2. 2.

    Negatively classified agents request recourse, and the system selects agents for recourse provision to maximize utility (Equation 3).

  3. 3.

    Positively classified agents and those provided recourse have a probability p[0,1]𝑝01p\in[0,1]italic_p ∈ [ 0 , 1 ] to reveal features, contributing to the publicly revealed set ZX+ZsubscriptX\textbf{Z}\subseteq\pazocal{X}_{+}Z ⊆ roman_X start_POSTSUBSCRIPT + end_POSTSUBSCRIPT.

  4. 4.

    Upon observing Z, agents execute final actions based on Equation 5.

Our framework is intended to capture settings where black box models are used for decision-making. Any agent subjected to the decision rules will not have direct access to the model but will still act in their best interest. In these opaque settings, recourse proposed by the system naturally offers a way for agents to learn more about the decision rule, thus increasing their ability to game the system.

The following two definitions introduce key metrics that will be used throughout this paper – the recourse rate quantifies the proportion of negatively classified agents who opt to take recourse actions when presented with a disclosed feature set. The manipulation rate captures the fraction of negatively classified agents who choose to manipulate their features under the same conditions:

Definition 1

(Recourse Rate) Let XsubscriptX\textbf{X}_{-}X start_POSTSUBSCRIPT - end_POSTSUBSCRIPT be the set of features of negatively classified agents. For a given set of disclosed features (i.e., recourse actions) Z, the recourse rate rec(Z,X)recZsubscriptX\text{rec}(\textbf{Z},\textbf{X}_{-})rec ( Z , X start_POSTSUBSCRIPT - end_POSTSUBSCRIPT ) is defined as the fraction of agents who choose to perform recourse when shown Z:

Definition 2

(Manipulation Rate) Let XsubscriptX\textbf{X}_{-}X start_POSTSUBSCRIPT - end_POSTSUBSCRIPT be the set of features of the negatively classified agents. For a given set of disclosed features (i.e., recourse actions) Z, the manipulation rate manip(Z,X)manipZsubscriptX\text{manip}(\textbf{Z},\textbf{X}_{-})manip ( Z , X start_POSTSUBSCRIPT - end_POSTSUBSCRIPT ) is defined as the fraction of the n𝑛nitalic_n agents which choose to manipulate when shown features Z:

5 SYSTEM UTILITY

Recall from the previous section, the system aims to select a set ZRXRsubscriptZ𝑅subscriptX𝑅{\textbf{Z}}_{R}\subseteq\textbf{X}_{R}Z start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ⊆ X start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT to reveal as recourse recommendations simultaneously to maximize its utility (Equation 3). We can first show that this problem is NP-hard (Theorem 6 in Section C.2). Despite the hardness of this objective, the system’s utility is submodular in the set of provided recourse actions (Theorem 7 in Section C.3). This characteristic enables the system to employ standard submodular optimization techniques to approximately get the optimal recourse actions to disclose to k𝑘kitalic_k agents.

We can show that in expectation, the system benefits from agents taking recourse actions:

Theorem 1

(System’s Expected Utility Changes) The system’s expected utility (defined in Eq. 3) increases for each recourse action taken by agents and decreases for every manipulation action taken by agents. When the classifier used by the system is better than random guessing, which means that f(x)=1𝑓𝑥1f(x)=1italic_f ( italic_x ) = 1 implies Pr[y(x)=1|X=x]0.5Pr𝑦𝑥conditional1𝑋𝑥0.5\Pr[y(x)=1|X=x]\geq 0.5roman_Pr [ italic_y ( italic_x ) = 1 | italic_X = italic_x ] ≥ 0.5, then the system’s utility is monotonically increasing in each recourse action taken by an agent in expectation but will be monotonically decreasing in each manipulation action taken by an agent.

However, this does not imply that the system is always incentivized to provide as many recourse actions as possible, since agents might not always take them if they collude, which creates a natural misalignment between the system’s utility and recourse offering for the system.

6 COST OF STRATEGICALLY WITHHOLDING RECOURSE SYSTEM

Having shown that the system could be incentivized to withhold recourse from the agents, we now study the consequence of such withholdings by examining the social cost and unfairness as a result of the system’s strategic actions.

Definition 3

(Social Cost of a Strategically Withhold Recourse) Given a publically revealed set ZX+ZsubscriptX\textbf{Z}\subseteq\pazocal{X}_{+}Z ⊆ roman_X start_POSTSUBSCRIPT + end_POSTSUBSCRIPT, the social cost refers to the additional cost agents must pay as a result of the system withholding recourse. Denote 𝐱R(𝐱)subscript𝐱𝑅𝐱\mathbf{x}_{R}(\mathbf{x})bold_x start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ( bold_x ) as the optimal recourse action provided by a non-strategic system, and 𝐳R(𝐱,Z)subscript𝐳𝑅𝐱Z\mathbf{z}_{R}(\mathbf{x},\textbf{Z})bold_z start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ( bold_x , Z ) as the recourse action that the agent takes given the revealed set Z, then the social cost of a strategically withholding recourse system is defined as:

cost(Z,X)=𝐱X(cR(𝐱,𝐳R(𝐱,Z))cR(𝐱,𝐱R(𝐱)))costZsubscriptXsubscript𝐱subscriptXsubscript𝑐𝑅𝐱subscript𝐳𝑅𝐱Zsubscript𝑐𝑅𝐱subscript𝐱𝑅𝐱\displaystyle\text{cost}(\textbf{Z},\textbf{X}_{-})=\sum_{\mathbf{x}\in\textbf% {X}_{-}}\big{(}c_{R}(\mathbf{x},\mathbf{z}_{R}(\mathbf{x},\textbf{Z}))-c_{R}(% \mathbf{x},\mathbf{x}_{R}(\mathbf{x}))\big{)}cost ( Z , X start_POSTSUBSCRIPT - end_POSTSUBSCRIPT ) = ∑ start_POSTSUBSCRIPT bold_x ∈ X start_POSTSUBSCRIPT - end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_c start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ( bold_x , bold_z start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ( bold_x , Z ) ) - italic_c start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ( bold_x , bold_x start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ( bold_x ) ) )

where 𝐳R(𝐱,Z)=argmin𝐳ZcR(𝐱,𝐳)subscript𝐳𝑅𝐱Zsubscriptargmin𝐳Zsubscript𝑐𝑅𝐱𝐳\mathbf{z}_{R}(\mathbf{x},\textbf{Z})=\operatorname*{arg\,min}_{\mathbf{z}\in% \textbf{Z}}c_{R}(\mathbf{x},\mathbf{z})bold_z start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ( bold_x , Z ) = start_OPERATOR roman_arg roman_min end_OPERATOR start_POSTSUBSCRIPT bold_z ∈ Z end_POSTSUBSCRIPT italic_c start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ( bold_x , bold_z ). For the remainder of our results, we focus on univariate classifiers, i.e., the feature 𝐱𝐱\mathbf{x}bold_x is one-dimensional. There is a natural correspondence between univariate and multivariate classifiers in the sense that one can imagine the space of single-dimensional features as the scores produced a multi-dimensional classifier f(𝐱)𝑓𝐱f(\mathbf{x})italic_f ( bold_x ) 444This follows similarly to Lemma 3.1 in Milli et al. (2019).. That is, in the case when f(𝐱)=[h(𝐱)θ]𝑓𝐱delimited-[]𝐱𝜃f(\mathbf{x})=[h(\mathbf{x})\geq\theta]italic_f ( bold_x ) = [ italic_h ( bold_x ) ≥ italic_θ ] for some score function hhitalic_h and threshold theta, we can view f𝑓fitalic_f as a single dimensional classifier acting on the space of scores produced by hhitalic_h.

We also measure the disparities of different social groups in terms of their differences in 1) recourse ratios (defined in Definition 1), and 2) social cost (defined in Definition 3). Understanding the disparities in terms of recourse rate and social cost among different groups is crucial for addressing issues of unfairness in an algorithmic recourse system Gupta et al. (2019); von Kügelgen et al. (2022). These disparities often reflect systemic biases and inequalities, impacting marginalized communities disproportionately. In particular, assume there are two groups of agents X(g0)superscriptXsubscript𝑔0\textbf{X}^{(g_{0})}X start_POSTSUPERSCRIPT ( italic_g start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT and X(g1)superscriptXsubscript𝑔1\textbf{X}^{(g_{1})}X start_POSTSUPERSCRIPT ( italic_g start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT, where g0,g1subscript𝑔0subscript𝑔1g_{0},g_{1}italic_g start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_g start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT represents their group memberships, we are interested in the following quantities:

Definition 4

(Disparity in Social Cost and Recourse Ratio) The disparity in social cost and recourse ratio for two groups g0,g1subscript𝑔0subscript𝑔1g_{0},g_{1}italic_g start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_g start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT are defined as:

Diff(cost)(Z,X(g0),X(g1)):=|cost(Z,X(g1))cost(Z,X(g0))|,assignsuperscriptDiffcostZsuperscriptXsubscript𝑔0superscriptXsubscript𝑔1costZsuperscriptsubscriptXsubscript𝑔1costZsuperscriptsubscriptXsubscript𝑔0\displaystyle\leavevmode\resizebox{443.92024pt}{}{$\textit{Diff}^{(\text{cost}% )}(\textbf{Z},\textbf{X}^{(g_{0})},\textbf{X}^{(g_{1})}):=\left|\text{cost}(% \textbf{Z},\textbf{X}_{-}^{(g_{1})})-\text{cost}(\textbf{Z},\textbf{X}_{-}^{(g% _{0})})\right|$},Diff start_POSTSUPERSCRIPT ( cost ) end_POSTSUPERSCRIPT ( Z , X start_POSTSUPERSCRIPT ( italic_g start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT , X start_POSTSUPERSCRIPT ( italic_g start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT ) := | cost ( Z , X start_POSTSUBSCRIPT - end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_g start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT ) - cost ( Z , X start_POSTSUBSCRIPT - end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_g start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT ) | ,

In the experiments section, we demonstrate that these disparities can be quite common across different datasets (see Figure 3). By quantifying and illuminating these disparities, we gain crucial insights into the specific mechanisms of inequity and injustice within algorithmic recourse systems.

7 THE EFFECT OF SUBSIDIES

To remedy the adverse population- and group-level impacts previously observed, we investigate the use of subsidies (rigorously defined next) and their impact on recourse rate, social cost, and unfairness we defined in the previous section. Subsidies correspond to a global decrease in the cost of recourse. For example, free educational material on financial literacy distributed to any agent petitioning the bank for recourse will increase the ease at which that agent can perform recourse actions.

Definition 5

(Subsidies)(Hu et al., 2019) A subsidy 0α10𝛼10\leq\alpha\leq 10 ≤ italic_α ≤ 1 is a scalar decrease to the cost of recourse. For subsidy α𝛼\alphaitalic_α, agents performing recourse pay only (1α)cR(𝐱,𝐱)1𝛼subscript𝑐𝑅𝐱superscript𝐱(1-\alpha)\cdot c_{R}(\mathbf{x},\mathbf{x}^{\prime})( 1 - italic_α ) ⋅ italic_c start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ( bold_x , bold_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) instead of the full cost of cR(𝐱,𝐱)subscript𝑐𝑅𝐱superscript𝐱c_{R}(\mathbf{x},\mathbf{x}^{\prime})italic_c start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ( bold_x , bold_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ). We denote cR(𝐱,𝐱;α)=(1α)cR(𝐱,𝐱)subscript𝑐𝑅𝐱superscript𝐱𝛼1𝛼subscript𝑐𝑅𝐱superscript𝐱c_{R}(\mathbf{x},\mathbf{x}^{\prime};\alpha)=(1-\alpha)\cdot c_{R}(\mathbf{x},% \mathbf{x}^{\prime})italic_c start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ( bold_x , bold_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ; italic_α ) = ( 1 - italic_α ) ⋅ italic_c start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ( bold_x , bold_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) as the new recourse cost at subsidy level α𝛼\alphaitalic_α.

Next, we demonstrate how subsidies can help increase the recourse rate (Theorem 1) and system’s utility (Theorem 3). Additionally, subsidies can mitigate disparities in recourse rate differences (Theorem 5) and social cost differences (Theorem 4) among various groups.

We first show how subsidies influence the recourse rate. Recall that subsidy reduces the cost of recourse from cR(𝐱,𝐱)subscript𝑐𝑅𝐱superscript𝐱c_{R}(\mathbf{x},\mathbf{x}^{\prime})italic_c start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ( bold_x , bold_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) to cR(𝐱,𝐱;α)subscript𝑐𝑅𝐱superscript𝐱𝛼c_{R}(\mathbf{x},\mathbf{x}^{\prime};\alpha)italic_c start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ( bold_x , bold_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ; italic_α ). With that, the recourse rate becomes:

rec(Z,X;α)recZsubscriptX𝛼\displaystyle\text{rec}(\textbf{Z},\textbf{X}_{-};\alpha)rec ( Z , X start_POSTSUBSCRIPT - end_POSTSUBSCRIPT ; italic_α )
=\displaystyle== 𝐱X𝟙[min𝐳ZcR(𝐱,𝐳;α)<min(1,min𝐳′′ZcM(𝐱,𝐳′′))]|X|.𝐱subscriptX1delimited-[]superscript𝐳Zsubscript𝑐𝑅𝐱superscript𝐳𝛼1superscript𝐳′′Zsubscript𝑐𝑀𝐱superscript𝐳′′subscriptX\displaystyle\frac{\underset{\mathbf{x}\in\textbf{X}_{-}}{\sum}\mathbbm{1}% \bigg{[}\underset{\mathbf{z}^{\prime}\in\textbf{Z}}{\min}~{}c_{R}(\mathbf{x},% \mathbf{z}^{\prime};\alpha)<\min\big{(}1,~{}\underset{\mathbf{z}^{\prime\prime% }\in\textbf{Z}}{\min}~{}c_{M}(\mathbf{x},\mathbf{z}^{\prime\prime})\big{)}% \bigg{]}}{|\textbf{X}_{-}|}.divide start_ARG start_UNDERACCENT bold_x ∈ X start_POSTSUBSCRIPT - end_POSTSUBSCRIPT end_UNDERACCENT start_ARG ∑ end_ARG blackboard_1 [ start_UNDERACCENT bold_z start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ Z end_UNDERACCENT start_ARG roman_min end_ARG italic_c start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ( bold_x , bold_z start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ; italic_α ) < roman_min ( 1 , start_UNDERACCENT bold_z start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT ∈ Z end_UNDERACCENT start_ARG roman_min end_ARG italic_c start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT ( bold_x , bold_z start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT ) ) ] end_ARG start_ARG | X start_POSTSUBSCRIPT - end_POSTSUBSCRIPT | end_ARG .

The key observation here is that with subsidy α𝛼\alphaitalic_α, the recourse cost reduces, but the manipulation cost remains the same. Both optimal recourse actions 𝐱R(𝐱)subscript𝐱𝑅𝐱\mathbf{x}_{R}(\mathbf{x})bold_x start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ( bold_x ) and the optimal manipulation action 𝐱M(𝐱)subscript𝐱𝑀𝐱\mathbf{x}_{M}(\mathbf{x})bold_x start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT ( bold_x ) remain the same. With that, we can show that the recourse rate is a monotonic function in subsidy – as the subsidy level increases, the recourse rate will also increase:

Theorem 1

(Subsidy Influence on Recourse Rate) Given a reveal set Z, the recourse rate rec(Z,X,α)recZsubscriptX𝛼\text{rec}(\textbf{Z},\textbf{X}_{-},\alpha)rec ( Z , X start_POSTSUBSCRIPT - end_POSTSUBSCRIPT , italic_α ) is a monotonically increasing function of subsidies α𝛼\alphaitalic_α.

With subsidy α𝛼\alphaitalic_α, the social cost for a given revealed set Z becomes:

where 𝐳R(𝐱,Z;α)=argmin𝐳Z(1α)cR(𝐱,𝐳)subscript𝐳𝑅𝐱Z𝛼subscriptargmin𝐳Z1𝛼subscript𝑐𝑅𝐱𝐳\mathbf{z}_{R}(\mathbf{x},\textbf{Z};\alpha)=\operatorname*{arg\,min}_{\mathbf% {z}\in\textbf{Z}}(1-\alpha)c_{R}(\mathbf{x},\mathbf{z})bold_z start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ( bold_x , Z ; italic_α ) = start_OPERATOR roman_arg roman_min end_OPERATOR start_POSTSUBSCRIPT bold_z ∈ Z end_POSTSUBSCRIPT ( 1 - italic_α ) italic_c start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ( bold_x , bold_z ) is the optimal recourse action given revealed set Z and a particular subsidy level α𝛼\alphaitalic_α, and 𝐱Rsubscript𝐱𝑅\mathbf{x}_{R}bold_x start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT is the optimal default recourse action provided by the system without any strategic withholding. We can show that the social cost is also a monotonic non-increasing function in the subsidy level:

Theorem 2

(Subsidy Influence on Social Cost) Given a revealed set Z, the social cost cost(Z,X;α)costZsubscriptX𝛼\text{cost}(\textbf{Z},\textbf{X}_{-};\alpha)cost ( Z , X start_POSTSUBSCRIPT - end_POSTSUBSCRIPT ; italic_α ) is monotonically decreasing in subsidies.

Subsidies also help improve the system’s utility; under some assumptions on the cost functions (i.e., monotonic in the distance and only cross once), the system’s utility is monotonic in subsidies as well:

Theorem 3

(Subsidy’s Influence on System’s Utility) Given a revealed set Z, when both cR(𝐱,𝐱)subscript𝑐𝑅𝐱superscript𝐱c_{R}(\mathbf{x},\mathbf{x}^{\prime})italic_c start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ( bold_x , bold_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) and cM(𝐱,𝐱)subscript𝑐𝑀𝐱superscript𝐱c_{M}(\mathbf{x},\mathbf{x}^{\prime})italic_c start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT ( bold_x , bold_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) are monotonic in 𝐱𝐱norm𝐱superscript𝐱\|\mathbf{x}-\mathbf{x}^{\prime}\|∥ bold_x - bold_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∥ and only cross once, the system utility is monotonically increasing in subsidies.

Next we examine the difference in social cost between groups as a function of subsidies. We find that subsidies are an effective tool to mitigate disparities caused by strategically withheld recourse.

Theorem 4

(Subsidy Influence on Social Cost Disparity) With subsidy α𝛼\alphaitalic_α, the disparity in social cost for two group g0,g1subscript𝑔0subscript𝑔1g_{0},g_{1}italic_g start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_g start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT becomes: Diff(cost)(Z,X(g0),X(g1);α):=|cost(Z,X(g1);α)cost(Z,X(g0);α)|assignsuperscriptDiffcostZsuperscriptXsubscript𝑔0superscriptXsubscript𝑔1𝛼costZsuperscriptsubscriptXsubscript𝑔1𝛼costZsuperscriptsubscriptXsubscript𝑔0𝛼\textit{Diff}^{(\text{cost})}(\textbf{Z},\textbf{X}^{(g_{0})},\textbf{X}^{(g_{% 1})};\alpha):=\left|\text{cost}(\textbf{Z},\textbf{X}_{-}^{(g_{1})};\alpha)-% \text{cost}(\textbf{Z},\textbf{X}_{-}^{(g_{0})};\alpha)\right|Diff start_POSTSUPERSCRIPT ( cost ) end_POSTSUPERSCRIPT ( Z , X start_POSTSUPERSCRIPT ( italic_g start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT , X start_POSTSUPERSCRIPT ( italic_g start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT ; italic_α ) := | cost ( Z , X start_POSTSUBSCRIPT - end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_g start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT ; italic_α ) - cost ( Z , X start_POSTSUBSCRIPT - end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_g start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT ; italic_α ) | Given a revealed set Z, the social cost difference monotonically decreases in subsidies.

Intuitively, as we increase the subsidy level, the cost of recourse decreases linearly as a function of the subsidy level, making it increasingly cheaper to perform the optimal recourse action. For both social groups, their social cost approaches 0 as we increase the subsidy level; as a result, the disparity in social cost between the two groups also decreases to 0.

With subsidy α𝛼\alphaitalic_α, for a given a revealed set Z, the disparity in recourse ratio for groups g0,g1subscript𝑔0subscript𝑔1g_{0},g_{1}italic_g start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_g start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT is:

Diff(rec)(Z,X(g0),X(g1);α):=|rec(Z,X(g1);α)rec(Z,X(g0);α)|assignsuperscriptDiffrecZsuperscriptXsubscript𝑔0superscriptXsubscript𝑔1𝛼recZsuperscriptsubscriptXsubscript𝑔1𝛼recZsuperscriptsubscriptXsubscript𝑔0𝛼\textit{Diff}^{(\text{rec})}(\textbf{Z},\textbf{X}^{(g_{0})},\textbf{X}^{(g_{1% })};\alpha):=\left|\text{rec}(\textbf{Z},\textbf{X}_{-}^{(g_{1})};\alpha)-% \text{rec}(\textbf{Z},\textbf{X}_{-}^{(g_{0})};\alpha)\right|Diff start_POSTSUPERSCRIPT ( rec ) end_POSTSUPERSCRIPT ( Z , X start_POSTSUPERSCRIPT ( italic_g start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT , X start_POSTSUPERSCRIPT ( italic_g start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT ; italic_α ) := | rec ( Z , X start_POSTSUBSCRIPT - end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_g start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT ; italic_α ) - rec ( Z , X start_POSTSUBSCRIPT - end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_g start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT ; italic_α ) |

where rec(Z,X(gi))recZsuperscriptsubscriptXsubscript𝑔𝑖\text{rec}(\textbf{Z},\textbf{X}_{-}^{(g_{i})})rec ( Z , X start_POSTSUBSCRIPT - end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT ) is the recourse rate for a particular subgroup gisubscript𝑔𝑖g_{i}italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT. We show that when subsidies are sufficiently large, the recourse rate difference is monotonically decreasing in subsidies:

Theorem 5

(Subsidy’s Influence on Recourse Rate Disparity) Given two groups g0subscript𝑔0g_{0}italic_g start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT and g1subscript𝑔1g_{1}italic_g start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT of relatively equal negatively classified agents size |X(g0)||X(g1)|superscriptsubscriptXsubscript𝑔0superscriptsubscriptXsubscript𝑔1|\textbf{X}_{-}^{(g_{0})}|\approx|\textbf{X}_{-}^{(g_{1})}|| X start_POSTSUBSCRIPT - end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_g start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT | ≈ | X start_POSTSUBSCRIPT - end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_g start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT |, there exists a subsidy level 0α10superscript𝛼10\leq\alpha^{*}\leq 10 ≤ italic_α start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ≤ 1, such that ααfor-all𝛼superscript𝛼\forall\alpha\geq\alpha^{*}∀ italic_α ≥ italic_α start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT, the recourse rate difference monotonically decreases.

This result follows that when recourse is free, i.e., subsidies are maximized, all agents can perform recourse, and the recourse rate difference is 00. Thus, as subsidies increase, there must exist a point (namely αsuperscript𝛼\alpha^{*}italic_α start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT) when both groups can take advantage of subsidies at proportional rates, thus decreasing the gap between the number of agents performing recourse in both groups. We also verify empirically that for recourse rate difference, there indeed exists a peak subsidy value αsuperscript𝛼\alpha^{*}italic_α start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT where the recourse rate difference increases before and then decreases afterward (see Figure 4).

8 EMPIRICAL STUDIES

Setup

We conduct experiments using three datasets: 1) Law School Wightman & Council (1998) dataset, in which the objective is to predict whether a student will pass the bar exam on the first attempt, Adult Income Dua et al. (2017) in which the objective is to predict whether an individual earns more than 50K50𝐾50K50 italic_K annually, and German Credit Yeh & Lien (2009) in which the objective is to predict whether a given individual will not default on their credit. In each dataset, agents have constant utility over approved features, i.e., the conventional recourse setting where ua(𝐱)=1subscript𝑢𝑎𝐱1u_{a}(\mathbf{x})=1italic_u start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ( bold_x ) = 1 for all 𝐱𝐱\mathbf{x}bold_x; the principal (system) has utility up(𝐱)=1subscript𝑢𝑝𝐱1u_{p}(\mathbf{x})=1italic_u start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ( bold_x ) = 1 when the agent is a true positive (y=1𝑦1y=1italic_y = 1, f(x)=1𝑓𝑥1f(x)=1italic_f ( italic_x ) = 1) and up(𝐱)=1subscript𝑢𝑝𝐱1u_{p}(\mathbf{x})=-1italic_u start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ( bold_x ) = - 1 when the agent is a false positive (y=1𝑦1y=-1italic_y = - 1, f(x)=1𝑓𝑥1f(x)=1italic_f ( italic_x ) = 1). Qualification is predicted via Logistic Regression (shown in this section) or Gradient Boosting Trees (shown in the Supplement Appendix F).

Recourse and manipulation both carry an 2subscript2\ell_{2}roman_ℓ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT cost, namely cR(𝐱,𝐳)=wR(𝐱𝐳)2subscript𝑐𝑅𝐱𝐳subscriptnormsubscript𝑤𝑅𝐱𝐳2c_{R}(\mathbf{x},\mathbf{z})=\|w_{R}\cdot(\mathbf{x}-\mathbf{z})\|_{2}italic_c start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ( bold_x , bold_z ) = ∥ italic_w start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ⋅ ( bold_x - bold_z ) ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT, and cM(𝐱,𝐳)=wM(𝐱𝐳)2subscript𝑐𝑀𝐱𝐳subscriptnormsubscript𝑤𝑀𝐱𝐳2c_{M}(\mathbf{x},\mathbf{z})=\|w_{M}\cdot(\mathbf{x}-\mathbf{z})\|_{2}italic_c start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT ( bold_x , bold_z ) = ∥ italic_w start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT ⋅ ( bold_x - bold_z ) ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT,where wRsubscript𝑤𝑅w_{R}italic_w start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT and wMsubscript𝑤𝑀w_{M}italic_w start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT are the weight vectors for the cost functions. In our experiments, we report outcomes over 100100100100 runs using randomly initialized wRsubscript𝑤𝑅w_{R}italic_w start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT and wMsubscript𝑤𝑀w_{M}italic_w start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT and resampled subsets of positive and negative agents in the dataset in each run. We set the probability that the agent discloses their feature publicly at p=0.7𝑝0.7p=0.7italic_p = 0.7 for all experiments. When varying this value, we observe similar results.

Refer to caption
Figure 2: Fraction of the population performing recourse (top row) or manipulation (bottom row). Each line corresponds to a different subsidy ratio “sub”, i.e., the cost reduction applied to recourse.
Refer to caption
Figure 3: Difference in recourse rate (top row) and social cost (bottom row) between different sensitive attribute groups. Each line corresponds to a different subsidy ratio “subs”, i.e., the cost reduction applied to recourse.

Recourse Rate and Manipulation Rate

We begin by examining the relationship between the fraction of the population choosing to perform recourse and the fraction choosing to perform manipulation as a function of the fraction of agents given a recourse action. In Figure 2, we see that in general, as the percentage of agents who are provided a recourse action increases, the recourse rate decreases while the manipulation rate increases (this trend holds for each subsidy value). Thus, when agents themselves can strategically select between recourse and manipulation, the increased model transparency, created by providing more agents with recourse actions, results in more agents selecting to perform manipulation. Providing more recourse actions to agents, does not necessarily result in more agents performing recourse. Despite this general trend, we also observe the effectiveness of subsidies. As subsidies converge to 1111 (meaning recourse carries no cost), the fraction of agents choosing recourse converges to 1111, while the fraction of agents choosing manipulation converges to 00. While it may be expensive in general to provide such subsidies, and the question of how to balance this expense against the system’s own utility remains open, these results indicate that subsidies are an effective avenue for broadly promoting recourse and disincentivizing manipulation.

Disparity in Recourse and Social Cost

Lastly, we investigate how strategic system behavior causes disparate impacts among sensitive groups. In our experiments, groups are taken to be binary and are defined by race in the Law School dataset (White and Non-White), by gender in the Adult Income dataset (Male and Female), and by age in the German Credit dataset (Young and Old). In Figure 3, we see the difference in the number of agents performing recourse and social cost between groups. Higher values in these plots indicate higher rates of recourse, or lower cost, for White individuals in the Law School dataset, Male individuals in the Adult income dataset, and Young individuals in the Credit dataset. First, strong subsidies (particularly subs 0.4absent0.4\leq 0.4≤ 0.4) result in a large decrease in the disparities between groups for both recourse rate and social cost. For less strong subsidies (subs 0.6absent0.6\geq 0.6≥ 0.6), we see that the gap in recourse rate between groups can increase. This is due to the fact that when subsidies are less strong, only agents with already low costs of recourse (primarily from the advantaged group) can benefit from those subsidies.

9 CONCLUSION

In scenarios where agents can manipulate a system, there is a reduced incentive for the system to provide recourse due to increased model transparency. Consequently, the system strategically withholds recourse from some, leading to higher social costs, and disproportionately impacting disadvantaged groups. Despite the inherent tension between the system’s utility and its provision of recourse, subsidies emerge as a viable tool to boost recourse-providing rates and alleviate group-wise disparities resulting from recourse withholding.

Acknowledgements

This work is partially supported by the National Science Foundation (NSF) under grants IIS-2214141, IIS-1905558, CNS-2310470, IIS-2143895, IIS-2040800, and CCF-2023495; the Office of Naval Research (ONR) under grant N00014-24-1-2663, and Amazon.

References

  • Akyol et al. (2016) Emrah Akyol, Cedric Langbort, and Tamer Basar. Price of transparency in strategic machine learning. arXiv preprint arXiv:1610.08210, 2016.
  • Barsotti et al. (2022) Flavia Barsotti, Rüya Gökhan Koçer, and Fernando P Santos. Transparency, detection and imitation in strategic classification. In Proceedings of the 31st International Joint Conference on Artificial Intelligence, IJCAI 2022. International Joint Conferences on Artificial Intelligence (IJCAI), 2022.
  • Bechavod et al. (2022) Yahav Bechavod, Chara Podimata, Steven Wu, and Juba Ziani. Information discrepancy in strategic learning. In International Conference on Machine Learning, pp.  1691–1715. PMLR, 2022.
  • Blocki et al. (2013) Jeremiah Blocki, Nicolas Christin, Anupam Datta, Ariel D Procaccia, and Arunesh Sinha. Audit games. arXiv preprint arXiv:1303.0356, 2013.
  • Chen et al. (2020) Yatong Chen, Jialu Wang, and Yang Liu. Linear classifiers that encourage constructive adaptation. arXiv preprint arXiv:2011.00355, 2020.
  • Chen et al. (2023) Yatong Chen, Zeyu Tang, Kun Zhang, and Yang Liu. Model transferability with responsive decision subjects. In International Conference on Machine Learning, pp.  4921–4952. PMLR, 2023.
  • Chen et al. (2018) Yiling Chen, Chara Podimata, Ariel D Procaccia, and Nisarg Shah. Strategyproof linear regression in high dimensions. In Proceedings of the 2018 ACM Conference on Economics and Computation, pp.  9–26, 2018.
  • Dong et al. (2018) Jinshuo Dong, Aaron Roth, Zachary Schutzman, Bo Waggoner, and Zhiwei Steven Wu. Strategic classification from revealed preferences. In Proceedings of the 2018 ACM Conference on Economics and Computation, pp.  55–70, 2018.
  • Dua et al. (2017) Dheeru Dua, Casey Graff, et al. Uci machine learning repository. 2017.
  • Ehyaei et al. (2023) Ahmad-Reza Ehyaei, Amir-Hossein Karimi, Bernhard Schölkopf, and Setareh Maghsudi. Robustness implies fairness in causal algorithmic recourse. In Proceedings of the 2023 ACM Conference on Fairness, Accountability, and Transparency, pp.  984–1001, 2023.
  • Estornell et al. (2023a) Andrew Estornell, Yatong Chen, Sanmay Das, Yang Liu, and Yevgeniy Vorobeychik. Incentivizing recourse through auditing in strategic classification. In Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, pp.  400–408, 08 2023a. doi: 10.24963/ijcai.2023/45.
  • Estornell et al. (2023b) Andrew Estornell, Sanmay Das, Yang Liu, and Yevgeniy Vorobeychik. Group-fair classification with strategic agents. In ACM Conference on Fairness, Accountability, and Transparency, pp.  389–399, 2023b.
  • Fokkema et al. (2024) Hidde Fokkema, Damien Garreau, and Tim van Erven. The risks of recourse in binary classification. In International Conference on Artificial Intelligence and Statistics, pp.  550–558. PMLR, 2024.
  • Gupta et al. (2019) Vivek Gupta, Pegah Nokhiz, Chitradeep Dutta Roy, and Suresh Venkatasubramanian. Equalizing recourse across groups. arXiv preprint arXiv:1909.03166, 2019.
  • Hardt et al. (2016) Moritz Hardt, Nimrod Megiddo, Christos Papadimitriou, and Mary Wootters. Strategic classification. In Proceedings of the 2016 ACM conference on innovations in theoretical computer science, pp.  111–122, 2016.
  • Harris et al. (2022) Keegan Harris, Valerie Chen, Joon Kim, Ameet Talwalkar, Hoda Heidari, and Steven Z Wu. Bayesian persuasion for algorithmic recourse. Advances in Neural Information Processing Systems, 35:11131–11144, 2022.
  • Hu et al. (2019) Lily Hu, Nicole Immorlica, and Jennifer Wortman Vaughan. The disparate effects of strategic manipulation. In Proceedings of the Conference on Fairness, Accountability, and Transparency, pp.  259–268, 2019.
  • Karimi et al. (2020a) Amir-Hossein Karimi, Gilles Barthe, Bernhard Schölkopf, and Isabel Valera. A survey of algorithmic recourse: definitions, formulations, solutions, and prospects, 2020a.
  • Karimi et al. (2020b) Amir-Hossein Karimi, Julius von Kügelgen, Bernhard Schölkopf, and Isabel Valera. Algorithmic recourse under imperfect causal knowledge: a probabilistic approach, 2020b.
  • Levanon & Rosenfeld (2021) Sagi Levanon and Nir Rosenfeld. Strategic classification made practical. In International Conference on Machine Learning, pp.  6243–6253. PMLR, 2021.
  • Milli et al. (2019) Smitha Milli, John Miller, Anca D Dragan, and Moritz Hardt. The social cost of strategic classification. In Proceedings of the Conference on Fairness, Accountability, and Transparency, pp.  230–239, 2019.
  • Olckers & Walsh (2023) Matthew Olckers and Toby Walsh. Incentives to offer algorithmic recourse, 2023.
  • Orso et al. (2015) Andrew Orso, Jon Lee, and Siqian Shen. Submodular minimization in the context of modern lp and milp methods and solvers. In Proceedings of the 14th International Symposium on Experimental Algorithms - Volume 9125, pp.  193–204, Berlin, Heidelberg, 2015. Springer-Verlag. ISBN 9783319200859.
  • Tsirtsis et al. (2019) Stratis Tsirtsis, Behzad Tabibian, Moein Khajehnejad, Adish Singla, Bernhard Schölkopf, and Manuel Gomez-Rodriguez. Optimal Decision Making Under Strategic Behavior. arXiv e-prints, 2019.
  • Ustun et al. (2019) Berk Ustun, Alexander Spangher, and Yang Liu. Actionable recourse in linear classification. In Proceedings of the conference on fairness, accountability, and transparency, pp.  10–19, 2019.
  • Venkatasubramanian & Alfano (2020) Suresh Venkatasubramanian and Mark Alfano. The philosophical basis of algorithmic recourse. In Proceedings of the 2020 Conference on Fairness, Accountability, and Transparency, pp.  284–293, 2020.
  • von Kügelgen et al. (2020) Julius von Kügelgen, Umang Bhatt, Amir-Hossein Karimi, Isabel Valera, Adrian Weller, and Bernhard Schölkopf. On the fairness of causal algorithmic recourse, 2020.
  • von Kügelgen et al. (2022) Julius von Kügelgen, Amir-Hossein Karimi, Umang Bhatt, Isabel Valera, Adrian Weller, and Bernhard Schölkopf. On the fairness of causal algorithmic recourse, 2022.
  • Vorobeychik (2023) Yevgeniy Vorobeychik. The many faces of adversarial machine learning. In AAAI Conference on Artificial Intelligence, volume 37, pp.  15402–15409, 2023.
  • Wightman & Council (1998) L.F. Wightman and Law School Admission Council. LSAC National Longitudinal Bar Passage Study. LSAC research report series. Law School Admission Council, 1998. URL https://books.google.com/books?id=O9A7AQAAIAAJ.
  • Yeh & Lien (2009) I-Cheng Yeh and Che-hui Lien. The comparisons of data mining techniques for the predictive accuracy of probability of default of credit card clients. Expert systems with applications, 36(2):2473–2480, 2009.
  • Zrnic et al. (2021) Tijana Zrnic, Eric Mazumdar, Shankar Sastry, and Michael Jordan. Who leads and who follows in strategic classification? Advances in Neural Information Processing Systems, 34:15257–15269, 2021.

 

To Give or Not to Give? The Impacts of Strategically Withheld Recourse


 


Appendix A Notation Table

Symbol Usage
XdXsuperscriptd\pazocal{X}\subset\mathbb{R}^{d}roman_X ⊂ blackboard_R start_POSTSUPERSCRIPT roman_d end_POSTSUPERSCRIPT The domain of the feature 𝐱𝐱\mathbf{x}bold_x
Y{0,1}Y01\pazocal{Y}\equiv\{0,1\}roman_Y ≡ { 0 , 1 } The domain of labels
Xn×dXsuperscript𝑛𝑑\textbf{X}\in\mathbb{R}^{n\times d}X ∈ blackboard_R start_POSTSUPERSCRIPT italic_n × italic_d end_POSTSUPERSCRIPT A set of features of n𝑛nitalic_n agents
Y{0,1}|X|𝑌superscript01XY\in\{0,1\}^{|\textbf{X}|}italic_Y ∈ { 0 , 1 } start_POSTSUPERSCRIPT | X | end_POSTSUPERSCRIPT The labels for the set of features X
𝐱X𝐱X\mathbf{x}\in\pazocal{X}bold_x ∈ roman_X A random variable representing an example’s features
yY𝑦Yy\in\pazocal{Y}italic_y ∈ roman_Y A random variable representing an example’s ground truth label
f:XY:𝑓XYf:\pazocal{X}\rightarrow\pazocal{Y}italic_f : roman_X → roman_Y a binary classifier, unknown to the agents
X,X(0)XsubscriptXsuperscriptX0X\pazocal{X}_{-},\pazocal{X}^{(0)}\subseteq\pazocal{X}roman_X start_POSTSUBSCRIPT - end_POSTSUBSCRIPT , roman_X start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT ⊆ roman_X The domain of negatively classified features, i.e. 𝐱Xfor-all𝐱subscriptX\forall\mathbf{x}\in\pazocal{X}_{-}∀ bold_x ∈ roman_X start_POSTSUBSCRIPT - end_POSTSUBSCRIPT, f(𝐱)=0𝑓𝐱0f(\mathbf{x})=0italic_f ( bold_x ) = 0
X+XsubscriptXX\pazocal{X}_{+}\subseteq\pazocal{X}roman_X start_POSTSUBSCRIPT + end_POSTSUBSCRIPT ⊆ roman_X The domain of positively classified features, i.e., 𝐱X+,f(𝐱)=1formulae-sequencefor-all𝐱subscriptXf𝐱1\forall\mathbf{x}\in\pazocal{X}_{+},f(\mathbf{x})=1∀ bold_x ∈ roman_X start_POSTSUBSCRIPT + end_POSTSUBSCRIPT , roman_f ( bold_x ) = 1
XXsubscriptXX\textbf{X}_{-}\subseteq\pazocal{X}X start_POSTSUBSCRIPT - end_POSTSUBSCRIPT ⊆ roman_X The set of negatively classified features, i.e. 𝐱Xfor-all𝐱subscriptX\forall\mathbf{x}\in\textbf{X}_{-}∀ bold_x ∈ X start_POSTSUBSCRIPT - end_POSTSUBSCRIPT, f(𝐱)=0𝑓𝐱0f(\mathbf{x})=0italic_f ( bold_x ) = 0
X+XsubscriptXX\textbf{X}_{+}\subseteq\pazocal{X}X start_POSTSUBSCRIPT + end_POSTSUBSCRIPT ⊆ roman_X The set of positively classified features, i.e., 𝐱X+,f(𝐱)=1formulae-sequencefor-all𝐱subscriptX𝑓𝐱1\forall\mathbf{x}\in\textbf{X}_{+},f(\mathbf{x})=1∀ bold_x ∈ X start_POSTSUBSCRIPT + end_POSTSUBSCRIPT , italic_f ( bold_x ) = 1
X(gi)XsuperscriptXsubscript𝑔𝑖X\textbf{X}^{(g_{i})}\subseteq\textbf{X}X start_POSTSUPERSCRIPT ( italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT ⊆ X The subset of features belongs to group G=gi𝐺subscript𝑔𝑖G=g_{i}italic_G = italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT
cR:X×X+:subscript𝑐𝑅XXsubscriptc_{R}:\pazocal{X}\times\pazocal{X}\rightarrow\mathbb{R}_{+}italic_c start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT : roman_X × roman_X → blackboard_R start_POSTSUBSCRIPT + end_POSTSUBSCRIPT The cost function of recourse
cM:X×X+:subscript𝑐𝑀XXsubscriptc_{M}:\pazocal{X}\times\pazocal{X}\rightarrow\mathbb{R}_{+}italic_c start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT : roman_X × roman_X → blackboard_R start_POSTSUBSCRIPT + end_POSTSUBSCRIPT The cost function of recourse
XRsubscriptX𝑅\textbf{X}_{R}X start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT The set of all possible recourse actions
ZRsubscriptZ𝑅\textbf{Z}_{R}Z start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT The set of revealed recourse actions
Z+subscriptZ\textbf{Z}_{+}Z start_POSTSUBSCRIPT + end_POSTSUBSCRIPT The set of revealed positively classified features
Z=ZRZ+ZsubscriptZ𝑅subscriptZ\textbf{Z}=\textbf{Z}_{R}\cup\textbf{Z}_{+}Z = Z start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ∪ Z start_POSTSUBSCRIPT + end_POSTSUBSCRIPT A publicly revealed feature set
𝐱R(𝐱)subscript𝐱𝑅𝐱\mathbf{x}_{R}(\mathbf{x})bold_x start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ( bold_x ) The optimal recourse action for agent with feature 𝐱𝐱\mathbf{x}bold_x
𝐱M(𝐱)subscript𝐱𝑀𝐱\mathbf{x}_{M}(\mathbf{x})bold_x start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT ( bold_x ) The optimal manipulation action for agent with feature 𝐱𝐱\mathbf{x}bold_x
𝐳(𝐱,Z)𝐳𝐱Z\mathbf{z}(\mathbf{x},\textbf{Z})bold_z ( bold_x , Z ) The agent’s final action
rec(Z,X)recZX\text{rec}(\textbf{Z},\textbf{X})rec ( Z , X ) The recourse ratio for feature sets X given revealed set is Z
α[0,1]𝛼01\alpha\in[0,1]italic_α ∈ [ 0 , 1 ] A subsidy level
u0subscript𝑢0u_{0}\in\mathbb{R}italic_u start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∈ blackboard_R The initial utility of a system without providing recourse
Table 1: Primary Notation

Appendix B Additional Related Work

Recourse

Recourse focuses on providing agents with the ability to contest or improve their outcome via a modification to their attributes in a genuine manner (e.g., paying off debt to increase creditworthiness) Ustun et al. (2019); Venkatasubramanian & Alfano (2020); Karimi et al. (2020a); Gupta et al. (2019); Karimi et al. (2020b); von Kügelgen et al. (2020); Chen et al. (2020); Harris et al. (2022). Much of this line of work focuses on the setting where the requested recourse is guaranteed to be provided. As far as we know, our work is the first to challenge this fundamental assumption and argue that without a third-party’s intervention (e.g., the government regulation on the system’s recourse providing), a utility-maximizing algorithmic recourse system may be incentivized to strategically withhold recourse from some agents to prevent manipulations. We point the reader to Karimi et al. (2020a) for a more detailed discussion of the concepts and recent development of algorithmic recourse. To our knowledge, even though the literature has previously introduced the concepts of recourse, strategic manipulation, and subsidy analysis, we have yet to find any studies that explicitly highlight how a recourse system might act strategically by withholding recourse to enhance its own utility. The originality of our work, thus, is to address this gap. Some works have investigated the relationship between incentives/utility and recourse, such as Fokkema et al. (2024), which finds that providing recourse can decrease classifier accuracy, Estornell et al. (2023a), which investigates ways to ensure that given recourse actions are taken by agents, and Olckers & Walsh (2023) which investigates the incentive compatibility of recourse.

Strategic Classification

Strategic Classification focuses on the problem of how to effectively make predictions in the presence of agents who behave strategically to obtain desirable outcomes Hardt et al. (2016); Chen et al. (2018); Tsirtsis et al. (2019); Levanon & Rosenfeld (2021); Dong et al. (2018); Chen et al. (2018); Zrnic et al. (2021); Chen et al. (2023). Our work considers a specific type of strategic behavior, namely the imitation-based manipulations: agents do not know the classifier f𝑓fitalic_f but are aware of a set of positively classified features and can misreport their feature by imitating another agent’s feature that is positively classified. Such copycat behavior has been well-known in the literature of game theory, the behavioral economy, and strategic classification, e.g., Bechavod et al. (2022); Barsotti et al. (2022). While most of this line of work focuses on agents being strategic and could potentially modify their features to get a favorable prediction outcome, our work focuses on when the system is being strategic and potentially withholds recourse to the agents.

Fairness and Social Cost in Recourse and Strategic Classification

Fairness has been explored in the literature algorithmic recourse and strategic classification. For example, existing works on fairness in recourse emphasize the importance of equitable recourse and explore various remedying unfair recourse decisions Gupta et al. (2019); von Kügelgen et al. (2022); Ehyaei et al. (2023). Fairness with the presence of strategic behavior has featured studies that highlight the inequity that results from strategic behavior by individuals Hu et al. (2019), as well as inequity (e.g., social cost) resulting from making classifiers robust to strategic behavior Milli et al. (2019); Estornell et al. (2023b). Unlike previous work that primarily focuses on proposing fair classifiers with the presence of strategic agents, our work uniquely demonstrates how the system’s strategic withholding impacts the fairness and social cost for different societal groups.

Transparency

Also related is work on transparency in machine learning. In particular, Barsotti et al. (2022) find that the risks of transparent explanations are alleviated if effective methods to detect faking behaviors are in place. Unlike our modeling framework, they model transparency as how much noise is in the threshold of a threshold classifier. Akyol et al. (2016) examines the impact of users’ strategic behavior on the design and performance of transparent machine learning algorithms, quantifying the ”price of transparency” as the cost ratio for the algorithm designer when users exploit transparency compared to when the algorithm is opaque.

Comparison with three closely related papers

  • Comparison with Estornell et al. (2023a): the key distinction between these our work and Estornell et al. (2023a) is that Estornell et al. (2023a) presumes the system will provide any agent with an optimal recourse action and examines how auditing can dissuade agents from manipulating. In contrast, we do not consider auditing, and instead focus on how a system may be incentivized to withhold recourse from certain agents. While both papers examine the use of subsidies, our focus and model are distinct.

  • Comparison with Fokkema et al. (2024): Fokkema et al. (2024) focuses on discussing the accuracy drop as a result of the system providing recourse, because it pushes users to regions of higher class uncertainty and therefore leads to more mistakes. Our work, on the other hand, focuses on the incentive-compatibility problem in an algorithmic recourse system. of the population

  • Comparison with Olckers & Walsh (2023): similar to our work, Olckers & Walsh (2023) also studies when it is incentive-compatible for a decision-maker to offer recourse. Unlike our setting, however, they primarily operate on a simple toy model that assumes the applicant’s profitability is fixed.

Appendix C Proofs for Section 5

C.1 ILP for system when p = 1

We provide the ILP formula for the system to find optimal recourse actions when the revealing probability p=1𝑝1p=1italic_p = 1:

max𝐚{0,1}|Zmax|,𝐛{0,1}|X|subscriptformulae-sequence𝐚superscript01subscriptZ𝐛superscript01subscriptX\displaystyle\max_{\mathbf{a}\in\{0,1\}^{|\textbf{Z}_{\max}|},\mathbf{b}\in\{0% ,1\}^{|\textbf{X}_{-}|}}roman_max start_POSTSUBSCRIPT bold_a ∈ { 0 , 1 } start_POSTSUPERSCRIPT | Z start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT | end_POSTSUPERSCRIPT , bold_b ∈ { 0 , 1 } start_POSTSUPERSCRIPT | X start_POSTSUBSCRIPT - end_POSTSUBSCRIPT | end_POSTSUPERSCRIPT end_POSTSUBSCRIPT j=1|X|bjsuperscriptsubscript𝑗1subscriptXsubscript𝑏𝑗\displaystyle~{}\sum_{j=1}^{|\textbf{X}_{-}|}b_{j}∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT | X start_POSTSUBSCRIPT - end_POSTSUBSCRIPT | end_POSTSUPERSCRIPT italic_b start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT (maximize the number of agents performing recourse)
s.t. bjcR(𝐱j,𝐳R)aicM(𝐱j,𝐳i)+(1ai)subscript𝑏𝑗subscript𝑐𝑅subscript𝐱𝑗subscript𝐳𝑅subscript𝑎𝑖subscript𝑐𝑀subscript𝐱𝑗subscript𝐳𝑖1subscript𝑎𝑖\displaystyle~{}b_{j}c_{R}(\mathbf{x}_{j},\mathbf{z}_{R})\leq a_{i}c_{M}(% \mathbf{x}_{j},\mathbf{z}_{i})+(1-a_{i})italic_b start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT italic_c start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ( bold_x start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT , bold_z start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ) ≤ italic_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_c start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT ( bold_x start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT , bold_z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) + ( 1 - italic_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) (only do recourse if all manipulation costs are greater)
bjajRsubscript𝑏𝑗subscript𝑎subscript𝑗𝑅\displaystyle~{}b_{j}\leq a_{j_{R}}italic_b start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ≤ italic_a start_POSTSUBSCRIPT italic_j start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT end_POSTSUBSCRIPT (the optimal recourse action 𝐳jRsubscript𝐳subscript𝑗𝑅\mathbf{z}_{j_{R}}bold_z start_POSTSUBSCRIPT italic_j start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT end_POSTSUBSCRIPT for agent j𝑗jitalic_j must be revealed)
bjcR(xj,zR)1subscript𝑏𝑗subscript𝑐𝑅subscript𝑥𝑗subscript𝑧𝑅1\displaystyle~{}b_{j}c_{R}(x_{j},z_{R})\leq 1italic_b start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT italic_c start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT , italic_z start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ) ≤ 1 (the optimal recourse action 𝐳jRsubscript𝐳subscript𝑗𝑅\mathbf{z}_{j_{R}}bold_z start_POSTSUBSCRIPT italic_j start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT end_POSTSUBSCRIPT for agent j𝑗jitalic_j must be less than 1)
h=1|Z|ah=ksuperscriptsubscript1Zsubscript𝑎𝑘\displaystyle\sum_{h=1}^{|\textbf{Z}|}a_{h}=k∑ start_POSTSUBSCRIPT italic_h = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT | Z | end_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT = italic_k (the total number of revealed recourse action is k𝑘kitalic_k)

C.2 NP hardness of the System’s Optimal Recourse Providing Problem

Theorem 6

The problem of selecting the optimal set of recourse actions to recommend, such that the system’s utility is maximized (Equation 3), is NP-hard, even when the probability of diclosure p=1𝑝1p=1italic_p = 1.

Proof 1

To demonstrate the intractability of this objective, we reduce from the known NP-hard problem Minimum k𝑘kitalic_k-Union (Mk𝑘kitalic_kU), an instance of which is defined via a universe of n𝑛nitalic_n elements U={s1,,sn}𝑈subscript𝑠1subscript𝑠𝑛U=\{s_{1},\ldots,s_{n}\}italic_U = { italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_s start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT },a collection of n𝑛nitalic_n sets S={S1,Sm}Ssubscript𝑆1subscript𝑆𝑚\textbf{S}=\{S_{1},\ldots S_{m}\}S = { italic_S start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … italic_S start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT } with elements in U𝑈Uitalic_U, and a budget k𝑘kitalic_k. The objective in Mk𝑘kitalic_kU is to select an index set I𝐼Iitalic_I of size exactly k𝑘kitalic_k such that jISjsubscript𝑗𝐼subscript𝑆𝑗\cup_{j\in I}S_{j}∪ start_POSTSUBSCRIPT italic_j ∈ italic_I end_POSTSUBSCRIPT italic_S start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT is minimized. Given an instance of Mk𝑘kitalic_kU can be mapped to an instance of simultaneous recourse as follows. Let X(0)×Z={(𝐱,𝐳j):siU and Sj𝐒}superscriptX0Zconditional-set𝐱subscript𝐳𝑗subscript𝑠𝑖𝑈 and subscript𝑆𝑗𝐒\textbf{X}^{(0)}\times\textbf{Z}=\big{\{}(\mathbf{x},\mathbf{z}_{j}):s_{i}\in U% \text{ and }S_{j}\in\mathbf{S}\big{\}}X start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT × Z = { ( bold_x , bold_z start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) : italic_s start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ italic_U and italic_S start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∈ bold_S }, and define cRsubscript𝑐𝑅c_{R}italic_c start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT and cMsubscript𝑐𝑀c_{M}italic_c start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT as follows,

cR(𝐱,𝐳j)={1 if ij0 if i=jcM(𝐱,𝐳j)={1 if siSj1/2 if siSjformulae-sequencesubscript𝑐𝑅𝐱subscript𝐳𝑗cases1 if 𝑖𝑗0 if 𝑖𝑗subscript𝑐𝑀𝐱subscript𝐳𝑗cases1 if subscript𝑠𝑖subscript𝑆𝑗12 if subscript𝑠𝑖subscript𝑆𝑗\displaystyle c_{R}(\mathbf{x},\mathbf{z}_{j})=\begin{cases}1&\text{ if }~{}i% \neq j\\ 0&\text{ if }~{}i=j\end{cases}\qquad c_{M}(\mathbf{x},\mathbf{z}_{j})=\begin{% cases}1&\text{ if }~{}s_{i}\notin S_{j}\\ 1/2&\text{ if }~{}s_{i}\in S_{j}\end{cases}italic_c start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ( bold_x , bold_z start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) = { start_ROW start_CELL 1 end_CELL start_CELL if italic_i ≠ italic_j end_CELL end_ROW start_ROW start_CELL 0 end_CELL start_CELL if italic_i = italic_j end_CELL end_ROW italic_c start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT ( bold_x , bold_z start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) = { start_ROW start_CELL 1 end_CELL start_CELL if italic_s start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∉ italic_S start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_CELL end_ROW start_ROW start_CELL 1 / 2 end_CELL start_CELL if italic_s start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ italic_S start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_CELL end_ROW

Under this construction of the cost functions, each agent 𝐱𝐱\mathbf{x}bold_x will perform recourse if and only if 𝐳isubscript𝐳𝑖\mathbf{z}_{i}bold_z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT is revealed, and the disclosure probability p=1𝑝1p=1italic_p = 1. In the case that 𝐳isubscript𝐳𝑖\mathbf{z}_{i}bold_z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT is not revealed, the agent will elect to perform manipulation when any 𝐳jsubscript𝐳𝑗\mathbf{z}_{j}bold_z start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT is revealed where ji𝑗𝑖j\neq iitalic_j ≠ italic_i and siSjsubscript𝑠𝑖subscript𝑆𝑗s_{i}\in S_{j}italic_s start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ italic_S start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT. If neither criterion is met, the agent will elect to do nothing (remaining negatively classified). Combining these cases, we see that revealing each 𝐳jsubscript𝐳𝑗\mathbf{z}_{j}bold_z start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT causes exactly one agent to perform recourse, namely 𝐱jsubscript𝐱𝑗\mathbf{x}_{j}bold_x start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT, and causes all 𝐱𝐱\mathbf{x}bold_x (with siSjsubscript𝑠𝑖subscript𝑆𝑗s_{i}\in S_{j}italic_s start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ italic_S start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT) to manipulate. Let I={j1,,jk}𝐼subscript𝑗1subscript𝑗𝑘I=\{j_{1},\ldots,j_{k}\}italic_I = { italic_j start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_j start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT } be the index set of the revealed features, then the number of agents manipulating is equal to |jISj|ksubscript𝑗𝐼subscript𝑆𝑗𝑘\big{|}\cup_{j\in I}S_{j}\big{|}-k| ∪ start_POSTSUBSCRIPT italic_j ∈ italic_I end_POSTSUBSCRIPT italic_S start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT | - italic_k. Therefore providing k𝑘kitalic_k recourse actions to agents while minimizing the number of agents manipulating is equivalent to minimizing |jISj|subscript𝑗𝐼subscript𝑆𝑗\big{|}\cup_{j\in I}S_{j}\big{|}| ∪ start_POSTSUBSCRIPT italic_j ∈ italic_I end_POSTSUBSCRIPT italic_S start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT |.

C.3 Submodularity of the System’s Utility

Theorem 7

The system’s objective function is submodular with respect to the size of the set of revealed features.

Proof 2

Given a revealed set ZXRZsubscriptX𝑅\textbf{Z}\subseteq\textbf{X}_{R}Z ⊆ X start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT, for agent with feature 𝐱X𝐱subscriptX\mathbf{x}\in\textbf{X}_{-}bold_x ∈ X start_POSTSUBSCRIPT - end_POSTSUBSCRIPT, let Sm(𝐱,Z):={zZ:cM(𝐱,z)cR(𝐱,𝐳R(𝐱,Z))}assignsubscript𝑆𝑚𝐱Zconditional-set𝑧Zsubscript𝑐𝑀𝐱𝑧subscript𝑐𝑅𝐱subscript𝐳𝑅𝐱ZS_{m}(\mathbf{x},\textbf{Z}):=\{z\in\textbf{Z}:c_{M}(\mathbf{x},z)\leq c_{R}(% \mathbf{x},\mathbf{z}_{R}(\mathbf{x},\textbf{Z}))\}italic_S start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( bold_x , Z ) := { italic_z ∈ Z : italic_c start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT ( bold_x , italic_z ) ≤ italic_c start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ( bold_x , bold_z start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ( bold_x , Z ) ) } be the set of manipulation features that are cheaper than the minimum recourse action 𝐳R(x,Z)subscript𝐳𝑅𝑥Z\mathbf{z}_{R}(x,\textbf{Z})bold_z start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ( italic_x , Z ) given the revealed set Z. Then the agent will perform recourse if and only if Sm(𝐱,Z)=subscript𝑆𝑚𝐱ZS_{m}(\mathbf{x},\textbf{Z})=\emptysetitalic_S start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( bold_x , Z ) = ∅. Given the cost function cMsubscript𝑐𝑀c_{M}italic_c start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT and cRsubscript𝑐𝑅c_{R}italic_c start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT, the principal can pre-compute each agent’s manipulation set Sm(𝐱,Z)subscript𝑆𝑚𝐱ZS_{m}(\mathbf{x},\textbf{Z})italic_S start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( bold_x , Z ).

The probability for the manipulation set Sm(𝐱,Z)subscript𝑆𝑚𝐱ZS_{m}(\mathbf{x},\textbf{Z})italic_S start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( bold_x , Z ) to overlap with a given revealed set Z is P(𝐱;Z)=ΠzZm(𝐱,Z)(1p)𝑃𝐱ZsubscriptΠ𝑧subscriptZ𝑚𝐱Z1𝑝P(\mathbf{x};\textbf{Z})=\Pi_{z\in\textbf{Z}_{m}(\mathbf{x},\textbf{Z})}(1-p)italic_P ( bold_x ; Z ) = roman_Π start_POSTSUBSCRIPT italic_z ∈ Z start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( bold_x , Z ) end_POSTSUBSCRIPT ( 1 - italic_p ), where p𝑝pitalic_p is the disclosure probability for any criteria 𝐳𝐳\mathbf{z}bold_z.

The goal for the system is to select a disclosure set ZXRZsubscriptX𝑅\textbf{Z}\subseteq\textbf{X}_{R}Z ⊆ X start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT to minimize the overlap between Z and Sm(𝐱,Z)subscript𝑆𝑚𝐱ZS_{m}(\mathbf{x},\textbf{Z})italic_S start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( bold_x , Z ) for all agents, namely:

minZXRu(Z,X)subscriptZsubscriptX𝑅𝑢ZsubscriptX\displaystyle\min_{\textbf{Z}\subseteq\textbf{X}_{R}}u(\textbf{Z},\textbf{X}_{% -})roman_min start_POSTSUBSCRIPT Z ⊆ X start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_u ( Z , X start_POSTSUBSCRIPT - end_POSTSUBSCRIPT ) :=𝐱X(1P(𝐱;Z))=𝐱X(1ΠzZm(𝐱,Z)(1p))assignabsentsubscript𝐱subscriptX1𝑃𝐱Zsubscript𝐱subscriptX1subscriptΠ𝑧subscriptZ𝑚𝐱Z1𝑝\displaystyle:=\sum_{\mathbf{x}\in\textbf{X}_{-}}\left(1-P(\mathbf{x};\textbf{% Z})\right)=\sum_{\mathbf{x}\in\textbf{X}_{-}}\left(1-\Pi_{z\in\textbf{Z}_{m}(% \mathbf{x},\textbf{Z})}(1-p)\right):= ∑ start_POSTSUBSCRIPT bold_x ∈ X start_POSTSUBSCRIPT - end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( 1 - italic_P ( bold_x ; Z ) ) = ∑ start_POSTSUBSCRIPT bold_x ∈ X start_POSTSUBSCRIPT - end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( 1 - roman_Π start_POSTSUBSCRIPT italic_z ∈ Z start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( bold_x , Z ) end_POSTSUBSCRIPT ( 1 - italic_p ) ) (6)

To ease the notation, we use u(Z)𝑢Zu(\textbf{Z})italic_u ( Z ) to shorthand u(Z,X)𝑢ZsubscriptXu(\textbf{Z},\textbf{X}_{-})italic_u ( Z , X start_POSTSUBSCRIPT - end_POSTSUBSCRIPT ) since XsubscriptX\textbf{X}_{-}X start_POSTSUBSCRIPT - end_POSTSUBSCRIPT is fixed in our setting. To show that Equation 6 is submodular, it is equivalent to prove that the objective function u(Z,X)𝑢ZsubscriptXu(\textbf{Z},\pazocal{X}_{-})italic_u ( Z , roman_X start_POSTSUBSCRIPT - end_POSTSUBSCRIPT ) satisfies the diminishing returns property, which means A,BZfor-all𝐴𝐵Z\forall A,B\subseteq\textbf{Z}∀ italic_A , italic_B ⊆ Z with ABZ𝐴𝐵ZA\subseteq B\subseteq\textbf{Z}italic_A ⊆ italic_B ⊆ Z, and any criteria zZ\B𝑧\Z𝐵z\in\textbf{Z}\backslash Bitalic_z ∈ Z \ italic_B, we want to show

u(A{z})u(A)u(B{z})u(B)𝑢𝐴𝑧𝑢𝐴𝑢𝐵𝑧𝑢𝐵\displaystyle u(A\cup\{z\})-u(A)\geq u(B\cup\{z\})-u(B)italic_u ( italic_A ∪ { italic_z } ) - italic_u ( italic_A ) ≥ italic_u ( italic_B ∪ { italic_z } ) - italic_u ( italic_B )

Only four types of agents could potentially contribute to the marginal gain for U𝑈Uitalic_U when the revealed sets are A{z}𝐴𝑧A\cup\{z\}italic_A ∪ { italic_z } v.s. B{z}𝐵𝑧B\cup\{z\}italic_B ∪ { italic_z }:

  1. 1.

    when Sm(𝐱,B{z})=B{z}subscript𝑆𝑚𝐱𝐵𝑧𝐵𝑧S_{m}(\mathbf{x},B\cup\{z\})=B\cup\{z\}italic_S start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( bold_x , italic_B ∪ { italic_z } ) = italic_B ∪ { italic_z }

  2. 2.

    when Sm(𝐱,B{z})=A{z}subscript𝑆𝑚𝐱𝐵𝑧𝐴𝑧S_{m}(\mathbf{x},B\cup\{z\})=A\cup\{z\}italic_S start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( bold_x , italic_B ∪ { italic_z } ) = italic_A ∪ { italic_z }

  3. 3.

    when Sm(𝐱,B{z})={z}subscript𝑆𝑚𝐱𝐵𝑧𝑧S_{m}(\mathbf{x},B\cup\{z\})=\{z\}italic_S start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( bold_x , italic_B ∪ { italic_z } ) = { italic_z }

  4. 4.

    when Sm(𝐱,B{z})=B\A{z}subscript𝑆𝑚𝐱𝐵𝑧\𝐵𝐴𝑧S_{m}(\mathbf{x},B\cup\{z\})=B\backslash A\cup\{z\}italic_S start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( bold_x , italic_B ∪ { italic_z } ) = italic_B \ italic_A ∪ { italic_z }

For the first three cases, we can verify that the two marginal gains are the same. For the last case, the two marginal gains are:

u(A+{z})u(A)𝑢𝐴𝑧𝑢𝐴\displaystyle u(A+\{z\})-u(A)italic_u ( italic_A + { italic_z } ) - italic_u ( italic_A ) =[1(1p)]0=pabsentdelimited-[]11𝑝0𝑝\displaystyle=\left[1-(1-p)\right]-0=p= [ 1 - ( 1 - italic_p ) ] - 0 = italic_p
u(B+{z})u(B)𝑢𝐵𝑧𝑢𝐵\displaystyle u(B+\{z\})-u(B)italic_u ( italic_B + { italic_z } ) - italic_u ( italic_B ) =[1Πt{B\A{z}}(1p)][1Πt{B\A}(1p)]absentdelimited-[]1subscriptΠ𝑡\𝐵𝐴𝑧1𝑝delimited-[]1subscriptΠ𝑡\𝐵𝐴1𝑝\displaystyle=[1-\Pi_{t\in\{B\backslash A\cup\{z\}\}}(1-p)]-[1-\Pi_{t\in\{B% \backslash A\}}(1-p)]= [ 1 - roman_Π start_POSTSUBSCRIPT italic_t ∈ { italic_B \ italic_A ∪ { italic_z } } end_POSTSUBSCRIPT ( 1 - italic_p ) ] - [ 1 - roman_Π start_POSTSUBSCRIPT italic_t ∈ { italic_B \ italic_A } end_POSTSUBSCRIPT ( 1 - italic_p ) ]
=p×Πt{B\A}(1p)absent𝑝subscriptΠ𝑡\𝐵𝐴1𝑝\displaystyle=p\times\Pi_{t\in\{B\backslash A\}}(1-p)= italic_p × roman_Π start_POSTSUBSCRIPT italic_t ∈ { italic_B \ italic_A } end_POSTSUBSCRIPT ( 1 - italic_p )
pabsent𝑝\displaystyle\leq p≤ italic_p

Since this holds for all agents, we show that adding a criterion z𝑧zitalic_z to a larger set B𝐵Bitalic_B provides an equal or smaller marginal gain in the objective function compared to adding it to a smaller set A𝐴Aitalic_A, satisfying the diminishing returns property. Therefore, the objective function defined in Equation 6 is submodular.

C.4 Proof for Theorem 1

Proof 3

Notice that only agents 𝐱X(0)𝐱superscript𝑋0\mathbf{x}\in X^{(0)}bold_x ∈ italic_X start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT who are originally negatively classified would request a recourse from the system in the first place, and both the recourse action and the manipulation actions that they are potentially going to take will be positively classified by the system. From the system’s perspective, when the classifier is non-trivial (better than random guessing), all positively classified 𝐱𝐱\mathbf{x}bold_x are more likely to have true label y=1𝑦1y=1italic_y = 1, and all negatively classified 𝐱𝐱\mathbf{x}bold_x are more likely to have true label 00. When an agent with feature 𝐱𝐱\mathbf{x}bold_x takes recourse, the expected system utility change is:

Δ(System’s Utility)(𝐱𝐳R)ΔSystem’s Utility𝐱subscript𝐳𝑅\displaystyle\Delta(\text{System's Utility})(\mathbf{x}\rightarrow\mathbf{z}_{% R})roman_Δ ( System’s Utility ) ( bold_x → bold_z start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ) =(𝟙[y(𝐳R)=1,f(𝐳R)=1]𝟙[y(𝐳R)=1,f(𝐳R)=1])0absent1delimited-[]formulae-sequence𝑦subscript𝐳𝑅1𝑓subscript𝐳𝑅11delimited-[]formulae-sequence𝑦subscript𝐳𝑅1𝑓subscript𝐳𝑅10\displaystyle=\big{(}\mathbbm{1}[y(\mathbf{z}_{R})=1,f(\mathbf{z}_{R})=1]-% \mathbbm{1}[y(\mathbf{z}_{R})=-1,f(\mathbf{z}_{R})=1]\big{)}-0= ( blackboard_1 [ italic_y ( bold_z start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ) = 1 , italic_f ( bold_z start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ) = 1 ] - blackboard_1 [ italic_y ( bold_z start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ) = - 1 , italic_f ( bold_z start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ) = 1 ] ) - 0
=2Pr[y(𝐳R)=1|X=𝐳R]10absent2Pr𝑦subscript𝐳𝑅conditional1𝑋subscript𝐳𝑅10\displaystyle=2\Pr[y(\mathbf{z}_{R})=1|X=\mathbf{z}_{R}]-1\geq 0= 2 roman_Pr [ italic_y ( bold_z start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ) = 1 | italic_X = bold_z start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ] - 1 ≥ 0 (f is a non-trivial classifier, and f(𝐳R)=1𝑓subscript𝐳𝑅1f(\mathbf{z}_{R})=1italic_f ( bold_z start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ) = 1 )

Similarly, when the agent takes manipulation, the expected system utility change is:

Δ(System’s Utility)(𝐱𝐳M)ΔSystem’s Utility𝐱subscript𝐳𝑀\displaystyle\Delta(\text{System's Utility})(\mathbf{x}\rightarrow\mathbf{z}_{% M})roman_Δ ( System’s Utility ) ( bold_x → bold_z start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT ) =(𝟙[y(𝐱)=1,f(𝐳M)=1]𝟙[y(𝐱)=1,f(𝐳M)=1])0absent1delimited-[]formulae-sequence𝑦𝐱1𝑓subscript𝐳𝑀11delimited-[]formulae-sequence𝑦𝐱1𝑓subscript𝐳𝑀10\displaystyle=\big{(}\mathbbm{1}[y(\mathbf{x})=1,f(\mathbf{z}_{M})=1]-\mathbbm% {1}[y(\mathbf{x})=-1,f(\mathbf{z}_{M})=1]\big{)}-0= ( blackboard_1 [ italic_y ( bold_x ) = 1 , italic_f ( bold_z start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT ) = 1 ] - blackboard_1 [ italic_y ( bold_x ) = - 1 , italic_f ( bold_z start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT ) = 1 ] ) - 0
=2Pr[y(𝐱)=1|X=𝐱]10absent2Pr𝑦𝐱conditional1𝑋𝐱10\displaystyle=2\Pr[y(\mathbf{x})=1|X=\mathbf{x}]-1\leq 0= 2 roman_Pr [ italic_y ( bold_x ) = 1 | italic_X = bold_x ] - 1 ≤ 0 (Since f is a non-trivial classifier, and f(𝐱)=0𝑓𝐱0f(\mathbf{x})=0italic_f ( bold_x ) = 0)

When the agent performs do-nothing, the system utility remains the same.

Appendix D Proof for Section 6

We first prove a theorem on the monotonicity of social cost.

Theorem 2 (Monotonicity of Social Cost)

When the recourse cost cR(x,x)subscript𝑐𝑅𝑥superscript𝑥c_{R}(x,x^{\prime})italic_c start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ( italic_x , italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) is monotonic in xxnorm𝑥superscript𝑥\|x-x^{\prime}\|∥ italic_x - italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∥, and consider a linear threshold classifier. The social cost monotonically decreases in the easiest obtained recourse action.

Proof 4

Consider a 1-dimensional setting, where the system uses a linear threshold classifier f(x)=𝟙[xτ]𝑓𝑥1delimited-[]𝑥𝜏f(x)=\mathbbm{1}[x\geq\tau]italic_f ( italic_x ) = blackboard_1 [ italic_x ≥ italic_τ ]. In this case, the optimal recourse action for any agent is always the minimum recourse action that has been revealed so far, namely zmin=minzZzsubscript𝑧subscript𝑧Z𝑧z_{\min}=\min_{z\in\textbf{Z}}zitalic_z start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT = roman_min start_POSTSUBSCRIPT italic_z ∈ Z end_POSTSUBSCRIPT italic_z. Recall the definition of the social cost:

cost(Z,X)=𝐱X(cR(𝐱,𝐳R(𝐱,Z)cR(𝐱,𝐱R)),where𝐳R(𝐱,Z)=argmin𝐳ZcR(𝐱,𝐳)\displaystyle\text{cost}(\textbf{Z},\textbf{X}_{-})=\sum_{\mathbf{x}\in\textbf% {X}_{-}}\left(c_{R}(\mathbf{x},\mathbf{z}_{R}(\mathbf{x},\textbf{Z})-c_{R}(% \mathbf{x},\mathbf{x}_{R})\right),\text{where}~{}\mathbf{z}_{R}(\mathbf{x},% \textbf{Z})=\operatorname*{arg\,min}_{\mathbf{z}\in\textbf{Z}}c_{R}(\mathbf{x}% ,\mathbf{z})cost ( Z , X start_POSTSUBSCRIPT - end_POSTSUBSCRIPT ) = ∑ start_POSTSUBSCRIPT bold_x ∈ X start_POSTSUBSCRIPT - end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_c start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ( bold_x , bold_z start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ( bold_x , Z ) - italic_c start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ( bold_x , bold_x start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ) ) , where bold_z start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ( bold_x , Z ) = start_OPERATOR roman_arg roman_min end_OPERATOR start_POSTSUBSCRIPT bold_z ∈ Z end_POSTSUBSCRIPT italic_c start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ( bold_x , bold_z )

When the cost function is monotonic in the 2subscript2\ell_{2}roman_ℓ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT norm, e.g., cR(x,x)=wRxxsubscript𝑐𝑅𝑥superscript𝑥subscript𝑤𝑅norm𝑥superscript𝑥c_{R}(x,x^{\prime})=w_{R}\cdot\|x-x^{\prime}\|italic_c start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ( italic_x , italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) = italic_w start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ⋅ ∥ italic_x - italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∥, we have

cR(𝐱,𝐳R(𝐱,Z))subscript𝑐𝑅𝐱subscript𝐳𝑅𝐱Z\displaystyle c_{R}(\mathbf{x},\mathbf{z}_{R}(\mathbf{x},\textbf{Z}))italic_c start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ( bold_x , bold_z start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ( bold_x , Z ) ) =wRx𝐳R(𝐱,Z)=wRminzZxz=wR(minzZzx)absentsubscript𝑤𝑅norm𝑥subscript𝐳𝑅𝐱Zsubscript𝑤𝑅subscript𝑧Znorm𝑥𝑧subscript𝑤𝑅subscript𝑧Z𝑧𝑥\displaystyle=w_{R}\cdot\|x-\mathbf{z}_{R}(\mathbf{x},\textbf{Z})\|=w_{R}\cdot% \min_{z\in\textbf{Z}}\|x-z\|=w_{R}\cdot\left(\min_{z\in\textbf{Z}}z-x\right)= italic_w start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ⋅ ∥ italic_x - bold_z start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ( bold_x , Z ) ∥ = italic_w start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ⋅ roman_min start_POSTSUBSCRIPT italic_z ∈ Z end_POSTSUBSCRIPT ∥ italic_x - italic_z ∥ = italic_w start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ⋅ ( roman_min start_POSTSUBSCRIPT italic_z ∈ Z end_POSTSUBSCRIPT italic_z - italic_x )
cR(𝐱,𝐱R)subscript𝑐𝑅𝐱subscript𝐱𝑅\displaystyle c_{R}(\mathbf{x},\mathbf{x}_{R})italic_c start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ( bold_x , bold_x start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ) =wR𝐱𝐱R=wR𝐱τ=wR(τx)absentsubscript𝑤𝑅norm𝐱subscript𝐱𝑅subscript𝑤𝑅norm𝐱𝜏subscript𝑤𝑅𝜏𝑥\displaystyle=w_{R}\cdot\|\mathbf{x}-\mathbf{x}_{R}\|=w_{R}\cdot\|\mathbf{x}-% \tau\|=w_{R}\cdot\left(\tau-x\right)= italic_w start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ⋅ ∥ bold_x - bold_x start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ∥ = italic_w start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ⋅ ∥ bold_x - italic_τ ∥ = italic_w start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ⋅ ( italic_τ - italic_x )

Thus,

cost(Z,X)costZsubscriptX\displaystyle\text{cost}(\textbf{Z},\textbf{X}_{-})cost ( Z , X start_POSTSUBSCRIPT - end_POSTSUBSCRIPT ) =𝐱X(cR(𝐱,𝐳R(𝐱,Z))cR(𝐱,𝐳R))absentsubscript𝐱subscriptXsubscript𝑐𝑅𝐱subscript𝐳𝑅𝐱Zsubscript𝑐𝑅𝐱subscript𝐳𝑅\displaystyle=\sum_{\mathbf{x}\in\textbf{X}_{-}}\left(c_{R}(\mathbf{x},\mathbf% {z}_{R}(\mathbf{x},\textbf{Z}))-c_{R}(\mathbf{x},\mathbf{z}_{R})\right)= ∑ start_POSTSUBSCRIPT bold_x ∈ X start_POSTSUBSCRIPT - end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_c start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ( bold_x , bold_z start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ( bold_x , Z ) ) - italic_c start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ( bold_x , bold_z start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT ) )
=𝐱X[wR