-
Jet modification via $π^0$-hadron correlations in Au$+$Au collisions at $\sqrt{s_{_{NN}}}=200$ GeV
Authors:
PHENIX Collaboration,
N. J. Abdulameer,
U. Acharya,
A. Adare,
S. Afanasiev,
C. Aidala,
N. N. Ajitanand,
Y. Akiba,
H. Al-Bataineh,
J. Alexander,
M. Alfred,
K. Aoki,
N. Apadula,
L. Aphecetche,
J. Asai,
H. Asano,
E. T. Atomssa,
R. Averbeck,
T. C. Awes,
B. Azmoun,
V. Babintsev,
M. Bai,
G. Baksay,
L. Baksay,
A. Baldisseri
, et al. (510 additional authors not shown)
Abstract:
High-momentum two-particle correlations are a useful tool for studying jet-quenching effects in the quark-gluon plasma. Angular correlations between neutral-pion triggers and charged hadrons with transverse momenta in the range 4--12~GeV/$c$ and 0.5--7~GeV/$c$, respectively, have been measured by the PHENIX experiment in 2014 for Au$+$Au collisions at $\sqrt{s_{_{NN}}}=200$~GeV. Suppression is obs…
▽ More
High-momentum two-particle correlations are a useful tool for studying jet-quenching effects in the quark-gluon plasma. Angular correlations between neutral-pion triggers and charged hadrons with transverse momenta in the range 4--12~GeV/$c$ and 0.5--7~GeV/$c$, respectively, have been measured by the PHENIX experiment in 2014 for Au$+$Au collisions at $\sqrt{s_{_{NN}}}=200$~GeV. Suppression is observed in the yield of high-momentum jet fragments opposite the trigger particle, which indicates jet suppression stemming from in-medium partonic energy loss, while enhancement is observed for low-momentum particles. The ratio and differences between the yield in Au$+$Au collisions and $p$$+$$p$ collisions, $I_{AA}$ and $Δ_{AA}$, as a function of the trigger-hadron azimuthal separation, $Δφ$, are measured for the first time at the Relativistic Heavy Ion Collider. These results better quantify how the yield of low-$p_T$ associated hadrons is enhanced at wide angle, which is crucial for studying energy loss as well as medium-response effects.
△ Less
Submitted 12 June, 2024;
originally announced June 2024.
-
Neyman Meets Causal Machine Learning: Experimental Evaluation of Individualized Treatment Rules
Authors:
Michael Lingzhi Li,
Kosuke Imai
Abstract:
A century ago, Neyman showed how to evaluate the efficacy of treatment using a randomized experiment under a minimal set of assumptions. This classical repeated sampling framework serves as a basis of routine experimental analyses conducted by today's scientists across disciplines. In this paper, we demonstrate that Neyman's methodology can also be used to experimentally evaluate the efficacy of i…
▽ More
A century ago, Neyman showed how to evaluate the efficacy of treatment using a randomized experiment under a minimal set of assumptions. This classical repeated sampling framework serves as a basis of routine experimental analyses conducted by today's scientists across disciplines. In this paper, we demonstrate that Neyman's methodology can also be used to experimentally evaluate the efficacy of individualized treatment rules (ITRs), which are derived by modern causal machine learning algorithms. In particular, we show how to account for additional uncertainty resulting from a training process based on cross-fitting. The primary advantage of Neyman's approach is that it can be applied to any ITR regardless of the properties of machine learning algorithms that are used to derive the ITR. We also show, somewhat surprisingly, that for certain metrics, it is more efficient to conduct this ex-post experimental evaluation of an ITR than to conduct an ex-ante experimental evaluation that randomly assigns some units to the ITR. Our analysis demonstrates that Neyman's repeated sampling framework is as relevant for causal inference today as it has been since its inception.
△ Less
Submitted 25 April, 2024;
originally announced April 2024.
-
Does AI help humans make better decisions? A methodological framework for experimental evaluation
Authors:
Eli Ben-Michael,
D. James Greiner,
Melody Huang,
Kosuke Imai,
Zhichao Jiang,
Sooahn Shin
Abstract:
The use of Artificial Intelligence (AI) based on data-driven algorithms has become ubiquitous in today's society. Yet, in many cases and especially when stakes are high, humans still make final decisions. The critical question, therefore, is whether AI helps humans make better decisions as compared to a human alone or AI an alone. We introduce a new methodological framework that can be used to ans…
▽ More
The use of Artificial Intelligence (AI) based on data-driven algorithms has become ubiquitous in today's society. Yet, in many cases and especially when stakes are high, humans still make final decisions. The critical question, therefore, is whether AI helps humans make better decisions as compared to a human alone or AI an alone. We introduce a new methodological framework that can be used to answer experimentally this question with no additional assumptions. We measure a decision maker's ability to make correct decisions using standard classification metrics based on the baseline potential outcome. We consider a single-blinded experimental design, in which the provision of AI-generated recommendations is randomized across cases with a human making final decisions. Under this experimental design, we show how to compare the performance of three alternative decision-making systems--human-alone, human-with-AI, and AI-alone. We apply the proposed methodology to the data from our own randomized controlled trial of a pretrial risk assessment instrument. We find that AI recommendations do not improve the classification accuracy of a judge's decision to impose cash bail. Our analysis also shows that AI-alone decisions generally perform worse than human decisions with or without AI assistance. Finally, AI recommendations tend to impose cash bail on non-white arrestees more often than necessary when compared to white arrestees.
△ Less
Submitted 17 March, 2024;
originally announced March 2024.
-
The Cram Method for Efficient Simultaneous Learning and Evaluation
Authors:
Zeyang Jia,
Kosuke Imai,
Michael Lingzhi Li
Abstract:
We introduce the "cram" method, a general and efficient approach to simultaneous learning and evaluation using a generic machine learning (ML) algorithm. In a single pass of batched data, the proposed method repeatedly trains an ML algorithm and tests its empirical performance. Because it utilizes the entire sample for both learning and evaluation, cramming is significantly more data-efficient tha…
▽ More
We introduce the "cram" method, a general and efficient approach to simultaneous learning and evaluation using a generic machine learning (ML) algorithm. In a single pass of batched data, the proposed method repeatedly trains an ML algorithm and tests its empirical performance. Because it utilizes the entire sample for both learning and evaluation, cramming is significantly more data-efficient than sample-splitting. The cram method also naturally accommodates online learning algorithms, making its implementation computationally efficient. To demonstrate the power of the cram method, we consider the standard policy learning setting where cramming is applied to the same data to both develop an individualized treatment rule (ITR) and estimate the average outcome that would result if the learned ITR were to be deployed. We show that under a minimal set of assumptions, the resulting crammed evaluation estimator is consistent and asymptotically normal. While our asymptotic results require a relatively weak stabilization condition of ML algorithm, we develop a simple, generic method that can be used with any policy learning algorithm to satisfy this condition. Our extensive simulation studies show that, when compared to sample-splitting, cramming reduces the evaluation standard error by more than 40% while improving the performance of learned policy. We also apply the cram method to a randomized clinical trial to demonstrate its applicability to real-world problems. Finally, we briefly discuss future extensions of the cram method to other learning and evaluation settings.
△ Less
Submitted 11 March, 2024;
originally announced March 2024.
-
Identified charged-hadron production in $p$$+$Al, $^3$He$+$Au, and Cu$+$Au collisions at $\sqrt{s_{_{NN}}}=200$ GeV and in U$+$U collisions at $\sqrt{s_{_{NN}}}=193$ GeV
Authors:
PHENIX Collaboration,
N. J. Abdulameer,
U. Acharya,
A. Adare,
C. Aidala,
N. N. Ajitanand,
Y. Akiba,
R. Akimoto,
J. Alexander,
M. Alfred,
V. Andrieux,
K. Aoki,
N. Apadula,
H. Asano,
E. T. Atomssa,
T. C. Awes,
B. Azmoun,
V. Babintsev,
M. Bai,
X. Bai,
N. S. Bandara,
B. Bannier,
K. N. Barish,
S. Bathe,
V. Baublis
, et al. (456 additional authors not shown)
Abstract:
The PHENIX experiment has performed a systematic study of identified charged-hadron ($π^\pm$, $K^\pm$, $p$, $\bar{p}$) production at midrapidity in $p$$+$Al, $^3$He$+$Au, Cu$+$Au collisions at $\sqrt{s_{_{NN}}}=200$ GeV and U$+$U collisions at $\sqrt{s_{_{NN}}}=193$ GeV. Identified charged-hadron invariant transverse-momentum ($p_T$) and transverse-mass ($m_T$) spectra are presented and interprete…
▽ More
The PHENIX experiment has performed a systematic study of identified charged-hadron ($π^\pm$, $K^\pm$, $p$, $\bar{p}$) production at midrapidity in $p$$+$Al, $^3$He$+$Au, Cu$+$Au collisions at $\sqrt{s_{_{NN}}}=200$ GeV and U$+$U collisions at $\sqrt{s_{_{NN}}}=193$ GeV. Identified charged-hadron invariant transverse-momentum ($p_T$) and transverse-mass ($m_T$) spectra are presented and interpreted in terms of radially expanding thermalized systems. The particle ratios of $K/π$ and $p/π$ have been measured in different centrality ranges of large (Cu$+$Au, U$+$U) and small ($p$$+$Al, $^3$He$+$Au) collision systems. The values of $K/π$ ratios measured in all considered collision systems were found to be consistent with those measured in $p$$+$$p$ collisions. However the values of $p/π$ ratios measured in large collision systems reach the values of $\approx0.6$, which is $\approx2$ times larger than in $p$$+$$p$ collisions. These results can be qualitatively understood in terms of the baryon enhancement expected from hadronization by recombination. Identified charged-hadron nuclear-modification factors ($R_{AB}$) are also presented. Enhancement of proton $R_{AB}$ values over meson $R_{AB}$ values was observed in central $^3$He$+$Au, Cu$+$Au, and U$+$U collisions. The proton $R_{AB}$ values measured in $p$$+$Al collision system were found to be consistent with $R_{AB}$ values of $φ$, $π^\pm$, $K^\pm$, and $π^0$ mesons, which may indicate that the size of the system produced in $p$$+$Al collisions is too small for recombination to cause a noticeable increase in proton production.
△ Less
Submitted 22 May, 2024; v1 submitted 14 December, 2023;
originally announced December 2023.
-
Design-based inference for generalized network experiments with stochastic interventions
Authors:
Ambarish Chattopadhyay,
Kosuke Imai,
Jose R. Zubizarreta
Abstract:
A growing number of scholars and data scientists are conducting randomized experiments to analyze causal relationships in network settings where units influence one another. A dominant methodology for analyzing these network experiments has been design-based, leveraging randomization of treatment assignment as the basis for inference. In this paper, we generalize this design-based approach so that…
▽ More
A growing number of scholars and data scientists are conducting randomized experiments to analyze causal relationships in network settings where units influence one another. A dominant methodology for analyzing these network experiments has been design-based, leveraging randomization of treatment assignment as the basis for inference. In this paper, we generalize this design-based approach so that it can be applied to more complex experiments with a variety of causal estimands with different target populations. An important special case of such generalized network experiments is a bipartite network experiment, in which the treatment assignment is randomized among one set of units and the outcome is measured for a separate set of units. We propose a broad class of causal estimands based on stochastic intervention for generalized network experiments. Using a design-based approach, we show how to estimate the proposed causal quantities without bias, and develop conservative variance estimators. We apply our methodology to a randomized experiment in education where a group of selected students in middle schools are eligible for the anti-conflict promotion program, and the program participation is randomized within this group. In particular, our analysis estimates the causal effects of treating each student or his/her close friends, for different target populations in the network. We find that while the treatment improves the overall awareness against conflict among students, it does not significantly reduce the total number of conflicts.
△ Less
Submitted 5 December, 2023;
originally announced December 2023.
-
Individualized Policy Evaluation and Learning under Clustered Network Interference
Authors:
Yi Zhang,
Kosuke Imai
Abstract:
While there now exists a large literature on policy evaluation and learning, much of prior work assumes that the treatment assignment of one unit does not affect the outcome of another unit. Unfortunately, ignoring interference may lead to biased policy evaluation and ineffective learned policies. For example, treating influential individuals who have many friends can generate positive spillover e…
▽ More
While there now exists a large literature on policy evaluation and learning, much of prior work assumes that the treatment assignment of one unit does not affect the outcome of another unit. Unfortunately, ignoring interference may lead to biased policy evaluation and ineffective learned policies. For example, treating influential individuals who have many friends can generate positive spillover effects, thereby improving the overall performance of an individualized treatment rule (ITR). We consider the problem of evaluating and learning an optimal ITR under clustered network interference (also known as partial interference) where clusters of units are sampled from a population and units may influence one another within each cluster. Unlike previous methods that impose strong restrictions on spillover effects, the proposed methodology only assumes a semiparametric structural model where each unit's outcome is an additive function of individual treatments within the cluster. Under this model, we propose an estimator that can be used to evaluate the empirical performance of an ITR. We show that this estimator is substantially more efficient than the standard inverse probability weighting estimator, which does not impose any assumption about spillover effects. We derive the finite-sample regret bound for a learned ITR, showing that the use of our efficient evaluation estimator leads to the improved performance of learned policies. Finally, we conduct simulation and empirical studies to illustrate the advantages of the proposed methodology.
△ Less
Submitted 4 February, 2024; v1 submitted 4 November, 2023;
originally announced November 2023.
-
Statistical Performance Guarantee for Subgroup Identification with Generic Machine Learning
Authors:
Michael Lingzhi Li,
Kosuke Imai
Abstract:
Across a wide array of disciplines, many researchers use machine learning (ML) algorithms to identify a subgroup of individuals who are likely to benefit from a treatment the most (``exceptional responders'') or those who are harmed by it. A common approach to this subgroup identification problem consists of two steps. First, researchers estimate the conditional average treatment effect (CATE) usi…
▽ More
Across a wide array of disciplines, many researchers use machine learning (ML) algorithms to identify a subgroup of individuals who are likely to benefit from a treatment the most (``exceptional responders'') or those who are harmed by it. A common approach to this subgroup identification problem consists of two steps. First, researchers estimate the conditional average treatment effect (CATE) using an ML algorithm. Next, they use the estimated CATE to select those individuals who are predicted to be most affected by the treatment, either positively or negatively. Unfortunately, CATE estimates are often biased and noisy. In addition, utilizing the same data to both identify a subgroup and estimate its group average treatment effect results in a multiple testing problem. To address these challenges, we develop uniform confidence bands for estimation of the group average treatment effect sorted by generic ML algorithm (GATES). Using these uniform confidence bands, researchers can identify, with a statistical guarantee, a subgroup whose GATES exceeds a certain effect size, regardless of how this effect size is chosen. The validity of the proposed methodology depends solely on randomization of treatment and random sampling of units. Importantly, our method does not require modeling assumptions and avoids a computationally intensive resampling procedure. A simulation study shows that the proposed uniform confidence bands are reasonably informative and have an appropriate empirical coverage even when the sample size is as small as 100. We analyze a clinical trial of late-stage prostate cancer and find a relatively large proportion of exceptional responders.
△ Less
Submitted 20 December, 2023; v1 submitted 11 October, 2023;
originally announced October 2023.
-
Bayesian Safe Policy Learning with Chance Constrained Optimization: Application to Military Security Assessment during the Vietnam War
Authors:
Zeyang Jia,
Eli Ben-Michael,
Kosuke Imai
Abstract:
Algorithmic decisions and recommendations are used in many high-stakes decision-making settings such as criminal justice, medicine, and public policy. We investigate whether it would have been possible to improve a security assessment algorithm employed during the Vietnam War, using outcomes measured immediately after its introduction in late 1969. This empirical application raises several methodo…
▽ More
Algorithmic decisions and recommendations are used in many high-stakes decision-making settings such as criminal justice, medicine, and public policy. We investigate whether it would have been possible to improve a security assessment algorithm employed during the Vietnam War, using outcomes measured immediately after its introduction in late 1969. This empirical application raises several methodological challenges that frequently arise in high-stakes algorithmic decision-making. First, before implementing a new algorithm, it is essential to characterize and control the risk of yielding worse outcomes than the existing algorithm. Second, the existing algorithm is deterministic, and learning a new algorithm requires transparent extrapolation. Third, the existing algorithm involves discrete decision tables that are difficult to optimize over.
To address these challenges, we introduce the Average Conditional Risk (ACRisk), which first quantifies the risk that a new algorithmic policy leads to worse outcomes for subgroups of individual units and then averages this over the distribution of subgroups. We also propose a Bayesian policy learning framework that maximizes the posterior expected value while controlling the posterior expected ACRisk. This framework separates the estimation of heterogeneous treatment effects from policy optimization, enabling flexible estimation of effects and optimization over complex policy classes. We characterize the resulting chance-constrained optimization problem as a constrained linear programming problem. Our analysis shows that compared to the actual algorithm used during the Vietnam War, the learned algorithm assesses most regions as more secure and emphasizes economic and political factors over military factors.
△ Less
Submitted 27 May, 2024; v1 submitted 17 July, 2023;
originally announced July 2023.
-
Evaluating Bias and Noise Induced by the U.S. Census Bureau's Privacy Protection Methods
Authors:
Christopher T. Kenny,
Cory McCartan,
Shiro Kuriwaki,
Tyler Simko,
Kosuke Imai
Abstract:
The United States Census Bureau faces a difficult trade-off between the accuracy of Census statistics and the protection of individual information. We conduct the first independent evaluation of bias and noise induced by the Bureau's two main disclosure avoidance systems: the TopDown algorithm employed for the 2020 Census and the swapping algorithm implemented for the three previous Censuses. Our…
▽ More
The United States Census Bureau faces a difficult trade-off between the accuracy of Census statistics and the protection of individual information. We conduct the first independent evaluation of bias and noise induced by the Bureau's two main disclosure avoidance systems: the TopDown algorithm employed for the 2020 Census and the swapping algorithm implemented for the three previous Censuses. Our evaluation leverages the Noisy Measure File (NMF) as well as two independent runs of the TopDown algorithm applied to the 2010 decennial Census. We find that the NMF contains too much noise to be directly useful, especially for Hispanic and multiracial populations. TopDown's post-processing dramatically reduces the NMF noise and produces data whose accuracy is similar to that of swapping. While the estimated errors for both TopDown and swapping algorithms are generally no greater than other sources of Census error, they can be relatively substantial for geographies with small total populations.
△ Less
Submitted 10 February, 2024; v1 submitted 12 June, 2023;
originally announced June 2023.
-
Priming bias versus post-treatment bias in experimental designs
Authors:
Matthew Blackwell,
Jacob R. Brown,
Sophie Hill,
Kosuke Imai,
Teppei Yamamoto
Abstract:
Conditioning on variables affected by treatment can induce post-treatment bias when estimating causal effects. Although this suggests that researchers should measure potential moderators before administering the treatment in an experiment, doing so may also bias causal effect estimation if the covariate measurement primes respondents to react differently to the treatment. This paper formally analy…
▽ More
Conditioning on variables affected by treatment can induce post-treatment bias when estimating causal effects. Although this suggests that researchers should measure potential moderators before administering the treatment in an experiment, doing so may also bias causal effect estimation if the covariate measurement primes respondents to react differently to the treatment. This paper formally analyzes this trade-off between post-treatment and priming biases in three experimental designs that vary when moderators are measured: pre-treatment, post-treatment, or a randomized choice between the two. We derive nonparametric bounds for interactions between the treatment and the moderator under each design and show how to use substantive assumptions to narrow these bounds. These bounds allow researchers to assess the sensitivity of their empirical findings to either source of bias. We then apply the proposed methodology to a survey experiment on electoral messaging.
△ Less
Submitted 28 June, 2024; v1 submitted 1 June, 2023;
originally announced June 2023.
-
Making Differential Privacy Work for Census Data Users
Authors:
Cory McCartan,
Tyler Simko,
Kosuke Imai
Abstract:
The U.S. Census Bureau collects and publishes detailed demographic data about Americans which are heavily used by researchers and policymakers. The Bureau has recently adopted the framework of differential privacy in an effort to improve confidentiality of individual census responses. A key output of this privacy protection system is the Noisy Measurement File (NMF), which is produced by adding ra…
▽ More
The U.S. Census Bureau collects and publishes detailed demographic data about Americans which are heavily used by researchers and policymakers. The Bureau has recently adopted the framework of differential privacy in an effort to improve confidentiality of individual census responses. A key output of this privacy protection system is the Noisy Measurement File (NMF), which is produced by adding random noise to tabulated statistics. The NMF is critical to understanding any errors introduced in the data, and performing valid statistical inference on published census data. Unfortunately, the current release format of the NMF is difficult to access and work with. We describe the process we use to transform the NMF into a usable format, and provide recommendations to the Bureau for how to release future versions of the NMF. These changes are essential for ensuring transparency of privacy measures and reproducibility of scientific research built on census data.
△ Less
Submitted 7 October, 2023; v1 submitted 11 May, 2023;
originally announced May 2023.
-
A Statistical Model of Bipartite Networks: Application to Cosponsorship in the United States Senate
Authors:
Adeline Lo,
Santiago Olivella,
Kosuke Imai
Abstract:
Many networks in political and social research are bipartite, with edges connecting exclusively across two distinct types of nodes. A common example includes cosponsorship networks, in which legislators are connected indirectly through the bills they support. Yet most existing network models are designed for unipartite networks, where edges can arise between any pair of nodes. However, using a uni…
▽ More
Many networks in political and social research are bipartite, with edges connecting exclusively across two distinct types of nodes. A common example includes cosponsorship networks, in which legislators are connected indirectly through the bills they support. Yet most existing network models are designed for unipartite networks, where edges can arise between any pair of nodes. However, using a unipartite network model to analyze bipartite networks, as often done in practice, can result in aggregation bias and artificially high-clustering -- a particularly insidious problem when studying the role groups play in network formation. To address these methodological problems, we develop a statistical model of bipartite networks theorized to be generated through group interactions by extending the popular mixed-membership stochastic blockmodel. Our model allows researchers to identify the groups of nodes, within each node type in the bipartite structure, that share common patterns of edge formation. The model also incorporates both node and dyad-level covariates as the predictors of group membership and of observed dyadic relations. We develop an efficient computational algorithm for fitting the model, and apply it to cosponsorship data from the United States Senate. We show that legislators in a Senate that was perfectly split along party lines were able to remain productive and pass major legislation by forming non-partisan, power-brokering coalitions that found common ground through their collaboration on low-stakes bills. We also find evidence for norms of reciprocity, and uncover the substantial role played by policy expertise in the formation of cosponsorships between senators and legislation. We make an open-source software package available that makes it possible for other researchers to uncover similar insights from bipartite networks.
△ Less
Submitted 27 June, 2024; v1 submitted 9 May, 2023;
originally announced May 2023.
-
Transverse single-spin asymmetry of charged hadrons at forward and backward rapidity in polarized $p$+$p$, $p$+Al, and $p$+Au collisions at $\sqrt{s_{NN}}=200$ GeV}
Authors:
N. J. Abdulameer,
U. Acharya,
C. Aidala,
Y. Akiba,
M. Alfred,
V. Andrieux,
N. Apadula,
H. Asano,
B. Azmoun,
V. Babintsev,
N. S. Bandara,
K. N. Barish,
S. Bathe,
A. Bazilevsky,
M. Beaumier,
R. Belmont,
A. Berdnikov,
Y. Berdnikov,
L. Bichon,
B. Blankenship,
D. S. Blau,
J. S. Bok,
V. Borisov,
M. L. Brooks,
J. Bryslawskyj
, et al. (297 additional authors not shown)
Abstract:
Reported here are transverse single-spin asymmetries ($A_{N}$) in the production of charged hadrons as a function of transverse momentum ($p_T$) and Feynman-$x$ ($x_F$) in polarized $p^{\uparrow}$+$p$, $p^{\uparrow}$+Al, and $p^{\uparrow}$+Au collisions at $\sqrt{s_{_{NN}}}=200$ GeV. The measurements have been performed at forward and backward rapidity ($1.4<|η|<2.4$) over the range of…
▽ More
Reported here are transverse single-spin asymmetries ($A_{N}$) in the production of charged hadrons as a function of transverse momentum ($p_T$) and Feynman-$x$ ($x_F$) in polarized $p^{\uparrow}$+$p$, $p^{\uparrow}$+Al, and $p^{\uparrow}$+Au collisions at $\sqrt{s_{_{NN}}}=200$ GeV. The measurements have been performed at forward and backward rapidity ($1.4<|η|<2.4$) over the range of $1.5<p_{T}<7.0~{\rm GeV}/c$ and $0.04<|x_{F}|<0.2$. A nonzero asymmetry is observed for positively charged hadrons at forward rapidity ($x_F>0$) in $p^{\uparrow}$+$p$ collisions, whereas the $p^{\uparrow}$+Al and $p^{\uparrow}$+Au results show smaller asymmetries. This finding provides new opportunities to investigate the origin of transverse single-spin asymmetries and a tool to study nuclear effects in $p$+$A$ collisions.
△ Less
Submitted 31 October, 2023; v1 submitted 13 March, 2023;
originally announced March 2023.
-
Transverse single-spin asymmetry of midrapidity $π^{0}$ and $η$ mesons in $p$+Au and $p$+Al collisions at $\sqrt{s_{_{NN}}}=$ 200 GeV
Authors:
N. J. Abdulameer,
U. Acharya,
C. Aidala,
Y. Akiba,
M. Alfred,
V. Andrieux,
N. Apadula,
H. Asano,
B. Azmoun,
V. Babintsev,
N. S. Bandara,
K. N. Barish,
S. Bathe,
A. Bazilevsky,
M. Beaumier,
R. Belmont,
A. Berdnikov,
Y. Berdnikov,
L. Bichon,
B. Blankenship,
D. S. Blau,
J. S. Bok,
V. Borisov,
M. L. Brooks,
J. Bryslawskyj
, et al. (297 additional authors not shown)
Abstract:
Presented are the first measurements of the transverse single-spin asymmetries ($A_N$) for neutral pions and eta mesons in $p$+Au and $p$+Al collisions at $\sqrt{s_{_{NN}}}=200$ GeV in the pseudorapidity range $|η|<$0.35 with the PHENIX detector at the Relativistic Heavy Ion Collider. The asymmetries are consistent with zero, similar to those for midrapidity neutral pions and eta mesons produced i…
▽ More
Presented are the first measurements of the transverse single-spin asymmetries ($A_N$) for neutral pions and eta mesons in $p$+Au and $p$+Al collisions at $\sqrt{s_{_{NN}}}=200$ GeV in the pseudorapidity range $|η|<$0.35 with the PHENIX detector at the Relativistic Heavy Ion Collider. The asymmetries are consistent with zero, similar to those for midrapidity neutral pions and eta mesons produced in $p$+$p$ collisions. These measurements show no evidence of additional effects that could potentially arise from the more complex partonic environment present in proton-nucleus collisions.
△ Less
Submitted 6 June, 2023; v1 submitted 13 March, 2023;
originally announced March 2023.
-
Estimating Racial Disparities When Race is Not Observed
Authors:
Cory McCartan,
Robin Fisher,
Jacob Goldin,
Daniel E. Ho,
Kosuke Imai
Abstract:
The estimation of racial disparities in various fields is often hampered by the lack of individual-level racial information. In many cases, the law prohibits the collection of such information to prevent direct racial discrimination. As a result, analysts have frequently adopted Bayesian Improved Surname Geocoding (BISG) and its variants, which combine individual names and addresses with Census da…
▽ More
The estimation of racial disparities in various fields is often hampered by the lack of individual-level racial information. In many cases, the law prohibits the collection of such information to prevent direct racial discrimination. As a result, analysts have frequently adopted Bayesian Improved Surname Geocoding (BISG) and its variants, which combine individual names and addresses with Census data to predict race. Unfortunately, the residuals of BISG are often correlated with the outcomes of interest, generally attenuating estimates of racial disparities. To correct this bias, we propose an alternative identification strategy under the assumption that surname is conditionally independent of the outcome given (unobserved) race, residence location, and other observed characteristics. We introduce a new class of models, Bayesian Instrumental Regression for Disparity Estimation (BIRDiE), that take BISG probabilities as inputs and produce racial disparity estimates by using surnames as an instrumental variable for race. Our estimation method is scalable, making it possible to analyze large-scale administrative data. We also show how to address potential violations of the key identification assumptions. A validation study based on the North Carolina voter file shows that BISG+BIRDiE reduces error by up to 84% when estimating racial differences in party registration. Finally, we apply the proposed methodology to estimate racial differences in who benefits from the home mortgage interest deduction using individual-level tax data from the U.S. Internal Revenue Service. Open-source software is available which implements the proposed methodology.
△ Less
Submitted 16 April, 2024; v1 submitted 4 March, 2023;
originally announced March 2023.
-
Comment: The Essential Role of Policy Evaluation for the 2020 Census Disclosure Avoidance System
Authors:
Christopher T. Kenny,
Shiro Kuriwaki,
Cory McCartan,
Evan T. R. Rosenman,
Tyler Simko,
Kosuke Imai
Abstract:
In "Differential Perspectives: Epistemic Disconnects Surrounding the US Census Bureau's Use of Differential Privacy," boyd and Sarathy argue that empirical evaluations of the Census Disclosure Avoidance System (DAS), including our published analysis, failed to recognize how the benchmark data against which the 2020 DAS was evaluated is never a ground truth of population counts. In this commentary,…
▽ More
In "Differential Perspectives: Epistemic Disconnects Surrounding the US Census Bureau's Use of Differential Privacy," boyd and Sarathy argue that empirical evaluations of the Census Disclosure Avoidance System (DAS), including our published analysis, failed to recognize how the benchmark data against which the 2020 DAS was evaluated is never a ground truth of population counts. In this commentary, we explain why policy evaluation, which was the main goal of our analysis, is still meaningful without access to a perfect ground truth. We also point out that our evaluation leveraged features specific to the decennial Census and redistricting data, such as block-level population invariance under swapping and voter file racial identification, better approximating a comparison with the ground truth. Lastly, we show that accurate statistical predictions of individual race based on the Bayesian Improved Surname Geocoding, while not a violation of differential privacy, substantially increases the disclosure risk of private information the Census Bureau sought to protect. We conclude by arguing that policy makers must confront a key trade-off between data utility and privacy protection, and an epistemic disconnect alone is insufficient to explain disagreements between policy choices.
△ Less
Submitted 15 October, 2022;
originally announced October 2022.
-
Distributionally Robust Causal Inference with Observational Data
Authors:
Dimitris Bertsimas,
Kosuke Imai,
Michael Lingzhi Li
Abstract:
We consider the estimation of average treatment effects in observational studies and propose a new framework of robust causal inference with unobserved confounders. Our approach is based on distributionally robust optimization and proceeds in two steps. We first specify the maximal degree to which the distribution of unobserved potential outcomes may deviate from that of observed outcomes. We then…
▽ More
We consider the estimation of average treatment effects in observational studies and propose a new framework of robust causal inference with unobserved confounders. Our approach is based on distributionally robust optimization and proceeds in two steps. We first specify the maximal degree to which the distribution of unobserved potential outcomes may deviate from that of observed outcomes. We then derive sharp bounds on the average treatment effects under this assumption. Our framework encompasses the popular marginal sensitivity model as a special case, and we demonstrate how the proposed methodology can address a primary challenge of the marginal sensitivity model that it produces uninformative results when unobserved confounders substantially affect treatment and outcome. Specifically, we develop an alternative sensitivity model, called the distributional sensitivity model, under the assumption that heterogeneity of treatment effect due to unobserved variables is relatively small. Unlike the marginal sensitivity model, the distributional sensitivity model allows for potential lack of overlap and often produces informative bounds even when unobserved variables substantially affect both treatment and outcome. Finally, we show how to extend the distributional sensitivity model to difference-in-differences designs and settings with instrumental variables. Through simulation and empirical studies, we demonstrate the applicability of the proposed methodology.
△ Less
Submitted 2 February, 2023; v1 submitted 15 October, 2022;
originally announced October 2022.
-
Safe Policy Learning under Regression Discontinuity Designs with Multiple Cutoffs
Authors:
Yi Zhang,
Eli Ben-Michael,
Kosuke Imai
Abstract:
The regression discontinuity (RD) design is widely used for program evaluation with observational data. The primary focus of the existing literature has been the estimation of the local average treatment effect at the existing treatment cutoff. In contrast, we consider policy learning under the RD design. Because the treatment assignment mechanism is deterministic, learning better treatment cutoff…
▽ More
The regression discontinuity (RD) design is widely used for program evaluation with observational data. The primary focus of the existing literature has been the estimation of the local average treatment effect at the existing treatment cutoff. In contrast, we consider policy learning under the RD design. Because the treatment assignment mechanism is deterministic, learning better treatment cutoffs requires extrapolation. We develop a robust optimization approach to finding optimal treatment cutoffs that improve upon the existing ones. We first decompose the expected utility into point-identifiable and unidentifiable components. We then propose an efficient doubly-robust estimator for the identifiable parts. To account for the unidentifiable components, we leverage the existence of multiple cutoffs that are common under the RD design. Specifically, we assume that the heterogeneity in the conditional expectations of potential outcomes across different groups vary smoothly along the running variable. Under this assumption, we minimize the worst case utility loss relative to the status quo policy. The resulting new treatment cutoffs have a safety guarantee that they will not yield a worse overall outcome than the existing cutoffs. Finally, we establish the asymptotic regret bounds for the learned policy using semi-parametric efficiency theory. We apply the proposed methodology to empirical and simulated data sets.
△ Less
Submitted 8 July, 2023; v1 submitted 28 August, 2022;
originally announced August 2022.
-
Race and ethnicity data for first, middle, and last names
Authors:
Evan T. R. Rosenman,
Santiago Olivella,
Kosuke Imai
Abstract:
We provide the largest compiled publicly available dictionaries of first, middle, and last names for the purpose of imputing race and ethnicity using, for example, Bayesian Improved Surname Geocoding (BISG). The dictionaries are based on the voter files of six Southern states that collect self-reported racial data upon voter registration. Our data cover a much larger scope of names than any compar…
▽ More
We provide the largest compiled publicly available dictionaries of first, middle, and last names for the purpose of imputing race and ethnicity using, for example, Bayesian Improved Surname Geocoding (BISG). The dictionaries are based on the voter files of six Southern states that collect self-reported racial data upon voter registration. Our data cover a much larger scope of names than any comparable dataset, containing roughly one million first names, 1.1 million middle names, and 1.4 million surnames. Individuals are categorized into five mutually exclusive racial and ethnic groups -- White, Black, Hispanic, Asian, and Other -- and racial/ethnic counts by name are provided for every name in each dictionary. Counts can then be normalized row-wise or column-wise to obtain conditional probabilities of race given name or name given race. These conditional probabilities can then be deployed for imputation in a data analytic task for which ground truth racial and ethnic data is not available.
△ Less
Submitted 26 August, 2022;
originally announced August 2022.
-
Widespread Partisan Gerrymandering Mostly Cancels Nationally, but Reduces Electoral Competition
Authors:
Christopher T. Kenny,
Cory McCartan,
Tyler Simko,
Shiro Kuriwaki,
Kosuke Imai
Abstract:
Congressional district lines in many U.S. states are drawn by partisan actors, raising concerns about gerrymandering. To separate the partisan effects of redistricting from the effects of other factors including geography and redistricting rules, we compare possible party compositions of the U.S. House under the enacted plan to those under a set of alternative simulated plans that serve as a non-p…
▽ More
Congressional district lines in many U.S. states are drawn by partisan actors, raising concerns about gerrymandering. To separate the partisan effects of redistricting from the effects of other factors including geography and redistricting rules, we compare possible party compositions of the U.S. House under the enacted plan to those under a set of alternative simulated plans that serve as a non-partisan baseline. We find that partisan gerrymandering is widespread in the 2020 redistricting cycle, but most of the electoral bias it creates cancels at the national level, giving Republicans two additional seats on average. Geography and redistricting rules separately contribute a moderate pro-Republican bias. Finally, we find that partisan gerrymandering reduces electoral competition and makes the partisan composition of the U.S. House less responsive to shifts in the national vote.
△ Less
Submitted 13 April, 2023; v1 submitted 14 August, 2022;
originally announced August 2022.
-
Measurement of $φ$-meson production in Cu$+$Au at $\sqrt{s_{_{NN}}}=200$ GeV and U$+$U at $\sqrt{s_{_{NN}}}=193$ GeV
Authors:
N. J. Abdulameer,
U. Acharya,
C. Aidala,
N. N. Ajitanand,
Y. Akiba,
R. Akimoto,
J. Alexander,
M. Alfred,
M. Alibordi,
K. Aoki,
N. Apadula,
H. Asano,
E. T. Atomssa,
T. C. Awes,
B. Azmoun,
V. Babintsev,
M. Bai,
X. Bai,
B. Bannier,
K. N. Barish,
S. Bathe,
V. Baublis,
C. Baumann,
S. Baumgart,
A. Bazilevsky
, et al. (387 additional authors not shown)
Abstract:
The PHENIX experiment reports systematic measurements at the Relativistic Heavy Ion Collider of $φ$-meson production in asymmetric Cu$+$Au collisions at $\sqrt{s_{_{NN}}}$=200 GeV and in U$+$U collisions at $\sqrt{s_{_{NN}}}$=193 GeV. Measurements were performed via the $φ\rightarrow K^{+}K^{-}$ decay channel at midrapidity $|η|<0.35$. Features of $φ$-meson production measured in Cu$+$Cu, Cu$+$Au,…
▽ More
The PHENIX experiment reports systematic measurements at the Relativistic Heavy Ion Collider of $φ$-meson production in asymmetric Cu$+$Au collisions at $\sqrt{s_{_{NN}}}$=200 GeV and in U$+$U collisions at $\sqrt{s_{_{NN}}}$=193 GeV. Measurements were performed via the $φ\rightarrow K^{+}K^{-}$ decay channel at midrapidity $|η|<0.35$. Features of $φ$-meson production measured in Cu$+$Cu, Cu$+$Au, Au$+$Au, and U$+$U collisions were found to not depend on the collision geometry, which was expected because the yields are averaged over the azimuthal angle and follow the expected scaling with nuclear-overlap size. The elliptic flow of the $φ$ meson in Cu$+$Au, Au$+$Au, and U$+$U collisions scales with second-order-participant eccentricity and the length scale of the nuclear-overlap region (estimated with the number of participating nucleons). At moderate $p_T$, $φ$-meson production measured in Cu$+$Au and U$+$U collisions is consistent with coalescence-model predictions, whereas at high $p_T$ the production is in agreement with expectations for in-medium energy loss of parent partons prior to their fragmentation. The elliptic flow for $φ$ mesons measured in Cu$+$Au and U$+$U collisions is well described by a (2+1)D viscous-hydrodynamic model with specific-shear viscosity $η/s=1/4π$.
△ Less
Submitted 13 January, 2023; v1 submitted 21 July, 2022;
originally announced July 2022.
-
Simulated redistricting plans for the analysis and evaluation of redistricting in the United States
Authors:
Cory McCartan,
Christopher T. Kenny,
Tyler Simko,
George Garcia III,
Kevin Wang,
Melissa Wu,
Shiro Kuriwaki,
Kosuke Imai
Abstract:
This article introduces the 50stateSimulations, a collection of simulated congressional districting plans and underlying code developed by the Algorithm-Assisted Redistricting Methodology (ALARM) Project. The 50stateSimulations allow for the evaluation of enacted and other congressional redistricting plans in the United States. While the use of redistricting simulation algorithms has become standa…
▽ More
This article introduces the 50stateSimulations, a collection of simulated congressional districting plans and underlying code developed by the Algorithm-Assisted Redistricting Methodology (ALARM) Project. The 50stateSimulations allow for the evaluation of enacted and other congressional redistricting plans in the United States. While the use of redistricting simulation algorithms has become standard in academic research and court cases, any simulation analysis requires non-trivial efforts to combine multiple data sets, identify state-specific redistricting criteria, implement complex simulation algorithms, and summarize and visualize simulation outputs. We have developed a complete workflow that facilitates this entire process of simulation-based redistricting analysis for the congressional districts of all 50 states. The resulting 50stateSimulations include ensembles of simulated 2020 congressional redistricting plans and necessary replication data. We also provide the underlying code, which serves as a template for customized analyses. All data and code are free and publicly available. This article details the design, creation, and validation of the data.
△ Less
Submitted 20 October, 2022; v1 submitted 21 June, 2022;
originally announced June 2022.
-
Policy Learning with Asymmetric Counterfactual Utilities
Authors:
Eli Ben-Michael,
Kosuke Imai,
Zhichao Jiang
Abstract:
Data-driven decision making plays an important role even in high stakes settings like medicine and public policy. Learning optimal policies from observed data requires a careful formulation of the utility function whose expected value is maximized across a population. Although researchers typically use utilities that depend on observed outcomes alone, in many settings the decision maker's utility…
▽ More
Data-driven decision making plays an important role even in high stakes settings like medicine and public policy. Learning optimal policies from observed data requires a careful formulation of the utility function whose expected value is maximized across a population. Although researchers typically use utilities that depend on observed outcomes alone, in many settings the decision maker's utility function is more properly characterized by the joint set of potential outcomes under all actions. For example, the Hippocratic principle to "do no harm" implies that the cost of causing death to a patient who would otherwise survive without treatment is greater than the cost of forgoing life-saving treatment. We consider optimal policy learning with asymmetric counterfactual utility functions of this form that consider the joint set of potential outcomes. We show that asymmetric counterfactual utilities lead to an unidentifiable expected utility function, and so we first partially identify it. Drawing on statistical decision theory, we then derive minimax decision rules by minimizing the maximum expected utility loss relative to different alternative policies. We show that one can learn minimax loss decision rules from observed data by solving intermediate classification problems, and establish that the finite sample excess expected utility loss of this procedure is bounded by the regret of these intermediate classifiers. We apply this conceptual framework and methodology to the decision about whether or not to use right heart catheterization for patients with possible pulmonary hypertension.
△ Less
Submitted 28 November, 2023; v1 submitted 21 June, 2022;
originally announced June 2022.
-
Addressing Census data problems in race imputation via fully Bayesian Improved Surname Geocoding and name supplements
Authors:
Kosuke Imai,
Santiago Olivella,
Evan T. R. Rosenman
Abstract:
Prediction of individual's race and ethnicity plays an important role in social science and public health research. Examples include studies of racial disparity in health and voting. Recently, Bayesian Improved Surname Geocoding (BISG), which uses Bayes' rule to combine information from Census surname files with the geocoding of an individual's residence, has emerged as a leading methodology for t…
▽ More
Prediction of individual's race and ethnicity plays an important role in social science and public health research. Examples include studies of racial disparity in health and voting. Recently, Bayesian Improved Surname Geocoding (BISG), which uses Bayes' rule to combine information from Census surname files with the geocoding of an individual's residence, has emerged as a leading methodology for this prediction task. Unfortunately, BISG suffers from two Census data problems that contribute to unsatisfactory predictive performance for minorities. First, the decennial Census often contains zero counts for minority racial groups in the Census blocks where some members of those groups reside. Second, because the Census surname files only include frequent names, many surnames -- especially those of minorities -- are missing from the list. To address the zero counts problem, we introduce a fully Bayesian Improved Surname Geocoding (fBISG) methodology that accounts for potential measurement error in Census counts by extending the naive Bayesian inference of the BISG methodology to full posterior inference. To address the missing surname problem, we supplement the Census surname data with additional data on last, first, and middle names taken from the voter files of six Southern states where self-reported race is available. Our empirical validation shows that the fBISG methodology and name supplements significantly improve the accuracy of race imputation across all racial groups, and especially for Asians. The proposed methodology, together with additional name data, is available via the open-source software WRU.
△ Less
Submitted 31 August, 2022; v1 submitted 12 May, 2022;
originally announced May 2022.
-
Improving constraints on gluon spin-momentum correlations in transversely polarized protons via midrapidity open-heavy-flavor electrons in $p^{\uparrow}+p$ collisions at $\sqrt{s}=200$ GeV
Authors:
N. J. Abdulameer,
U. Acharya,
C. Aidala,
Y. Akiba,
M. Alfred,
V. Andrieux,
N. Apadula,
H. Asano,
B. Azmoun,
V. Babintsev,
N. S. Bandara,
K. N. Barish,
S. Bathe,
A. Bazilevsky,
M. Beaumier,
R. Belmont,
A. Berdnikov,
Y. Berdnikov,
L. Bichon,
B. Blankenship,
D. S. Blau,
J. S. Bok,
V. Borisov,
M. L. Brooks,
J. Bryslawskyj
, et al. (299 additional authors not shown)
Abstract:
Polarized proton-proton collisions provide leading-order access to gluons, presenting an opportunity to constrain gluon spin-momentum correlations within transversely polarized protons and enhance our understanding of the three-dimensional structure of the proton. Midrapidity open-heavy-flavor production at $\sqrt{s}=200$ GeV is dominated by gluon-gluon fusion, providing heightened sensitivity to…
▽ More
Polarized proton-proton collisions provide leading-order access to gluons, presenting an opportunity to constrain gluon spin-momentum correlations within transversely polarized protons and enhance our understanding of the three-dimensional structure of the proton. Midrapidity open-heavy-flavor production at $\sqrt{s}=200$ GeV is dominated by gluon-gluon fusion, providing heightened sensitivity to gluon dynamics relative to other production channels. Transverse single-spin asymmetries of positrons and electrons from heavy-flavor hadron decays are measured at midrapidity using the PHENIX detector at the Relativistic Heavy Ion Collider. These charge-separated measurements are sensitive to gluon correlators that can in principle be related to gluon orbital angular momentum via model calculations. Explicit constraints on gluon correlators are extracted for two separate models, one of which had not been constrained previously.
△ Less
Submitted 7 March, 2023; v1 submitted 27 April, 2022;
originally announced April 2022.
-
Nonprompt direct-photon production in Au$+$Au collisions at $\sqrt{s_{_{NN}}}=200$ GeV
Authors:
U. A. Acharya,
A. Adare,
C. Aidala,
N. N. Ajitanand,
Y. Akiba,
M. Alfred,
N. Apadula,
H. Asano,
B. Azmoun,
V. Babintsev,
M. Bai,
N. S. Bandara,
B. Bannier,
K. N. Barish,
S. Bathe,
A. Bazilevsky,
M. Beaumier,
S. Beckman,
R. Belmont,
A. Berdnikov,
Y. Berdnikov,
L. Bichon,
B. Blankenship,
D. S. Blau,
J. S. Bok
, et al. (311 additional authors not shown)
Abstract:
The measurement of the direct-photon spectrum from Au$+$Au collisions at $\sqrt{s_{_{NN}}}=200$ GeV is presented by the PHENIX collaboration using the external-photon-conversion technique for 0\%--93\% central collisions in a transverse-momentum ($p_T$) range of 0.8--10 GeV/$c$. An excess of direct photons, above prompt-photon production from hard-scattering processes, is observed for $p_T<6$ GeV/…
▽ More
The measurement of the direct-photon spectrum from Au$+$Au collisions at $\sqrt{s_{_{NN}}}=200$ GeV is presented by the PHENIX collaboration using the external-photon-conversion technique for 0\%--93\% central collisions in a transverse-momentum ($p_T$) range of 0.8--10 GeV/$c$. An excess of direct photons, above prompt-photon production from hard-scattering processes, is observed for $p_T<6$ GeV/$c$. Nonprompt direct photons are measured by subtracting the prompt component, which is estimated as $N_{\rm coll}$-scaled direct photons from $p$$+$$p$ collisions at 200 GeV, from the direct-photon spectrum. Results are obtained for $0.8<p_T<6.0$ GeV/$c$ and suggest that the spectrum has an increasing inverse slope from ${\approx}0.2$ to 0.4 GeV/$c$ with increasing $p_T$, which indicates a possible sensitivity of the measurement to photons from earlier stages of the evolution of the collision. In addition, like the direct-photon production, the $p_T$-integrated nonprompt direct-photon yields also follow a power-law scaling behavior as a function of collision-system size. The exponent, $α$, for the nonprompt component is found to be consistent with 1.1 with no apparent $p_T$ dependence.
△ Less
Submitted 19 April, 2024; v1 submitted 31 March, 2022;
originally announced March 2022.
-
Charm- and Bottom-Quark Production in Au$+$Au Collisions at $\sqrt{s_{_{NN}}}$ = 200 GeV
Authors:
PHENIX Collaboration,
N. J. Abdulameer,
U. Acharya,
A. Adare,
C. Aidala,
N. N. Ajitanand,
Y. Akiba,
M. Alfred,
N. Apadula,
H. Asano,
B. Azmoun,
V. Babintsev,
M. Bai,
N. S. Bandara,
B. Bannier,
K. N. Barish,
S. Bathe,
A. Bazilevsky,
M. Beaumier,
S. Beckman,
R. Belmont,
A. Berdnikov,
Y. Berdnikov,
L. Bichon,
B. Blankenship
, et al. (321 additional authors not shown)
Abstract:
The invariant yield of electrons from open-heavy-flavor decays for $1<p_T<8$ GeV/$c$ at midrapidity $|y|<0.35$ in Au$+$Au collisions at $\sqrt{s_{_{NN}}}$ = 200 GeV has been measured by the PHENIX experiment at the Relativistic Heavy Ion Collider. A displaced-vertex analysis with the PHENIX silicon-vertex detector enables extraction of the fraction of charm and bottom hadron decays and unfolding o…
▽ More
The invariant yield of electrons from open-heavy-flavor decays for $1<p_T<8$ GeV/$c$ at midrapidity $|y|<0.35$ in Au$+$Au collisions at $\sqrt{s_{_{NN}}}$ = 200 GeV has been measured by the PHENIX experiment at the Relativistic Heavy Ion Collider. A displaced-vertex analysis with the PHENIX silicon-vertex detector enables extraction of the fraction of charm and bottom hadron decays and unfolding of the invariant yield of parent charm and bottom hadrons. The nuclear-modification factors $R_{AA}$ for electrons from charm and bottom hadron decays and heavy-flavor hadrons show both a centrality and a quark-mass dependence, indicating suppression in the quark-gluon plasma produced in these collisions that is medium sized and quark-mass dependent.
△ Less
Submitted 11 April, 2024; v1 submitted 31 March, 2022;
originally announced March 2022.
-
Statistical Inference for Heterogeneous Treatment Effects Discovered by Generic Machine Learning in Randomized Experiments
Authors:
Kosuke Imai,
Michael Lingzhi Li
Abstract:
Researchers are increasingly turning to machine learning (ML) algorithms to investigate causal heterogeneity in randomized experiments. Despite their promise, ML algorithms may fail to accurately ascertain heterogeneous treatment effects under practical settings with many covariates and small sample size. In addition, the quantification of estimation uncertainty remains a challenge. We develop a g…
▽ More
Researchers are increasingly turning to machine learning (ML) algorithms to investigate causal heterogeneity in randomized experiments. Despite their promise, ML algorithms may fail to accurately ascertain heterogeneous treatment effects under practical settings with many covariates and small sample size. In addition, the quantification of estimation uncertainty remains a challenge. We develop a general approach to statistical inference for heterogeneous treatment effects discovered by a generic ML algorithm. We apply the Neyman's repeated sampling framework to a common setting, in which researchers use an ML algorithm to estimate the conditional average treatment effect and then divide the sample into several groups based on the magnitude of the estimated effects. We show how to estimate the average treatment effect within each of these groups, and construct a valid confidence interval. In addition, we develop nonparametric tests of treatment effect homogeneity across groups, and rank-consistency of within-group average treatment effects. The validity of our methodology does not rely on the properties of ML algorithms because it is solely based on the randomization of treatment assignment and random sampling of units. Finally, we generalize our methodology to the cross-fitting procedure by accounting for the additional uncertainty induced by the random splitting of data.
△ Less
Submitted 20 April, 2024; v1 submitted 28 March, 2022;
originally announced March 2022.
-
Measurements of second-harmonic Fourier coefficients from azimuthal anisotropies in $p$$+$$p$, $p$$+$Au, $d$$+$Au, and $^3$He$+$Au collisions at $\sqrt{s_{_{NN}}}=200$ GeV
Authors:
N. J. Abdulameer,
U. Acharya,
A. Adare,
C. Aidala,
N. N. Ajitanand,
Y. Akiba,
M. Alfred,
V. Andrieux,
K. Aoki,
N. Apadula,
H. Asano,
C. Ayuso,
B. Azmoun,
V. Babintsev,
M. Bai,
N. S. Bandara,
B. Bannier,
K. N. Barish,
S. Bathe,
A. Bazilevsky,
M. Beaumier,
S. Beckman,
R. Belmont,
A. Berdnikov,
Y. Berdnikov
, et al. (368 additional authors not shown)
Abstract:
Recently, the PHENIX Collaboration has published second- and third-harmonic Fourier coefficients $v_2$ and $v_3$ for midrapidity ($|η|<0.35$) charged hadrons in 0\%--5\% central $p$$+$Au, $d$$+$Au, and $^3$He$+$Au collisions at $\sqrt{s_{_{NN}}}=200$ GeV utilizing three sets of two-particle correlations for two detector combinations with different pseudorapidity acceptance [Phys. Rev. C {\bf 105},…
▽ More
Recently, the PHENIX Collaboration has published second- and third-harmonic Fourier coefficients $v_2$ and $v_3$ for midrapidity ($|η|<0.35$) charged hadrons in 0\%--5\% central $p$$+$Au, $d$$+$Au, and $^3$He$+$Au collisions at $\sqrt{s_{_{NN}}}=200$ GeV utilizing three sets of two-particle correlations for two detector combinations with different pseudorapidity acceptance [Phys. Rev. C {\bf 105}, 024901 (2022)]. This paper extends these measurements of $v_2$ to all centralities in $p$$+$Au, $d$$+$Au, and $^3$He$+$Au collisions, as well as $p$$+$$p$ collisions, as a function of transverse momentum ($p_T$) and event multiplicity. The kinematic dependence of $v_2$ is quantified as the ratio $R$ of $v_2$ between the two detector combinations as a function of event multiplicity for $0.5$$<$$p_T$$<$$1$ and $2$$<$$p_T$$<$$2.5$ GeV/$c$. A multiphase-transport (AMPT) model can reproduce the observed $v_2$ in most-central to midcentral $d$$+$Au and $^3$He$+$Au collisions. However, the AMPT model systematically overestimates the measurements in $p$$+$$p$, $p$$+$Au, and peripheral $d$$+$Au and $^3$He$+$Au collisions, indicating a higher nonflow contribution in AMPT than in the experimental data. The AMPT model fails to describe the observed $R$ for $0.5$$<$$p_T$$<$$1$ GeV/$c$, but there is qualitative agreement with the measurements for $2$$<$$p_T$$<$$2.5$ GeV/$c$.
△ Less
Submitted 4 March, 2023; v1 submitted 18 March, 2022;
originally announced March 2022.
-
Measurement of differential cross sections for $Σ^+p$ elastic scattering in the momentum range 0.44-0.80 GeV/c
Authors:
J-PARC E40 Collaboration,
:,
T. Nanamura,
K. Miwa,
J. K. Ahn,
Y. Akazawa,
T. Aramaki,
S. Ashikaga,
S. Callier,
N. Chiga,
S. W. Choi,
H. Ekawa,
P. Evtoukhovitch,
N. Fujioka,
M. Fujita,
T. Gogami,
T. K. Harada,
S. Hasegawa,
S. H. Hayakawa,
R. Honda,
S. Hoshino,
K. Hosomi,
M. Ichikawa,
Y. Ichikawa,
M. Ieiri
, et al. (48 additional authors not shown)
Abstract:
We performed a novel $Σ^+ p$ scattering experiment at the J-PARC Hadron Experimental Facility. Approximately 2400 $Σ^+ p$ elastic scattering events were identified from $4.9 \times 10^7$ tagged $Σ^+$ particles in the $Σ^+$ momentum range 0.44 -- 0.80 GeV/$c$. The differential cross sections of the $Σ^+ p$ elastic scattering were derived with much better precision than in previous experiments. The…
▽ More
We performed a novel $Σ^+ p$ scattering experiment at the J-PARC Hadron Experimental Facility. Approximately 2400 $Σ^+ p$ elastic scattering events were identified from $4.9 \times 10^7$ tagged $Σ^+$ particles in the $Σ^+$ momentum range 0.44 -- 0.80 GeV/$c$. The differential cross sections of the $Σ^+ p$ elastic scattering were derived with much better precision than in previous experiments. The obtained differential cross sections were approximately 2 mb/sr or less, which were not as large as those predicted by the fss2 and FSS models based on the quark cluster model in the short-range region. By performing phase-shift analyses for the obtained differential cross sections, we experimentally derived the phase shifts of the $^3 S_1$ and $^1 P_1$ channels for the first time. The phase shift of the $^3 S_1$ channel, where a large repulsive core was predicted owing to the Pauli effect between quarks, was evaluated as $20^\circ<|δ_{^3S_1}|<35^\circ$. If the sign of $δ_{^3S_1}$ is assumed to be negative, the interaction in this channel is moderately repulsive, as the Nijmegen extended-sort-core models predicted.
△ Less
Submitted 11 July, 2022; v1 submitted 16 March, 2022;
originally announced March 2022.
-
Study of $φ$-meson production in $p$$+$Al, $p$$+$Au, $d$$+$Au, and $^3$He$+$Au collisions at $\sqrt{s_{_{NN}}}=200$ GeV
Authors:
U. Acharya,
A. Adare,
C. Aidala,
N. N. Ajitanand,
Y. Akiba,
M. Alfred,
V. Andrieux,
N. Apadula,
H. Asano,
B. Azmoun,
V. Babintsev,
M. Bai,
N. S. Bandara,
B. Bannier,
K. N. Barish,
S. Bathe,
A. Bazilevsky,
M. Beaumier,
S. Beckman,
R. Belmont,
A. Berdnikov,
Y. Berdnikov,
L. Bichon,
B. Blankenship,
D. S. Blau
, et al. (346 additional authors not shown)
Abstract:
Small nuclear collisions are mainly sensitive to cold-nuclear-matter effects; however, the collective behavior observed in these collisions shows a hint of hot-nuclear-matter effects. The identified-particle spectra, especially the $φ$ mesons which contain strange and antistrange quarks and have a relatively small hadronic-interaction cross section, are a good tool to study these effects. The PHEN…
▽ More
Small nuclear collisions are mainly sensitive to cold-nuclear-matter effects; however, the collective behavior observed in these collisions shows a hint of hot-nuclear-matter effects. The identified-particle spectra, especially the $φ$ mesons which contain strange and antistrange quarks and have a relatively small hadronic-interaction cross section, are a good tool to study these effects. The PHENIX experiment has measured $φ$ mesons in a specific set of small collision systems $p$$+$Al, $p$$+$Au, and $^3$He$+$Au, as well as $d$$+$Au [Phys. Rev. C {\bf 83}, 024909 (2011)], at $\sqrt{s_{_{NN}}}=200$ GeV. The transverse-momentum spectra and nuclear-modification factors are presented and compared to theoretical-model predictions. The comparisons with different calculations suggest that quark-gluon plasma may be formed in these small collision systems at $\sqrt{s_{_{NN}}}=200$ GeV. However, the volume and the lifetime of the produced medium may be insufficient for observing strangeness-enhancement and jet-quenching effects. Comparison with calculations suggests that the main production mechanisms of $φ$ mesons at midrapidity may be different in $p$$+$Al versus $p/d/$$^3$He$+$Au collisions at $\sqrt{s_{_{NN}}}=200$ GeV. While thermal quark recombination seems to dominate in $p/d/$$^3$He$+$Au collisions, fragmentation seems to be the main production mechanism in $p$$+$Al collisions.
△ Less
Submitted 26 July, 2022; v1 submitted 11 March, 2022;
originally announced March 2022.
-
Measurement of Direct-Photon Cross Section and Double-Helicity Asymmetry at $\sqrt{s}=510$ GeV in $\vec{p}+\vec{p}$ Collisions
Authors:
PHENIX Collaboration,
N. J. Abdulameer,
U. Acharya,
A. Adare,
C. Aidala,
N. N. Ajitanand,
Y. Akiba,
R. Akimoto,
M. Alfred,
N. Apadula,
Y. Aramaki,
H. Asano,
E. T. Atomssa,
T. C. Awes,
B. Azmoun,
V. Babintsev,
M. Bai,
N. S. Bandara,
B. Bannier,
K. N. Barish,
S. Bathe,
A. Bazilevsky,
M. Beaumier,
S. Beckman,
R. Belmont
, et al. (336 additional authors not shown)
Abstract:
We present measurements of the cross section and double-helicity asymmetry $A_{LL}$ of direct-photon production in $\vec{p}+\vec{p}$ collisions at $\sqrt{s}=510$ GeV. The measurements have been performed at midrapidity ($|η|<0.25$) with the PHENIX detector at the Relativistic Heavy Ion Collider. At relativistic energies, direct photons are dominantly produced from the initial quark-gluon hard scat…
▽ More
We present measurements of the cross section and double-helicity asymmetry $A_{LL}$ of direct-photon production in $\vec{p}+\vec{p}$ collisions at $\sqrt{s}=510$ GeV. The measurements have been performed at midrapidity ($|η|<0.25$) with the PHENIX detector at the Relativistic Heavy Ion Collider. At relativistic energies, direct photons are dominantly produced from the initial quark-gluon hard scattering and do not interact via the strong force at leading order. Therefore, at $\sqrt{s}=510$ GeV, where leading-order-effects dominate, these measurements provide clean and direct access to the gluon helicity in the polarized proton in the gluon-momentum-fraction range $0.02<x<0.08$, with direct sensitivity to the sign of the gluon contribution.
△ Less
Submitted 6 May, 2023; v1 submitted 16 February, 2022;
originally announced February 2022.
-
Measurement of $ψ(2S)$ nuclear modification at backward and forward rapidity in $p$$+$$p$, $p$$+$Al, and $p$$+$Au collisions at $\sqrt{s_{_{NN}}}=200$ GeV
Authors:
U. A. Acharya,
C. Aidala,
Y. Akiba,
M. Alfred,
V. Andrieux,
N. Apadula,
H. Asano,
B. Azmoun,
V. Babintsev,
N. S. Bandara,
K. N. Barish,
S. Bathe,
A. Bazilevsky,
M. Beaumier,
R. Belmont,
A. Berdnikov,
Y. Berdnikov,
L. Bichon,
B. Blankenship,
D. S. Blau,
J. S. Bok,
V. Borisov,
M. L. Brooks,
J. Bryslawskyj,
V. Bumazhnov
, et al. (291 additional authors not shown)
Abstract:
Suppression of the $J/ψ$ nuclear-modification factor has been seen as a trademark signature of final-state effects in large collision systems for decades. In small systems, the nuclear modification was attributed to cold-nuclear-matter effects until the observation of strong differential suppression of the $ψ(2S)$ state in $p/d$$+$$A$ collisions suggested the presence of final-state effects. Resul…
▽ More
Suppression of the $J/ψ$ nuclear-modification factor has been seen as a trademark signature of final-state effects in large collision systems for decades. In small systems, the nuclear modification was attributed to cold-nuclear-matter effects until the observation of strong differential suppression of the $ψ(2S)$ state in $p/d$$+$$A$ collisions suggested the presence of final-state effects. Results of $J/ψ$ and $ψ(2S)$ measurements in the dimuon decay channel are presented here for $p$$+$$p$, $p$$+$Al, and $p$$+$Au collision systems at $\sqrt{s_{_{NN}}}=200$ GeV. The results are predominantly shown in the form of the nuclear-modification factor, $R_{pA}$, the ratio of the $ψ(2S)$ invariant yield per nucleon-nucleon collision in collisions of proton on target nucleus to that in $p$$+$$p$ collisions. Measurements of the $J/ψ$ and $ψ(2S)$ nuclear-modification factor are compared with shadowing and transport-model predictions, as well as to complementary measurements at Large-Hadron-Collider energies.
△ Less
Submitted 30 June, 2022; v1 submitted 8 February, 2022;
originally announced February 2022.
-
Using Machine Learning to Test Causal Hypotheses in Conjoint Analysis
Authors:
Dae Woong Ham,
Kosuke Imai,
Lucas Janson
Abstract:
Conjoint analysis is a popular experimental design used to measure multidimensional preferences. Researchers examine how varying a factor of interest, while controlling for other relevant factors, influences decision-making. Currently, there exist two methodological approaches to analyzing data from a conjoint experiment. The first focuses on estimating the average marginal effects of each factor…
▽ More
Conjoint analysis is a popular experimental design used to measure multidimensional preferences. Researchers examine how varying a factor of interest, while controlling for other relevant factors, influences decision-making. Currently, there exist two methodological approaches to analyzing data from a conjoint experiment. The first focuses on estimating the average marginal effects of each factor while averaging over the other factors. Although this allows for straightforward design-based estimation, the results critically depend on the distribution of other factors and how interaction effects are aggregated. An alternative model-based approach can compute various quantities of interest, but requires researchers to correctly specify the model, a challenging task for conjoint analysis with many factors and possible interactions. In addition, a commonly used logistic regression has poor statistical properties even with a moderate number of factors when incorporating interactions. We propose a new hypothesis testing approach based on the conditional randomization test to answer the most fundamental question of conjoint analysis: Does a factor of interest matter in any way given the other factors? Our methodology is solely based on the randomization of factors, and hence is free from assumptions. Yet, it allows researchers to use any test statistic, including those based on complex machine learning algorithms. As a result, we are able to combine the strengths of the existing design-based and model-based approaches. We illustrate the proposed methodology through conjoint analysis of immigration preferences and political candidate evaluation. We also extend the proposed approach to test for regularity assumptions commonly used in conjoint analysis. An open-source software package is available for implementing the proposed methodology.
△ Less
Submitted 17 August, 2022; v1 submitted 20 January, 2022;
originally announced January 2022.
-
Estimating Heterogeneous Causal Effects of High-Dimensional Treatments: Application to Conjoint Analysis
Authors:
Max Goplerud,
Kosuke Imai,
Nicole E. Pashley
Abstract:
Estimation of heterogeneous treatment effects is an active area of research. Most of the existing methods, however, focus on estimating the conditional average treatment effects of a single, binary treatment given a set of pre-treatment covariates. In this paper, we propose a method to estimate the heterogeneous causal effects of high-dimensional treatments, which poses unique challenges in terms…
▽ More
Estimation of heterogeneous treatment effects is an active area of research. Most of the existing methods, however, focus on estimating the conditional average treatment effects of a single, binary treatment given a set of pre-treatment covariates. In this paper, we propose a method to estimate the heterogeneous causal effects of high-dimensional treatments, which poses unique challenges in terms of estimation and interpretation. The proposed approach finds maximally heterogeneous groups and uses a Bayesian mixture of regularized logistic regressions to identify groups of units who exhibit similar patterns of treatment effects. By directly modeling group membership with covariates, the proposed methodology allows one to explore the unit characteristics that are associated with different patterns of treatment effects. Our motivating application is conjoint analysis, which is a popular type of survey experiment in social science and marketing research and is based on a high-dimensional factorial design. We apply the proposed methodology to the conjoint data, where survey respondents are asked to select one of two immigrant profiles with randomly selected attributes. We find that a group of respondents with a relatively high degree of prejudice appears to discriminate against immigrants from non-European countries like Iraq. An open-source software package is available for implementing the proposed methodology.
△ Less
Submitted 16 June, 2024; v1 submitted 4 January, 2022;
originally announced January 2022.
-
Transverse-single-spin asymmetries of charged pions at midrapidity in transversely polarized $p{+}p$ collisions at $\sqrt{s}=200$ GeV
Authors:
U. A. Acharya,
C. Aidala,
Y. Akiba,
M. Alfred,
V. Andrieux,
N. Apadula,
H. Asano,
B. Azmoun,
V. Babintsev,
N. S. Bandara,
K. N. Barish,
S. Bathe,
A. Bazilevsky,
M. Beaumier,
R. Belmont,
A. Berdnikov,
Y. Berdnikov,
L. Bichon,
B. Blankenship,
D. S. Blau,
J. S. Bok,
V. Borisov,
M. L. Brooks,
J. Bryslawskyj,
V. Bumazhnov
, et al. (286 additional authors not shown)
Abstract:
In 2015, the PHENIX collaboration has measured single-spin asymmetries for charged pions in transversely polarized proton-proton collisions at the center of mass energy of $\sqrt{s}=200$ GeV. The pions were detected at central rapidities of $|η|<0.35$. The single-spin asymmetries are consistent with zero for each charge individually, as well as consistent with the previously published neutral-pion…
▽ More
In 2015, the PHENIX collaboration has measured single-spin asymmetries for charged pions in transversely polarized proton-proton collisions at the center of mass energy of $\sqrt{s}=200$ GeV. The pions were detected at central rapidities of $|η|<0.35$. The single-spin asymmetries are consistent with zero for each charge individually, as well as consistent with the previously published neutral-pion asymmetries in the same rapidity range. However, they show a slight indication of charge-dependent differences which may suggest a flavor dependence in the underlying mechanisms that create these asymmetries.
△ Less
Submitted 9 February, 2022; v1 submitted 10 December, 2021;
originally announced December 2021.
-
Precise measurement of differential cross sections of the Σ-p --> Λ n reaction in momentum range 470-650 MeV/c
Authors:
J-PARC E40 Collaboration,
:,
K. Miwa,
J. K. Ahn,
Y. Akazawa,
T. Aramaki,
S. Ashikaga,
S. Callier,
N. Chiga,
S. W. Choi,
H. Ekawa,
P. Evtoukhovitch,
N. Fujioka,
M. Fujita,
T. Gogami,
T. Harada,
S. Hasegawa,
S. H. Hayakawa,
R. Honda,
S. Hoshino,
K. Hosomi,
M. Ichikawa,
Y. Ichikawa,
M. Ieiri,
M. Ikeda
, et al. (48 additional authors not shown)
Abstract:
The differential cross sections of the Σ-p --> Λ n reaction were measured accurately for the Σ- momentum (p_{Σ}) ranging from 470 to 650 MeV/c at the J-PARC Hadron Experimental Facility. Precise angular information about the Σ-p --> Λ n reaction was obtained for the first time by detecting approximately 100 reaction events at each angular step of Δcosθ = 0.1. The obtained differential cross sectio…
▽ More
The differential cross sections of the Σ-p --> Λ n reaction were measured accurately for the Σ- momentum (p_{Σ}) ranging from 470 to 650 MeV/c at the J-PARC Hadron Experimental Facility. Precise angular information about the Σ-p --> Λ n reaction was obtained for the first time by detecting approximately 100 reaction events at each angular step of Δcosθ = 0.1. The obtained differential cross sections show slightly forward-peaking structure in the measured momentum regions.The cross sections integrated for -0.7 < cosθ < 1.0 were obtained as 22.5 +- 0.68 (stat.) +- 0.65 (syst.) mb and 15.8 +-0.83(stat.) +- 0.52 (syst.) mb for 470<p_{Σ}(MeV/c)<550 and 550<p_{Σ}(MeV/c)<650, respectively. These results show a drastic improvement compared to past measurements of the hyperon-proton scattering experiments. They will play essential roles in updating the theoretical models of the baryon-baryon interactions.
△ Less
Submitted 12 January, 2022; v1 submitted 28 November, 2021;
originally announced November 2021.
-
kmclib: Automated Inference and Verification of Session Types
Authors:
Keigo Imai,
Julien Lange,
Rumyana Neykova
Abstract:
Theories and tools based on multiparty session types offer correctness guarantees for concurrent programs that communicate using message-passing. These guarantees usually come at the cost of an intrinsically top-down approach, which requires the communication behaviour of the entire program to be specified as a global type. This paper introduces kmclib: an OCaml library that supports the developme…
▽ More
Theories and tools based on multiparty session types offer correctness guarantees for concurrent programs that communicate using message-passing. These guarantees usually come at the cost of an intrinsically top-down approach, which requires the communication behaviour of the entire program to be specified as a global type. This paper introduces kmclib: an OCaml library that supports the development of correct message-passing programs without having to write any types. The library utilises the meta-programming facilities of OCaml to automatically infer the session types of concurrent programs and verify their compatibility (k-MC). Well-typed programs, written with kmclib, do not lead to communication errors and cannot get stuck.
△ Less
Submitted 26 November, 2021; v1 submitted 23 November, 2021;
originally announced November 2021.
-
Systematic study of nuclear effects in $p$$+$Al, $p$$+$Au, $d$$+$Au, and $^{3}$He$+$Au collisions at $\sqrt{s_{_{NN}}}=200$ GeV using $π^0$ production
Authors:
U. A. Acharya,
A. Adare,
C. Aidala,
N. N. Ajitanand,
Y. Akiba,
H. Al-Bataineh,
J. Alexander,
M. Alfred,
V. Andrieux,
A. Angerami,
K. Aoki,
N. Apadula,
Y. Aramaki,
H. Asano,
E. T. Atomssa,
R. Averbeck,
T. C. Awes,
B. Azmoun,
V. Babintsev,
M. Bai,
G. Baksay,
L. Baksay,
N. S. Bandara,
B. Bannier,
K. N. Barish
, et al. (529 additional authors not shown)
Abstract:
The PHENIX collaboration presents a systematic study of $π^0$ production from $p$$+$$p$, $p$$+$Al, $p$$+$Au, $d$$+$Au, and $^{3}$He$+$Au collisions at $\sqrt{s_{_{NN}}}=200$ GeV. Measurements were performed with different centrality selections as well as the total inelastic, 0%--100%, selection for all collision systems. For 0%--100% collisions, the nuclear modification factors, $R_{xA}$, are cons…
▽ More
The PHENIX collaboration presents a systematic study of $π^0$ production from $p$$+$$p$, $p$$+$Al, $p$$+$Au, $d$$+$Au, and $^{3}$He$+$Au collisions at $\sqrt{s_{_{NN}}}=200$ GeV. Measurements were performed with different centrality selections as well as the total inelastic, 0%--100%, selection for all collision systems. For 0%--100% collisions, the nuclear modification factors, $R_{xA}$, are consistent with unity for $p_T$ above 8 GeV/$c$, but exhibit an enhancement in peripheral collisions and a suppression in central collisions. The enhancement and suppression characteristics are similar for all systems for the same centrality class. It is shown that for high-$p_T$-$π^0$ production, the nucleons in the $d$ and $^3$He interact mostly independently with the Au nucleus and that the counter intuitive centrality dependence is likely due to a physical correlation between multiplicity and the presence of a hard scattering process. These observations disfavor models where parton energy loss has a significant contribution to nuclear modifications in small systems. Nuclear modifications at lower $p_T$ resemble the Cronin effect -- an increase followed by a peak in central or inelastic collisions and a plateau in peripheral collisions. The peak height has a characteristic ordering by system size as $p$$+$Au $>$ $d$$+$Au $>$ $^{3}$He$+$Au $>$ $p$$+$Al. For collisions with Au ions, current calculations based on initial state cold nuclear matter effects result in the opposite order, suggesting the presence of other contributions to nuclear modifications, in particular at lower $p_T$.
△ Less
Submitted 6 June, 2022; v1 submitted 10 November, 2021;
originally announced November 2021.
-
Measuring and Modeling Neighborhoods
Authors:
Cory McCartan,
Jacob R. Brown,
Kosuke Imai
Abstract:
Granular geographic data present new opportunities to understand how neighborhoods are formed, and how they influence politics. At the same time, the inherent subjectivity of neighborhoods creates methodological challenges in measuring and modeling them. We develop an open-source survey instrument that allows respondents to draw their neighborhoods on a map. We also propose a statistical model to…
▽ More
Granular geographic data present new opportunities to understand how neighborhoods are formed, and how they influence politics. At the same time, the inherent subjectivity of neighborhoods creates methodological challenges in measuring and modeling them. We develop an open-source survey instrument that allows respondents to draw their neighborhoods on a map. We also propose a statistical model to analyze how the characteristics of respondents and local areas determine subjective neighborhoods. We conduct two surveys: collecting subjective neighborhoods from voters in Miami, New York City, and Phoenix, and asking New York City residents to draw a community of interest for inclusion in their city council district. Our analysis shows that, holding other factors constant, White respondents include census blocks with more White residents in their neighborhoods. Similarly, Democrats and Republicans are more likely to include co-partisan areas. Furthermore, our model provides more accurate out-of-sample predictions than standard neighborhood measures.
△ Less
Submitted 19 January, 2024; v1 submitted 26 October, 2021;
originally announced October 2021.
-
Transverse single spin asymmetries of forward neutrons in $p$$+$$p$, $p$$+$Al, and $p$$+$Au collisions at $\sqrt{s_{_{NN}}}=200$ GeV as a function of transverse and longitudinal momenta
Authors:
U. A. Acharya,
C. Aidala,
Y. Akiba,
M. Alfred,
V. Andrieux,
N. Apadula,
H. Asano,
B. Azmoun,
V. Babintsev,
N. S. Bandara,
K. N. Barish,
S. Bathe,
A. Bazilevsky,
M. Beaumier,
R. Belmont,
A. Berdnikov,
Y. Berdnikov,
L. Bichon,
B. Blankenship,
D. S. Blau,
J. S. Bok,
V. Borisov,
M. L. Brooks,
J. Bryslawskyj,
V. Bumazhnov
, et al. (286 additional authors not shown)
Abstract:
In 2015 the PHENIX collaboration at the Relativistic Heavy Ion Collider recorded $p$$+$$p$, $p$$+$Al, and $p$$+$Au collision data at center of mass energies of $\sqrt{s_{_{NN}}}=200$ GeV with the proton beam(s) transversely polarized. At very forward rapidities $η>6.8$ relative to the polarized proton beam, neutrons were detected either inclusively or in (anti)correlation with detector activity re…
▽ More
In 2015 the PHENIX collaboration at the Relativistic Heavy Ion Collider recorded $p$$+$$p$, $p$$+$Al, and $p$$+$Au collision data at center of mass energies of $\sqrt{s_{_{NN}}}=200$ GeV with the proton beam(s) transversely polarized. At very forward rapidities $η>6.8$ relative to the polarized proton beam, neutrons were detected either inclusively or in (anti)correlation with detector activity related to hard collisions. The resulting single spin asymmetries, that were previously reported, have now been extracted as a function of the transverse momentum of the neutron as well as its longitudinal momentum fraction $x_F$. The explicit kinematic dependence, combined with the correlation information allows for a closer look at the interplay of different mechanisms suggested to describe these asymmetries, such as hadronic interactions or electromagnetic interactions in ultra-peripheral collisions, UPC. Events that are correlated with a hard collision indeed display a mostly negative asymmetry that increases in magnitude as a function of transverse momentum with only little dependence on $x_F$. In contrast, events that are not likely to have emerged from a hard collision display positive asymmetries for the nuclear collisions with a kinematic dependence that resembles that of a UPC based model. Because the UPC interaction depends strongly on the charge of the nucleus, those effects are very small for $p$$+$$p$ collisions, moderate for $p$$+$Al collisions, and large for $p$$+$Au collisions.
△ Less
Submitted 9 February, 2022; v1 submitted 14 October, 2021;
originally announced October 2021.
-
Safe Policy Learning through Extrapolation: Application to Pre-trial Risk Assessment
Authors:
Eli Ben-Michael,
D. James Greiner,
Kosuke Imai,
Zhichao Jiang
Abstract:
Algorithmic recommendations and decisions have become ubiquitous in today's society. Many of these and other data-driven policies, especially in the realm of public policy, are based on known, deterministic rules to ensure their transparency and interpretability. For example, algorithmic pre-trial risk assessments, which serve as our motivating application, provide relatively simple, deterministic…
▽ More
Algorithmic recommendations and decisions have become ubiquitous in today's society. Many of these and other data-driven policies, especially in the realm of public policy, are based on known, deterministic rules to ensure their transparency and interpretability. For example, algorithmic pre-trial risk assessments, which serve as our motivating application, provide relatively simple, deterministic classification scores and recommendations to help judges make release decisions. How can we use the data based on existing deterministic policies to learn new and better policies? Unfortunately, prior methods for policy learning are not applicable because they require existing policies to be stochastic rather than deterministic. We develop a robust optimization approach that partially identifies the expected utility of a policy, and then finds an optimal policy by minimizing the worst-case regret. The resulting policy is conservative but has a statistical safety guarantee, allowing the policy-maker to limit the probability of producing a worse outcome than the existing policy. We extend this approach to common and important settings where humans make decisions with the aid of algorithmic recommendations. Lastly, we apply the proposed methodology to a unique field experiment on pre-trial risk assessment instruments. We derive new classification and recommendation rules that retain the transparency and interpretability of the existing instrument while potentially leading to better overall outcomes at a lower cost.
△ Less
Submitted 15 February, 2022; v1 submitted 21 September, 2021;
originally announced September 2021.
-
Optimal Covariate Balancing Conditions in Propensity Score Estimation
Authors:
Jianqing Fan,
Kosuke Imai,
Inbeom Lee,
Han Liu,
Yang Ning,
Xiaolin Yang
Abstract:
Inverse probability of treatment weighting (IPTW) is a popular method for estimating the average treatment effect (ATE). However, empirical studies show that the IPTW estimators can be sensitive to the misspecification of the propensity score model. To address this problem, researchers have proposed to estimate propensity score by directly optimizing the balance of pre-treatment covariates. While…
▽ More
Inverse probability of treatment weighting (IPTW) is a popular method for estimating the average treatment effect (ATE). However, empirical studies show that the IPTW estimators can be sensitive to the misspecification of the propensity score model. To address this problem, researchers have proposed to estimate propensity score by directly optimizing the balance of pre-treatment covariates. While these methods appear to empirically perform well, little is known about how the choice of balancing conditions affects their theoretical properties. To fill this gap, we first characterize the asymptotic bias and efficiency of the IPTW estimator based on the Covariate Balancing Propensity Score (CBPS) methodology under local model misspecification. Based on this analysis, we show how to optimally choose the covariate balancing functions and propose an optimal CBPS-based IPTW estimator. This estimator is doubly robust; it is consistent for the ATE if either the propensity score model or the outcome model is correct. In addition, the proposed estimator is locally semiparametric efficient when both models are correctly specified. To further relax the parametric assumptions, we extend our method by using a sieve estimation approach. We show that the resulting estimator is globally efficient under a set of much weaker assumptions and has a smaller asymptotic bias than the existing estimators. Finally, we evaluate the finite sample performance of the proposed estimators via simulation and empirical studies. An open-source software package is available for implementing the proposed methods.
△ Less
Submitted 2 August, 2021;
originally announced August 2021.
-
Kinematic dependence of azimuthal anisotropies in $p$$+$Au, $d$$+$Au, $^3$He+Au at $\sqrt{s_{_{NN}}}$ = 200 GeV
Authors:
U. A. Acharya,
A. Adare,
C. Aidala,
N. N. Ajitanand,
Y. Akiba,
M. Alfred,
V. Andrieux,
K. Aoki,
N. Apadula,
H. Asano,
C. Ayuso,
B. Azmoun,
V. Babintsev,
M. Bai,
N. S. Bandara,
B. Bannier,
K. N. Barish,
S. Bathe,
A. Bazilevsky,
M. Beaumier,
S. Beckman,
R. Belmont,
A. Berdnikov,
Y. Berdnikov,
L. Bichon
, et al. (360 additional authors not shown)
Abstract:
There is strong evidence for the formation of small droplets of quark-gluon plasma in $p/d/^{3}$He+Au collisions at the Relativistic Heavy Ion Collider (RHIC) and in $p$+$p$/Pb collisions at the Large Hadron Collider. In particular, the analysis of data at RHIC for different geometries obtained by varying the projectile size and shape has proven insightful. In the present analysis, we find excelle…
▽ More
There is strong evidence for the formation of small droplets of quark-gluon plasma in $p/d/^{3}$He+Au collisions at the Relativistic Heavy Ion Collider (RHIC) and in $p$+$p$/Pb collisions at the Large Hadron Collider. In particular, the analysis of data at RHIC for different geometries obtained by varying the projectile size and shape has proven insightful. In the present analysis, we find excellent agreement with the previously published PHENIX at RHIC results on elliptical and triangular flow with an independent analysis via the two-particle correlation method, which has quite different systematic uncertainties and an independent code base. In addition, the results are extended to other detector combinations with different kinematic (pseudorapidity) coverage. These results provide additional constraints on contributions from nonflow and longitudinal decorrelations.
△ Less
Submitted 3 February, 2022; v1 submitted 14 July, 2021;
originally announced July 2021.
-
The Impact of the U.S. Census Disclosure Avoidance System on Redistricting and Voting Rights Analysis
Authors:
Christopher T. Kenny,
Shiro Kuriwaki,
Cory McCartan,
Evan Rosenman,
Tyler Simko,
Kosuke Imai
Abstract:
The US Census Bureau plans to protect the privacy of 2020 Census respondents through its Disclosure Avoidance System (DAS), which attempts to achieve differential privacy guarantees by adding noise to the Census microdata. By applying redistricting simulation and analysis methods to DAS-protected 2010 Census data, we find that the protected data are not of sufficient quality for redistricting purp…
▽ More
The US Census Bureau plans to protect the privacy of 2020 Census respondents through its Disclosure Avoidance System (DAS), which attempts to achieve differential privacy guarantees by adding noise to the Census microdata. By applying redistricting simulation and analysis methods to DAS-protected 2010 Census data, we find that the protected data are not of sufficient quality for redistricting purposes. We demonstrate that the injected noise makes it impossible for states to accurately comply with the One Person, One Vote principle. Our analysis finds that the DAS-protected data are biased against certain areas, depending on voter turnout and partisan and racial composition, and that these biases lead to large and unpredictable errors in the analysis of partisan and racial gerrymanders. Finally, we show that the DAS algorithm does not universally protect respondent privacy. Based on the names and addresses of registered voters, we are able to predict their race as accurately using the DAS-protected data as when using the 2010 Census data. Despite this, the DAS-protected data can still inaccurately estimate the number of majority-minority districts. We conclude with recommendations for how the Census Bureau should proceed with privacy protection for the 2020 Census.
△ Less
Submitted 20 August, 2021; v1 submitted 28 May, 2021;
originally announced May 2021.
-
Measurement of the differential cross sections of the Sigma-p elastic scattering in momentum range of 470 to 850 MeV/c
Authors:
J-PARC E40 Collaboration,
:,
K. Miwa,
J. K. Ahn,
Y. Akazawa,
T. Aramaki,
S. Ashikaga,
S. Callier,
N. Chiga,
S. W. Choi,
H. Ekawa,
P. Evtoukhovitch,
N. Fujioka,
M. Fujita,
T. Gogami,
T. Harada,
S. Hasegawa,
S. H. Hayakawa,
R. Honda,
S. Hoshino,
K. Hosomi,
M. Ichikawa,
Y. Ichikawa,
M. Ieiri,
M. Ikeda
, et al. (48 additional authors not shown)
Abstract:
A high statistics $Σp$ scattering experiment has been performed at the J-PARC Hadron Experimental Facility. Data for momentum-tagged $Σ^{-}$ running in a liquid hydrogen target were accumulated by detecting the $π^{-}p \to K^{+}Σ^{-}$ reaction with a high intensity $π^{-}$ beam of 20 M/spill. Differential cross sections of the $Σ^{-}p$ elastic scattering were derived with a drastically improved ac…
▽ More
A high statistics $Σp$ scattering experiment has been performed at the J-PARC Hadron Experimental Facility. Data for momentum-tagged $Σ^{-}$ running in a liquid hydrogen target were accumulated by detecting the $π^{-}p \to K^{+}Σ^{-}$ reaction with a high intensity $π^{-}$ beam of 20 M/spill. Differential cross sections of the $Σ^{-}p$ elastic scattering were derived with a drastically improved accuracy by identifying the largest statistics of about 4,500 events from 1.72 $\times$ $10^{7}$ $Σ^{-}$. The derived differential cross section shows a clear forward-peaking angular distribution for a $Σ^{-}$ momentum range from 470 to 850 MeV/$c$. The accurate data will impose a strong constraint on the theoretical models of the baryon-baryon interactions.
△ Less
Submitted 24 July, 2021; v1 submitted 28 April, 2021;
originally announced April 2021.
-
First observation of a nuclear $s$-state of $Ξ$ hypernucleus, $^{15}_Ξ{\rm C}$
Authors:
M. Yoshimoto,
J. K. Ahn,
B. Bassalleck,
H. Ekawa,
Y. Endo,
M. Fujita,
Y. Han,
T. Hashimoto,
S. H. Hayakawa,
K. Hicks,
K. Hoshino,
S. Hoshino,
S. H. Hwang,
Y. Ichikawa,
M. Ichikawa,
K. Imai,
Y. Ishikawa,
H. Kanauchi,
A. Kasagi,
S. H. Kim,
S. Kinbara,
P. M. Lin,
T. L. Ma,
K. Miwa,
A. T. Moe
, et al. (24 additional authors not shown)
Abstract:
Bound-systems of $Ξ^-$--$^{14}_{}{\rm N}$ are studied via $Ξ^-$ capture at rest followed by emission of a twin single-$Λ$ hypernucleus in the emulsion detectors. Two events forming extremely deep $Ξ^-$ bound states were obtained by analysis of a hybrid method in the E07 experiment at J-PARC and reanalysis of the E373 experiment at KEK-PS. The decay mode of one event was assigned as…
▽ More
Bound-systems of $Ξ^-$--$^{14}_{}{\rm N}$ are studied via $Ξ^-$ capture at rest followed by emission of a twin single-$Λ$ hypernucleus in the emulsion detectors. Two events forming extremely deep $Ξ^-$ bound states were obtained by analysis of a hybrid method in the E07 experiment at J-PARC and reanalysis of the E373 experiment at KEK-PS. The decay mode of one event was assigned as $Ξ^-+^{14}_{}{\rm N}\to^{5}_Λ{\rm He}$+$^{5}_Λ{\rm He}$+$^{4}_{}{\rm He}$+n. Since there are no excited states for daughter particles, the binding energy of the $Ξ^-$ hyperon, $B_{Ξ^-}$, in $^{14}_{}{\rm N}$ nucleus was uniquely determined to be 6.27 $\pm$ 0.27 MeV. Another $Ξ^-$--$^{14}_{}{\rm N}$ system via the decay $^{9}_Λ{\rm Be}$ + $^{5}_Λ{\rm He}$ + n brings a $B_{Ξ^-}$ value, 8.00 $\pm$ 0.77 MeV or 4.96 $\pm$ 0.77 MeV, where the two possible values of $B_{Ξ^-}$ correspond to the ground and the excited states of the daughter $^{9}_Λ{\rm Be}$ nucleus, respectively. Because the $B_{Ξ^-}$ values are larger than those of the previously reported events (KISO and IBUKI), which are both interpreted as the nuclear $1p$ state of the $Ξ^-$--$^{14}_{}{\rm N}$ system, these new events give the first indication of the nuclear $1s$ state of the $Ξ$ hypernucleus, $^{15}_Ξ{\rm C}$.
△ Less
Submitted 26 May, 2021; v1 submitted 15 March, 2021;
originally announced March 2021.
-
Dynamic Stochastic Blockmodel Regression for Network Data: Application to International Militarized Conflicts
Authors:
Santiago Olivella,
Tyler Pratt,
Kosuke Imai
Abstract:
A primary goal of social science research is to understand how latent group memberships predict the dynamic process of network evolution. In the modeling of international militarized conflicts, for instance, scholars hypothesize that membership in geopolitical coalitions shapes the decision to engage in conflict. Such theories explain the ways in which nodal and dyadic characteristics affect the e…
▽ More
A primary goal of social science research is to understand how latent group memberships predict the dynamic process of network evolution. In the modeling of international militarized conflicts, for instance, scholars hypothesize that membership in geopolitical coalitions shapes the decision to engage in conflict. Such theories explain the ways in which nodal and dyadic characteristics affect the evolution of conflict patterns over time via their effects on group memberships. To aid the empirical testing of these arguments, we develop a dynamic model of network data by combining a hidden Markov model with a mixed-membership stochastic blockmodel that identifies latent groups underlying the network structure. Unlike existing models, we incorporate covariates that predict dynamic node memberships in latent groups as well as the direct formation of edges between dyads. While prior substantive research often assumes the decision to engage in international militarized conflict is independent across states and static over time, we demonstrate that conflict is driven by states' evolving membership in geopolitical blocs. Changes in monadic covariates like democracy shift states between coalitions, generating heterogeneous effects on conflict over time and across states. The proposed methodology, which relies on a variational approximation to a collapsed posterior distribution as well as stochastic optimization for scalability, is implemented through an open-source software package.
△ Less
Submitted 25 October, 2021; v1 submitted 28 February, 2021;
originally announced March 2021.
-
Probing gluon spin-momentum correlations in transversely polarized protons through midrapidity isolated direct photons in $p^\uparrow+p$ collisions at $\sqrt{s}=200$ GeV
Authors:
U. A. Acharya,
C. Aidala,
Y. Akiba,
M. Alfred,
V. Andrieux,
N. Apadula,
H. Asano,
B. Azmoun,
V. Babintsev,
N. S. Bandara,
K. N. Barish,
S. Bathe,
A. Bazilevsky,
M. Beaumier,
R. Belmont,
A. Berdnikov,
Y. Berdnikov,
L. Bichon,
B. Blankenship,
D. S. Blau,
J. S. Bok,
M. L. Brooks,
J. Bryslawskyj,
V. Bumazhnov,
S. Campbell
, et al. (286 additional authors not shown)
Abstract:
Studying spin-momentum correlations in hadronic collisions offers a glimpse into a three-dimensional picture of proton structure. The transverse single-spin asymmetry for midrapidity isolated direct photons in $p^\uparrow+p$ collisions at $\sqrt{s}=200$ GeV is measured with the PHENIX detector at the Relativistic Heavy Ion Collider (RHIC). Because direct photons in particular are produced from the…
▽ More
Studying spin-momentum correlations in hadronic collisions offers a glimpse into a three-dimensional picture of proton structure. The transverse single-spin asymmetry for midrapidity isolated direct photons in $p^\uparrow+p$ collisions at $\sqrt{s}=200$ GeV is measured with the PHENIX detector at the Relativistic Heavy Ion Collider (RHIC). Because direct photons in particular are produced from the hard scattering and do not interact via the strong force, this measurement is a clean probe of initial-state spin-momentum correlations inside the proton and is in particular sensitive to gluon interference effects within the proton. This is the first time direct photons have been used as a probe of spin-momentum correlations at RHIC. The uncertainties on the results are a fifty-fold improvement with respect to those of the one prior measurement for the same observable, from the Fermilab E704 experiment. These results constrain gluon spin-momentum correlations in transversely polarized protons.
△ Less
Submitted 20 August, 2021; v1 submitted 26 February, 2021;
originally announced February 2021.