Search | arXiv e-print repository

Nonparametric FBST for Validating Linear Models

Authors: Rodrigo F. L. Lassance, Julio M. Stern, Rafael B. Stern

Abstract: The Full Bayesian Significance Test (FBST) possesses many desirable aspects, such as not requiring a non-zero prior probability for hypotheses while also producing a measure of evidence for $H_0$. Still, few attempts have been made to bring the FBST to nonparametric settings, with the main drawback being the need to obtain the highest posterior density (HPD) in a function space. In this work, we u… ▽ More The Full Bayesian Significance Test (FBST) possesses many desirable aspects, such as not requiring a non-zero prior probability for hypotheses while also producing a measure of evidence for $H_0$. Still, few attempts have been made to bring the FBST to nonparametric settings, with the main drawback being the need to obtain the highest posterior density (HPD) in a function space. In this work, we use Gaussian processes to provide an analytically tractable FBST for hypotheses of the type $$ H_0: g(\boldsymbol{x}) = \boldsymbol{b}(\boldsymbol{x})\boldsymbolβべーた, \quad \forall \boldsymbol{x} \in \mathcal{X}, \quad \boldsymbolβべーた \in \mathbb{R}^k, $$ where $g(\cdot)$ is the regression function, $\boldsymbol{b}(\cdot)$ is a vector of linearly independent linear functions -- such as $\boldsymbol{b}(\boldsymbol{x}) = \boldsymbol{x}'$ -- and $\mathcal{X}$ is the covariates' domain. We also make use of pragmatic hypotheses to verify if the adherence of linear models may be approximately instead of exactly true, allowing for the inclusion of valuable information such as measurement errors and utility judgments. This contribution extends the theory of the FBST, allowing its application in nonparametric settings and providing a procedure that easily tests if linear models are adequate for the data and that can automatically perform variable selection. △ Less

Submitted 21 June, 2024; originally announced June 2024.

Comments: All code available in https://github.com/rflassance/lmFBST

arXiv:2406.10827 [pdf, other]

Algorithm Selection for Optimal Multi-Agent Path Finding via Graph Embedding

Authors: Carmel Shabalin, Omri Kaduri, Roni Stern

Abstract: Multi-agent path finding (MAPF) is the problem of finding paths for multiple agents such that they do not collide. This problem manifests in numerous real-world applications such as controlling transportation robots in automated warehouses, moving characters in video games, and coordinating self-driving cars in intersections. Finding optimal solutions to MAPF is NP-Hard, yet modern optimal solvers… ▽ More Multi-agent path finding (MAPF) is the problem of finding paths for multiple agents such that they do not collide. This problem manifests in numerous real-world applications such as controlling transportation robots in automated warehouses, moving characters in video games, and coordinating self-driving cars in intersections. Finding optimal solutions to MAPF is NP-Hard, yet modern optimal solvers can scale to hundreds of agents and even thousands in some cases. Different solvers employ different approaches, and there is no single state-of-the-art approach for all problems. Furthermore, there are no clear, provable, guidelines for choosing when each optimal MAPF solver to use. Prior work employed Algorithm Selection (AS) techniques to learn such guidelines from past data. A major challenge when employing AS for choosing an optimal MAPF algorithm is how to encode the given MAPF problem. Prior work either used hand-crafted features or an image representation of the problem. We explore graph-based encodings of the MAPF problem and show how they can be used on-the-fly with a modern graph embedding algorithm called FEATHER. Then, we show how this encoding can be effectively joined with existing encodings, resulting in a novel AS method we call MAPF Algorithm selection via Graph embedding (MAG). An extensive experimental evaluation of MAG on several MAPF algorithm selection tasks reveals that it is either on-par or significantly better than existing methods. △ Less

Submitted 16 June, 2024; originally announced June 2024.

MSC Class: 68T20 ACM Class: I.2.8

arXiv:2404.16379 [pdf, other]

Optimal and Bounded Suboptimal Any-Angle Multi-agent Pathfinding

Authors: Konstantin Yakovlev, Anton Andreychuk, Roni Stern

Abstract: Multi-agent pathfinding (MAPF) is the problem of finding a set of conflict-free paths for a set of agents. Typically, the agents' moves are limited to a pre-defined graph of possible locations and allowed transitions between them, e.g. a 4-neighborhood grid. We explore how to solve MAPF problems when each agent can move between any pair of possible locations as long as traversing the line segment… ▽ More Multi-agent pathfinding (MAPF) is the problem of finding a set of conflict-free paths for a set of agents. Typically, the agents' moves are limited to a pre-defined graph of possible locations and allowed transitions between them, e.g. a 4-neighborhood grid. We explore how to solve MAPF problems when each agent can move between any pair of possible locations as long as traversing the line segment connecting them does not lead to the collision with the obstacles. This is known as any-angle pathfinding. We present the first optimal any-angle multi-agent pathfinding algorithm. Our planner is based on the Continuous Conflict-based Search (CCBS) algorithm and an optimal any-angle variant of the Safe Interval Path Planning (TO-AA-SIPP). The straightforward combination of those, however, scales poorly since any-angle path finding induces search trees with a very large branching factor. To mitigate this, we adapt two techniques from classical MAPF to the any-angle setting, namely Disjoint Splitting and Multi-Constraints. Experimental results on different combinations of these techniques show they enable solving over 30% more problems than the vanilla combination of CCBS and TO-AA-SIPP. In addition, we present a bounded-suboptimal variant of our algorithm, that enables trading runtime for solution cost in a controlled manner. △ Less

Submitted 25 April, 2024; originally announced April 2024.

arXiv:2403.16331 [pdf, other]

Modeling Analog Dynamic Range Compressors using Deep Learning and State-space Models

Authors: Hanzhi Yin, Gang Cheng, Christian J. Steinmetz, Ruibin Yuan, Richard M. Stern, Roger B. Dannenberg

Abstract: We describe a novel approach for developing realistic digital models of dynamic range compressors for digital audio production by analyzing their analog prototypes. While realistic digital dynamic compressors are potentially useful for many applications, the design process is challenging because the compressors operate nonlinearly over long time scales. Our approach is based on the structured stat… ▽ More We describe a novel approach for developing realistic digital models of dynamic range compressors for digital audio production by analyzing their analog prototypes. While realistic digital dynamic compressors are potentially useful for many applications, the design process is challenging because the compressors operate nonlinearly over long time scales. Our approach is based on the structured state space sequence model (S4), as implementing the state-space model (SSM) has proven to be efficient at learning long-range dependencies and is promising for modeling dynamic range compressors. We present in this paper a deep learning model with S4 layers to model the Teletronix LA-2A analog dynamic range compressor. The model is causal, executes efficiently in real time, and achieves roughly the same quality as previous deep-learning models but with fewer parameters. △ Less

Submitted 24 March, 2024; originally announced March 2024.

arXiv:2403.15251 [pdf, other]

Safe Learning of PDDL Domains with Conditional Effects -- Extended Version

Authors: Argaman Mordoch, Enrico Scala, Roni Stern, Brendan Juba

Abstract: Powerful domain-independent planners have been developed to solve various types of planning problems. These planners often require a model of the acting agent's actions, given in some planning domain description language. Manually designing such an action model is a notoriously challenging task. An alternative is to automatically learn action models from observation. Such an action model is called… ▽ More Powerful domain-independent planners have been developed to solve various types of planning problems. These planners often require a model of the acting agent's actions, given in some planning domain description language. Manually designing such an action model is a notoriously challenging task. An alternative is to automatically learn action models from observation. Such an action model is called safe if every plan created with it is consistent with the real, unknown action model. Algorithms for learning such safe action models exist, yet they cannot handle domains with conditional or universal effects, which are common constructs in many planning problems. We prove that learning non-trivial safe action models with conditional effects may require an exponential number of samples. Then, we identify reasonable assumptions under which such learning is tractable and propose SAM Learning of Conditional Effects (Conditional-SAM), the first algorithm capable of doing so. We analyze Conditional-SAM theoretically and evaluate it experimentally. Our results show that the action models learned by Conditional-SAM can be used to solve perfectly most of the test set problems in most of the experimented domains. △ Less

Submitted 22 March, 2024; originally announced March 2024.

arXiv:2403.05655 [pdf, other]

PROTEST: Nonparametric Testing of Hypotheses Enhanced by Experts' Utility Judgements

Authors: Rodrigo F. L. Lassance, Rafael Izbicki, Rafael B. Stern

Abstract: Instead of testing solely a precise hypothesis, it is often useful to enlarge it with alternatives that are deemed to differ from it negligibly. For instance, in a bioequivalence study one might consider the hypothesis that the concentration of an ingredient is exactly the same in two drugs. In such a context, it might be more relevant to test the enlarged hypothesis that the difference in concent… ▽ More Instead of testing solely a precise hypothesis, it is often useful to enlarge it with alternatives that are deemed to differ from it negligibly. For instance, in a bioequivalence study one might consider the hypothesis that the concentration of an ingredient is exactly the same in two drugs. In such a context, it might be more relevant to test the enlarged hypothesis that the difference in concentration between the drugs is of no practical significance. While this concept is not alien to Bayesian statistics, applications remain confined to parametric settings and strategies on how to effectively harness experts' intuitions are often scarce or nonexistent. To resolve both issues, we introduce PROTEST, an accessible nonparametric testing framework that seamlessly integrates with Markov Chain Monte Carlo (MCMC) methods. We develop expanded versions of the model adherence, goodness-of-fit, quantile and two-sample tests. To demonstrate how PROTEST operates, we make use of examples, simulated studies - such as testing link functions in a binary regression setting, as well as a comparison between the performance of PROTEST and the PTtest (Holmes et al., 2015) - and an application with data on neuron spikes. Furthermore, we address the crucial issue of selecting the threshold - which controls how much a hypothesis is to be expanded - even when intuitions are limited or challenging to quantify. △ Less

Submitted 8 March, 2024; originally announced March 2024.

Comments: R package found in https://github.com/rflassance/protest

arXiv:2402.07357 [pdf, other]

Regression Trees for Fast and Adaptive Prediction Intervals

Authors: Luben M. C. Cabezas, Mateus P. Otto, Rafael Izbicki, Rafael B. Stern

Abstract: Predictive models make mistakes. Hence, there is a need to quantify the uncertainty associated with their predictions. Conformal inference has emerged as a powerful tool to create statistically valid prediction regions around point predictions, but its naive application to regression problems yields non-adaptive regions. New conformal scores, often relying upon quantile regressors or conditional d… ▽ More Predictive models make mistakes. Hence, there is a need to quantify the uncertainty associated with their predictions. Conformal inference has emerged as a powerful tool to create statistically valid prediction regions around point predictions, but its naive application to regression problems yields non-adaptive regions. New conformal scores, often relying upon quantile regressors or conditional density estimators, aim to address this limitation. Although they are useful for creating prediction bands, these scores are detached from the original goal of quantifying the uncertainty around an arbitrary predictive model. This paper presents a new, model-agnostic family of methods to calibrate prediction intervals for regression problems with local coverage guarantees. Our approach is based on pursuing the coarsest partition of the feature space that approximates conditional coverage. We create this partition by training regression trees and Random Forests on conformity scores. Our proposal is versatile, as it applies to various conformity scores and prediction settings and demonstrates superior scalability and performance compared to established baselines in simulated and real-world datasets. We provide a Python package clover that implements our methods using the standard scikit-learn interface. △ Less

Submitted 13 February, 2024; v1 submitted 11 February, 2024; originally announced February 2024.

arXiv:2401.06309 [pdf, other]

Cyberattacks on Adaptive Cruise Control Vehicles: An Analytical Characterization

Authors: Shian Wang, Mingfeng Shang, Raphael Stern

Abstract: While automated vehicles (AVs) are expected to revolutionize future transportation systems, emerging AV technologies open a door for malicious actors to compromise intelligent vehicles. As the first generation of AVs, adaptive cruise control (ACC) vehicles are vulnerable to cyberattacks. While recent effort has been made to understanding the impact of attacks on transportation systems, little work… ▽ More While automated vehicles (AVs) are expected to revolutionize future transportation systems, emerging AV technologies open a door for malicious actors to compromise intelligent vehicles. As the first generation of AVs, adaptive cruise control (ACC) vehicles are vulnerable to cyberattacks. While recent effort has been made to understanding the impact of attacks on transportation systems, little work has been done to systematically model and characterize the malicious nature of candidate attacks. In this study, we develop a general framework for modeling and synthesizing two types of candidate attacks on ACC vehicles, namely direct attacks on vehicle control commands and false data injection attacks on sensor measurement, with explicit characterization of their adverse effects. Based on linear stability analysis of car-following dynamics, we derive a series of analytical conditions characterizing the malicious nature of potential attacks. This ensures a higher degree of realism in modeling attacks with adverse effects, as opposed to simply considering attacks as constants or random variables. Notably, the conditions derived provide an effective method for strategically synthesizing an array of candidate attacks on ACC vehicles. We conduct extensive simulation to examine the impacts of intelligently designed attacks on microscopic car-following dynamics and macroscopic traffic flow. Numerical results illustrate the mechanism of candidate attacks, offering useful insights into understanding the vulnerability of future transportation systems. The methodology developed allows for further study of the widespread impact of strategically designed attacks on traffic cybersecurity, and is expected to inspire the development of efficient attack detection techniques and advanced vehicle controls. △ Less

Submitted 11 January, 2024; originally announced January 2024.

arXiv:2312.15113 [pdf, other]

Understanding driver-pedestrian interactions to predict driver yielding: naturalistic open-source dataset collected in Minnesota

Authors: Tianyi Li, Joshua Klavins, Te Xu, Niaz Mahmud Zafri, Raphael Stern

Abstract: Many factors influence the yielding result of a driver-pedestrian interaction, including traffic volume, vehicle speed, roadway characteristics, etc. While individual aspects of these interactions have been explored, comprehensive, naturalistic studies, particularly those considering the built environment's influence on driver-yielding behavior, are lacking. To address this gap, our study introduc… ▽ More Many factors influence the yielding result of a driver-pedestrian interaction, including traffic volume, vehicle speed, roadway characteristics, etc. While individual aspects of these interactions have been explored, comprehensive, naturalistic studies, particularly those considering the built environment's influence on driver-yielding behavior, are lacking. To address this gap, our study introduces an extensive open-source dataset, compiled from video data at 18 unsignalized intersections across Minnesota. Documenting more than 3000 interactions, this dataset provides a detailed view of driver-pedestrian interactions and over 50 distinct contextual variables. The data, which covers individual driver-pedestrian interactions and contextual factors, is made publicly available at https://github.com/tianyi17/pedestrian_yielding_data_MN. Using logistic regression, we developed a classification model that predicts driver yielding based on the identified variables. Our analysis indicates that vehicle speed, the presence of parking lots, proximity to parks or schools, and the width of major road crossings significantly influence driver yielding at unsignalized intersections. This study contributes to one of the most comprehensive driver-pedestrian datasets in the US, offering valuable insights for traffic safety improvements. By making this information available, our study will support communities across Minnesota and the United States in their ongoing efforts to improve road safety for pedestrians. △ Less

Submitted 22 December, 2023; originally announced December 2023.

arXiv:2312.10705 [pdf, other]

Enhancing Numeric-SAM for Learning with Few Observations

Authors: Argaman Mordoch, Shahaf S. Shperberg, Roni Stern, Berndan Juba

Abstract: A significant challenge in applying planning technology to real-world problems lies in obtaining a planning model that accurately represents the problem's dynamics. Numeric Safe Action Models Learning (N-SAM) is a recently proposed algorithm that addresses this challenge. It is an algorithm designed to learn the preconditions and effects of actions from observations in domains that may involve bot… ▽ More A significant challenge in applying planning technology to real-world problems lies in obtaining a planning model that accurately represents the problem's dynamics. Numeric Safe Action Models Learning (N-SAM) is a recently proposed algorithm that addresses this challenge. It is an algorithm designed to learn the preconditions and effects of actions from observations in domains that may involve both discrete and continuous state variables. N-SAM has several attractive properties. It runs in polynomial time and is guaranteed to output an action model that is safe, in the sense that plans generated by it are applicable and will achieve their intended goals. To preserve this safety guarantee, N-SAM must observe a substantial number of examples for each action before it is included in the learned action model. We address this limitation of N-SAM and propose N-SAM*, an enhanced version of N-SAM that always returns an action model where every observed action is applicable at least in some state, even if it was only observed once. N-SAM* does so without compromising the safety of the returned action model. We prove that N-SAM* is optimal in terms of sample complexity compared to any other algorithm that guarantees safety. An empirical study on a set of benchmark domains shows that the action models returned by N-SAM* enable solving significantly more problems compared to the action models returned by N-SAM. △ Less

Submitted 17 December, 2023; originally announced December 2023.

arXiv:2312.07003 [pdf, other]

RACER: Rational Artificial Intelligence Car-following-model Enhanced by Reality

Authors: Tianyi Li, Alexander Halatsis, Raphael Stern

Abstract: This paper introduces RACER, the Rational Artificial Intelligence Car-following model Enhanced by Reality, a cutting-edge deep learning car-following model, that satisfies partial derivative constraints, designed to predict Adaptive Cruise Control (ACC) driving behavior while staying theoretically feasible. Unlike conventional models, RACER effectively integrates Rational Driving Constraints (RDCs… ▽ More This paper introduces RACER, the Rational Artificial Intelligence Car-following model Enhanced by Reality, a cutting-edge deep learning car-following model, that satisfies partial derivative constraints, designed to predict Adaptive Cruise Control (ACC) driving behavior while staying theoretically feasible. Unlike conventional models, RACER effectively integrates Rational Driving Constraints (RDCs), crucial tenets of actual driving, resulting in strikingly accurate and realistic predictions. Against established models like the Optimal Velocity Relative Velocity (OVRV), a car-following Neural Network (NN), and a car-following Physics-Informed Neural Network (PINN), RACER excels across key metrics, such as acceleration, velocity, and spacing. Notably, it displays a perfect adherence to the RDCs, registering zero violations, in stark contrast to other models. This study highlights the immense value of incorporating physical constraints within AI models, especially for augmenting safety measures in transportation. It also paves the way for future research to test these models against human driving data, with the potential to guide safer and more rational driving behavior. The versatility of the proposed model, including its potential to incorporate additional derivative constraints and broader architectural applications, enhances its appeal and broadens its impact within the scientific community. △ Less

Submitted 12 December, 2023; originally announced December 2023.

arXiv:2311.08559 [pdf]

Alternatives to the ROC Curve AUC and C-statistic for Risk Prediction Models

Authors: Ralph H. Stern

Abstract: Assessment of risk prediction models has primarily utilized measures of discrimination, the ROC curve AUC and C-statistic. These derive from the risk distributions of patients and nonpatients, which in turn are derived from a population risk distribution. As greater dispersion of the population risk distribution produces greater separation of patient and nonpatient risks (discrimination), its para… ▽ More Assessment of risk prediction models has primarily utilized measures of discrimination, the ROC curve AUC and C-statistic. These derive from the risk distributions of patients and nonpatients, which in turn are derived from a population risk distribution. As greater dispersion of the population risk distribution produces greater separation of patient and nonpatient risks (discrimination), its parameters can be used as alternatives to the ROC curve AUC and C-statistic. Here continuous probability distributions are employed to develop insight into the relationship between their parameters and the ROC curve AUC and C-statistic derived from them. The ROC curve AUC and C-statistic are shown to have a straight-line relationship with the SD for uniform, half-sine, and symmetric triangular probability distributions, with slight differences in the slope: AUC approx 1/2+0.28 SD/(mean(1-mean)). This also characterizes the beta distribution over the same range of SD's. But at larger beta distribution SD's the plot of AUえーゆーC versus SD deviates downward from this straight-line relationship, approaching the ROC curve AUC and SD of a perfect model (AUC=1, SD= $\sqrt{\rm mean(1-mean)}$). A simpler and more intuitive discrimination metric is the coefficient of discrimination, the difference between the mean risk in patients and nonpatients. This is SD2/(mean(1-mean)), which is also the same for any distribution. Since estimating parameters or metrics discards information, the population risk distribution should always be presented. As the ROC curve AUC and C-statistic are functions of this distribution's parameters, the parameters represent simpler, intuitive alternatives to these discrimination metrics. Among discrimination metrics, the coefficient of discrimination provides a simple, intuitive alternative to the ROC curve AUC and C-statistic. △ Less

Submitted 1 December, 2023; v1 submitted 14 November, 2023; originally announced November 2023.

Comments: log likelihood text and figure added

arXiv:2310.17091 [pdf, other]

Detecting stealthy cyberattacks on adaptive cruise control vehicles: A machine learning approach

Authors: Tianyi Li, Mingfeng Shang, Shian Wang, Raphael Stern

Abstract: With the advent of vehicles equipped with advanced driver-assistance systems, such as adaptive cruise control (ACC) and other automated driving features, the potential for cyberattacks on these automated vehicles (AVs) has emerged. While overt attacks that force vehicles to collide may be easily identified, more insidious attacks, which only slightly alter driving behavior, can result in network-w… ▽ More With the advent of vehicles equipped with advanced driver-assistance systems, such as adaptive cruise control (ACC) and other automated driving features, the potential for cyberattacks on these automated vehicles (AVs) has emerged. While overt attacks that force vehicles to collide may be easily identified, more insidious attacks, which only slightly alter driving behavior, can result in network-wide increases in congestion, fuel consumption, and even crash risk without being easily detected. To address the detection of such attacks, we first present a traffic model framework for three types of potential cyberattacks: malicious manipulation of vehicle control commands, false data injection attacks on sensor measurements, and denial-of-service (DoS) attacks. We then investigate the impacts of these attacks at both the individual vehicle (micro) and traffic flow (macro) levels. A novel generative adversarial network (GAN)-based anomaly detection model is proposed for real-time identification of such attacks using vehicle trajectory data. We provide numerical evidence {to demonstrate} the efficacy of our machine learning approach in detecting cyberattacks on ACC-equipped vehicles. The proposed method is compared against some recently proposed neural network models and observed to have higher accuracy in identifying anomalous driving behaviors of ACC vehicles. △ Less

Submitted 25 October, 2023; originally announced October 2023.

arXiv:2309.14460 [pdf, other]

Online Active Learning For Sound Event Detection

Authors: Mark Lindsey, Ankit Shah, Francis Kubala, Richard M. Stern

Abstract: Data collection and annotation is a laborious, time-consuming prerequisite for supervised machine learning tasks. Online Active Learning (OAL) is a paradigm that addresses this issue by simultaneously minimizing the amount of annotation required to train a classifier and adapting to changes in the data over the duration of the data collection process. Prior work has indicated that fluctuating clas… ▽ More Data collection and annotation is a laborious, time-consuming prerequisite for supervised machine learning tasks. Online Active Learning (OAL) is a paradigm that addresses this issue by simultaneously minimizing the amount of annotation required to train a classifier and adapting to changes in the data over the duration of the data collection process. Prior work has indicated that fluctuating class distributions and data drift are still common problems for OAL. This work presents new loss functions that address these challenges when OAL is applied to Sound Event Detection (SED). Experimental results from the SONYC dataset and two Voice-Type Discrimination (VTD) corpora indicate that OAL can reduce the time and effort required to train SED classifiers by a factor of 5 for SONYC, and that the new methods presented here successfully resolve issues present in existing OAL methods. △ Less

Submitted 25 September, 2023; originally announced September 2023.

Comments: Submitted to ICASSP 2024. Publication will belong to IEEE

arXiv:2309.11588 [pdf, other]

Analysis and Applications of Population Flows in a Networked SEIRS Epidemic Process

Authors: Brooks A. Butler, Raphael Stern, Philip E. Paré

Abstract: Transportation networks play a critical part in the spread of infectious diseases between populations. In this work, we define a networked susceptible-exposed-infected-recovered epidemic process with loss of immunity over time (SEIRS) that explicitly models the flow of individuals between sub-populations, which serves as the propagating mechanism for infection. We provide sufficient conditions for… ▽ More Transportation networks play a critical part in the spread of infectious diseases between populations. In this work, we define a networked susceptible-exposed-infected-recovered epidemic process with loss of immunity over time (SEIRS) that explicitly models the flow of individuals between sub-populations, which serves as the propagating mechanism for infection. We provide sufficient conditions for local stability and instability of the healthy state of the system and show that no perturbation of population flows can change the local stability of any healthy state. We also provide sufficient conditions for the existence and uniqueness of an endemic state. We then develop tools and methods for applying our model to real-world data, including spreading parameter estimation and disease arrival time prediction, and apply them in a case study using both travel and infection data from counties in Minnesota during the first year of the COVID-19 pandemic. △ Less

Submitted 20 September, 2023; originally announced September 2023.

Comments: Submitted to and under revision in IEEE Transactions on Network Science and Engineering

arXiv:2308.09112 [pdf, other]

REACT to NHST: Sensible conclusions to meaningful hypotheses

Authors: Rafael Izbicki, Luben M. C. Cabezas, Fernando A. B. Colugnatti, Rodrigo F. L. Lassance, Altay A. L. de Souza, Rafael B. Stern

Abstract: While Null Hypothesis Significance Testing (NHST) remains a widely used statistical tool, it suffers from several shortcomings, such as conflating statistical and practical significance, sensitivity to sample size, and the inability to distinguish between accepting the null hypothesis and failing to reject it. Recent efforts have focused on developing alternatives to NHST to address these issues.… ▽ More While Null Hypothesis Significance Testing (NHST) remains a widely used statistical tool, it suffers from several shortcomings, such as conflating statistical and practical significance, sensitivity to sample size, and the inability to distinguish between accepting the null hypothesis and failing to reject it. Recent efforts have focused on developing alternatives to NHST to address these issues. Despite these efforts, conventional NHST remains dominant in scientific research due to its simplicity and perceived ease of interpretation. Our work presents a novel alternative to NHST that is just as accessible and intuitive: REACT. It not only tackles the shortcomings of NHST but also offers additional advantages over existing alternatives. For instance, REACT is easily applicable to multiparametric hypotheses and does not require stringent significance-level corrections when conducting multiple tests. We illustrate the practical utility of REACT through real-world data examples, using criteria aligned with common research practices to distinguish between the absence of evidence and evidence of absence. △ Less

Submitted 11 December, 2023; v1 submitted 17 August, 2023; originally announced August 2023.

arXiv:2306.09237 [pdf, other]

SCALE: Scaling up the Complexity for Advanced Language Model Evaluation

Authors: Vishvaksenan Rasiah, Ronja Stern, Veton Matoshi, Matthias Stürmer, Ilias Chalkidis, Daniel E. Ho, Joel Niklaus

Abstract: Recent strides in Large Language Models (LLMs) have saturated many NLP benchmarks (even professional domain-specific ones), emphasizing the need for novel, more challenging novel ones to properly assess LLM capabilities. In this paper, we introduce a novel NLP benchmark that poses challenges to current LLMs across four key dimensions: processing long documents (up to 50K tokens), utilizing domain… ▽ More Recent strides in Large Language Models (LLMs) have saturated many NLP benchmarks (even professional domain-specific ones), emphasizing the need for novel, more challenging novel ones to properly assess LLM capabilities. In this paper, we introduce a novel NLP benchmark that poses challenges to current LLMs across four key dimensions: processing long documents (up to 50K tokens), utilizing domain specific knowledge (embodied in legal texts), multilingual understanding (covering five languages), and multitasking (comprising legal document to document Information Retrieval, Court View Generation, Leading Decision Summarization, Citation Extraction, and eight challenging Text Classification tasks). Our benchmark comprises diverse legal NLP datasets from the Swiss legal system, allowing for a comprehensive study of the underlying Non-English, inherently multilingual, federal legal system. Despite recent advances, efficiently processing long documents for intense review/analysis tasks remains an open challenge for language models. Also, comprehensive, domain-specific benchmarks requiring high expertise to develop are rare, as are multilingual benchmarks. This scarcity underscores our contribution's value, considering most public models are trained predominantly on English corpora, while other languages remain understudied, particularly for practical domain-specific NLP tasks. Our benchmark allows for testing and advancing the state-of-the-art LLMs. As part of our study, we evaluate several pre-trained multilingual language models on our benchmark to establish strong baselines as a point of reference. Despite the large size of our datasets (tens to hundreds of thousands of examples), existing publicly available models struggle with most tasks, even after in-domain pretraining. We publish all resources (benchmark suite, pre-trained models, code) under a fully permissive open CC BY-SA license. △ Less

Submitted 1 September, 2023; v1 submitted 15 June, 2023; originally announced June 2023.

MSC Class: 68T50 ACM Class: I.2

arXiv:2306.06272 [pdf, other]

A Domain-Independent Agent Architecture for Adaptive Operation in Evolving Open Worlds

Authors: Shiwali Mohan, Wiktor Piotrowski, Roni Stern, Sachin Grover, Sookyung Kim, Jacob Le, Johan De Kleer

Abstract: Model-based reasoning agents are ill-equipped to act in novel situations in which their model of the environment no longer sufficiently represents the world. We propose HYDRA - a framework for designing model-based agents operating in mixed discrete-continuous worlds, that can autonomously detect when the environment has evolved from its canonical setup, understand how it has evolved, and adapt th… ▽ More Model-based reasoning agents are ill-equipped to act in novel situations in which their model of the environment no longer sufficiently represents the world. We propose HYDRA - a framework for designing model-based agents operating in mixed discrete-continuous worlds, that can autonomously detect when the environment has evolved from its canonical setup, understand how it has evolved, and adapt the agents' models to perform effectively. HYDRA is based upon PDDL+, a rich modeling language for planning in mixed, discrete-continuous environments. It augments the planning module with visual reasoning, task selection, and action execution modules for closed-loop interaction with complex environments. HYDRA implements a novel meta-reasoning process that enables the agent to monitor its own behavior from a variety of aspects. The process employs a diverse set of computational methods to maintain expectations about the agent's own behavior in an environment. Divergences from those expectations are useful in detecting when the environment has evolved and identifying opportunities to adapt the underlying models. HYDRA builds upon ideas from diagnosis and repair and uses a heuristics-guided search over model changes such that they become competent in novel conditions. The HYDRA framework has been used to implement novelty-aware agents for three diverse domains - CartPole++ (a higher dimension variant of a classic control problem), Science Birds (an IJCAI competition problem), and PogoStick (a specific problem domain in Minecraft). We report empirical observations from these domains to demonstrate the efficacy of various components in the novelty meta-reasoning process. △ Less

Submitted 9 June, 2023; originally announced June 2023.

Comments: Under review in Artificial Intelligence Journal - Open World Learning track

ACM Class: I.2.4; I.2.6

arXiv:2303.16967 [pdf, other]

Heuristic Search For Physics-Based Problems: Angry Birds in PDDL+

Authors: Wiktor Piotrowski, Yoni Sher, Sachin Grover, Roni Stern, Shiwali Mohan

Abstract: This paper studies how a domain-independent planner and combinatorial search can be employed to play Angry Birds, a well established AI challenge problem. To model the game, we use PDDL+, a planning language for mixed discrete/continuous domains that supports durative processes and exogenous events. The paper describes the model and identifies key design decisions that reduce the problem complexit… ▽ More This paper studies how a domain-independent planner and combinatorial search can be employed to play Angry Birds, a well established AI challenge problem. To model the game, we use PDDL+, a planning language for mixed discrete/continuous domains that supports durative processes and exogenous events. The paper describes the model and identifies key design decisions that reduce the problem complexity. In addition, we propose several domain-specific enhancements including heuristics and a search technique similar to preferred operators. Together, they alleviate the complexity of combinatorial search. We evaluate our approach by comparing its performance with dedicated domain-specific solvers on a range of Angry Birds levels. The results show that our performance is on par with these domain-specific approaches in most levels, even without using our domain-specific search enhancements. △ Less

Submitted 29 March, 2023; originally announced March 2023.

arXiv:2303.14272 [pdf, other]

Learning to Operate in Open Worlds by Adapting Planning Models

Authors: Wiktor Piotrowski, Roni Stern, Yoni Sher, Jacob Le, Matthew Klenk, Johan deKleer, Shiwali Mohan

Abstract: Planning agents are ill-equipped to act in novel situations in which their domain model no longer accurately represents the world. We introduce an approach for such agents operating in open worlds that detects the presence of novelties and effectively adapts their domain models and consequent action selection. It uses observations of action execution and measures their divergence from what is expe… ▽ More Planning agents are ill-equipped to act in novel situations in which their domain model no longer accurately represents the world. We introduce an approach for such agents operating in open worlds that detects the presence of novelties and effectively adapts their domain models and consequent action selection. It uses observations of action execution and measures their divergence from what is expected, according to the environment model, to infer existence of a novelty. Then, it revises the model through a heuristics-guided search over model changes. We report empirical evaluations on the CartPole problem, a standard Reinforcement Learning (RL) benchmark. The results show that our approach can deal with a class of novelties very quickly and in an interpretable fashion. △ Less

Submitted 24 March, 2023; originally announced March 2023.

Comments: To appears in the Proceedings of the 22nd International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2023)

ACM Class: I.2.6; I.2.8

arXiv:2301.09671 [pdf, other]

Flexible conditional density estimation for time series

Authors: Gustavo Grivol, Rafael Izbicki, Alex A. Okuno, Rafael B. Stern

Abstract: This paper introduces FlexCodeTS, a new conditional density estimator for time series. FlexCodeTS is a flexible nonparametric conditional density estimator, which can be based on an arbitrary regression method. It is shown that FlexCodeTS inherits the rate of convergence of the chosen regression method. Hence, FlexCodeTS can adapt its convergence by employing the regression method that best fits t… ▽ More This paper introduces FlexCodeTS, a new conditional density estimator for time series. FlexCodeTS is a flexible nonparametric conditional density estimator, which can be based on an arbitrary regression method. It is shown that FlexCodeTS inherits the rate of convergence of the chosen regression method. Hence, FlexCodeTS can adapt its convergence by employing the regression method that best fits the structure of data. From an empirical perspective, FlexCodeTS is compared to NNKCDE and GARCH in both simulated and real data. FlexCodeTS is shown to generally obtain the best performance among the selected methods according to either the CDE loss or the pinball loss. △ Less

Submitted 23 January, 2023; originally announced January 2023.

Comments: 19 pages, 7 figures

MSC Class: 00-01; 99-00

arXiv:2211.15254 [pdf, other]

Learnable Front Ends Based on Temporal Modulation for Music Tagging

Authors: Yinghao Ma, Richard M. Stern

Abstract: While end-to-end systems are becoming popular in auditory signal processing including automatic music tagging, models using raw audio as input needs a large amount of data and computational resources without domain knowledge. Inspired by the fact that temporal modulation is regarded as an essential component in auditory perception, we introduce the Temporal Modulation Neural Network (TMNN) that co… ▽ More While end-to-end systems are becoming popular in auditory signal processing including automatic music tagging, models using raw audio as input needs a large amount of data and computational resources without domain knowledge. Inspired by the fact that temporal modulation is regarded as an essential component in auditory perception, we introduce the Temporal Modulation Neural Network (TMNN) that combines Mel-like data-driven front ends and temporal modulation filters with a simple ResNet back end. The structure includes a set of temporal modulation filters to capture long-term patterns in all frequency channels. Experimental results show that the proposed front ends surpass state-of-the-art (SOTA) methods on the MagnaTagATune dataset in automatic music tagging, and they are also helpful for keyword spotting on speech commands. Moreover, the model performance for each tag suggests that genre or instrument tags with complex rhythm and mood tags can especially be improved with temporal modulation. △ Less

Submitted 28 November, 2022; originally announced November 2022.

Comments: Submitted to ICASSP 2023

arXiv:2209.01707 [pdf]

Magnetic phase crossover in strongly correlated EuMn2P2

Authors: Tanya Berry, Nicodemos Varnava, Dominic Ryan, Veronica Stewart, Riho Rästa, Ivo Heinmaa, Nitesh Kumar, Walter Schnelle, Rishi Bhandia, Christopher Pasco, N. P. Armitage, Raivo Stern, Claudia Felser, David Vanderbilt, Tyrel M. McQueen

Abstract: Strong electron correlations underlie a plethora of electronic and magnetic components and devices and are often used to identify and probe novel ground states in quantum materials. Herein we report a magnetic phase crossover in EuMn2P2, an insulator which shows Eu antiferromagnetism at TN=17K, but no phase transition attributed to Mn magnetism. The absence of a Mn magnetic phase transition contra… ▽ More Strong electron correlations underlie a plethora of electronic and magnetic components and devices and are often used to identify and probe novel ground states in quantum materials. Herein we report a magnetic phase crossover in EuMn2P2, an insulator which shows Eu antiferromagnetism at TN=17K, but no phase transition attributed to Mn magnetism. The absence of a Mn magnetic phase transition contrasts with the formation of long-range Mn order at T=130K in isoelectronic EuMn2Sb2 and EuMn2As2. Temperature-dependent specific heat and 31P NMR measurements provide evidence for the development of Mn magnetic correlations from T=250-100 K. Density functional theory calculations demonstrate an unusual sensitivity of the band structure to the details of the imposed Mn and Eu magnetic order, with antiferromagnetic Mn order required to recapitulate an insulating state. Our results imply a picture in which long range Mn magnetic order is suppressed by chemical pressure, but that magnetic correlations persist, narrowing bands and producing an insulating state. △ Less

Submitted 4 September, 2022; originally announced September 2022.

arXiv:2205.08010 [pdf, other]

The e-value and the Full Bayesian Significance Test: Logical Properties and Philosophical Consequences

Authors: Julio Michael Stern, Carlos Alberto de Braganca Pereira, Marcelo de Souza Lauretto, Luis Gustavo Esteves, Rafael Izbicki, Rafael Bassi Stern, Marcio Alves Diniz, Wagner de Souza Borges

Abstract: This article gives a conceptual review of the e-value, ev(H|X) -- the epistemic value of hypothesis H given observations X. This statistical significance measure was developed in order to allow logically coherent and consistent tests of hypotheses, including sharp or precise hypotheses, via the Full Bayesian Significance Test (FBST). Arguments of analysis allow a full characterization of this stat… ▽ More This article gives a conceptual review of the e-value, ev(H|X) -- the epistemic value of hypothesis H given observations X. This statistical significance measure was developed in order to allow logically coherent and consistent tests of hypotheses, including sharp or precise hypotheses, via the Full Bayesian Significance Test (FBST). Arguments of analysis allow a full characterization of this statistical test by its logical or compositional properties, showing a mutual complementarity between results of mathematical statistics and the logical desiderata lying at the foundations of this theory. △ Less

Submitted 22 June, 2024; v1 submitted 16 May, 2022; originally announced May 2022.

MSC Class: 62A01; 62F15

arXiv:2205.04921 [pdf, other]

doi 10.1103/PhysRevB.106.075429

Orbital disorder and ordering in NaTiSi2O6: 29Si and 23Na NMR Study

Authors: Ivo Heinmaa, Riho Rästa, Harlyn J. Silverstein, Christopher R. Wiebe, Raivo Stern

Abstract: NaTiSi2O6 is an exemplary compound, showing an orbital assisted spin-Peierls phase transition at Tc = 210 K. We present the results of 29Si and 23Na NMR measurements of NaTiSi2O6. The use of magic angle spinning techniques unambiguously shows that only one dynamically averaged silicon site can be seen at T > Tc. At cooling, the 29Si MAS NMR spectrum shows interesting changes. Immediately below Tc… ▽ More NaTiSi2O6 is an exemplary compound, showing an orbital assisted spin-Peierls phase transition at Tc = 210 K. We present the results of 29Si and 23Na NMR measurements of NaTiSi2O6. The use of magic angle spinning techniques unambiguously shows that only one dynamically averaged silicon site can be seen at T > Tc. At cooling, the 29Si MAS NMR spectrum shows interesting changes. Immediately below Tc the spectrum gets very broad. Cooling further, it shows two broad lines of unequal intensities which become narrower as the temperature decreases. Below 70 K two narrow lines have chemical shifts that are typical for diamagnetic silicates. The hyperfine field values for the two sites are 7.4 kOe/kB and 4.9 kOe/kB. In the paramagnetic state at high temperature, the spin-lattice relaxation of 29Si was found to be weakly temperature dependent. Below Tc the Arrhenius type temperature dependence of the relaxation rate indicates an energy gap D/kB = 1000(50) K. In the temperature region from 120 to 300 K the relaxation rate was strongly frequency dependent. At room temperature we found a power law dependence on Larmor frequency of -0.65(5). For 70 < T < 120 K the relaxation appeared to be non-exponential which we assigned to a relaxation due to fixed paramagnetic centers. Simulation of the magnetization recovery curve showed a temperature dependence of the concentration of these centers proportional to the magnetic susceptibility. The NMR spectrum of 23Na shows the line with typical shape for the central transition of a quadrupolar nucleus. A small frequency shift of 23Na resonance corresponds to a very small hyperfine field of 0.32 kOe/kB . In addition, at T > Tc the 23Na spectrum shows another Lorentzian shaped resonance which we attribute to the Na sites where the quadrupolar coupling is partly averaged out by ionic motion. △ Less

Submitted 10 May, 2022; originally announced May 2022.

Comments: 9 pages, 5 figures

arXiv:2204.01495 [pdf, ps, other]

Logical coherence in Bayesian simultaneous three-way hypothesis tests

Authors: Bernardo F. Reimann, Rafael Izbicki, Julio M. Stern, Rafael B. Stern, Luis G. Esteves

Abstract: This paper studies whether Bayesian simultaneous three-way hypothesis tests can be logically coherent. Two types of results are obtained. First, under the standard error-wise constant loss, only for a limited set of models can a Bayes simultaneous test be logically coherent. Second, if more general loss functions are used, then it is possible to obtain Bayes simultaneous tests that are always logi… ▽ More This paper studies whether Bayesian simultaneous three-way hypothesis tests can be logically coherent. Two types of results are obtained. First, under the standard error-wise constant loss, only for a limited set of models can a Bayes simultaneous test be logically coherent. Second, if more general loss functions are used, then it is possible to obtain Bayes simultaneous tests that are always logically coherent. An explicit example of such a loss function is provided. △ Less

Submitted 4 April, 2022; originally announced April 2022.

Comments: 19 pages, 1 figure

MSC Class: 62C10

arXiv:2203.12499 [pdf, ps, other]

An Example of the SAM+ Algorithm for Learning Action Models for Stochastic Worlds

Authors: Brendan Juba, Roni Stern

Abstract: In this technical report, we provide a complete example of running the SAM+ algorithm, an algorithm for learning stochastic planning action models, on a simplified PPDDL version of the Coffee problem. We provide a very brief description of the SAM+ algorithm and detailed description of our simplified version of the Coffee domain, and then describe the results of running it on the simplified Coffee… ▽ More In this technical report, we provide a complete example of running the SAM+ algorithm, an algorithm for learning stochastic planning action models, on a simplified PPDDL version of the Coffee problem. We provide a very brief description of the SAM+ algorithm and detailed description of our simplified version of the Coffee domain, and then describe the results of running it on the simplified Coffee domain. △ Less

Submitted 23 March, 2022; originally announced March 2022.

MSC Class: 68T05

arXiv:2202.12280 [pdf]

Tactile Materials in Practice: Understanding the Experiences of Teachers of the Visually Impaired

Authors: Mahika Phutane, Julie Wright, Brenda Veronica Castro, Lei Shi, Simone R. Stern, Holly M. Lawson, Shiri Azenkot

Abstract: Teachers of the visually impaired (TVIs) regularly present tactile materials (tactile graphics, 3D models, and real objects) to students with vision impairments. Researchers have been increasingly interested in designing tools to support the use of tactile materials, but we still lack an in-depth understanding of how tactile materials are created and used in practice today. To address this gap, we… ▽ More Teachers of the visually impaired (TVIs) regularly present tactile materials (tactile graphics, 3D models, and real objects) to students with vision impairments. Researchers have been increasingly interested in designing tools to support the use of tactile materials, but we still lack an in-depth understanding of how tactile materials are created and used in practice today. To address this gap, we conducted interviews with 21 TVIs and a 3-week diary study with eight of them. We found that tactile materials were regularly used for academic as well as non-academic concepts like tactile literacy, motor ability, and spatial awareness. Real objects and 3D models served as "stepping stones" to tactile graphics and our participants preferred to teach with 3D models, despite finding them difficult to create, obtain, and modify. Use of certain materials also carried social implications; participants selected materials that fostered student independence and allow classroom inclusion. We contribute design considerations, encouraging future work on tactile materials to enable student and TVI co-creation, facilitate rapid prototyping, and promote movement and spatial awareness. To support future research in this area, our paper provides a fundamental understanding of current practices. We bridge these practices to established pedagogical approaches and highlight opportunities for growth regarding this important genre of educational materials. △ Less

Submitted 24 February, 2022; originally announced February 2022.

Comments: 35 pages, 6 figures, 3 tables, to be published in TACCESS

arXiv:2112.01372 [pdf, other]

Hierarchical clustering: visualization, feature importance and model selection

Authors: Luben M. C. Cabezas, Rafael Izbicki, Rafael B. Stern

Abstract: We propose methods for the analysis of hierarchical clustering that fully use the multi-resolution structure provided by a dendrogram. Specifically, we propose a loss for choosing between clustering methods, a feature importance score and a graphical tool for visualizing the segmentation of features in a dendrogram. Current approaches to these tasks lead to loss of information since they require t… ▽ More We propose methods for the analysis of hierarchical clustering that fully use the multi-resolution structure provided by a dendrogram. Specifically, we propose a loss for choosing between clustering methods, a feature importance score and a graphical tool for visualizing the segmentation of features in a dendrogram. Current approaches to these tasks lead to loss of information since they require the user to generate a single partition of the instances by cutting the dendrogram at a specified level. Our proposed methods, instead, use the full structure of the dendrogram. The key insight behind the proposed methods is to view a dendrogram as a phylogeny. This analogy permits the assignment of a feature value to each internal node of a tree through an evolutionary model. Real and simulated datasets provide evidence that our proposed framework has desirable outcomes and gives more insights than state-of-art approaches. We provide an R package that implements our methods. △ Less

Submitted 27 January, 2023; v1 submitted 30 November, 2021; originally announced December 2021.

Comments: 29 pages, 9 figures, 3 tables

ACM Class: I.5.3

arXiv:2107.04635 [pdf, ps, other]

Playing Angry Birds with a Domain-Independent PDDL+ Planner

Authors: Wiktor Piotrowski, Roni Stern, Matthew Klenk, Alexandre Perez, Shiwali Mohan, Johan de Kleer, Jacob Le

Abstract: This demo paper presents the first system for playing the popular Angry Birds game using a domain-independent planner. Our system models Angry Birds levels using PDDL+, a planning language for mixed discrete/continuous domains. It uses a domain-independent PDDL+ planner to generate plans and executes them. In this demo paper, we present the system's PDDL+ model for this domain, identify key design… ▽ More This demo paper presents the first system for playing the popular Angry Birds game using a domain-independent planner. Our system models Angry Birds levels using PDDL+, a planning language for mixed discrete/continuous domains. It uses a domain-independent PDDL+ planner to generate plans and executes them. In this demo paper, we present the system's PDDL+ model for this domain, identify key design decisions that reduce the problem complexity, and compare the performance of our system to model-specific methods for this domain. The results show that our system's performance is on par with other domain-specific systems for Angry Birds, suggesting the applicability of domain-independent planning to this benchmark AI challenge. △ Less

Submitted 9 July, 2021; originally announced July 2021.

Comments: 2 pages, submitted to ICAPS 2021 Demonstration Track

Journal ref: Proceedings of the International Conference on Automated Planning and Scheduling (2021) Demonstration Track

arXiv:2107.04169 [pdf, ps, other]

Safe Learning of Lifted Action Models

Authors: Brendan Juba, Hai S. Le, Roni Stern

Abstract: Creating a domain model, even for classical, domain-independent planning, is a notoriously hard knowledge-engineering task. A natural approach to solve this problem is to learn a domain model from observations. However, model learning approaches frequently do not provide safety guarantees: the learned model may assume actions are applicable when they are not, and may incorrectly capture actions' e… ▽ More Creating a domain model, even for classical, domain-independent planning, is a notoriously hard knowledge-engineering task. A natural approach to solve this problem is to learn a domain model from observations. However, model learning approaches frequently do not provide safety guarantees: the learned model may assume actions are applicable when they are not, and may incorrectly capture actions' effects. This may result in generating plans that will fail when executed. In some domains such failures are not acceptable, due to the cost of failure or inability to replan online after failure. In such settings, all learning must be done offline, based on some observations collected, e.g., by some other agents or a human. Through this learning, the task is to generate a plan that is guaranteed to be successful. This is called the model-free planning problem. Prior work proposed an algorithm for solving the model-free planning problem in classical planning. However, they were limited to learning grounded domains, and thus they could not scale. We generalize this prior work and propose the first safe model-free planning algorithm for lifted domains. We prove the correctness of our approach, and provide a statistical analysis showing that the number of trajectories needed to solve future problems with high probability is linear in the potential size of the domain model. We also present experiments on twelve IPC domains showing that our approach is able to learn the real action model in all cases with at most two trajectories. △ Less

Submitted 8 July, 2021; originally announced July 2021.

MSC Class: 68T05

arXiv:2104.07600 [pdf, other]

The Effect of Population Flow on Epidemic Spread: Analysis and Control

Authors: Brooks Butler, Ciyuan Zhang, Ian Walter, Nishant Nair, Raphael Stern, Philip E. Paré

Abstract: In this paper, we present a discrete-time networked SEIR model using population flow, its derivation, and assumptions under which this model is well defined. We identify properties of the system's equilibria, namely the healthy states. We show that the set of healthy states is asymptotically stable, and that the value of the equilibria becomes equal across all sub-populations as a result of the ne… ▽ More In this paper, we present a discrete-time networked SEIR model using population flow, its derivation, and assumptions under which this model is well defined. We identify properties of the system's equilibria, namely the healthy states. We show that the set of healthy states is asymptotically stable, and that the value of the equilibria becomes equal across all sub-populations as a result of the network flow model. Furthermore, we explore closed-loop feedback control of the system by limiting flow between sub-populations as a function of the current infected states. These results are illustrated via simulation based on flight traffic between major airports in the United States. We find that a flow restriction strategy combined with a vaccine roll-out significantly reduces the total number of infections over the course of an epidemic, given that the initial flow restriction response is not delayed. △ Less

Submitted 15 April, 2021; originally announced April 2021.

Comments: Submitted to CDC 2021, 14 pages, 5 figures

arXiv:2104.04021 [pdf, ps, other]

A universal characterization of noncommutative motives and secondary algebraic K-theory

Authors: Aaron Mazel-Gee, Reuben Stern

Abstract: We provide a universal characterization of the construction taking a scheme $X$ to its stable $\infty$-category $\text{Mot}(X)$ of noncommutative motives, patterned after the universal characterization of algebraic K-theory due to Blumberg--Gepner--Tabuada. As a consequence, we obtain a corepresentability theorem for secondary K-theory. We envision this as a fundamental tool for the construction o… ▽ More We provide a universal characterization of the construction taking a scheme $X$ to its stable $\infty$-category $\text{Mot}(X)$ of noncommutative motives, patterned after the universal characterization of algebraic K-theory due to Blumberg--Gepner--Tabuada. As a consequence, we obtain a corepresentability theorem for secondary K-theory. We envision this as a fundamental tool for the construction of trace maps from secondary K-theory. Towards these main goals, we introduce a preliminary formalism of "stable $(\infty, 2)$-categories"; notable examples of these include (quasicoherent or constructible) sheaves of stable $\infty$-categories. We also develop the rudiments of a theory of presentable enriched $\infty$-categories -- and in particular, a theory of presentable $(\infty, n)$-categories -- which may be of intependent interest. △ Less

Submitted 8 April, 2021; originally announced April 2021.

arXiv:2103.06860 [pdf, other]

doi 10.1103/PhysRevResearch.3.023177

Revealing three-dimensional quantum criticality by Sr-substitution in Han Purple

Authors: Stephan Allenspach, Pascal Puphal, Joosep Link, Ivo Heinmaa, Ekaterina Pomjakushina, Cornelius Krellner, Jakob Lass, Gregory S. Tucker, Christof Niedermayer, Shusaku Imajo, Yoshimitsu Kohama, Koichi Kindo, Steffen Krämer, Mladen Horvatić, Marcelo Jaime, Alexander Madsen, Antonietta Mira, Nicolas Laflorencie, Frédéric Mila, Bruce Normand, Christian Rüegg, Raivo Stern, Franziska Weickert

Abstract: Classical and quantum phase transitions (QPTs), with their accompanying concepts of criticality and universality, are a cornerstone of statistical thermodynamics. An exemplary controlled QPT is the field-induced magnetic ordering of a gapped quantum magnet. Although numerous "quasi-one-dimensional" coupled spin-chain and -ladder materials are known whose ordering transition is three-dimensional (3… ▽ More Classical and quantum phase transitions (QPTs), with their accompanying concepts of criticality and universality, are a cornerstone of statistical thermodynamics. An exemplary controlled QPT is the field-induced magnetic ordering of a gapped quantum magnet. Although numerous "quasi-one-dimensional" coupled spin-chain and -ladder materials are known whose ordering transition is three-dimensional (3D), quasi-2D systems are special for several physical reasons. Motivated by the ancient pigment Han Purple (BaCuSi$_{2}$O$_{6}$), a quasi-2D material displaying anomalous critical properties, we present a complete analysis of Ba$_{0.9}$Sr$_{0.1}$CuSi$_{2}$O$_{6}$. We measure the zero-field magnetic excitations by neutron spectroscopy and deduce the magnetic Hamiltonian. We probe the field-induced transition by combining magnetization, specific-heat, torque and magnetocalorimetric measurements with low-temperature nuclear magnetic resonance studies near the QPT. By a Bayesian statistical analysis and large-scale Quantum Monte Carlo simulations, we demonstrate unambiguously that observable 3D quantum critical scaling is restored by the structural simplification arising from light Sr-substitution in Han Purple. △ Less

Submitted 9 June, 2021; v1 submitted 11 March, 2021; originally announced March 2021.

Comments: 19 pages, 5 figures

Journal ref: Phys. Rev. Research 3, 023177 (2021)

arXiv:2102.11053 [pdf]

Interpretation of the Area Under the ROC Curve for Risk Prediction Models

Authors: Ralph H. Stern

Abstract: The area under the curve (AUえーゆーC) of the receiver operating characteristics curve (ROC) evaluates the separation between patients and nonpatients or discrimination. For risk prediction models these risk distributions can be derived from the population risk distribution so are not independent as in diagnosis. A ROC curve AUC formula based on the underlying population risk distribution clarifies how di… ▽ More The area under the curve (AUえーゆーC) of the receiver operating characteristics curve (ROC) evaluates the separation between patients and nonpatients or discrimination. For risk prediction models these risk distributions can be derived from the population risk distribution so are not independent as in diagnosis. A ROC curve AUC formula based on the underlying population risk distribution clarifies how discrimination is defined mathematically and that generation of the equivalent c-statistic effects a Monte Carlo integration of the formula. For a selection of continuous risk distributions, exact analytic formulas or numerical results for the ROC curve AUC and overlap measure are presented and demonstrate a linear or near-linear dependence on their standard deviation. The ROC curve AUC is also shown to be highly dependent on the mean population risk, a distinction from the independence from disease prevalence for diagnostic tests. The converse of discrimination, overlap, has been quantified by the overlap measure, which appears to provide equivalent information. As achieving wider population risk distributions is the goal of risk prediction modeling for clinical risk stratification, interpreting the ROC curve AUC as a measure of dispersion, rather than discrimination, when comparing risk prediction models may be more relevant. △ Less

Submitted 22 February, 2021; originally announced February 2021.

arXiv:2102.07330 [pdf, other]

A Modulation-Domain Loss for Neural-Network-based Real-time Speech Enhancement

Authors: Tyler Vuong, Yangyang Xia, Richard M. Stern

Abstract: We describe a modulation-domain loss function for deep-learning-based speech enhancement systems. Learnable spectro-temporal receptive fields (STRFs) were adapted to optimize for a speaker identification task. The learned STRFs were then used to calculate a weighted mean-squared error (MSE) in the modulation domain for training a speech enhancement system. Experiments showed that adding the modula… ▽ More We describe a modulation-domain loss function for deep-learning-based speech enhancement systems. Learnable spectro-temporal receptive fields (STRFs) were adapted to optimize for a speaker identification task. The learned STRFs were then used to calculate a weighted mean-squared error (MSE) in the modulation domain for training a speech enhancement system. Experiments showed that adding the modulation-domain MSE to the MSE in the spectro-temporal domain substantially improved the objective prediction of speech quality and intelligibility for real-time speech enhancement systems without incurring additional computation during inference. △ Less

Submitted 14 February, 2021; originally announced February 2021.

Comments: Accepted IEEE ICASSP 2021

arXiv:2102.07185 [pdf, other]

Partial Disclosure of Private Dependencies in Privacy Preserving Planning

Authors: Rotem Lev Lehman, Guy Shani, Roni Stern

Abstract: In collaborative privacy preserving planning (CPPP), a group of agents jointly creates a plan to achieve a set of goals while preserving each others' privacy. During planning, agents often reveal the private dependencies between their public actions to other agents, that is, which public action facilitates the preconditions of another public action. Previous work in CPPP does not limit the disclos… ▽ More In collaborative privacy preserving planning (CPPP), a group of agents jointly creates a plan to achieve a set of goals while preserving each others' privacy. During planning, agents often reveal the private dependencies between their public actions to other agents, that is, which public action facilitates the preconditions of another public action. Previous work in CPPP does not limit the disclosure of such dependencies. In this paper, we explicitly limit the amount of disclosed dependencies, allowing agents to publish only a part of their private dependencies. We investigate different strategies for deciding which dependencies to publish, and how they affect the ability to find solutions. We evaluate the ability of two solvers -- distribute forward search and centralized planning based on a single-agent projection -- to produce plans under this constraint. Experiments over standard CPPP domains show that the proposed dependency-sharing strategies enable generating plans while sharing only a small fraction of all private dependencies. △ Less

Submitted 14 February, 2021; originally announced February 2021.

ACM Class: I.2.8; I.2.11

arXiv:2101.09723 [pdf, other]

Improving Continuous-time Conflict Based Search

Authors: Anton Andreychuk, Konstantin Yakovlev, Eli Boyarski, Roni Stern

Abstract: Conflict-Based Search (CBS) is a powerful algorithmic framework for optimally solving classical multi-agent path finding (MAPF) problems, where time is discretized into the time steps. Continuous-time CBS (CCBS) is a recently proposed version of CBS that guarantees optimal solutions without the need to discretize time. However, the scalability of CCBS is limited because it does not include any kno… ▽ More Conflict-Based Search (CBS) is a powerful algorithmic framework for optimally solving classical multi-agent path finding (MAPF) problems, where time is discretized into the time steps. Continuous-time CBS (CCBS) is a recently proposed version of CBS that guarantees optimal solutions without the need to discretize time. However, the scalability of CCBS is limited because it does not include any known improvements of CBS. In this paper, we begin to close this gap and explore how to adapt successful CBS improvements, namely, prioritizing conflicts (PC), disjoint splitting (DS), and high-level heuristics, to the continuous time setting of CCBS. These adaptions are not trivial, and require careful handling of different types of constraints, applying a generalized version of the Safe interval path planning (SIPP) algorithm, and extending the notion of cardinal conflicts. We evaluate the effect of the suggested enhancements by running experiments both on general graphs and $2^k$-neighborhood grids. CCBS with these improvements significantly outperforms vanilla CCBS, solving problems with almost twice as many agents in some cases and pushing the limits of multiagent path finding in continuous-time domains. △ Less

Submitted 2 March, 2021; v1 submitted 24 January, 2021; originally announced January 2021.

Comments: This is a pre-print of the paper accepted to AAAI 2021

arXiv:2101.04053

Anomaly Detection for Aggregated Data Using Multi-Graph Autoencoder

Authors: Tomer Meirman, Roni Stern, Gilad Katz

Abstract: In data systems, activities or events are continuously collected in the field to trace their proper executions. Logging, which means recording sequences of events, can be used for analyzing system failures and malfunctions, and identifying the causes and locations of such issues. In our research we focus on creating an Anomaly detection models for system logs. The task of anomaly detection is iden… ▽ More In data systems, activities or events are continuously collected in the field to trace their proper executions. Logging, which means recording sequences of events, can be used for analyzing system failures and malfunctions, and identifying the causes and locations of such issues. In our research we focus on creating an Anomaly detection models for system logs. The task of anomaly detection is identifying unexpected events in dataset, which differ from the normal behavior. Anomaly detection models also assist in data systems analysis tasks. Modern systems may produce such a large amount of events monitoring every individual event is not feasible. In such cases, the events are often aggregated over a fixed period of time, reporting the number of times every event has occurred in that time period. This aggregation facilitates scaling, but requires a different approach for anomaly detection. In this research, we present a thorough analysis of the aggregated data and the relationships between aggregated events. Based on the initial phase of our research we present graphs representations of our aggregated dataset, which represent the different relationships between aggregated instances in the same context. Using the graph representation, we propose Multiple-graphs autoencoder MGAE, a novel convolutional graphs-autoencoder model which exploits the relationships of the aggregated instances in our unique dataset. MGAE outperforms standard graph-autoencoder models and the different experiments. With our novel MGAE we present 60% decrease in reconstruction error in comparison to standard graph autoencoder, which is expressed in reconstructing high-degree relationships. △ Less

Submitted 27 June, 2021; v1 submitted 11 January, 2021; originally announced January 2021.

Comments: Mandatory changes for the thesis, post thesis-defense presentation. Changing of title, fixing grammar and vocabulary mistakes, removing unnecessary parts

arXiv:2012.04226 [pdf, other]

A Unifying Framework for Formal Theories of Novelty:Framework, Examples and Discussion

Authors: T. E. Boult, P. A. Grabowicz, D. S. Prijatelj, R. Stern, L. Holder, J. Alspector, M. Jafarzadeh, T. Ahmad, A. R. Dhamija, C. Li, S. Cruz, A. Shrivastava, C. Vondrick, W. J. Scheirer

Abstract: Managing inputs that are novel, unknown, or out-of-distribution is critical as an agent moves from the lab to the open world. Novelty-related problems include being tolerant to novel perturbations of the normal input, detecting when the input includes novel items, and adapting to novel inputs. While significant research has been undertaken in these areas, a noticeable gap exists in the lack of a f… ▽ More Managing inputs that are novel, unknown, or out-of-distribution is critical as an agent moves from the lab to the open world. Novelty-related problems include being tolerant to novel perturbations of the normal input, detecting when the input includes novel items, and adapting to novel inputs. While significant research has been undertaken in these areas, a noticeable gap exists in the lack of a formalized definition of novelty that transcends problem domains. As a team of researchers spanning multiple research groups and different domains, we have seen, first hand, the difficulties that arise from ill-specified novelty problems, as well as inconsistent definitions and terminology. Therefore, we present the first unified framework for formal theories of novelty and use the framework to formally define a family of novelty types. Our framework can be applied across a wide range of domains, from symbolic AI to reinforcement learning, and beyond to open world image recognition. Thus, it can be used to help kick-start new research efforts and accelerate ongoing work on these important novelty-related problems. This extended version of our AAAI 2021 paper included more details and examples in multiple domains. △ Less

Submitted 8 December, 2020; originally announced December 2020.

Comments: Extended version/preprint of a AAAI 2021 paper

arXiv:2011.11739 [pdf, other]

Analysis and Estimation of Networked SIR & SEIR Models with Transportation Networks

Authors: Damir Vrabac, Raphael Stern, Philip E. Paré

Abstract: In this paper we present the discrete-time networked SIR and SEIR models and present assumptions under which they are well defined. We analyze the limiting behavior of the models and present necessary and sufficient conditions for estimating the spreading parameters from data. We illustrate these results via simulation. In this paper we present the discrete-time networked SIR and SEIR models and present assumptions under which they are well defined. We analyze the limiting behavior of the models and present necessary and sufficient conditions for estimating the spreading parameters from data. We illustrate these results via simulation. △ Less

Submitted 23 November, 2020; originally announced November 2020.

arXiv:2010.09151 [pdf, other]

doi 10.21437/Interspeech.2020-1878

Learnable Spectro-temporal Receptive Fields for Robust Voice Type Discrimination

Authors: Tyler Vuong, Yangyang Xia, Richard Stern

Abstract: Voice Type Discrimination (VTD) refers to discrimination between regions in a recording where speech was produced by speakers that are physically within proximity of the recording device ("Live Speech") from speech and other types of audio that were played back such as traffic noise and television broadcasts ("Distractor Audio"). In this work, we propose a deep-learning-based VTD system that featu… ▽ More Voice Type Discrimination (VTD) refers to discrimination between regions in a recording where speech was produced by speakers that are physically within proximity of the recording device ("Live Speech") from speech and other types of audio that were played back such as traffic noise and television broadcasts ("Distractor Audio"). In this work, we propose a deep-learning-based VTD system that features an initial layer of learnable spectro-temporal receptive fields (STRFs). Our approach is also shown to provide very strong performance on a similar spoofing detection task in the ASVspoof 2019 challenge. We evaluate our approach on a new standardized VTD database that was collected to support research in this area. In particular, we study the effect of using learnable STRFs compared to static STRFs or unconstrained kernels. We also show that our system consistently improves a competitive baseline system across a wide range of signal-to-noise ratios on spoofing detection in the presence of VTD distractor noise. △ Less

Submitted 18 October, 2020; originally announced October 2020.

Comments: Accepted Interspeech 2020. Video: http://www.interspeech2020.org/index.php?m=content&c=index&a=show&catid=311&id=712

arXiv:2010.06335 [pdf, ps, other]

doi 10.1103/PhysRevB.103.014428

Freezing of molecular rotation in a paramagnetic crystal studied by $^{31}$P NMR

Authors: D. Opherden, F. Bärtl, Sh. Yamamoto, Z. T. Zhang, S. Luther, S. Molatta, J. Wosnitza, M. Baenitz, I. Heinmaa, R. Stern, C. P. Landee, H. Kühne

Abstract: We present a detailed $^{31}$P nuclear magnetic resonance (NMR) study of the molecular rotation in the compound [Cu(pz)$_{2}$(2-HOpy)$_{2}$](PF$_{6}$)$_{2}$, where pz = C$_4$H$_4$N$_2$ and 2-HOpy = C$_5$H$_4$NHO. Here, a freezing of the PF$_6$ rotation modes is revealed by several steplike increases of the temperature-dependent second spectral moment, with accompanying broad peaks of the longitudi… ▽ More We present a detailed $^{31}$P nuclear magnetic resonance (NMR) study of the molecular rotation in the compound [Cu(pz)$_{2}$(2-HOpy)$_{2}$](PF$_{6}$)$_{2}$, where pz = C$_4$H$_4$N$_2$ and 2-HOpy = C$_5$H$_4$NHO. Here, a freezing of the PF$_6$ rotation modes is revealed by several steplike increases of the temperature-dependent second spectral moment, with accompanying broad peaks of the longitudinal and transverse nuclear spin-relaxation rates. An analysis based on the Bloembergen-Purcell-Pound (BPP) theory quantifies the related activation energies as $E_{a}/k_{B}$ = 250 and 1400 K. Further, the anisotropy of the second spectral moment of the $^{31}$P absorption line was calculated for the rigid lattice, as well as in the presence of several sets of PF$_6$ reorientation modes, and is in excellent agreement with the experimental data. Whereas the anisotropy of the frequency shift and enhancement of nuclear spin-relaxation rates is driven by the molecular rotation with respect to the dipole fields stemming from the Cu ions, the second spectral moment is determined by the intramolecular interaction of nuclear $^{19}$F and $^{31}$P moments in the presence of the distinct rotation modes. △ Less

Submitted 25 January, 2021; v1 submitted 13 October, 2020; originally announced October 2020.

Comments: 9 pages, 5 figures; additionally 2 pages with 3 figures of supplemental material; typos corrected, references added, supplemental material added

Journal ref: Phys. Rev. B 103, 014428 (2021)

arXiv:2010.03109 [pdf, other]

Calibrating microscopic car following models for adaptive cruise control vehicles: a multi-objective approach

Authors: Felipe de Souza, Raphael Stern

Abstract: Adaptive cruise control (ACC) vehicles are the first step toward comprehensive vehicle automation. However, the impacts of such vehicles on the underlying traffic flow are not yet clear. Therefore, it is of interest to accurately model vehicle-level dynamics of commercially available ACC vehicles so that they may be used in further modeling efforts to quantify the impact of commercially available… ▽ More Adaptive cruise control (ACC) vehicles are the first step toward comprehensive vehicle automation. However, the impacts of such vehicles on the underlying traffic flow are not yet clear. Therefore, it is of interest to accurately model vehicle-level dynamics of commercially available ACC vehicles so that they may be used in further modeling efforts to quantify the impact of commercially available ACC vehicles on traffic flow. Importantly, not only model selection but also the calibration approach and error metric used for calibration are critical to accurately model ACC vehicle behavior. In this work, we explore the question of how to calibrate car following models to describe ACC vehicle dynamics. Specifically, we apply a multi-objective calibration approach to understand the tradeoff between calibrating model parameters to minimize speed error vs. spacing error. Three different car-following models are calibrated for data from six vehicles. The results are in line with recent literature and verify that targeting a low spacing error does not compromise the speed accuracy whether the opposite is not true for modeling ACC vehicle dynamics. △ Less

Submitted 6 October, 2020; originally announced October 2020.

Comments: 15 pages, 4 figures, 2 tables

arXiv:2009.02832 [pdf]

Non causal deep learning based dereverberation

Authors: Jorge Wuth, Richard M. Stern, Nestor Becerra Yoma

Abstract: In this paper we demonstrate the effectiveness of non-causal context for mitigating the effects of reverberation in deep-learning-based automatic speech recognition (ASR) systems. First, the value of non-causal context using a non-causal FIR filter is shown by comparing the contributions of previous vs. future information. Second, MLP- and LSTM-based dereverberation networks were trained to confir… ▽ More In this paper we demonstrate the effectiveness of non-causal context for mitigating the effects of reverberation in deep-learning-based automatic speech recognition (ASR) systems. First, the value of non-causal context using a non-causal FIR filter is shown by comparing the contributions of previous vs. future information. Second, MLP- and LSTM-based dereverberation networks were trained to confirm the effects of causal and non-causal context when used in ASR systems trained with clean speech. The non-causal deep-learning-based dereverberation provides a 45% relative reduction in word error rate (WER) compared to the popular weighted prediction error (WPE) method in experiments with clean training in the REVERB challenge. Finally, an expanded multicondition training procedure used in combination with a semi-enhanced test utterance generation based on combinations of reverberated and dereverberated signals is proposed to reduce any artifacts or distortion that may be introduced by the non-causal dereverberation methods. The combination of both approaches provided average relative reductions in WER equal to 10.9% and 6.0% when compared to the baseline system obtained with the most recent REVERB challenge recipe without and with WPE, respectively. △ Less

Submitted 6 September, 2020; originally announced September 2020.

Comments: 33 pages

arXiv:2007.12778 [pdf, other]

CD-split and HPD-split: efficient conformal regions in high dimensions

Authors: Rafael Izbicki, Gilson Shimizu, Rafael B. Stern

Abstract: Conformal methods create prediction bands that control average coverage assuming solely i.i.d. data. Although the literature has mostly focused on prediction intervals, more general regions can often better represent uncertainty. For instance, a bimodal target is better represented by the union of two intervals. Such prediction regions are obtained by CD-split , which combines the split method and… ▽ More Conformal methods create prediction bands that control average coverage assuming solely i.i.d. data. Although the literature has mostly focused on prediction intervals, more general regions can often better represent uncertainty. For instance, a bimodal target is better represented by the union of two intervals. Such prediction regions are obtained by CD-split , which combines the split method and a data-driven partition of the feature space which scales to high dimensions. CD-split however contains many tuning parameters, and their role is not clear. In this paper, we provide new insights on CD-split by exploring its theoretical properties. In particular, we show that CD-split converges asymptotically to the oracle highest predictive density set and satisfies local and asymptotic conditional validity. We also present simulations that show how to tune CD-split. Finally, we introduce HPD-split, a variation of CD-split that requires less tuning, and show that it shares the same theoretical guarantees as CD-split. In a wide variety of our simulations, CD-split and HPD-split have better conditional coverage and yield smaller prediction regions than other methods. △ Less

Submitted 4 October, 2021; v1 submitted 24 July, 2020; originally announced July 2020.

Comments: 34 pages, 15 figures

MSC Class: 62G15

arXiv:2006.01195 [pdf, other]

Revisiting Bounded-Suboptimal Safe Interval Path Planning

Authors: Konstantin Yakovlev, Anton Andreychuk, Roni Stern

Abstract: Safe-interval path planning (SIPP) is a powerful algorithm for finding a path in the presence of dynamic obstacles. SIPP returns provably optimal solutions. However, in many practical applications of SIPP such as path planning for robots, one would like to trade-off optimality for shorter planning time. In this paper we explore different ways to build a bounded-suboptimal SIPP and discuss their pr… ▽ More Safe-interval path planning (SIPP) is a powerful algorithm for finding a path in the presence of dynamic obstacles. SIPP returns provably optimal solutions. However, in many practical applications of SIPP such as path planning for robots, one would like to trade-off optimality for shorter planning time. In this paper we explore different ways to build a bounded-suboptimal SIPP and discuss their pros and cons. We compare the different bounded-suboptimal versions of SIPP experimentally. While there is no universal winner, the results provide insights into when each method should be used. △ Less

Submitted 1 June, 2020; originally announced June 2020.

arXiv:2005.12058 [pdf]

doi 10.1103/PhysRevB.102.064431

Extremely well isolated 2D spin-$1/2$ antiferromagnetic Heisenberg layers with small exchange coupling in the molecular-based magnet CuPOF

Authors: D. Opherden, N. Nizar, K. Richardson, J. C. Monroe, M. M. Turnbull, M. Polson, S. Vela, W. J. A. Blackmore, P. A. Goddard, J. Singleton, E. S. Choi, F. Xiao, R. C. Williams, T. Lancaster, F. L. Pratt, S. J. Blundell, Y. Skourski, M. Uhlarz, A. N. Ponomaryov, S. A. Zvyagin, J. Wosnitza, M. Baenitz, I. Heinmaa, R. Stern, H. Kühne , et al. (1 additional authors not shown)

Abstract: We report on a comprehensive characterization of the newly synthesized Cu$^{2+}$-based molecular magnet [Cu(pz)$_2$(2-HOpy)$_2$](PF$_6$)$_2$ (CuPOF), where pz = C$_4$H$_4$N$_2$ and 2-HOpy = C$_5$H$_4$NHO. From a comparison of theoretical modeling to results of bulk magnetometry, specific heat, $μみゅー^+$SR, ESR, and NMR spectroscopy, this material is determined as an excellent realization of the 2D squ… ▽ More We report on a comprehensive characterization of the newly synthesized Cu$^{2+}$-based molecular magnet [Cu(pz)$_2$(2-HOpy)$_2$](PF$_6$)$_2$ (CuPOF), where pz = C$_4$H$_4$N$_2$ and 2-HOpy = C$_5$H$_4$NHO. From a comparison of theoretical modeling to results of bulk magnetometry, specific heat, $μみゅー^+$SR, ESR, and NMR spectroscopy, this material is determined as an excellent realization of the 2D square-lattice $S=1/2$ antiferromagnetic Heisenberg model with a moderate intraplane nearest-neighbor exchange coupling of $J/k_\mathrm{B} = 6.80(5)$ K, and an extremely small interlayer interaction of about 1 mK. At zero field, the bulk magnetometry reveals a temperature-driven crossover of spin correlations from isotropic to $XY$ type, caused by the presence of a weak intrinsic easy-plane anisotropy. A transition to long-range order, driven by the low-temperature $XY$ anisotropy under the influence of the interlayer coupling, occurs at $T_\mathrm{N} = 1.38(2)$ K, as revealed by $μみゅー^+$SR. In applied magnetic fields, our $^1$H-NMR data reveal a strong increase of the magnetic anisotropy, manifested by a pronounced enhancement of the transition temperature to commensurate long-range order at $T_\mathrm{N} =2.8$ K and 7 T. △ Less

Submitted 1 September, 2020; v1 submitted 25 May, 2020; originally announced May 2020.

Comments: 14 pages, 8 figures, as well as 10 pages and 18 figures of supplemental material

Journal ref: Phys. Rev. B 102, 064431 (2020)

arXiv:2001.07223 [pdf, other]

doi 10.1103/PhysRevC.101.021302

A stringent upper limit on the direct 3$αあるふぁ$ decay of the Hoyle State in $^{12}$C

Authors: R. Smith, M. Gai, M. W. Ahmed, M. Freer, H. O. U. Fynbo, D. Schweitzer, S. R. Stern

Abstract: We investigate an implication of the most recent observation of a second $J^πぱい= 2^+$ state in $^{12}$C, which was measured using the $^{12}$C($γがんま$,$αあるふぁ$)$^8$Be$_{\textrm{(g.s.)}}$ reaction. In addition to the dissociation of $^{12}$C to an $αあるふぁ$-particle and $^8$Be in its ground state, a small fraction of events (2%) were identified as direct decays and decays to excited states in $^8$Be. This allowed a… ▽ More We investigate an implication of the most recent observation of a second $J^πぱい= 2^+$ state in $^{12}$C, which was measured using the $^{12}$C($γがんま$,$αあるふぁ$)$^8$Be$_{\textrm{(g.s.)}}$ reaction. In addition to the dissociation of $^{12}$C to an $αあるふぁ$-particle and $^8$Be in its ground state, a small fraction of events (2%) were identified as direct decays and decays to excited states in $^8$Be. This allowed a limit on the direct 3$αあるふぁ$ partial decay width to be determined as $Γがんま_{3αあるふぁ} < 32(4)$ keV. Since this 2$^+$ state is predicted by all theoretical models to be a collective excitation of the Hoyle state, the 3$αあるふぁ$ partial width of the Hoyle state is calculable from the ratio of 3$αあるふぁ$ decay penetrabilities of the Hoyle and 2$^+$ states. This was calculated using the semi-classical WKB approach and we deduce a stringent upper limit for the direct decay branching ratio of the Hoyle state of ${Γがんま_{3αあるふぁ} \over Γがんま} < 5.7 \times 10^{-6}$, over an order of magnitude lower than previously reported. This result places the direct measurement of this rare decay mode beyond current experimental capabilities. △ Less

Submitted 20 January, 2020; originally announced January 2020.

Comments: Accepted for publication in Physical Review C (Rapid) on 01/16/2020

Journal ref: Phys. Rev. C 101, 021302 (2020)

arXiv:2001.03384 [pdf, other]

Decentralized Optimization of Vehicle Route Planning -- A Cross-City Comparative Study

Authors: Brionna Davis, Grace Jennings, Taylor Pothast, Ilias Gerostathopoulos, Evangelos Pournaras, Raphael E. Stern

Abstract: New mobility concepts are at the forefront of research and innovation in smart cities. The introduction of connected and autonomous vehicles enables new possibilities in vehicle routing. Specifically, knowing the origin and destination of each agent in the network can allow for real-time routing of the vehicles to optimize network performance. However, this relies on individual vehicles being "alt… ▽ More New mobility concepts are at the forefront of research and innovation in smart cities. The introduction of connected and autonomous vehicles enables new possibilities in vehicle routing. Specifically, knowing the origin and destination of each agent in the network can allow for real-time routing of the vehicles to optimize network performance. However, this relies on individual vehicles being "altruistic" i.e., being willing to accept an alternative non-preferred route in order to achieve a network-level performance goal. In this work, we conduct a study to compare different levels of agent altruism and the resulting effect on the network-level traffic performance. Specifically, this study compares the effects of different underlying urban structures on the overall network performance, and investigates which characteristics of the network make it possible to realize routing improvements using a decentralized optimization router. The main finding is that, with increased vehicle altruism, it is possible to balance traffic flow among the links of the network. We show evidence that the decentralized optimization router is more effective with networks of high load while we study the influence of cities characteristics, in particular: networks with a higher number of nodes (intersections) or edges (roads) per unit area allow for more possible alternate routes, and thus higher potential to improve network performance. △ Less

Submitted 10 January, 2020; originally announced January 2020.

Showing 1–50 of 130 results for author: Stern, R