(Translated by https://www.hiragana.jp/)
Search | arXiv e-print repository
Skip to main content

Showing 1–50 of 773 results for author: Zhang, Y

Searching in archive stat. Search in all archives.
.
  1. arXiv:2409.08551  [pdf, other

    stat.ML cs.LG

    Think Twice Before You Act: Improving Inverse Problem Solving With MCMC

    Authors: Yaxuan Zhu, Zehao Dou, Haoxin Zheng, Yasi Zhang, Ying Nian Wu, Ruiqi Gao

    Abstract: Recent studies demonstrate that diffusion models can serve as a strong prior for solving inverse problems. A prominent example is Diffusion Posterior Sampling (DPS), which approximates the posterior distribution of data given the measure using Tweedie's formula. Despite the merits of being versatile in solving various inverse problems without re-training, the performance of DPS is hindered by the… ▽ More

    Submitted 13 September, 2024; originally announced September 2024.

  2. arXiv:2409.06879  [pdf, other

    q-bio.QM cs.LG stat.ML

    Joint trajectory and network inference via reference fitting

    Authors: Stephen Y Zhang

    Abstract: Network inference, the task of reconstructing interactions in a complex system from experimental observables, is a central yet extremely challenging problem in systems biology. While much progress has been made in the last two decades, network inference remains an open problem. For systems observed at steady state, limited insights are available since temporal information is unavailable and thus c… ▽ More

    Submitted 10 September, 2024; originally announced September 2024.

    Comments: 14 pages, 6 figures

    MSC Class: 92C42; 62M10; 49Q22;

  3. arXiv:2409.06302  [pdf, other

    math.MG math.OC stat.ML

    Geometry of the Space of Partitioned Networks: A Unified Theoretical and Computational Framework

    Authors: Stephen Y Zhang, Fangfei Lan, Youjia Zhou, Agnese Barbensi, Michael P H Stumpf, Bei Wang, Tom Needham

    Abstract: Interactions and relations between objects may be pairwise or higher-order in nature, and so network-valued data are ubiquitous in the real world. The "space of networks", however, has a complex structure that cannot be adequately described using conventional statistical tools. We introduce a measure-theoretic formalism for modeling generalized network structures such as graphs, hypergraphs, or gr… ▽ More

    Submitted 10 September, 2024; originally announced September 2024.

    Comments: 64 pages, 10 figures

    MSC Class: 51F99; 62R20; 49Q22; 05C65

  4. arXiv:2409.05798  [pdf, other

    cs.LG cs.AI cs.HC econ.EM stat.ML

    Enhancing Preference-based Linear Bandits via Human Response Time

    Authors: Shen Li, Yuyang Zhang, Zhaolin Ren, Claire Liang, Na Li, Julie A. Shah

    Abstract: Binary human choice feedback is widely used in interactive preference learning for its simplicity, but it provides limited information about preference strength. To overcome this limitation, we leverage human response times, which inversely correlate with preference strength, as complementary information. Our work integrates the EZ-diffusion model, which jointly models human choices and response t… ▽ More

    Submitted 9 September, 2024; originally announced September 2024.

  5. arXiv:2409.05160  [pdf, other

    stat.ME stat.AP stat.CO stat.ML

    Inference for Large Scale Regression Models with Dependent Errors

    Authors: Lionel Voirol, Haotian Xu, Yuming Zhang, Luca Insolia, Roberto Molinari, Stéphane Guerrier

    Abstract: The exponential growth in data sizes and storage costs has brought considerable challenges to the data science community, requiring solutions to run learning methods on such data. While machine learning has scaled to achieve predictive accuracy in big data settings, statistical inference and uncertainty quantification tools are still lagging. Priority scientific fields collect vast data to underst… ▽ More

    Submitted 8 September, 2024; originally announced September 2024.

  6. arXiv:2409.03296  [pdf, other

    stat.ME stat.AP

    An Efficient Two-Dimensional Functional Mixed-Effect Model Framework for Repeatedly Measured Functional Data

    Authors: Cheng Cao, Jiguo Cao, Hao Pan, Yunting Zhang, Fan Jiang, Xinyue Li

    Abstract: With the rapid development of wearable device technologies, accelerometers can record minute-by-minute physical activity for consecutive days, which provides important insight into a dynamic association between the intensity of physical activity and mental health outcomes for large-scale population studies. Using Shanghai school adolescent cohort we estimate the effect of health assessment results… ▽ More

    Submitted 5 September, 2024; originally announced September 2024.

    Comments: 50 pages, 8 figures in main, 6 figures in supp

  7. arXiv:2409.01908  [pdf, other

    stat.ME cs.LG q-fin.ST stat.AP stat.ML

    Bayesian CART models for aggregate claim modeling

    Authors: Yaojun Zhang, Lanpeng Ji, Georgios Aivaliotis, Charles C. Taylor

    Abstract: This paper proposes three types of Bayesian CART (or BCART) models for aggregate claim amount, namely, frequency-severity models, sequential models and joint models. We propose a general framework for the BCART models applicable to data with multivariate responses, which is particularly useful for the joint BCART models with a bivariate response: the number of claims and aggregate claim amount. To… ▽ More

    Submitted 3 September, 2024; originally announced September 2024.

  8. arXiv:2409.01220  [pdf, other

    math.ST stat.ME

    Simultaneous Inference for Non-Stationary Random Fields, with Application to Gridded Data Analysis

    Authors: Yunyi Zhang, Zhou Zhou

    Abstract: Current statistics literature on statistical inference of random fields typically assumes that the fields are stationary or focuses on models of non-stationary Gaussian fields with parametric/semiparametric covariance families, which may not be sufficiently flexible to tackle complex modern-era random field data. This paper performs simultaneous nonparametric statistical inference for a general cl… ▽ More

    Submitted 2 September, 2024; originally announced September 2024.

    Comments: Main part includes 31 pages and 5 figures

  9. arXiv:2408.15117  [pdf, other

    physics.soc-ph stat.AP

    Inferring ghost cities on the globe in newly developed urban areas based on urban vitality with multi-source data

    Authors: Yecheng Zhang, Tangqi Tu, Ying long

    Abstract: Due to rapid urbanization over the past 20 years, many newly developed areas have lagged in socio-economic maturity, creating an imbalance with older cities and leading to the rise of "ghost cities." However, due to the complexity of socio-economic factors, no global studies have measured this phenomenon. We propose a unified framework based on urban vitality theory and multi-source data, validate… ▽ More

    Submitted 27 August, 2024; originally announced August 2024.

    Comments: 28 pages, 13 figures

  10. arXiv:2408.07219  [pdf, other

    cs.LG stat.ME

    Causal Effect Estimation using identifiable Variational AutoEncoder with Latent Confounders and Post-Treatment Variables

    Authors: Yang Xie, Ziqi Xu, Debo Cheng, Jiuyong Li, Lin Liu, Yinghao Zhang, Zaiwen Feng

    Abstract: Estimating causal effects from observational data is challenging, especially in the presence of latent confounders. Much work has been done on addressing this challenge, but most of the existing research ignores the bias introduced by the post-treatment variables. In this paper, we propose a novel method of joint Variational AutoEncoder (VAE) and identifiable Variational AutoEncoder (iVAE) for lea… ▽ More

    Submitted 13 August, 2024; originally announced August 2024.

  11. arXiv:2408.05990  [pdf, other

    stat.ML cs.LG

    Parameters Inference for Nonlinear Wave Equations with Markovian Switching

    Authors: Yi Zhang, Zhikun Zhang, Xiangjun Wang

    Abstract: Traditional partial differential equations with constant coefficients often struggle to capture abrupt changes in real-world phenomena, leading to the development of variable coefficient PDEs and Markovian switching models. Recently, research has introduced the concept of PDEs with Markov switching models, established their well-posedness and presented numerical methods. However, there has been li… ▽ More

    Submitted 30 August, 2024; v1 submitted 12 August, 2024; originally announced August 2024.

  12. arXiv:2407.17694  [pdf, other

    stat.ME stat.ML

    Doubly Robust Conditional Independence Testing with Generative Neural Networks

    Authors: Yi Zhang, Linjun Huang, Yun Yang, Xiaofeng Shao

    Abstract: This article addresses the problem of testing the conditional independence of two generic random vectors $X$ and $Y$ given a third random vector $Z$, which plays an important role in statistical and machine learning applications. We propose a new non-parametric testing procedure that avoids explicitly estimating any conditional distributions but instead requires sampling from the two marginal cond… ▽ More

    Submitted 24 July, 2024; originally announced July 2024.

  13. arXiv:2407.15247  [pdf, other

    cs.LG stat.ML

    TimeInf: Time Series Data Contribution via Influence Functions

    Authors: Yizi Zhang, Jingyan Shen, Xiaoxue Xiong, Yongchan Kwon

    Abstract: Evaluating the contribution of individual data points to a model's prediction is critical for interpreting model predictions and improving model performance. Existing data contribution methods have been applied to various data types, including tabular data, images, and texts; however, their primary focus has been on i.i.d. settings. Despite the pressing need for principled approaches tailored to t… ▽ More

    Submitted 23 July, 2024; v1 submitted 21 July, 2024; originally announced July 2024.

  14. arXiv:2407.08560  [pdf, other

    stat.ML cs.LG math.ST stat.ME

    Causal inference through multi-stage learning and doubly robust deep neural networks

    Authors: Yuqian Zhang, Jelena Bradic

    Abstract: Deep neural networks (DNNs) have demonstrated remarkable empirical performance in large-scale supervised learning problems, particularly in scenarios where both the sample size $n$ and the dimension of covariates $p$ are large. This study delves into the application of DNNs across a wide spectrum of intricate causal inference tasks, where direct estimation falls short and necessitates multi-stage… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

  15. arXiv:2407.03082  [pdf, other

    cs.LG stat.ML

    Stable Heterogeneous Treatment Effect Estimation across Out-of-Distribution Populations

    Authors: Yuling Zhang, Anpeng Wu, Kun Kuang, Liang Du, Zixun Sun, Zhi Wang

    Abstract: Heterogeneous treatment effect (HTE) estimation is vital for understanding the change of treatment effect across individuals or subgroups. Most existing HTE estimation methods focus on addressing selection bias induced by imbalanced distributions of confounders between treated and control units, but ignore distribution shifts across populations. Thereby, their applicability has been limited to the… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: Accepted by ICDE'2024

  16. arXiv:2406.18035  [pdf, other

    cs.LG stat.ML

    Local Linear Recovery Guarantee of Deep Neural Networks at Overparameterization

    Authors: Yaoyu Zhang, Leyang Zhang, Zhongwang Zhang, Zhiwei Bai

    Abstract: Determining whether deep neural network (DNN) models can reliably recover target functions at overparameterization is a critical yet complex issue in the theory of deep learning. To advance understanding in this area, we introduce a concept we term "local linear recovery" (LLR), a weaker form of target function recovery that renders the problem more amenable to theoretical analysis. In the sense o… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: arXiv admin note: text overlap with arXiv:2211.11623

  17. arXiv:2406.16708  [pdf, other

    cs.LG stat.ME

    CausalFormer: An Interpretable Transformer for Temporal Causal Discovery

    Authors: Lingbai Kong, Wengen Li, Hanchen Yang, Yichao Zhang, Jihong Guan, Shuigeng Zhou

    Abstract: Temporal causal discovery is a crucial task aimed at uncovering the causal relations within time series data. The latest temporal causal discovery methods usually train deep learning models on prediction tasks to uncover the causality between time series. They capture causal relations by analyzing the parameters of some components of the trained models, e.g., attention weights and convolution weig… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  18. arXiv:2406.13478  [pdf, other

    stat.ME

    Semiparametric Localized Principal Stratification Analysis with Continuous Strata

    Authors: Yichi Zhang, Shu Yang

    Abstract: Principal stratification is essential for revealing causal mechanisms involving post-treatment intermediate variables. Principal stratification analysis with continuous intermediate variables is increasingly common but challenging due to the infinite principal strata and the nonidentifiability and nonregularity of principal causal effects. Inspired by recent research, we resolve these challenges b… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  19. arXiv:2406.12764  [pdf, other

    stat.ML cs.LG

    Quasi-Bayes meets Vines

    Authors: David Huk, Yuanhe Zhang, Mark Steel, Ritabrata Dutta

    Abstract: Recently proposed quasi-Bayesian (QB) methods initiated a new era in Bayesian computation by directly constructing the Bayesian predictive distribution through recursion, removing the need for expensive computations involved in sampling the Bayesian posterior distribution. This has proved to be data-efficient for univariate predictions, but extensions to multiple dimensions rely on a conditional d… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: 36 pages, 2 figures

    MSC Class: 62G07

  20. arXiv:2406.03849  [pdf

    cs.LG stat.AP stat.ML

    A Noise-robust Multi-head Attention Mechanism for Formation Resistivity Prediction: Frequency Aware LSTM

    Authors: Yongan Zhang, Junfeng Zhao, Jian Li, Xuanran Wang, Youzhuang Sun, Yuntian Chen, Dongxiao Zhang

    Abstract: The prediction of formation resistivity plays a crucial role in the evaluation of oil and gas reservoirs, identification and assessment of geothermal energy resources, groundwater detection and monitoring, and carbon capture and storage. However, traditional well logging techniques fail to measure accurate resistivity in cased boreholes, and the transient electromagnetic method for cased borehole… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  21. arXiv:2406.00633  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Improving GFlowNets for Text-to-Image Diffusion Alignment

    Authors: Dinghuai Zhang, Yizhe Zhang, Jiatao Gu, Ruixiang Zhang, Josh Susskind, Navdeep Jaitly, Shuangfei Zhai

    Abstract: Diffusion models have become the de-facto approach for generating visual data, which are trained to match the distribution of the training dataset. In addition, we also want to control generation to fulfill desired properties such as alignment to a text description, which can be specified with a black-box reward function. Prior works fine-tune pretrained diffusion models to achieve this goal throu… ▽ More

    Submitted 16 June, 2024; v1 submitted 2 June, 2024; originally announced June 2024.

  22. arXiv:2406.00196  [pdf, other

    stat.ME stat.AP

    A Seamless Phase II/III Design with Dose Optimization for Oncology Drug Development

    Authors: Yuhan Li, Yiding Zhang, Gu Mi, Ji Lin

    Abstract: The US FDA's Project Optimus initiative that emphasizes dose optimization prior to marketing approval represents a pivotal shift in oncology drug development. It has a ripple effect for rethinking what changes may be made to conventional pivotal trial designs to incorporate a dose optimization component. Aligned with this initiative, we propose a novel Seamless Phase II/III Design with Dose Optimi… ▽ More

    Submitted 31 May, 2024; originally announced June 2024.

  23. arXiv:2405.20677  [pdf, other

    cs.LG stat.ML

    Provably Efficient Interactive-Grounded Learning with Personalized Reward

    Authors: Mengxiao Zhang, Yuheng Zhang, Haipeng Luo, Paul Mineiro

    Abstract: Interactive-Grounded Learning (IGL) [Xie et al., 2021] is a powerful framework in which a learner aims at maximizing unobservable rewards through interacting with an environment and observing reward-dependent feedback on the taken actions. To deal with personalized rewards that are ubiquitous in applications such as recommendation systems, Maghakian et al. [2022] study a version of IGL with contex… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

  24. arXiv:2405.17828  [pdf, other

    stat.ME

    On Robust Clustering of Temporal Point Process

    Authors: Yuecheng Zhang, Guanhua Fang, Wen Yu

    Abstract: Clustering of event stream data is of great importance in many application scenarios, including but not limited to, e-commerce, electronic health, online testing, mobile music service, etc. Existing clustering algorithms fail to take outlier data into consideration and are implemented without theoretical guarantees. In this paper, we propose a robust temporal point processes clustering framework w… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  25. arXiv:2405.17591  [pdf, other

    stat.ME

    Individualized Dynamic Mediation Analysis Using Latent Factor Models

    Authors: Yijiao Zhang, Yubai Yuan, Yuexia Zhang, Zhongyi Zhu, Annie Qu

    Abstract: Mediation analysis plays a crucial role in causal inference as it can investigate the pathways through which treatment influences outcome. Most existing mediation analysis assumes that mediation effects are static and homogeneous within populations. However, mediation effects usually change over time and exhibit significant heterogeneity in many real-world applications. Additionally, the presence… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: 25 pages, 3 figures, 3 tables

  26. arXiv:2405.17479  [pdf, other

    cs.LG cs.NE stat.ML

    A rationale from frequency perspective for grokking in training neural network

    Authors: Zhangchen Zhou, Yaoyu Zhang, Zhi-Qin John Xu

    Abstract: Grokking is the phenomenon where neural networks NNs initially fit the training data and later generalize to the test data during training. In this paper, we empirically provide a frequency perspective to explain the emergence of this phenomenon in NNs. The core insight is that the networks initially learn the less salient frequency components present in the test data. We observe this phenomenon a… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  27. arXiv:2405.16732  [pdf, ps, other

    stat.ML cs.LG math.OC math.ST

    The Collusion of Memory and Nonlinearity in Stochastic Approximation With Constant Stepsize

    Authors: Dongyan Huo, Yixuan Zhang, Yudong Chen, Qiaomin Xie

    Abstract: In this work, we investigate stochastic approximation (SA) with Markovian data and nonlinear updates under constant stepsize $αあるふぁ>0$. Existing work has primarily focused on either i.i.d. data or linear update rules. We take a new perspective and carefully examine the simultaneous presence of Markovian dependency of data and nonlinear update rules, delineating how the interplay between these two stru… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

  28. arXiv:2405.16730  [pdf, other

    cs.LG cs.AI stat.AP

    Latent Energy-Based Odyssey: Black-Box Optimization via Expanded Exploration in the Energy-Based Latent Space

    Authors: Peiyu Yu, Dinghuai Zhang, Hengzhi He, Xiaojian Ma, Ruiyao Miao, Yifan Lu, Yasi Zhang, Deqian Kong, Ruiqi Gao, Jianwen Xie, Guang Cheng, Ying Nian Wu

    Abstract: Offline Black-Box Optimization (BBO) aims at optimizing a black-box function using the knowledge from a pre-collected offline dataset of function values and corresponding input designs. However, the high-dimensional and highly-multimodal input design space of black-box function pose inherent challenges for most existing methods that model and operate directly upon input designs. These issues inclu… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

  29. arXiv:2405.16387  [pdf, other

    stat.ML cs.LG

    Reverse Transition Kernel: A Flexible Framework to Accelerate Diffusion Inference

    Authors: Xunpeng Huang, Difan Zou, Hanze Dong, Yi Zhang, Yi-An Ma, Tong Zhang

    Abstract: To generate data from trained diffusion models, most inference algorithms, such as DDPM, DDIM, and other variants, rely on discretizing the reverse SDEs or their equivalent ODEs. In this paper, we view such approaches as decomposing the entire denoising diffusion process into several segments, each corresponding to a reverse transition kernel (RTK) sampling subproblem. Specifically, DDPM uses a Ga… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

    Comments: 68 pages, 2 figures

  30. arXiv:2405.15050  [pdf, ps, other

    stat.ML cs.LG

    Provably Efficient Reinforcement Learning for Infinite-Horizon Average-Reward Linear MDPs

    Authors: Kihyuk Hong, Yufan Zhang, Ambuj Tewari

    Abstract: We resolve the open problem of designing a computationally efficient algorithm for infinite-horizon average-reward linear Markov Decision Processes (MDPs) with $\widetilde{O}(\sqrt{T})$ regret. Previous approaches with $\widetilde{O}(\sqrt{T})$ regret either suffer from computational inefficiency or require strong assumptions on dynamics, such as ergodicity. In this paper, we approximate the avera… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  31. arXiv:2405.14681  [pdf, other

    cs.LG stat.ML

    Recursive PAC-Bayes: A Frequentist Approach to Sequential Prior Updates with No Information Loss

    Authors: Yi-Shan Wu, Yijie Zhang, Badr-Eddine Chérief-Abdellatif, Yevgeny Seldin

    Abstract: PAC-Bayesian analysis is a frequentist framework for incorporating prior knowledge into learning. It was inspired by Bayesian learning, which allows sequential data processing and naturally turns posteriors from one processing step into priors for the next. However, despite two and a half decades of research, the ability to update priors sequentially without losing confidence information along the… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  32. arXiv:2405.13912  [pdf, other

    math.ST cs.IT cs.LG math.PR stat.ML

    Matrix Denoising with Doubly Heteroscedastic Noise: Fundamental Limits and Optimal Spectral Methods

    Authors: Yihan Zhang, Marco Mondelli

    Abstract: We study the matrix denoising problem of estimating the singular vectors of a rank-$1$ signal corrupted by noise with both column and row correlations. Existing works are either unable to pinpoint the exact asymptotic estimation error or, when they do so, the resulting approaches (e.g., based on whitening or singular value shrinkage) remain vastly suboptimal. On top of this, most of the literature… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  33. arXiv:2405.11681  [pdf, other

    stat.ME math.ST

    Distributed Tensor Principal Component Analysis

    Authors: Elynn Chen, Xi Chen, Wenbo Jing, Yichen Zhang

    Abstract: As tensors become widespread in modern data analysis, Tucker low-rank Principal Component Analysis (PCA) has become essential for dimensionality reduction and structural discovery in tensor datasets. Motivated by the common scenario where large-scale tensors are distributed across diverse geographic locations, this paper investigates tensor PCA within a distributed framework where direct data pool… ▽ More

    Submitted 19 May, 2024; originally announced May 2024.

  34. arXiv:2405.09003  [pdf, other

    stat.ME math.ST stat.AP

    Nonparametric Inference on Dose-Response Curves Without the Positivity Condition

    Authors: Yikun Zhang, Yen-Chi Chen, Alexander Giessing

    Abstract: Existing statistical methods in causal inference often rely on the assumption that every individual has some chance of receiving any treatment level regardless of its associated covariates, which is known as the positivity condition. This assumption could be violated in observational studies with continuous treatments. In this paper, we present a novel integral estimator of the causal effects with… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

    Comments: 74 pages (23 pages for the main paper), 4 figures

    MSC Class: 62G05 (Primary) 62D20; 62G20 (Secondary)

  35. arXiv:2405.07549  [pdf, other

    q-fin.RM math.PR stat.AP

    On Joint Marginal Expected Shortfall and Associated Contribution Risk Measures

    Authors: Tong Pu, Yifei Zhang, Yiying Zhang

    Abstract: Systemic risk is the risk that a company- or industry-level risk could trigger a huge collapse of another or even the whole institution. Various systemic risk measures have been proposed in the literature to quantify the domino and (relative) spillover effects induced by systemic risks such as the well-known CoVaR, CoES, MES and CoD risk measures, and associated contribution measures. This paper p… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

  36. arXiv:2405.07343  [pdf, other

    eess.SY cs.LG stat.ME

    Graph neural networks for power grid operational risk assessment under evolving grid topology

    Authors: Yadong Zhang, Pranav M Karve, Sankaran Mahadevan

    Abstract: This article investigates the ability of graph neural networks (GNNs) to identify risky conditions in a power grid over the subsequent few hours, without explicit, high-resolution information regarding future generator on/off status (grid topology) or power dispatch decisions. The GNNs are trained using supervised learning, to predict the power grid's aggregated bus-level (either zonal or system-l… ▽ More

    Submitted 12 May, 2024; originally announced May 2024.

    Comments: Manuscript submitted to Applied Energy

  37. arXiv:2405.05679  [pdf, other

    math.ST math.PR stat.CO stat.ML

    Non-asymptotic estimates for accelerated high order Langevin Monte Carlo algorithms

    Authors: Ariel Neufeld, Ying Zhang

    Abstract: In this paper, we propose two new algorithms, namely aHOLA and aHOLLA, to sample from high-dimensional target distributions with possibly super-linearly growing potentials. We establish non-asymptotic convergence bounds for aHOLA in Wasserstein-1 and Wasserstein-2 distances with rates of convergence equal to $1+q/2$ and $1/2+q/4$, respectively, under a local Hölder condition with exponent… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

  38. arXiv:2405.03624  [pdf, ps, other

    cs.LG math.OC q-fin.ST stat.ML

    $εいぷしろん$-Policy Gradient for Online Pricing

    Authors: Lukasz Szpruch, Tanut Treetanthiploet, Yufei Zhang

    Abstract: Combining model-based and model-free reinforcement learning approaches, this paper proposes and analyzes an $εいぷしろん$-policy gradient algorithm for the online pricing learning task. The algorithm extends $εいぷしろん$-greedy algorithm by replacing greedy exploitation with gradient descent step and facilitates learning via model inference. We optimize the regret of the proposed algorithm by quantifying the explora… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

    MSC Class: 62J12; 68Q32; 65Y20

  39. arXiv:2404.15060  [pdf, other

    stat.ME math.ST

    Fast and reliable confidence intervals for a variance component or proportion

    Authors: Yiqiao Zhang, Karl Oskar Ekvall, Aaron J. Molstad

    Abstract: We show that confidence intervals for a variance component or proportion, with asymptotically correct uniform coverage probability, can be obtained by inverting certain test-statistics based on the score for the restricted likelihood. The results apply in settings where the variance or proportion is near or at the boundary of the parameter set. Simulations indicate the proposed test-statistics are… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

  40. arXiv:2404.12312  [pdf, ps, other

    cs.LG math.OC stat.ML

    A Mean-Field Analysis of Neural Stochastic Gradient Descent-Ascent for Functional Minimiax Optimization

    Authors: Yuchen Zhu, Yufeng Zhang, Zhaoran Wang, Zhuoran Yang, Xiaohong Chen

    Abstract: This paper studies minimax optimization problems defined over infinite-dimensional function classes of overparameterized two-layer neural networks. In particular, we consider the minimax optimization problem stemming from estimating linear functional equations defined by conditional expectations, where the objective functions are quadratic in the functional spaces. We address (i) the convergence o… ▽ More

    Submitted 25 May, 2024; v1 submitted 18 April, 2024; originally announced April 2024.

    Comments: Submitted

  41. arXiv:2404.11212  [pdf

    stat.AP

    Deciphering seasonal depression variations and interplays between weather changes, physical activity, and depression severity in real-world settings: Learnings from RADAR-MDD longitudinal mobile health study

    Authors: Yuezhou Zhang, Amos A. Folarin, Yatharth Ranjan, Nicholas Cummins, Zulqarnain Rashid, Pauline Conde, Callum Stewart, Shaoxiong Sun, Srinivasan Vairavan, Faith Matcham, Carolin Oetzmann, Sara Siddi, Femke Lamers, Sara Simblett, Til Wykes, David C. Mohr, Josep Maria Haro, Brenda W. J. H. Penninx, Vaibhav A. Narayan, Matthew Hotopf, Richard J. B. Dobson, Abhishek Pratap, RADAR-CNS consortium

    Abstract: Prior research has shown that changes in seasons and weather can have a significant impact on depression severity. However, findings are inconsistent across populations, and the interplay between weather, behavior, and depression has not been fully quantified. This study analyzed real-world data from 428 participants (a subset; 68.7% of the cohort) in the RADAR-MDD longitudinal mobile health study… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

  42. arXiv:2404.10985  [pdf, ps, other

    cs.CV stat.ML

    Pixel-Wise Symbol Spotting via Progressive Points Location for Parsing CADきゃど Images

    Authors: Junbiao Pang, Zailin Dong, Jiaxin Deng, Mengyuan Zhu, Yunwei Zhang

    Abstract: Parsing Computer-Aided Design (CADきゃど) drawings is a fundamental step for CADきゃど revision, semantic-based management, and the generation of 3D prototypes in both the architecture and engineering industries. Labeling symbols from a CADきゃど drawing is a challenging yet notorious task from a practical point of view. In this work, we propose to label and spot symbols from CADきゃど images that are converted from CADきゃど▽ More

    Submitted 16 April, 2024; originally announced April 2024.

    Comments: 10 pages, 10 figures,6 tables

  43. arXiv:2404.06023  [pdf, other

    stat.ML cs.LG math.OC math.PR

    Prelimit Coupling and Steady-State Convergence of Constant-stepsize Nonsmooth Contractive SA

    Authors: Yixuan Zhang, Dongyan Huo, Yudong Chen, Qiaomin Xie

    Abstract: Motivated by Q-learning, we study nonsmooth contractive stochastic approximation (SA) with constant stepsize. We focus on two important classes of dynamics: 1) nonsmooth contractive SA with additive noise, and 2) synchronous and asynchronous Q-learning, which features both additive and multiplicative noise. For both dynamics, we establish weak convergence of the iterates to a stationary limit dist… ▽ More

    Submitted 24 April, 2024; v1 submitted 9 April, 2024; originally announced April 2024.

    Comments: ACM SIGMETRICS 2024. 71 pages, 3 figures

  44. arXiv:2404.03764  [pdf, other

    cs.LG stat.ME stat.ML

    Covariate-Elaborated Robust Partial Information Transfer with Conditional Spike-and-Slab Prior

    Authors: Ruqian Zhang, Yijiao Zhang, Annie Qu, Zhongyi Zhu, Juan Shen

    Abstract: The popularity of transfer learning stems from the fact that it can borrow information from useful auxiliary datasets. Existing statistical transfer learning methods usually adopt a global similarity measure between the source data and the target data, which may lead to inefficiency when only partial information is shared. In this paper, we propose a novel Bayesian transfer learning method named `… ▽ More

    Submitted 21 August, 2024; v1 submitted 30 March, 2024; originally announced April 2024.

    Comments: 35 pages, 4 figures

  45. Statistical Inference For Noisy Matrix Completion Incorporating Auxiliary Information

    Authors: Shujie Ma, Po-Yao Niu, Yichong Zhang, Yinchu Zhu

    Abstract: This paper investigates statistical inference for noisy matrix completion in a semi-supervised model when auxiliary covariates are available. The model consists of two parts. One part is a low-rank matrix induced by unobserved latent factors; the other part models the effects of the observed covariates through a coefficient matrix which is composed of high-dimensional column vectors. We model the… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

  46. arXiv:2403.14593  [pdf, other

    cs.LG stat.ML

    Rethinking Adversarial Inverse Reinforcement Learning: Policy Imitation, Transferable Reward Recovery and Algebraic Equilibrium Proof

    Authors: Yangchun Zhang, Qiang Liu, Weiming Li, Yirui Zhou

    Abstract: Adversarial inverse reinforcement learning (AIRL) stands as a cornerstone approach in imitation learning, yet it faces criticisms from prior studies. In this paper, we rethink AIRL and respond to these criticisms. Criticism 1 lies in Inadequate Policy Imitation. We show that substituting the built-in algorithm with soft actor-critic (SAC) during policy updating (requires multi-iterations) signific… ▽ More

    Submitted 14 May, 2024; v1 submitted 21 March, 2024; originally announced March 2024.

  47. arXiv:2403.14573  [pdf, other

    stat.ME stat.AP stat.ML

    A Transfer Learning Causal Approach to Evaluate Racial/Ethnic and Geographic Variation in Outcomes Following Congenital Heart Surgery

    Authors: Larry Han, Yi Zhang, Meena Nathan, John E. Mayer, Jr., Sara K. Pasquali, Katya Zelevinsky, Rui Duan, Sharon-Lise T. Normand

    Abstract: Congenital heart defects (CHD) are the most prevalent birth defects in the United States and surgical outcomes vary considerably across the country. The outcomes of treatment for CHD differ for specific patient subgroups, with non-Hispanic Black and Hispanic populations experiencing higher rates of mortality and morbidity. A valid comparison of outcomes within racial/ethnic subgroups is difficult… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

    Comments: 26 pages

  48. arXiv:2403.12166  [pdf, other

    cs.LG stat.ML

    The Power of Few: Accelerating and Enhancing Data Reweighting with Coreset Selection

    Authors: Mohammad Jafari, Yimeng Zhang, Yihua Zhang, Sijia Liu

    Abstract: As machine learning tasks continue to evolve, the trend has been to gather larger datasets and train increasingly larger models. While this has led to advancements in accuracy, it has also escalated computational costs to unsustainable levels. Addressing this, our work aims to strike a delicate balance between computational efficiency and model accuracy, a persisting challenge in the field. We int… ▽ More

    Submitted 30 May, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

    Comments: Accepted to ICASSP 2024

  49. arXiv:2403.12143  [pdf, other

    cs.LG cs.AI stat.ML

    Graph Neural Networks for Learning Equivariant Representations of Neural Networks

    Authors: Miltiadis Kofinas, Boris Knyazev, Yan Zhang, Yunlu Chen, Gertjan J. Burghouts, Efstratios Gavves, Cees G. M. Snoek, David W. Zhang

    Abstract: Neural networks that process the parameters of other neural networks find applications in domains as diverse as classifying implicit neural representations, generating neural network weights, and predicting generalization errors. However, existing approaches either overlook the inherent permutation symmetry in the neural network or rely on intricate weight-sharing patterns to achieve equivariance,… ▽ More

    Submitted 23 July, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

    Comments: In ICLR 2024. Source code: https://github.com/mkofinas/neural-graphs

  50. arXiv:2403.09869  [pdf, other

    stat.ML cs.AI cs.LG stat.ME

    Mind the GAP: Improving Robustness to Subpopulation Shifts with Group-Aware Priors

    Authors: Tim G. J. Rudner, Ya Shi Zhang, Andrew Gordon Wilson, Julia Kempe

    Abstract: Machine learning models often perform poorly under subpopulation shifts in the data distribution. Developing methods that allow machine learning models to better generalize to such shifts is crucial for safe deployment in real-world settings. In this paper, we develop a family of group-aware prior (GAP) distributions over neural network parameters that explicitly favor models that generalize well… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

    Comments: Published in Proceedings of the 27th International Conference on Artificial Intelligence and Statistics (AISTATS 2024)