(Translated by https://www.hiragana.jp/)
Search | arXiv e-print repository
Skip to main content

Showing 1–50 of 216 results for author: Xing, P

.
  1. arXiv:2407.13642  [pdf, other

    cs.CV

    Open-Vocabulary 3D Semantic Segmentation with Text-to-Image Diffusion Models

    Authors: Xiaoyu Zhu, Hao Zhou, Pengfei Xing, Long Zhao, Hao Xu, Junwei Liang, Alexander Hauptmann, Ting Liu, Andrew Gallagher

    Abstract: In this paper, we investigate the use of diffusion models which are pre-trained on large-scale image-caption pairs for open-vocabulary 3D semantic understanding. We propose a novel method, namely Diff2Scene, which leverages frozen representations from text-image generative models, along with salient-aware and geometric-aware masks, for open-vocabulary 3D semantic segmentation and visual grounding… ▽ More

    Submitted 18 July, 2024; originally announced July 2024.

    Comments: ECCV 2024

  2. arXiv:2407.10960  [pdf, other

    cs.LG cs.CL cs.DC

    Fast Matrix Multiplications for Lookup Table-Quantized LLMs

    Authors: Han Guo, William Brandon, Radostin Cholakov, Jonathan Ragan-Kelley, Eric P. Xing, Yoon Kim

    Abstract: The deployment of large language models (LLMs) is often constrained by memory bandwidth, where the primary bottleneck is the cost of transferring model parameters from the GPU's global memory to its registers. When coupled with custom kernels that fuse the dequantization and matmul operations, weight-only quantization can thus enable faster inference by reducing the amount of memory movement. Howe… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

  3. arXiv:2407.00924  [pdf, other

    cs.CL

    EXCGEC: A Benchmark of Edit-wise Explainable Chinese Grammatical Error Correction

    Authors: Jingheng Ye, Shang Qin, Yinghui Li, Xuxin Cheng, Libo Qin, Hai-Tao Zheng, Peng Xing, Zishan Xu, Guo Cheng, Zhao Wei

    Abstract: Existing studies explore the explainability of Grammatical Error Correction (GEC) in a limited scenario, where they ignore the interaction between corrections and explanations. To bridge the gap, this paper introduces the task of EXplainable GEC (EXGEC), which focuses on the integral role of both correction and explanation tasks. To facilitate the task, we propose EXCGEC, a tailored benchmark for… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

    Comments: 22 pages, 10 tables, 9 figures. Under review

  4. arXiv:2407.00788  [pdf, other

    cs.CV

    InstantStyle-Plus: Style Transfer with Content-Preserving in Text-to-Image Generation

    Authors: Haofan Wang, Peng Xing, Renyuan Huang, Hao Ai, Qixun Wang, Xu Bai

    Abstract: Style transfer is an inventive process designed to create an image that maintains the essence of the original while embracing the visual style of another. Although diffusion models have demonstrated impressive generative power in personalized subject-driven or style-driven applications, existing state-of-the-art methods still encounter difficulties in achieving a seamless balance between content p… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

    Comments: Technical Report

  5. arXiv:2406.20098  [pdf, other

    cs.CV cs.AI cs.CL

    Web2Code: A Large-scale Webpage-to-Code Dataset and Evaluation Framework for Multimodal LLMs

    Authors: Sukmin Yun, Haokun Lin, Rusiru Thushara, Mohammad Qazim Bhat, Yongxin Wang, Zutao Jiang, Mingkai Deng, Jinhong Wang, Tianhua Tao, Junbo Li, Haonan Li, Preslav Nakov, Timothy Baldwin, Zhengzhong Liu, Eric P. Xing, Xiaodan Liang, Zhiqiang Shen

    Abstract: Multimodal large language models (MLLMs) have shown impressive success across modalities such as image, video, and audio in a variety of understanding and generation tasks. However, current MLLMs are surprisingly poor at understanding webpage screenshots and generating their corresponding HTML code. To address this problem, we propose Web2Code, a benchmark consisting of a new large-scale webpage-t… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

    Comments: Website at https://mbzuai-llm.github.io/webpage2code/

  6. arXiv:2406.09455  [pdf, other

    cs.CV cs.AI cs.CL

    Pandora: Towards General World Model with Natural Language Actions and Video States

    Authors: Jiannan Xiang, Guangyi Liu, Yi Gu, Qiyue Gao, Yuting Ning, Yuheng Zha, Zeyu Feng, Tianhua Tao, Shibo Hao, Yemin Shi, Zhengzhong Liu, Eric P. Xing, Zhiting Hu

    Abstract: World models simulate future states of the world in response to different actions. They facilitate interactive content creation and provides a foundation for grounded, long-horizon reasoning. Current foundation models do not fully meet the capabilities of general world models: large language models (LLMs) are constrained by their reliance on language modality and their limited understanding of the… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: Website: https://world-model.maitrix.org/

  7. arXiv:2406.04608  [pdf, other

    cs.CV

    A Recover-then-Discriminate Framework for Robust Anomaly Detection

    Authors: Peng Xing, Dong Zhang, Jinhui Tang, Zechao li

    Abstract: Anomaly detection (AD) has been extensively studied and applied in a wide range of scenarios in the recent past. However, there are still gaps between achieved and desirable levels of recognition accuracy for making AD for practical applications. In this paper, we start from an insightful analysis of two types of fundamental yet representative failure cases in the baseline model, and reveal reason… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: 17 pages, 10 figures

  8. arXiv:2406.02881  [pdf, other

    cs.CV

    Inv-Adapter: ID Customization Generation via Image Inversion and Lightweight Adapter

    Authors: Peng Xing, Ning Wang, Jianbo Ouyang, Zechao Li

    Abstract: The remarkable advancement in text-to-image generation models significantly boosts the research in ID customization generation. However, existing personalization methods cannot simultaneously satisfy high fidelity and high-efficiency requirements. Their main bottleneck lies in the prompt image encoder, which produces weak alignment signals with the text-to-image model and significantly increased m… ▽ More

    Submitted 6 June, 2024; v1 submitted 4 June, 2024; originally announced June 2024.

    Comments: technical report

  9. arXiv:2406.01746  [pdf

    physics.med-ph

    3D transcranial Dynamic Ultrasound Localization Microscopy in the mouse brain using a Row-Column Array

    Authors: Alice Wu, Jonathan Porée, Gerardo Ramos-Palacios, Chloé Bourquin, Nin Ghigo, Alexis Leconte, Paul Xing, Abbas F. Sadikot, Michaël Chassé, Jean Provost

    Abstract: The role of brain hemodynamics in neurodegenerative diseases cannot be fully assessed using existing imaging technologies. Recently, 2D Dynamic Ultrasound Localization Microscopy (DULM) has allowed for the quantitative mapping of the pulsatile flow at sub-wavelength resolution. However, to obtain accurate velocity estimates, 3D imaging is more adapted, especially for complex vascularized organs li… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: 10 pages, 5 figures

  10. arXiv:2406.00519  [pdf, other

    cs.LG cs.AI stat.ML

    Learning Discrete Concepts in Latent Hierarchical Models

    Authors: Lingjing Kong, Guangyi Chen, Biwei Huang, Eric P. Xing, Yuejie Chi, Kun Zhang

    Abstract: Learning concepts from natural high-dimensional data (e.g., images) holds potential in building human-aligned and interpretable machine learning models. Despite its encouraging prospect, formalization and theoretical insights into this crucial task are still lacking. In this work, we formalize concepts as discrete latent causal variables that are related via a hierarchical causal model that encode… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

  11. arXiv:2404.03547  [pdf, other

    eess.IV eess.SP

    Towards Transcranial 3D Ultrasound Localization Microscopy of the Nonhuman Primate Brain

    Authors: Paul Xing, Vincent Perrot, Adan Ulises Dominguez-Vargas, Stephan Quessy, Numa Dancause, Jean Provost

    Abstract: Hemodynamic changes occur in stroke and neurodegenerative diseases. Developing imaging techniques allowing the in vivo visualization and quantification of cerebral blood flow would help better understand the underlying mechanism of those cerebrovascular diseases. 3D ultrasound localization microscopy (ULM) is a novel technology that can map the microvasculature of the brain at large depth and has… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

  12. arXiv:2404.02852  [pdf, other

    cs.LG

    Toward Inference-optimal Mixture-of-Expert Large Language Models

    Authors: Longfei Yun, Yonghao Zhuang, Yao Fu, Eric P Xing, Hao Zhang

    Abstract: Mixture-of-Expert (MoE) based large language models (LLMs), such as the recent Mixtral and DeepSeek-MoE, have shown great promise in scaling model size without suffering from the quadratic growth of training cost of dense transformers. Like dense models, training MoEs requires answering the same question: given a training budget, what is the optimal allocation on the model size and number of token… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

    Comments: 15 pages, 8 figures

  13. arXiv:2402.19009  [pdf, other

    cs.LG cs.AI

    Unified Generation, Reconstruction, and Representation: Generalized Diffusion with Adaptive Latent Encoding-Decoding

    Authors: Guangyi Liu, Yu Wang, Zeyu Feng, Qiyu Wu, Liping Tang, Yuan Gao, Zhen Li, Shuguang Cui, Julian McAuley, Zichao Yang, Eric P. Xing, Zhiting Hu

    Abstract: The vast applications of deep generative models are anchored in three core capabilities -- generating new instances, reconstructing inputs, and learning compact representations -- across various data types, such as discrete text/protein sequences and continuous images. Existing model families, like variational autoencoders (VAEs), generative adversarial networks (GANs), autoregressive models, and… ▽ More

    Submitted 5 June, 2024; v1 submitted 29 February, 2024; originally announced February 2024.

    Comments: ICML 2024 camera-ready. Code is available at https://github.com/guangyliu/EDDPM

  14. arXiv:2402.16840  [pdf, other

    cs.CL

    MobiLlama: Towards Accurate and Lightweight Fully Transparent GPT

    Authors: Omkar Thawakar, Ashmal Vayani, Salman Khan, Hisham Cholakal, Rao M. Anwer, Michael Felsberg, Tim Baldwin, Eric P. Xing, Fahad Shahbaz Khan

    Abstract: "Bigger the better" has been the predominant trend in recent Large Language Models (LLMs) development. However, LLMs do not suit well for scenarios that require on-device processing, energy efficiency, low memory footprint, and response efficiency. These requisites are crucial for privacy, security, and sustainable deployment. This paper explores the "less is more" paradigm by addressing the chall… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

    Comments: Code available at : https://github.com/mbzuai-oryx/MobiLlama

  15. arXiv:2402.11422  [pdf, other

    cs.CL

    Mitigating Catastrophic Forgetting in Multi-domain Chinese Spelling Correction by Multi-stage Knowledge Transfer Framework

    Authors: Peng Xing, Yinghui Li, Shirong Ma, Xinnian Liang, Haojing Huang, Yangning Li, Hai-Tao Zheng, Wenhao Jiang, Ying Shen

    Abstract: Chinese Spelling Correction (CSC) aims to detect and correct spelling errors in given sentences. Recently, multi-domain CSC has gradually attracted the attention of researchers because it is more practicable. In this paper, we focus on the key flaw of the CSC model when adapting to multi-domain scenarios: the tendency to forget previously acquired knowledge upon learning new domain-specific knowle… ▽ More

    Submitted 17 February, 2024; originally announced February 2024.

  16. arXiv:2402.09359  [pdf, other

    eess.IV cs.CV

    Pruning Sparse Tensor Neural Networks Enables Deep Learning for 3D Ultrasound Localization Microscopy

    Authors: Brice Rauby, Paul Xing, Jonathan Porée, Maxime Gasse, Jean Provost

    Abstract: Ultrasound Localization Microscopy (ULM) is a non-invasive technique that allows for the imaging of micro-vessels in vivo, at depth and with a resolution on the order of ten microns. ULM is based on the sub-resolution localization of individual microbubbles injected in the bloodstream. Mapping the whole angioarchitecture requires the accumulation of microbubbles trajectories from thousands of fram… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

    ACM Class: I.4.9

  17. arXiv:2401.10389  [pdf, other

    eess.IV physics.med-ph

    Inverse Problem Approach to Aberration Correction for in vivo Transcranial Imaging Based on a Sparse Representation of Contrast-enhanced Ultrasound Data

    Authors: Paul Xing, Antoine Malescot, Eric Martineau, Ravi Rungta, Jean Provost

    Abstract: Transcranial ultrasound imaging is currently limited by attenuation and aberration induced by the skull. First used in contrast-enhanced ultrasound (CEUS), highly echoic microbubbles allowed for the development of novel imaging modalities such as ultrasound localization microscopy (ULM). Herein, we develop an inverse problem approach to aberration correction (IPAC) that leverages the sparsity of m… ▽ More

    Submitted 14 May, 2024; v1 submitted 18 January, 2024; originally announced January 2024.

  18. arXiv:2312.06550  [pdf, other

    cs.CL cs.AI cs.LG

    LLM360: Towards Fully Transparent Open-Source LLMs

    Authors: Zhengzhong Liu, Aurick Qiao, Willie Neiswanger, Hongyi Wang, Bowen Tan, Tianhua Tao, Junbo Li, Yuqi Wang, Suqi Sun, Omkar Pangarkar, Richard Fan, Yi Gu, Victor Miller, Yonghao Zhuang, Guowei He, Haonan Li, Fajri Koto, Liping Tang, Nikhil Ranjan, Zhiqiang Shen, Xuguang Ren, Roberto Iriondo, Cun Mu, Zhiting Hu, Mark Schulze , et al. (3 additional authors not shown)

    Abstract: The recent surge in open-source Large Language Models (LLMs), such as LLaMA, Falcon, and Mistral, provides diverse options for AI practitioners and researchers. However, most LLMs have only released partial artifacts, such as the final model weights or inference code, and technical reports increasingly limit their scope to high-level design choices and surface statistics. These choices hinder prog… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

  19. arXiv:2312.01628  [pdf

    physics.bio-ph

    A unified framework combining coherent compounding, harmonic imaging and angular coherence for simultaneous high-quality B-mode and tissue Doppler in ultrafast echocardiography

    Authors: Michael Mougharbel, Jonathan Porée, Stephen A. Lee, Paul Xing, Alice Wu, Jean-Claude Tardif, Jean Provost

    Abstract: Various methods have been proposed to enhance image quality in ultrafast ultrasound. Coherent compounding can improve image quality using multiple steered diverging transmits when motion occurring between transmits is corrected for. Harmonic imaging, a standard technique in conventional focused echocardiography, has been adapted for ultrafast imaging, reducing clutter. Coherence-based approaches h… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

  20. arXiv:2311.12023  [pdf, other

    cs.CL cs.LG

    LQ-LoRA: Low-rank Plus Quantized Matrix Decomposition for Efficient Language Model Finetuning

    Authors: Han Guo, Philip Greengard, Eric P. Xing, Yoon Kim

    Abstract: We propose a simple approach for memory-efficient adaptation of pretrained language models. Our approach uses an iterative algorithm to decompose each pretrained matrix into a high-precision low-rank component and a memory-efficient quantized component. During finetuning, the quantized component remains fixed and only the low-rank component is updated. We present an integer linear programming form… ▽ More

    Submitted 30 June, 2024; v1 submitted 20 November, 2023; originally announced November 2023.

  21. Dynamic Imaging using any Ultrasound Localization Microscopy Dataset

    Authors: Nin Ghigo, Gerardo Ramos-Palacios, Chloé Bourquin, Paul Xing, Alice Wu, Nelson Cortés, Hugo Ladret, Lamyae Ikan, Christian Casanova, Jonathan Porée, Abbas Sadikot, Jean Provost

    Abstract: Ultrasound Localization Microscopy (ULM) relies on the injection of microbubbles (MBs) to obtain highly resolved density maps of blood circulation in vivo, with a resolution that can reach 10 μみゅーm ~ λらむだ/10 in the rodent brain. Static mean velocity maps can be extracted but are intrinsically biased by potential significant changes in the number of MBs detected during the cardiac cycle. Dynamic ULM (DUL… ▽ More

    Submitted 1 November, 2023; originally announced November 2023.

    Comments: 11 pages, 6 figures, 4 tables

    Journal ref: Dynamic Ultrasound Localization Microscopy Without ECG-Gating,Ultrasound in Medicine & Biology, 2024

  22. arXiv:2310.16427  [pdf, other

    cs.CL

    PromptAgent: Strategic Planning with Language Models Enables Expert-level Prompt Optimization

    Authors: Xinyuan Wang, Chenxi Li, Zhen Wang, Fan Bai, Haotian Luo, Jiayou Zhang, Nebojsa Jojic, Eric P. Xing, Zhiting Hu

    Abstract: Highly effective, task-specific prompts are often heavily engineered by experts to integrate detailed instructions and domain insights based on a deep understanding of both instincts of large language models (LLMs) and the intricacies of the target task. However, automating the generation of such expert-level prompts remains elusive. Existing prompt optimization methods tend to overlook the depth… ▽ More

    Submitted 7 December, 2023; v1 submitted 25 October, 2023; originally announced October 2023.

    Comments: 34 pages, 10 figures

  23. arXiv:2310.11340  [pdf, other

    stat.ML cs.LG

    Contextualized Machine Learning

    Authors: Benjamin Lengerich, Caleb N. Ellington, Andrea Rubbi, Manolis Kellis, Eric P. Xing

    Abstract: We examine Contextualized Machine Learning (ML), a paradigm for learning heterogeneous and context-dependent effects. Contextualized ML estimates heterogeneous functions by applying deep learning to the meta-relationship between contextual information and context-specific parametric models. This is a form of varying-coefficient modeling that unifies existing frameworks including cluster analysis a… ▽ More

    Submitted 17 October, 2023; originally announced October 2023.

  24. arXiv:2310.07918  [pdf, other

    cs.LG cs.AI stat.ML

    Contextualized Policy Recovery: Modeling and Interpreting Medical Decisions with Adaptive Imitation Learning

    Authors: Jannik Deuschel, Caleb N. Ellington, Yingtao Luo, Benjamin J. Lengerich, Pascal Friederich, Eric P. Xing

    Abstract: Interpretable policy learning seeks to estimate intelligible decision policies from observed actions; however, existing models force a tradeoff between accuracy and interpretability, limiting data-driven interpretations of human decision-making processes. Fundamentally, existing approaches are burdened by this tradeoff because they represent the underlying decision process as a universal policy, w… ▽ More

    Submitted 7 May, 2024; v1 submitted 11 October, 2023; originally announced October 2023.

  25. arXiv:2310.03294  [pdf, other

    cs.LG cs.AI cs.DC

    DISTFLASHATTN: Distributed Memory-efficient Attention for Long-context LLMs Training

    Authors: Dacheng Li, Rulin Shao, Anze Xie, Eric P. Xing, Xuezhe Ma, Ion Stoica, Joseph E. Gonzalez, Hao Zhang

    Abstract: FlashAttention (Dao, 2023) effectively reduces the quadratic peak memory usage to linear in training transformer-based large language models (LLMs) on a single GPU. In this paper, we introduce DISTFLASHATTN, a distributed memory-efficient attention mechanism optimized for long-context LLMs training. We propose three key techniques: token-level workload balancing, overlapping key-value communicatio… ▽ More

    Submitted 31 March, 2024; v1 submitted 4 October, 2023; originally announced October 2023.

  26. arXiv:2310.03163  [pdf, other

    cs.LG

    FedNAR: Federated Optimization with Normalized Annealing Regularization

    Authors: Junbo Li, Ang Li, Chong Tian, Qirong Ho, Eric P. Xing, Hongyi Wang

    Abstract: Weight decay is a standard technique to improve generalization performance in modern deep neural network optimization, and is also widely adopted in federated learning (FL) to prevent overfitting in local clients. In this paper, we first explore the choices of weight decay and identify that weight decay value appreciably influences the convergence of existing FL algorithms. While preventing overfi… ▽ More

    Submitted 4 October, 2023; originally announced October 2023.

    Comments: Thirty-seventh Conference on Neural Information Processing Systems

    Journal ref: Thirty-seventh Conference on Neural Information Processing Systems, 2023

  27. arXiv:2309.11998  [pdf, other

    cs.CL cs.AI

    LMSYS-Chat-1M: A Large-Scale Real-World LLM Conversation Dataset

    Authors: Lianmin Zheng, Wei-Lin Chiang, Ying Sheng, Tianle Li, Siyuan Zhuang, Zhanghao Wu, Yonghao Zhuang, Zhuohan Li, Zi Lin, Eric P. Xing, Joseph E. Gonzalez, Ion Stoica, Hao Zhang

    Abstract: Studying how people interact with large language models (LLMs) in real-world scenarios is increasingly important due to their widespread use in various applications. In this paper, we introduce LMSYS-Chat-1M, a large-scale dataset containing one million real-world conversations with 25 state-of-the-art LLMs. This dataset is collected from 210K unique IP addresses in the wild on our Vicuna demo and… ▽ More

    Submitted 10 March, 2024; v1 submitted 21 September, 2023; originally announced September 2023.

  28. arXiv:2308.15324  [pdf, other

    cs.AI

    Federated Neuro-Symbolic Learning

    Authors: Pengwei Xing, Songtao Lu, Han Yu

    Abstract: Neuro-symbolic learning (NSL) models complex symbolic rule patterns into latent variable distributions by neural networks, which reduces rule search space and generates unseen rules to improve downstream task performance. Centralized NSL learning involves directly acquiring data from downstream tasks, which is not feasible for federated learning (FL). To address this limitation, we shift the focus… ▽ More

    Submitted 27 May, 2024; v1 submitted 29 August, 2023; originally announced August 2023.

    Comments: accepted by ICML 2024

  29. arXiv:2308.02724  [pdf, other

    physics.med-ph eess.SP

    A Tracking prior to Localization workflow for Ultrasound Localization Microscopy

    Authors: Alexis Leconte, Jonathan Porée, Brice Rauby, Alice Wu, Nin Ghigo, Paul Xing, Chloé Bourquin, Gerardo Ramos-Palacios, Abbas F. Sadikot, Jean Provost

    Abstract: Ultrasound Localization Microscopy (ULM) has proven effective in resolving microvascular structures and local mean velocities at sub-diffraction-limited scales, offering high-resolution imaging capabilities. Dynamic ULM (DULM) enables the creation of angiography or velocity movies throughout cardiac cycles. Currently, these techniques rely on a Localization-and-Tracking (LAT) workflow consisting i… ▽ More

    Submitted 4 August, 2023; originally announced August 2023.

  30. arXiv:2306.05685  [pdf, other

    cs.CL cs.AI

    Judging LLM-as-a-Judge with MT-Bench and Chatbot Arena

    Authors: Lianmin Zheng, Wei-Lin Chiang, Ying Sheng, Siyuan Zhuang, Zhanghao Wu, Yonghao Zhuang, Zi Lin, Zhuohan Li, Dacheng Li, Eric P. Xing, Hao Zhang, Joseph E. Gonzalez, Ion Stoica

    Abstract: Evaluating large language model (LLM) based chat assistants is challenging due to their broad capabilities and the inadequacy of existing benchmarks in measuring human preferences. To address this, we explore using strong LLMs as judges to evaluate these models on more open-ended questions. We examine the usage and limitations of LLM-as-a-judge, including position, verbosity, and self-enhancement… ▽ More

    Submitted 23 December, 2023; v1 submitted 9 June, 2023; originally announced June 2023.

    Comments: NeurIPS 2023 Datasets and Benchmarks Track

  31. arXiv:2306.04898  [pdf, other

    cs.LG cs.CV

    Understanding Masked Autoencoders via Hierarchical Latent Variable Models

    Authors: Lingjing Kong, Martin Q. Ma, Guangyi Chen, Eric P. Xing, Yuejie Chi, Louis-Philippe Morency, Kun Zhang

    Abstract: Masked autoencoder (MAE), a simple and effective self-supervised learning framework based on the reconstruction of masked image regions, has recently achieved prominent success in a variety of vision tasks. Despite the emergence of intriguing empirical observations on MAE, a theoretically principled understanding is still lacking. In this work, we formally characterize and justify existing empiric… ▽ More

    Submitted 7 June, 2023; originally announced June 2023.

    Comments: CVPR 2023 Highlight

  32. arXiv:2305.02538  [pdf, other

    cs.LG

    Cuttlefish: Low-Rank Model Training without All the Tuning

    Authors: Hongyi Wang, Saurabh Agarwal, Pongsakorn U-chupala, Yoshiki Tanaka, Eric P. Xing, Dimitris Papailiopoulos

    Abstract: Recent research has shown that training low-rank neural networks can effectively reduce the total number of trainable parameters without sacrificing predictive accuracy, resulting in end-to-end speedups. However, low-rank model training necessitates adjusting several additional factorization hyperparameters, such as the rank of the factorization at each layer. In this paper, we tackle this challen… ▽ More

    Submitted 5 May, 2023; v1 submitted 4 May, 2023; originally announced May 2023.

    Comments: Accepted for presentation at MLSys 2023

  33. arXiv:2302.04228  [pdf, other

    cs.LG

    Federated Learning as Variational Inference: A Scalable Expectation Propagation Approach

    Authors: Han Guo, Philip Greengard, Hongyi Wang, Andrew Gelman, Yoon Kim, Eric P. Xing

    Abstract: The canonical formulation of federated learning treats it as a distributed optimization problem where the model parameters are optimized against a global loss function that decomposes across client loss functions. A recent alternative formulation instead treats federated learning as a distributed inference problem, where the goal is to infer a global posterior from partitioned client data (Al-Shed… ▽ More

    Submitted 8 February, 2023; originally announced February 2023.

  34. arXiv:2301.02654  [pdf, other

    cs.LG

    Does compressing activations help model parallel training?

    Authors: Song Bian, Dacheng Li, Hongyi Wang, Eric P. Xing, Shivaram Venkataraman

    Abstract: Large-scale Transformer models are known for their exceptional performance in a range of tasks, but training them can be difficult due to the requirement for communication-intensive model parallelism. One way to improve training speed is to compress the message size in communication. Previous approaches have primarily focused on compressing gradients in a data parallelism setting, but compression… ▽ More

    Submitted 6 January, 2023; originally announced January 2023.

    Comments: 16 pages, 5 figures

  35. A novel method for searching the $Ξくしー_c^{0/+}$-$Ξくしー_c^{\prime 0/+}$ mixing effect in the angular distribution analysis of a four-body $Ξくしー_c^{0/+}$ decay

    Authors: Zhi Peng Xing, Yu ji Shi

    Abstract: In this work, we raised a novel method for searching the $Ξくしー^{0+}_c$-$Ξくしー_c^{0+\prime}$ mixing effect in an angular distribution analysis of the $Ξくしー_c\toΞくしー^{(\prime)}(Λらむだπぱい)\ell^+νにゅー$ decay, where the mixing effect can be observed by the appearance of the $Ξくしー^{\prime}$ resonant. Armed with this angular distribution, the decay branching fraction and the forward-backward asymmetry are predicted. We pointed out… ▽ More

    Submitted 17 December, 2022; originally announced December 2022.

    Comments: 7 pages, 3 figures

  36. arXiv:2212.04875  [pdf, other

    cs.CV cs.AI

    Expeditious Saliency-guided Mix-up through Random Gradient Thresholding

    Authors: Minh-Long Luu, Zeyi Huang, Eric P. Xing, Yong Jae Lee, Haohan Wang

    Abstract: Mix-up training approaches have proven to be effective in improving the generalization ability of Deep Neural Networks. Over the years, the research community expands mix-up methods into two directions, with extensive efforts to improve saliency-guided procedures but minimal focus on the arbitrary path, leaving the randomization domain unexplored. In this paper, inspired by the superior qualities… ▽ More

    Submitted 10 August, 2023; v1 submitted 9 December, 2022; originally announced December 2022.

    Comments: Accepted Long paper at 2nd Practical-DL Workshop at AAAI 2023

  37. Puffy: A Step-by-step Guide to Craft Bio-inspired Artifacts with Interactive Materiality

    Authors: Sark Pangrui Xing, Bart van Dijk, Pengcheng An, Miguel Bruns, Yaliang Chuang, Stephen Jia Wang

    Abstract: A rising number of HCI scholars have begun to use materiality as a starting point for exploring the design's potential and restrictions. Despite the theoretical flourishing, the practical design process and instruction for beginner practitioners are still in scarcity. We leveraged the pictorial format to illustrate our crafting process of Puffy, a bio-inspired artifact that features a cilia-mimeti… ▽ More

    Submitted 27 November, 2022; originally announced November 2022.

    Comments: 17th International Conference On Tangible Embedded And Embodied Interaction

  38. arXiv:2211.05322  [pdf, other

    cs.LG cs.DC

    On Optimizing the Communication of Model Parallelism

    Authors: Yonghao Zhuang, Hexu Zhao, Lianmin Zheng, Zhuohan Li, Eric P. Xing, Qirong Ho, Joseph E. Gonzalez, Ion Stoica, Hao Zhang

    Abstract: We study a novel and important communication pattern in large-scale model-parallel deep learning (DL), which we call cross-mesh resharding. This pattern emerges when the two paradigms of model parallelism - intra-operator and inter-operator parallelism - are combined to support large models on large clusters. In cross-mesh resharding, a sharded tensor needs to be sent from a source device mesh to… ▽ More

    Submitted 9 November, 2022; originally announced November 2022.

  39. arXiv:2211.01452  [pdf, other

    cs.LG cs.CR

    MPCFormer: fast, performant and private Transformer inference with MPC

    Authors: Dacheng Li, Rulin Shao, Hongyi Wang, Han Guo, Eric P. Xing, Hao Zhang

    Abstract: Enabling private inference is crucial for many cloud inference services that are based on Transformer models. However, existing private inference solutions can increase the inference latency by more than 60x or significantly compromise the inference quality. In this paper, we design the framework MPCFORMER as a practical solution, using Secure Multi-Party Computation (MPC) and Knowledge Distillati… ▽ More

    Submitted 16 March, 2023; v1 submitted 2 November, 2022; originally announced November 2022.

  40. arXiv:2210.10495  [pdf, other

    cs.CV

    ADPS: Asymmetric Distillation Post-Segmentation for Image Anomaly Detection

    Authors: Peng Xing, Hao Tang, Jinhui Tang, Zechao Li

    Abstract: Knowledge Distillation-based Anomaly Detection (KDAD) methods rely on the teacher-student paradigm to detect and segment anomalous regions by contrasting the unique features extracted by both networks. However, existing KDAD methods suffer from two main limitations: 1) the student network can effortlessly replicate the teacher network's representations, and 2) the features of the teacher network s… ▽ More

    Submitted 24 July, 2023; v1 submitted 19 October, 2022; originally announced October 2022.

    Comments: 11pages,9 figures

  41. arXiv:2210.04325  [pdf, other

    cs.CL cs.AI cs.LG

    ASDOT: Any-Shot Data-to-Text Generation with Pretrained Language Models

    Authors: Jiannan Xiang, Zhengzhong Liu, Yucheng Zhou, Eric P. Xing, Zhiting Hu

    Abstract: Data-to-text generation is challenging due to the great variety of the input data in terms of domains (e.g., finance vs sports) or schemata (e.g., diverse predicates). Recent end-to-end neural methods thus require substantial training examples to learn to disambiguate and describe the data. Yet, real-world data-to-text problems often suffer from various data-scarce issues: one may have access to o… ▽ More

    Submitted 22 October, 2022; v1 submitted 9 October, 2022; originally announced October 2022.

    Comments: Findings of EMNLP 2022

  42. arXiv:2209.12441  [pdf, other

    cs.CV

    Visual Anomaly Detection Via Partition Memory Bank Module and Error Estimation

    Authors: Peng Xing, Zechao Li

    Abstract: Reconstruction method based on the memory module for visual anomaly detection attempts to narrow the reconstruction error for normal samples while enlarging it for anomalous samples. Unfortunately, the existing memory module is not fully applicable to the anomaly detection task, and the reconstruction error of the anomaly samples remains small. Towards this end, this work proposes a new unsupervis… ▽ More

    Submitted 26 September, 2022; originally announced September 2022.

    Comments: 12 pages, 9 figures

  43. arXiv:2209.12440  [pdf, other

    cs.CV

    Self-Supervised Guided Segmentation Framework for Unsupervised Anomaly Detection

    Authors: Peng Xing, Yanpeng Sun, Zechao Li

    Abstract: Unsupervised anomaly detection is a challenging task in industrial applications since it is impracticable to collect sufficient anomalous samples. In this paper, a novel Self-Supervised Guided Segmentation Framework (SGSF) is proposed by jointly exploring effective generation method of forged anomalous samples and the normal sample features as the guidance information of segmentation for anomaly d… ▽ More

    Submitted 26 September, 2022; originally announced September 2022.

    Comments: 13 pages, 7 figures

  44. Phase Aberration Correction for in vivo Ultrasound Localization Microscopy Using a Spatiotemporal Complex-Valued Neural Network

    Authors: Paul Xing, Jonathan Porée, Brice Rauby, Antoine Malescot, Éric Martineau, Vincent Perrot, Ravi L. Rungta, Jean Provost

    Abstract: Ultrasound Localization Microscopy (ULM) can map microvessels at a resolution of a few micrometers (μみゅーm). Transcranial ULM remains challenging in presence of aberrations caused by the skull, which lead to localization errors. Herein, we propose a deep learning approach based on complex-valued convolutional neural networks (CV-CNNs) to retrieve the aberration function, which can then be used to form… ▽ More

    Submitted 17 July, 2023; v1 submitted 21 September, 2022; originally announced September 2022.

  45. arXiv:2208.00219  [pdf, other

    cs.CV cs.AI cs.LG cs.MM

    Meta-DETR: Image-Level Few-Shot Detection with Inter-Class Correlation Exploitation

    Authors: Gongjie Zhang, Zhipeng Luo, Kaiwen Cui, Shijian Lu, Eric P. Xing

    Abstract: Few-shot object detection has been extensively investigated by incorporating meta-learning into region-based detection frameworks. Despite its success, the said paradigm is still constrained by several factors, such as (i) low-quality region proposals for novel classes and (ii) negligence of the inter-class correlation among different classes. Such limitations hinder the generalization of base-cla… ▽ More

    Submitted 30 July, 2022; originally announced August 2022.

    Comments: Accepted by T-PAMI (IEEE Transactions on Pattern Analysis and Machine Intelligence). Codes: https://github.com/ZhangGongjie/Meta-DETR

  46. arXiv:2207.14172  [pdf, other

    cs.CV

    Semantic-Aligned Matching for Enhanced DETR Convergence and Multi-Scale Feature Fusion

    Authors: Gongjie Zhang, Zhipeng Luo, Jiaxing Huang, Shijian Lu, Eric P. Xing

    Abstract: The recently proposed DEtection TRansformer (DETR) has established a fully end-to-end paradigm for object detection. However, DETR suffers from slow training convergence, which hinders its applicability to various detection tasks. We observe that DETR's slow convergence is largely attributed to the difficulty in matching object queries to relevant regions due to the unaligned semantics between obj… ▽ More

    Submitted 6 February, 2023; v1 submitted 28 July, 2022; originally announced July 2022.

  47. arXiv:2207.08944  [pdf, other

    cs.CV cs.LG

    Robustar: Interactive Toolbox Supporting Precise Data Annotation for Robust Vision Learning

    Authors: Chonghan Chen, Haohan Wang, Leyang Hu, Yuhao Zhang, Shuguang Lyu, Jingcheng Wu, Xinnuo Li, Linjing Sun, Eric P. Xing

    Abstract: We introduce the initial release of our software Robustar, which aims to improve the robustness of vision classification machine learning models through a data-driven perspective. Building upon the recent understanding that the lack of machine learning model's robustness is the tendency of the model's learning of spurious features, we aim to solve this problem from its root at the data perspective… ▽ More

    Submitted 18 July, 2022; originally announced July 2022.

    Comments: This paper introduces the first release of our software. The paper is expected to be updated as we continue to develop the software

  48. arXiv:2207.08943  [pdf, ps, other

    cs.CL cs.LG

    MRCLens: an MRC Dataset Bias Detection Toolkit

    Authors: Yifan Zhong, Haohan Wang, Eric P. Xing

    Abstract: Many recent neural models have shown remarkable empirical results in Machine Reading Comprehension, but evidence suggests sometimes the models take advantage of dataset biases to predict and fail to generalize on out-of-sample data. While many other approaches have been proposed to address this issue from the computation perspective such as new architectures or training procedures, we believe a me… ▽ More

    Submitted 18 July, 2022; originally announced July 2022.

    Comments: dataperf workshop at IMCL

  49. arXiv:2206.14268  [pdf, other

    cs.CL

    BertNet: Harvesting Knowledge Graphs with Arbitrary Relations from Pretrained Language Models

    Authors: Shibo Hao, Bowen Tan, Kaiwen Tang, Bin Ni, Xiyan Shao, Hengzhe Zhang, Eric P. Xing, Zhiting Hu

    Abstract: It is crucial to automatically construct knowledge graphs (KGs) of diverse new relations to support knowledge discovery and broad applications. Previous KG construction methods, based on either crowdsourcing or text mining, are often limited to a small predefined set of relations due to manual cost or restrictions in text corpus. Recent research proposed to use pretrained language models (LMs) as… ▽ More

    Submitted 2 June, 2023; v1 submitted 28 June, 2022; originally announced June 2022.

    Comments: ACL 2023 (Findings); Code available at https://github.com/tanyuqian/knowledge-harvest-from-lms

  50. arXiv:2206.01909  [pdf, ps, other

    cs.LG

    Toward Learning Robust and Invariant Representations with Alignment Regularization and Data Augmentation

    Authors: Haohan Wang, Zeyi Huang, Xindi Wu, Eric P. Xing

    Abstract: Data augmentation has been proven to be an effective technique for developing machine learning models that are robust to known classes of distributional shifts (e.g., rotations of images), and alignment regularization is a technique often used together with data augmentation to further help the model learn representations invariant to the shifts used to augment the data. In this paper, motivated b… ▽ More

    Submitted 4 June, 2022; originally announced June 2022.

    Comments: to appear at KDD 2022, the software package is at https://github.com/jyanln/AlignReg. arXiv admin note: text overlap with arXiv:2011.13052