(Translated by https://www.hiragana.jp/)
Search | arXiv e-print repository
Skip to main content

Showing 1–50 of 280 results for author: Yuan, B

.
  1. arXiv:2407.07640  [pdf, other

    cond-mat.str-el

    Single Crystal Diffuse Neutron Scattering Study of the Dipole-Octupole Quantum Spin Ice Candidate Ce$_2$Zr$_2$O$_7$: No Apparent Octupolar Correlations Above $T = 0.05$ K

    Authors: E. M. Smith, R. Schäfer, J. Dudemaine, B. Placke, B. Yuan, Z. Morgan, F. Ye, R. Moessner, O. Benton, A. D. Bianchi, B. D. Gaulin

    Abstract: The insulating magnetic pyrochlore Ce$_2$Zr$_2$O$_7$ has attracted much attention as a quantum spin ice candidate with dipole-octupole character that permits spin ice phases based not only on magnetic dipole moments but also allows for even-more-exotic octupole-based spin ice phases. This work reports low-temperature neutron diffraction measurements on single crystal Ce$_2$Zr$_2$O$_7$ with $Q$-cov… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

  2. arXiv:2407.03680  [pdf, other

    math.NA

    The condition for constructing a finite element from a superspline

    Authors: Jun Hu, Ting Lin, Qingyu Wu, Beihui Yuan

    Abstract: This paper addresses the sufficient and necessary conditions for constructing $C^r$ conforming finite element spaces from a superspline spaces on general simplicial triangulations. We introduce the concept of extendability for the pre-element spaces, which encompasses both the superspline space and the finite element space. By examining the extendability condition for both types of spaces, we prov… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

    Comments: 22 pages, 4 figures

    MSC Class: 65N30; 65D07

  3. arXiv:2407.02688  [pdf, other

    cs.CV

    Funny-Valen-Tine: Planning Solution Distribution Enhances Machine Abstract Reasoning Ability

    Authors: Ruizhuo Song, Beiming Yuan

    Abstract: Visual abstract reasoning problems hold immense importance in the field of image processing. Both Bongard-Logo and Raven's Progressive Matrices (RPM) belong to this domain, with Bongard-Logo categorized as image clustering reasoning and RPM involving image progression pattern reasoning. This paper introduces Valen, a novel baseline model under probabilistic highlighting models. Valen exhibits rema… ▽ More

    Submitted 7 July, 2024; v1 submitted 2 July, 2024; originally announced July 2024.

    Comments: 14 pages, 20 figures, 3 tables

  4. arXiv:2406.04333  [pdf, other

    cs.CV

    BitsFusion: 1.99 bits Weight Quantization of Diffusion Model

    Authors: Yang Sui, Yanyu Li, Anil Kag, Yerlan Idelbayev, Junli Cao, Ju Hu, Dhritiman Sagar, Bo Yuan, Sergey Tulyakov, Jian Ren

    Abstract: Diffusion-based image generation models have achieved great success in recent years by showing the capability of synthesizing high-quality content. However, these models contain a huge number of parameters, resulting in a significantly large model size. Saving and transferring them is a major bottleneck for various applications, especially those running on resource-constrained devices. In this wor… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: Project Page: https://snap-research.github.io/BitsFusion

  5. arXiv:2406.03102  [pdf, other

    cs.LG cs.AI

    DEER: A Delay-Resilient Framework for Reinforcement Learning with Variable Delays

    Authors: Bo Xia, Yilun Kong, Yongzhe Chang, Bo Yuan, Zhiheng Li, Xueqian Wang, Bin Liang

    Abstract: Classic reinforcement learning (RL) frequently confronts challenges in tasks involving delays, which cause a mismatch between received observations and subsequent actions, thereby deviating from the Markov assumption. Existing methods usually tackle this issue with end-to-end solutions using state augmentation. However, these black-box approaches often involve incomprehensible processes and redund… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  6. arXiv:2405.18639  [pdf, other

    q-bio.NC cs.CL cs.LG cs.SD eess.AS

    Improving Speech Decoding from ECoG with Self-Supervised Pretraining

    Authors: Brian A. Yuan, Joseph G. Makin

    Abstract: Recent work on intracranial brain-machine interfaces has demonstrated that spoken speech can be decoded with high accuracy, essentially by treating the problem as an instance of supervised learning and training deep neural networks to map from neural activity to text. However, such networks pay for their expressiveness with very large numbers of labeled data, a requirement that is particularly bur… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  7. arXiv:2405.16640  [pdf, other

    cs.AI cs.CL cs.CV cs.MM

    A Survey of Multimodal Large Language Model from A Data-centric Perspective

    Authors: Tianyi Bai, Hao Liang, Binwang Wan, Ling Yang, Bozhou Li, Yifan Wang, Bin Cui, Conghui He, Binhang Yuan, Wentao Zhang

    Abstract: Human beings perceive the world through diverse senses such as sight, smell, hearing, and touch. Similarly, multimodal large language models (MLLMs) enhance the capabilities of traditional large language models by integrating and processing data from multiple modalities including text, vision, audio, video, and 3D environments. Data plays a pivotal role in the development and refinement of these m… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

  8. arXiv:2405.11703  [pdf, other

    cs.LG

    QComp: A QSAR-Based Data Completion Framework for Drug Discovery

    Authors: Bingjia Yang, Yunsie Chung, Archer Y. Yang, Bo Yuan, Xiang Yu

    Abstract: In drug discovery, in vitro and in vivo experiments reveal biochemical activities related to the efficacy and toxicity of compounds. The experimental data accumulate into massive, ever-evolving, and sparse datasets. Quantitative Structure-Activity Relationship (QSAR) models, which predict biochemical activities using only the structural information of compounds, face challenges in integrating the… ▽ More

    Submitted 19 May, 2024; originally announced May 2024.

  9. arXiv:2405.09418  [pdf, other

    cond-mat.str-el

    Highly Tunable Ru-dimer Molecular Orbital State in 6H-perovskite Ba$_3$MRu$_2$O$_9$

    Authors: Bo Yuan, Beom Hyun Kim, Qiang Chen, Daniel Dobrowolski, Monika Azmanska, G. M. Luke, Shiyu Fan, Valentina Bisogni, Jonathan Pelliciari, J. P. Clancy

    Abstract: Molecular orbital (MO) systems with clusters of heavy transition metal (TM) ions are one of the most important classes of model materials for studying the interplay between local physics and effects of itinerancy. Despite a large number of candidates identified in the family of 4d TM materials, an understanding of their physics from competing \textit{microscopic} energy scales is still missing. We… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

    Comments: 7 pages, 3 figures, Supplemental Materials available upon request

  10. arXiv:2405.09207  [pdf, other

    cs.IT eess.SY

    An Exact Theory of Causal Emergence for Linear Stochastic Iteration Systems

    Authors: Kaiwei Liu, Bing Yuan, Jiang Zhang

    Abstract: After coarse-graining a complex system, the dynamics of its macro-state may exhibit more pronounced causal effects than those of its micro-state. This phenomenon, known as causal emergence, is quantified by the indicator of effective information. However, two challenges confront this theory: the absence of well-developed frameworks in continuous stochastic dynamical systems and the reliance on coa… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

  11. arXiv:2405.06007  [pdf

    cond-mat.mtrl-sci

    Anomalous properties of spark plasma sintered boron nitride solids

    Authors: Abhijit Biswas, Peter Serles, Gustavo A. Alvarez, Jesse Schimpf, Michel Hache, Jonathan Kong, Pedro Guerra Demingos, Bo Yuan, Tymofii S. Pieshkov, Chenxi Li, Anand B. Puthirath, Bin Gao, Tia Gray, Xiang Zhang, Jishnu Murukeshan, Robert Vajtai, Pengcheng Dai, Chandra Veer Singh, Jane Howe, Yu Zou, Lane W. Martin, James Patrick Clancy, Zhiting Tian, Tobin Filleter, Pulickel M. Ajayan

    Abstract: Hexagonal boron nitride (h-BN) is brittle, however, its atomic-scale structural engineering can lead to unprecedented physical properties. Here we report the bulk synthesis of high-density crystalline h-BN solids by using high-temperature spark plasma sintering (SPS) of micron size h-BN powders. In addition to the high mechanical strength and ductile response of such materials, we have obtained an… ▽ More

    Submitted 10 July, 2024; v1 submitted 9 May, 2024; originally announced May 2024.

    Comments: Authors revised version, 46 pages, 4 figures

  12. arXiv:2404.07737  [pdf, ps, other

    math.AP

    Global regularity of 2D Rayleigh-Bénard equations with logarithmic supercritical dissipation

    Authors: Baoquan Yuan, Xinyuan Xu, Changhao Li

    Abstract: In this paper, we study the global regularity problem for the 2D Rayleigh-Bénard equations with logarithmic supercritical dissipation. By exploiting a combined quantity of the system, the technique of Littlewood-Paley decomposition and Besov spaces, and some commutator estimates, we establish the global regularity of a strong solution to this equations in the Sobolev space $H^{s}(\mathbb{R}^{2})$… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

    Comments: 18 pages

    MSC Class: 35Q35; 76D03; 35B65

  13. arXiv:2404.07509   

    quant-ph

    Multiparameter cascaded quantum interferometer

    Authors: Baihong Li, Zhuo-zhuo Wang, Qi-qi Li, Changhua Chen, Boxin Yuan, Yiwei Zhai, Rui-Bo Jin, Xiaofei Zhang

    Abstract: We theoretically propose a multiparameter cascaded quantum interferometer in which a two-input and two-output setup is obtained by concatenating 50:50 beam splitters with n independent and adjustable time delays. A general method for deriving the coincidence probability of such an interferometer is given based on the linear transformation of the matrix of beam splitters. As examples, we analyze th… ▽ More

    Submitted 8 May, 2024; v1 submitted 11 April, 2024; originally announced April 2024.

    Comments: We have found a serious error in this version, which may mislead readers

  14. Panoptic Perception: A Novel Task and Fine-grained Dataset for Universal Remote Sensing Image Interpretation

    Authors: Danpei Zhao, Bo Yuan, Ziqiang Chen, Tian Li, Zhuoran Liu, Wentao Li, Yue Gao

    Abstract: Current remote-sensing interpretation models often focus on a single task such as detection, segmentation, or caption. However, the task-specific designed models are unattainable to achieve the comprehensive multi-level interpretation of images. The field also lacks support for multi-task joint interpretation datasets. In this paper, we propose Panoptic Perception, a novel task and a new fine-grai… ▽ More

    Submitted 25 April, 2024; v1 submitted 6 April, 2024; originally announced April 2024.

    Journal ref: IEEE Transactions on Geoscience and Remote Sensing, 2024

  15. arXiv:2404.04390  [pdf, other

    cond-mat.str-el

    Field-dependent Magnons in a Honeycomb Antiferromagnet CoTiO$_3$

    Authors: Bo Yuan, Ezekiel Horsley, M. B. Stone, Nicholas P. Butch, Guangyong Xu, Guo-Jiun Shu, J. P. Clancy, Young-June Kim

    Abstract: We report field-dependent high-resolution inelastic neutron scattering (INS) measurements on the honeycomb lattice magnet, CoTiO$_3$, to study the evolution of its magnon excitations across a spin reorientation transition driven by an in-plane magnetic field. By carrying out elastic neutron scattering in a magnetic field, we show that the sample transitions from a collinear antiferromagnetic state… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

    Comments: 15 pages, 9 figures, Supplemental Materials available upon request

  16. arXiv:2404.00242  [pdf, other

    cs.CL cs.AI

    DeFT: Decoding with Flash Tree-attention for Efficient Tree-structured LLM Inference

    Authors: Jinwei Yao, Kaiqi Chen, Kexun Zhang, Jiaxuan You, Binhang Yuan, Zeke Wang, Tao Lin

    Abstract: Given the increasing demand for tree-structured interactions with LLMs, we introduce DeFT (Decoding with Flash Tree-Attention), an IO-aware tree attention algorithm tailored for tree-structured inference. Unlike traditional sequence-based decoding, tree-structured decoding better accommodates modern task requirements, including self-consistency, few-shot prompting, multi-step reasoning, and multi-… ▽ More

    Submitted 29 May, 2024; v1 submitted 30 March, 2024; originally announced April 2024.

    Comments: Update DeFT-v2. DeFT-v1 was accepted by ICLR'24 AGI Workshop ( https://openreview.net/forum?id=HqfLHoX8bR ). Code will be released soon

  17. arXiv:2403.07952  [pdf, other

    cs.CV cs.AI cs.MM

    AesopAgent: Agent-driven Evolutionary System on Story-to-Video Production

    Authors: Jiuniu Wang, Zehua Du, Yuyuan Zhao, Bo Yuan, Kexiang Wang, Jian Liang, Yaxi Zhao, Yihen Lu, Gengliang Li, Junlong Gao, Xin Tu, Zhenyu Guo

    Abstract: The Agent and AIGC (Artificial Intelligence Generated Content) technologies have recently made significant progress. We propose AesopAgent, an Agent-driven Evolutionary System on Story-to-Video Production. AesopAgent is a practical application of agent technology for multimodal content generation. The system integrates multiple generative capabilities within a unified framework, so that individual… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

    Comments: 22 pages, 13 figures

  18. arXiv:2403.06504  [pdf, other

    cs.DC

    Adding NVMe SSDs to Enable and Accelerate 100B Model Fine-tuning on a Single GPU

    Authors: Changyue Liao, Mo Sun, Zihan Yang, Kaiqi Chen, Binhang Yuan, Fei Wu, Zeke Wang

    Abstract: Recent advances in large language models have brought immense value to the world, with their superior capabilities stemming from the massive number of parameters they utilize. However, even the GPUs with the highest memory capacities, currently peaking at 80GB, are far from sufficient to accommodate these vast parameters and their associated optimizer states when conducting stochastic gradient des… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

  19. arXiv:2403.03452  [pdf, other

    cs.CV

    D4C Glove-train: Solving the RPM and Bongard-logo Problem by Circumscribing and Building Distribution for Concepts

    Authors: Ruizhuo Song, Beiming Yuan

    Abstract: This paper achieves noteworthy progress in the realm of abstract reasoning, particularly in addressing Raven's Progressive Matrices (RPM) and Bongard-Logo challenges. Initially, we introduce Lico-Net, a novel baseline model that resolves RPM problems with remarkable accuracy. Leveraging this foundation, we advance with the D3C approach, which advocates representing the underlying concepts in abstr… ▽ More

    Submitted 1 July, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

    Comments: 15 pages, 15 figures, 6 tables

  20. arXiv:2403.03190  [pdf, other

    cs.CV

    Triple-CFN: Restructuring Concept and Feature Spaces for Enhancing Abstract Reasoning Process

    Authors: Ruizhuo Song, Beiming Yuan

    Abstract: Visual abstract reasoning poses challenges to AI algorithms, requiring cognitive abilities beyond perception. For methodology, this study emphasizes the need to separately extract concepts and features from visual abstract reasoning problems, employing the responses of features to concepts as elements in the reasoning process. It also advocates for clear concept and feature spaces to tackle visual… ▽ More

    Submitted 21 June, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

    Comments: 13 pages, 16 figures, 7 tables

  21. arXiv:2403.03173  [pdf, other

    cs.CV

    Solving the Clustering Reasoning Problems by Modeling a Deep-Learning-Based Probabilistic Model

    Authors: Ruizhuo Song, Beiming Yuan

    Abstract: Visual abstract reasoning problems pose significant challenges to the perception and cognition abilities of artificial intelligence algorithms, demanding deeper pattern recognition and inductive reasoning beyond mere identification of explicit image features. Research advancements in this field often provide insights and technical support for other similar domains. In this study, we introduce PMoC… ▽ More

    Submitted 13 June, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

    Comments: 14 pages, 17 figures, 4 tables

  22. arXiv:2403.00193  [pdf

    cs.NI cs.CR

    Structural Resilience and Connectivity of the IPv6 Internet: An AS-level Topology Examination

    Authors: Bin Yuan, Tianbo Song

    Abstract: The study utilizes a comprehensive dataset informed by IPv6 routing information to provide statistics, degree distribution, joint degree distribution, and clustering analysis of the IPv6 Internet's structure and resilience.The dataset includes 17,232 unique ASes and 10,000 unique IPv6 prefixes. Analysis reveals an interconnected network with an average path length of approximately 3 hops, suggesti… ▽ More

    Submitted 29 February, 2024; originally announced March 2024.

  23. arXiv:2403.00190  [pdf

    cs.SI cs.AI

    Identification of important nodes in the information propagation network based on the artificial intelligence method

    Authors: Bin Yuan, Tianbo Song, Jerry Yao

    Abstract: This study presents an integrated approach for identifying key nodes in information propagation networks using advanced artificial intelligence methods. We introduce a novel technique that combines the Decision-making Trial and Evaluation Laboratory (DEMATEL) method with the Global Structure Model (GSM), creating a synergistic model that effectively captures both local and global influences within… ▽ More

    Submitted 29 February, 2024; originally announced March 2024.

  24. arXiv:2402.15054  [pdf, other

    cond-mat.stat-mech math-ph physics.data-an

    Dynamical Reversibility and A New Theory of Causal Emergence

    Authors: Jiang Zhang, Ruyi Tao, Keng Hou Leong, Mingzhe Yang, Bing Yuan

    Abstract: The theory of causal emergence based on effective information(EI) suggests that complex systems may exhibit a phenomenon called causal emergence(CE), where the macro-dynamics demonstrate a stronger causal effect than the micro-dynamics. However, a challenge in this theory is the dependence on the method used to coarse-grain the system. In this paper, we introduce a fresh concept of approximate dyn… ▽ More

    Submitted 9 June, 2024; v1 submitted 22 February, 2024; originally announced February 2024.

    Comments: 40 pages,9 figures

  25. arXiv:2402.14477  [pdf, other

    cond-mat.mtrl-sci cond-mat.str-el

    Pressure tunable magnetic skyrmion phase in Co8Zn8Mn4 single crystals

    Authors: Zhun Li, Xinrun Mi, Xinming Wang, Jian Lyu, Na Su, Aifeng Wang, Yisheng Chai, Bao Yuan, Wanju Luo, Hui Cheng, Jianxiang Gao, Hongliang Wang, Lijie Hao, Mingquan He, Junying Shen, Young Sun, Xin Tong

    Abstract: In a magnetic skyrmion phase, magnetic moments form vortex-like topological textures which are of both fundamental and industrial interests. In $βべーた$-Mn-type Co-Zn-Mn alloys, chrial magnetic skyrmions emerge above room temperature, providing a unique system for studying the skrymion physics and exploring spintronics applications. However, the magnetic skyrmion phase is typically confined in a narrow… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

    Comments: 7 pages, 4 figures

  26. arXiv:2402.02739  [pdf, other

    cs.CR cs.CV cs.LG

    DisDet: Exploring Detectability of Backdoor Attack on Diffusion Models

    Authors: Yang Sui, Huy Phan, Jinqi Xiao, Tianfang Zhang, Zijie Tang, Cong Shi, Yan Wang, Yingying Chen, Bo Yuan

    Abstract: In the exciting generative AI era, the diffusion model has emerged as a very powerful and widely adopted content generation and editing tool for various data modalities, making the study of their potential security risks very necessary and critical. Very recently, some pioneering works have shown the vulnerability of the diffusion model against backdoor attacks, calling for in-depth analysis and i… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

  27. arXiv:2402.01763  [pdf, other

    cs.DB cs.AI cs.CL cs.LG

    When Large Language Models Meet Vector Databases: A Survey

    Authors: Zhi Jing, Yongye Su, Yikun Han, Bo Yuan, Haiyun Xu, Chunjiang Liu, Kehai Chen, Min Zhang

    Abstract: This survey explores the synergistic potential of Large Language Models (LLMs) and Vector Databases (VecDBs), a burgeoning but rapidly evolving research area. With the proliferation of LLMs comes a host of challenges, including hallucinations, outdated knowledge, prohibitive commercial application costs, and memory issues. VecDBs emerge as a compelling solution to these issues by offering an effic… ▽ More

    Submitted 5 February, 2024; v1 submitted 30 January, 2024; originally announced February 2024.

  28. arXiv:2401.12994  [pdf, other

    cs.CL

    Automated Scoring of Clinical Patient Notes using Advanced NLP and Pseudo Labeling

    Authors: Jingyu Xu, Yifeng Jiang, Bin Yuan, Shulin Li, Tianbo Song

    Abstract: Clinical patient notes are critical for documenting patient interactions, diagnoses, and treatment plans in medical practice. Ensuring accurate evaluation of these notes is essential for medical education and certification. However, manual evaluation is complex and time-consuming, often resulting in variability and resource-intensive assessments. To tackle these challenges, this research introduce… ▽ More

    Submitted 18 January, 2024; originally announced January 2024.

  29. arXiv:2401.11240  [pdf, other

    cs.DC

    CaraServe: CPU-Assisted and Rank-Aware LoRA Serving for Generative LLM Inference

    Authors: Suyi Li, Hanfeng Lu, Tianyuan Wu, Minchen Yu, Qizhen Weng, Xusheng Chen, Yizhou Shan, Binhang Yuan, Wei Wang

    Abstract: Pre-trained large language models (LLMs) often need specialization for domain-specific tasks. Low-Rank Adaptation (LoRA) is a popular approach that adapts a base model to multiple tasks by adding lightweight trainable adapters. In this paper, we present CaraServe, a system that efficiently serves many LoRA adapters derived from a common base model. CaraServe maintains the base model on GPUs and dy… ▽ More

    Submitted 20 January, 2024; originally announced January 2024.

  30. arXiv:2401.10341  [pdf, other

    cs.CV cs.AI

    ELRT: Efficient Low-Rank Training for Compact Convolutional Neural Networks

    Authors: Yang Sui, Miao Yin, Yu Gong, Jinqi Xiao, Huy Phan, Bo Yuan

    Abstract: Low-rank compression, a popular model compression technique that produces compact convolutional neural networks (CNNs) with low rankness, has been well-studied in the literature. On the other hand, low-rank training, as an alternative way to train low-rank CNNs from scratch, has been exploited little yet. Unlike low-rank compression, low-rank training does not need pre-trained full-rank models, an… ▽ More

    Submitted 18 January, 2024; originally announced January 2024.

  31. arXiv:2401.09699  [pdf, other

    cs.CL cs.AI

    Curriculum Recommendations Using Transformer Base Model with InfoNCE Loss And Language Switching Method

    Authors: Xiaonan Xu, Bin Yuan, Yongyao Mo, Tianbo Song, Shulin Li

    Abstract: The Curriculum Recommendations paradigm is dedicated to fostering learning equality within the ever-evolving realms of educational technology and curriculum development. In acknowledging the inherent obstacles posed by existing methodologies, such as content conflicts and disruptions from language translation, this paradigm aims to confront and overcome these challenges. Notably, it addresses cont… ▽ More

    Submitted 17 January, 2024; originally announced January 2024.

    Comments: 4pages, 2 figures, ICAICA2023

    MSC Class: 68T50

  32. arXiv:2401.05439  [pdf

    cs.LG cs.CE

    Physics-informed Deep Learning to Solve Three-dimensional Terzaghi Consolidation Equation: Forward and Inverse Problems

    Authors: Biao Yuan, Ana Heitor, He Wang, Xiaohui Chen

    Abstract: The emergence of neural networks constrained by physical governing equations has sparked a new trend in deep learning research, which is known as Physics-Informed Neural Networks (PINNs). However, solving high-dimensional problems with PINNs is still a substantial challenge, the space complexity brings difficulty to solving large multidirectional problems. In this paper, a novel PINN framework to… ▽ More

    Submitted 8 January, 2024; originally announced January 2024.

    Comments: 30 pages, 11 figures, 6 tables, 23 equations

  33. arXiv:2401.00747  [pdf, other

    cs.GT cs.MA

    Polynomial-time Approximation Scheme for Equilibriums of Games

    Authors: Hongbo Sun, Chongkun Xia, Junbo Tan, Bo Yuan, Xueqian Wang, Bin Liang

    Abstract: Whether a PTAS (polynomial-time approximation scheme) exists for equilibriums of games has been an open question, which relates to questions in three fields, the practicality of methods in algorithmic game theory, the equation PPAD=FP about the two complexity classes in computational complexity theory, and non-stationarity and curse of multiagency in MARL (multi-agent reinforcement learning). This… ▽ More

    Submitted 3 June, 2024; v1 submitted 1 January, 2024; originally announced January 2024.

    Comments: 23 pages, 7 figures, code and animation are available at https://github.com/shb20tsinghua/PTAS_Game/tree/main

    MSC Class: 90C39; 90C51; 91A15

  34. arXiv:2312.16815  [pdf, other

    physics.soc-ph cs.AI nlin.AO

    Emergence and Causality in Complex Systems: A Survey on Causal Emergence and Related Quantitative Studies

    Authors: Bing Yuan, Zhang Jiang, Aobo Lyu, Jiayun Wu, Zhipeng Wang, Mingzhe Yang, Kaiwei Liu, Muyun Mou, Peng Cui

    Abstract: Emergence and causality are two fundamental concepts for understanding complex systems. They are interconnected. On one hand, emergence refers to the phenomenon where macroscopic properties cannot be solely attributed to the cause of individual properties. On the other hand, causality can exhibit emergence, meaning that new causal laws may arise as we increase the level of abstraction. Causal emer… ▽ More

    Submitted 25 February, 2024; v1 submitted 27 December, 2023; originally announced December 2023.

    Comments: 57 pages, 17 figures, 1 table

    MSC Class: 68P30 ACM Class: K.3.2

  35. arXiv:2312.16679  [pdf

    cond-mat.mtrl-sci

    Square Moiré Superlattices in Twisted Two-Dimensional Halide Perovskites

    Authors: Shuchen Zhang, Linrui Jin, Yuan Lu, Linghai Zhang, Jiaqi Yang, Qiuchen Zhao, Dewei Sun, Joshua J. P. Thompson, Biao Yuan, Ke Ma, Akriti, Jee Yung Park, Yoon Ho Lee, Zitang Wei, Blake P. Finkenauer, Daria D. Blach, Sarath Kumar, Hailin Peng, Arun Mannodi-Kanakkithodi, Yi Yu, Ermin Malic, Gang Lu, Letian Dou, Libai Huang

    Abstract: Moiré superlattices have emerged as a new platform for studying strongly correlated quantum phenomena, but these systems have been largely limited to van der Waals layer two-dimensional (2D) materials. Here we introduce moiré superlattices leveraging ultra-thin, ligand-free halide perovskites, facilitated by ionic interactions. Square moiré superlattices with varying periodic lengths are clearly v… ▽ More

    Submitted 27 December, 2023; originally announced December 2023.

  36. PETDet: Proposal Enhancement for Two-Stage Fine-Grained Object Detection

    Authors: Wentao Li, Danpei Zhao, Bo Yuan, Yue Gao, Zhenwei Shi

    Abstract: Fine-grained object detection (FGOD) extends object detection with the capability of fine-grained recognition. In recent two-stage FGOD methods, the region proposal serves as a crucial link between detection and fine-grained recognition. However, current methods overlook that some proposal-related procedures inherited from general detection are not equally suitable for FGOD, limiting the multi-tas… ▽ More

    Submitted 16 December, 2023; originally announced December 2023.

    Comments: IEEE TGRS 2023

  37. arXiv:2312.10343  [pdf, other

    eess.SP cs.AR cs.LG cs.NE

    In-Sensor Radio Frequency Computing for Energy-Efficient Intelligent Radar

    Authors: Yang Sui, Minning Zhu, Lingyi Huang, Chung-Tse Michael Wu, Bo Yuan

    Abstract: Radio Frequency Neural Networks (RFNNs) have demonstrated advantages in realizing intelligent applications across various domains. However, as the model size of deep neural networks rapidly increases, implementing large-scale RFNN in practice requires an extensive number of RF interferometers and consumes a substantial amount of energy. To address this challenge, we propose to utilize low-rank dec… ▽ More

    Submitted 16 December, 2023; originally announced December 2023.

  38. arXiv:2312.00843  [pdf, other

    cs.LG cs.AI cs.CR

    Exploring the Robustness of Decentralized Training for Large Language Models

    Authors: Lin Lu, Chenxi Dai, Wangcheng Tao, Binhang Yuan, Yanan Sun, Pan Zhou

    Abstract: Decentralized training of large language models has emerged as an effective way to democratize this technology. However, the potential threats associated with this approach have not been carefully discussed, which would hinder the development of decentralized training infrastructures. This paper aims to initiate discussion towards this end by exploring the robustness of decentralized training from… ▽ More

    Submitted 30 November, 2023; originally announced December 2023.

    Comments: 6 pages, 3 figures

  39. arXiv:2311.18103  [pdf, other

    eess.IV cs.CV

    Corner-to-Center Long-range Context Model for Efficient Learned Image Compression

    Authors: Yang Sui, Ding Ding, Xiang Pan, Xiaozhong Xu, Shan Liu, Bo Yuan, Zhenzhong Chen

    Abstract: In the framework of learned image compression, the context model plays a pivotal role in capturing the dependencies among latent representations. To reduce the decoding time resulting from the serial autoregressive context model, the parallel context model has been proposed as an alternative that necessitates only two passes during the decoding phase, thus facilitating efficient image compression… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

  40. arXiv:2311.12873  [pdf, ps, other

    math.AP

    Global Strong Solutions to the incompressible Magnetohydrodynamic Equations with Density-Dependent Viscosity and Vacuum in 3D Exterior Domains

    Authors: Bing Yuan, Rong Zhang, Peng Zhou

    Abstract: The nonhomogeneous incompressible Magnetohydrodynamic Equations with density-dependent viscosity is studied in three-dimensional (3D) exterior domains with slip boundary conditions. The key is the constraint of an additional initial value condition $B_0\in L^p (1\leqslant p<12/7)$, which increase decay-in-time rates of the solutions, thus we obtain the global existence of strong solutions provided… ▽ More

    Submitted 19 November, 2023; originally announced November 2023.

    Comments: arXiv admin note: text overlap with arXiv:2205.05925, arXiv:1709.05608, arXiv:1506.03884, arXiv:2112.08111 by other authors

  41. arXiv:2311.11557  [pdf, other

    cs.LG cs.AI

    Replay-enhanced Continual Reinforcement Learning

    Authors: Tiantian Zhang, Kevin Zehua Shen, Zichuan Lin, Bo Yuan, Xueqian Wang, Xiu Li, Deheng Ye

    Abstract: Replaying past experiences has proven to be a highly effective approach for averting catastrophic forgetting in supervised continual learning. However, some crucial factors are still largely ignored, making it vulnerable to serious failure, when used as a solution to forgetting in continual reinforcement learning, even in the context of perfect memory where all data of previous tasks are accessibl… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

    Comments: Accepted by Transactions on Machine Learning Research 2023

  42. arXiv:2311.11514  [pdf, other

    cs.DC

    HexGen: Generative Inference of Large Language Model over Heterogeneous Environment

    Authors: Youhe Jiang, Ran Yan, Xiaozhe Yao, Yang Zhou, Beidi Chen, Binhang Yuan

    Abstract: Serving generative inference of the large language model is a crucial component of contemporary AI applications. This paper focuses on deploying such services in a heterogeneous and cross-datacenter setting to mitigate the substantial inference costs typically associated with a single centralized datacenter. Towards this end, we propose HexGen, a flexible distributed inference engine that uniquely… ▽ More

    Submitted 27 May, 2024; v1 submitted 19 November, 2023; originally announced November 2023.

    Comments: Accepted by ICML 2024

  43. arXiv:2311.08164  [pdf, other

    quant-ph

    Full characterization of biphotons with a generalized quantum interferometer

    Authors: Baihong Li, Changhua Chen, Boxin Yuan, Xiaofei Zhang, Ruifang Dong, Shougang Zhang, Rui-Bo Jin

    Abstract: Entangled photons (biphotons) in the time-frequency degree of freedom play a crucial role in both foundational physics and advanced quantum technologies. Fully characterizing them poses a key scientific challenge. Here, we propose a theoretical approach to achieving the complete tomography of biphotons by introducing a frequency shift in one arm of the combination interferometer. Our method, a gen… ▽ More

    Submitted 20 March, 2024; v1 submitted 14 November, 2023; originally announced November 2023.

    Comments: 14 pages, 3 figures

  44. arXiv:2310.17157  [pdf, other

    cs.LG

    Deja Vu: Contextual Sparsity for Efficient LLMs at Inference Time

    Authors: Zichang Liu, Jue Wang, Tri Dao, Tianyi Zhou, Binhang Yuan, Zhao Song, Anshumali Shrivastava, Ce Zhang, Yuandong Tian, Christopher Re, Beidi Chen

    Abstract: Large language models (LLMs) with hundreds of billions of parameters have sparked a new wave of exciting AI applications. However, they are computationally expensive at inference time. Sparsity is a natural approach to reduce this cost, but existing methods either require costly retraining, have to forgo LLM's in-context learning ability, or do not yield wall-clock time speedup on modern hardware.… ▽ More

    Submitted 26 October, 2023; originally announced October 2023.

    Journal ref: Proceedings of the 40th International Conference on Machine Learning, 2023, 919

  45. arXiv:2310.14277  [pdf, other

    cs.CV

    A Survey on Continual Semantic Segmentation: Theory, Challenge, Method and Application

    Authors: Bo Yuan, Danpei Zhao

    Abstract: Continual learning, also known as incremental learning or life-long learning, stands at the forefront of deep learning and AI systems. It breaks through the obstacle of one-way training on close sets and enables continuous adaptive learning on open-set conditions. In the recent decade, continual learning has been explored and applied in multiple fields especially in computer vision covering classi… ▽ More

    Submitted 22 October, 2023; originally announced October 2023.

    Comments: 20 pages, 12 figures. Undergoing Review

  46. arXiv:2310.08161  [pdf, other

    physics.optics physics.app-ph physics.comp-ph

    Phase offset method of ptychographic contrast reversal correction

    Authors: Christoph Hofer, Chuang Gao, Tamazouzt Chennit, Biao Yuan, Timothy J. Pennycook

    Abstract: The contrast transfer function of direct ptychography methods such as the single side band (SSB) method are single signed, yet these methods still sometimes exhibit contrast reversals, most often where the projected potentials are strong. In thicker samples central focusing often provides the best ptychographic contrast as this leads to defocus variations within the sample canceling out. However f… ▽ More

    Submitted 21 December, 2023; v1 submitted 12 October, 2023; originally announced October 2023.

  47. arXiv:2310.04696  [pdf, other

    cs.DB cs.AI

    Serving Deep Learning Model in Relational Databases

    Authors: Alexandre Eichenberger, Qi Lin, Saif Masood, Hong Min, Alexander Sim, Jie Wang, Yida Wang, Kesheng Wu, Binhang Yuan, Lixi Zhou, Jia Zou

    Abstract: Serving deep learning (DL) models on relational data has become a critical requirement across diverse commercial and scientific domains, sparking growing interest recently. In this visionary paper, we embark on a comprehensive exploration of representative architectures to address the requirement. We highlight three pivotal paradigms: The state-of-the-artDL-Centricarchitecture offloadsDL computati… ▽ More

    Submitted 9 October, 2023; v1 submitted 7 October, 2023; originally announced October 2023.

    Comments: Authors are ordered alphabetically; Jia Zou is the corresponding author

  48. arXiv:2310.00205  [pdf, other

    cs.SE cs.CR

    An Empirical Study on the Use of Static Analysis Tools in Open Source Embedded Software

    Authors: Mingjie Shen, Akul Pillai, Brian A. Yuan, James C. Davis, Aravind Machiry

    Abstract: This paper performs the first study to understand the prevalence, challenges, and effectiveness of using Static Application Security Testing (SAST) tools on Open-Source Embedded Software (EMBOSS) repositories. We collect a corpus of 258 of the most popular EMBOSS projects, representing 13 distinct categories such as real-time operating systems, network stacks, and applications. To understand the c… ▽ More

    Submitted 29 September, 2023; originally announced October 2023.

  49. Inherit with Distillation and Evolve with Contrast: Exploring Class Incremental Semantic Segmentation Without Exemplar Memory

    Authors: Danpei Zhao, Bo Yuan, Zhenwei Shi

    Abstract: As a front-burner problem in incremental learning, class incremental semantic segmentation (CISS) is plagued by catastrophic forgetting and semantic drift. Although recent methods have utilized knowledge distillation to transfer knowledge from the old model, they are still unable to avoid pixel confusion, which results in severe misclassification after incremental steps due to the lack of annotati… ▽ More

    Submitted 27 September, 2023; originally announced September 2023.

    Journal ref: IEEE TPAMI 2023

  50. arXiv:2308.11166  [pdf, other

    cs.CV cs.AI

    Hierarchical Point-based Active Learning for Semi-supervised Point Cloud Semantic Segmentation

    Authors: Zongyi Xu, Bo Yuan, Shanshan Zhao, Qianni Zhang, Xinbo Gao

    Abstract: Impressive performance on point cloud semantic segmentation has been achieved by fully-supervised methods with large amounts of labelled data. As it is labour-intensive to acquire large-scale point cloud data with point-wise labels, many attempts have been made to explore learning 3D point cloud segmentation with limited annotations. Active learning is one of the effective strategies to achieve th… ▽ More

    Submitted 21 August, 2023; originally announced August 2023.

    Comments: International Conference on Computer Vision (ICCV) 2023