(Translated by https://www.hiragana.jp/)
Search | arXiv e-print repository
Skip to main content

Showing 1–50 of 257 results for author: Tong, Y

.
  1. arXiv:2407.00362  [pdf, other

    cs.CV cs.AI

    JSCDS: A Core Data Selection Method with Jason-Shannon Divergence for Caries RGB Images-Efficient Learning

    Authors: Peiliang Zhang, Yujia Tong, Chenghu Du, Chao Che, Yongjun Zhu

    Abstract: Deep learning-based RGB caries detection improves the efficiency of caries identification and is crucial for preventing oral diseases. The performance of deep learning models depends on high-quality data and requires substantial training resources, making efficient deployment challenging. Core data selection, by eliminating low-quality and confusing data, aims to enhance training efficiency withou… ▽ More

    Submitted 6 July, 2024; v1 submitted 29 June, 2024; originally announced July 2024.

    Comments: Accepted in KDD 2024 Workshop AIDSH

  2. arXiv:2406.18606  [pdf, other

    stat.AP physics.ao-ph

    Bayesian Inference for Stochastic Predictions of Non-Gaussian Systems with Applications in Climate Change

    Authors: Yunjin Tong

    Abstract: Climate change poses significant challenges for accurate climate modeling due to the complexity and variability of non-Gaussian climate systems. To address the complexities of non-Gaussian systems in climate modeling, this thesis proposes a Bayesian framework utilizing the Unscented Kalman Filter (UKF), Ensemble Kalman Filter (EnKF), and Unscented Particle Filter (UPF) for one-dimensional and two-… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  3. arXiv:2406.17758  [pdf, other

    cs.CV

    MotionBooth: Motion-Aware Customized Text-to-Video Generation

    Authors: Jianzong Wu, Xiangtai Li, Yanhong Zeng, Jiangning Zhang, Qianyu Zhou, Yining Li, Yunhai Tong, Kai Chen

    Abstract: In this work, we present MotionBooth, an innovative framework designed for animating customized subjects with precise control over both object and camera movements. By leveraging a few images of a specific object, we efficiently fine-tune a text-to-video model to capture the object's shape and attributes accurately. Our approach presents subject region loss and video preservation loss to enhance t… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: Project page at https://jianzongwu.github.io/projects/motionbooth

  4. arXiv:2406.16956  [pdf, other

    cs.LG physics.flu-dyn

    Data-Driven Computing Methods for Nonlinear Physics Systems with Geometric Constraints

    Authors: Yunjin Tong

    Abstract: In a landscape where scientific discovery is increasingly driven by data, the integration of machine learning (ML) with traditional scientific methodologies has emerged as a transformative approach. This paper introduces a novel, data-driven framework that synergizes physics-based priors with advanced ML techniques to address the computational and practical limitations inherent in first-principle-… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  5. arXiv:2406.15440  [pdf, ps, other

    math.OC

    Sufficient D-Stability Conditions for Non-Square Matrices

    Authors: Yuhao Tong, Steven W. Su

    Abstract: This note explores the extension of D-stability to non-square matrices, applicable to distributed/decentralized controllability analysis. We first present a definition of D-stability for non-square matrices, directly extending from square matrices. We propose sufficient conditions for specific configurations of non-square matrices. Finally, we consider the selection of configurations to ensure the… ▽ More

    Submitted 29 May, 2024; originally announced June 2024.

    Comments: arXiv admin note: text overlap with arXiv:2401.00367

  6. arXiv:2406.13368  [pdf

    cond-mat.mtrl-sci

    Lewis Acidity and Basicity Diagnostics of Molten Salt for its Properties and Structure Online Monitoring

    Authors: Changzu Zhu, Jia Song, Xiaorui Xu, Chengyu Wang, Yang Tong, Lve Lin, Shaoqiang Guo, Wentao Zhou, Adrien Couet, Yafei Wang

    Abstract: Analogous to the aqueous solution where the pH of the solvent affects its multiple behaviors, the Lewis acidity-basicity of molten salts also greatly influences their thermophysical and thermochemical properties. In the study, we develop ion probes to quantitatively determine the acidity-basicity scale of molten NaCl-xAlCl3 (x = 1.5-2.1) salt using in-situ ultra-violet visible (UV-Vis) spectroscop… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  7. arXiv:2406.11389  [pdf, other

    cs.LG

    SEFraud: Graph-based Self-Explainable Fraud Detection via Interpretative Mask Learning

    Authors: Kaidi Li, Tianmeng Yang, Min Zhou, Jiahao Meng, Shendi Wang, Yihui Wu, Boshuai Tan, Hu Song, Lujia Pan, Fan Yu, Zhenli Sheng, Yunhai Tong

    Abstract: Graph-based fraud detection has widespread application in modern industry scenarios, such as spam review and malicious account detection. While considerable efforts have been devoted to designing adequate fraud detectors, the interpretability of their results has often been overlooked. Previous works have attempted to generate explanations for specific instances using post-hoc explaining methods s… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: Accepted by KDD 2024

  8. arXiv:2406.05422  [pdf, other

    cs.AI cs.RO

    Diffusion-based Reinforcement Learning for Dynamic UAV-assisted Vehicle Twins Migration in Vehicular Metaverses

    Authors: Yongju Tong, Jiawen Kang, Junlong Chen, Minrui Xu, Gaolei Li, Weiting Zhang, Xincheng Yan

    Abstract: Air-ground integrated networks can relieve communication pressure on ground transportation networks and provide 6G-enabled vehicular Metaverses services offloading in remote areas with sparse RoadSide Units (RSUs) coverage and downtown areas where users have a high demand for vehicular services. Vehicle Twins (VTs) are the digital twins of physical vehicles to enable more immersive and realistic v… ▽ More

    Submitted 8 June, 2024; originally announced June 2024.

  9. arXiv:2406.05418  [pdf, other

    cs.AI cs.NI

    Multi-attribute Auction-based Resource Allocation for Twins Migration in Vehicular Metaverses: A GPT-based DRL Approach

    Authors: Yongju Tong, Junlong Chen, Minrui Xu, Jiawen Kang, Zehui Xiong, Dusit Niyato, Chau Yuen, Zhu Han

    Abstract: Vehicular Metaverses are developed to enhance the modern automotive industry with an immersive and safe experience among connected vehicles and roadside infrastructures, e.g., RoadSide Units (RSUs). For seamless synchronization with virtual spaces, Vehicle Twins (VTs) are constructed as digital representations of physical entities. However, resource-intensive VTs updating and high mobility of vehi… ▽ More

    Submitted 8 June, 2024; originally announced June 2024.

    Comments: 16 pages, 6 figures, 3 tables

  10. arXiv:2406.05383  [pdf, other

    math.DG math.NA

    A Discrete Exterior Calculus of Bundle-valued Forms

    Authors: Theo Braune, Yiying Tong, François Gay-Balmaz, Mathieu Desbrun

    Abstract: The discretization of Cartan's exterior calculus of differential forms has been fruitful in a variety of theoretical and practical endeavors: from computational electromagnetics to the development of Finite-Element Exterior Calculus, the development of structure-preserving numerical tools satisfying exact discrete equivalents to Stokes' theorem or the de Rham complex for the exterior derivative ha… ▽ More

    Submitted 8 June, 2024; originally announced June 2024.

    Comments: 58 pages, 20 figures, Fix erroneous line break

    MSC Class: 53A70

  11. arXiv:2406.04679  [pdf, other

    eess.IV cs.CV

    XctDiff: Reconstruction of CT Images with Consistent Anatomical Structures from a Single Radiographic Projection Image

    Authors: Qingze Bai, Tiange Liu, Zhi Liu, Yubing Tong, Drew Torigian, Jayaram Udupa

    Abstract: In this paper, we present XctDiff, an algorithm framework for reconstructing CT from a single radiograph, which decomposes the reconstruction process into two easily controllable tasks: feature extraction and CT reconstruction. Specifically, we first design a progressive feature extraction strategy that is able to extract robust 3D priors from radiographs. Then, we use the extracted prior informat… ▽ More

    Submitted 13 June, 2024; v1 submitted 7 June, 2024; originally announced June 2024.

  12. arXiv:2405.20282  [pdf, other

    cs.CV

    SemFlow: Binding Semantic Segmentation and Image Synthesis via Rectified Flow

    Authors: Chaoyang Wang, Xiangtai Li, Lu Qi, Henghui Ding, Yunhai Tong, Ming-Hsuan Yang

    Abstract: Semantic segmentation and semantic image synthesis are two representative tasks in visual perception and generation. While existing methods consider them as two distinct tasks, we propose a unified diffusion-based framework (SemFlow) and model them as a pair of reverse problems. Specifically, motivated by rectified flow theory, we train an ordinary differential equation (ODE) model to transport be… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  13. arXiv:2405.04128  [pdf, other

    cs.CL cs.SD eess.AS

    Fine-grained Speech Sentiment Analysis in Chinese Psychological Support Hotlines Based on Large-scale Pre-trained Model

    Authors: Zhonglong Chen, Changwei Song, Yining Chen, Jianqiang Li, Guanghui Fu, Yongsheng Tong, Qing Zhao

    Abstract: Suicide and suicidal behaviors remain significant challenges for public policy and healthcare. In response, psychological support hotlines have been established worldwide to provide immediate help to individuals in mental crises. The effectiveness of these hotlines largely depends on accurately identifying callers' emotional states, particularly underlying negative emotions indicative of increased… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

  14. arXiv:2405.04086  [pdf, other

    cs.CL

    Optimizing Language Model's Reasoning Abilities with Weak Supervision

    Authors: Yongqi Tong, Sizhe Wang, Dawei Li, Yifan Wang, Simeng Han, Zi Lin, Chengsong Huang, Jiaxin Huang, Jingbo Shang

    Abstract: While Large Language Models (LLMs) have demonstrated proficiency in handling complex queries, much of the past work has depended on extensively annotated datasets by human experts. However, this reliance on fully-supervised annotations poses scalability challenges, particularly as models and data requirements grow. To mitigate this, we explore the potential of enhancing LLMs' reasoning abilities w… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

  15. arXiv:2404.12731  [pdf, other

    hep-ex physics.ins-det

    Near-Quantum-limited Haloscope Detection of Dark Photon Dark Matter Enhanced by a High-Q Superconducting Cavit

    Authors: Runqi Kang, Man Jiao, Yu Tong, Yang Liu, Youpeng Zhong, Yi-Fu Cai, Jingwei Zhou, Xing Rong, Jiangfeng Du

    Abstract: We report new experimental results on the search for dark photons based on a near-quantum-limited haloscope equipped with a superconducting cavity. The loaded quality factor of the superconducting cavity is $6\times10^{5}$, so that the expected signal from dark photon dark matter can be enhanced by more than one order compared to a copper cavity. A Josephson parametric amplifier with a near-quantu… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

  16. arXiv:2404.11605  [pdf, other

    cs.CV cs.AI cs.RO

    VG4D: Vision-Language Model Goes 4D Video Recognition

    Authors: Zhichao Deng, Xiangtai Li, Xia Li, Yunhai Tong, Shen Zhao, Mengyuan Liu

    Abstract: Understanding the real world through point cloud video is a crucial aspect of robotics and autonomous driving systems. However, prevailing methods for 4D point cloud recognition have limitations due to sensor resolution, which leads to a lack of detailed information. Recent advances have shown that Vision-Language Models (VLM) pre-trained on web-scale text-image datasets can learn fine-grained vis… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

    Comments: ICRA 2024

  17. arXiv:2404.11503  [pdf, other

    quant-ph math-ph math.DS

    Mixing Time of Open Quantum Systems via Hypocoercivity

    Authors: Di Fang, Jianfeng Lu, Yu Tong

    Abstract: Understanding the mixing of open quantum systems is a fundamental problem in physics and quantum information science. Existing approaches for estimating the mixing time often rely on the spectral gap estimation of the Lindbladian generator, which can be challenging to obtain in practice. We propose a novel theoretical framework to estimate the mixing time of open quantum systems that treats the Ha… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

  18. arXiv:2404.09083  [pdf

    cond-mat.mes-hall cond-mat.mtrl-sci

    Interplay between electronic dephasing and localization in finite-sized Chern insulator

    Authors: Yunhe Bai, Yuanzhao Li, Jianli Luan, Yang Chen, Zongwei Gao, Wenyu Song, Yitian Tong, Jinsong Zhang, Yayu Wang, Junjie Qi, Chui-Zhen Chen, Hua Jiang, X. C. Xie, Ke He, Yang Feng, Xiao Feng, Qi-Kun Xue

    Abstract: Anderson localization is anticipated to play a pivotal role in the manifestation of the quantum anomalous Hall effect, akin to its role in conventional quantum Hall effects. The significance of Anderson localization is particularly pronounced in elucidating the reasons behind the fragility of the observed quantum anomalous Hall state in the intrinsic magnetic topological insulator MnBi2Te4 with a… ▽ More

    Submitted 13 April, 2024; originally announced April 2024.

    Comments: 20 pages, 4 figures

  19. Point defects in CdTe and CdTeSe alloy: a first principles investigation with DFT+U

    Authors: Xiaofeng Xiang, Yijun Tong, Aaron Gehrke, Scott Dunham

    Abstract: CdTe and its alloy CdTeSe are widely used in optoelectronic devices, such as radiation detectors and solar cells, due to their superior electrical properties. However, the formation of defects and defect complexes in these materials can significantly affect their performance. As a result, understanding the defect formation and recombination processes in CdTe and CdTeSe alloy is of great importance… ▽ More

    Submitted 1 May, 2024; v1 submitted 11 April, 2024; originally announced April 2024.

    Comments: 10 pages, 23 figures

  20. arXiv:2404.06970  [pdf, other

    cs.CL

    Hybrid Multi-stage Decoding for Few-shot NER with Entity-aware Contrastive Learning

    Authors: Peipei Liu, Gaosheng Wang, Ying Tong, Jian Liang, Zhenquan Ding, Hongsong Zhu

    Abstract: Few-shot named entity recognition can identify new types of named entities based on a few labeled examples. Previous methods employing token-level or span-level metric learning suffer from the computational burden and a large number of negative sample spans. In this paper, we propose the Hybrid Multi-stage Decoding for Few-shot NER with Entity-aware Contrastive Learning (MsFNER), which splits the… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

  21. arXiv:2404.04490  [pdf, other

    cs.LG cs.CR

    Hyperparameter Optimization for SecureBoost via Constrained Multi-Objective Federated Learning

    Authors: Yan Kang, Ziyao Ren, Lixin Fan, Linghua Yang, Yongxin Tong, Qiang Yang

    Abstract: SecureBoost is a tree-boosting algorithm that leverages homomorphic encryption (HE) to protect data privacy in vertical federated learning. SecureBoost and its variants have been widely adopted in fields such as finance and healthcare. However, the hyperparameters of SecureBoost are typically configured heuristically for optimizing model performance (i.e., utility) solely, assuming that privacy is… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

  22. arXiv:2404.02102  [pdf

    physics.optics physics.atom-ph quant-ph

    Atomic magnetometry using a metasurface polarizing beamsplitter in silicon on sapphire

    Authors: Xuting Yang, Pritha Mukherjee, Minjeong Kim, Hongyan Mei, Chengyu Fang, Soyeon Choi, Yuhan Tong, Sarah Perlowski, David A. Czaplewski, Alan M. Dibos, Mikhail A. Kats, Jennifer T. Choy

    Abstract: We demonstrate atomic magnetometry using a metasurface polarizing beamsplitter fabricated on a silicon-on-sapphire (SOS) platform. The metasurface splits a beam that is near-resonant with the rubidium atoms (795 nm) into orthogonal linear polarizations, enabling measurement of magnetically sensitive circular birefringence in a rubidium vapor through balanced polarimetry. We incorporated the metasu… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

  23. arXiv:2404.01510  [pdf, ps, other

    math.AT math.GT

    Homotopy commutativity in quasitoric manifolds

    Authors: Sho Hasui, Daisuke Kishimoto, Yichen Tong, Mitsunobu Tsutaya

    Abstract: We prove that the loop space of a quasitoric manifold is homotopy commutative if and only if the underlying polytope is $(Δでるた^3)^n$ and the characteristic matrix is equivalent to a matrix of certain type. We also construct for each $n\ge 2$ and a positive integer $k$, a quasitoric manifold $M(k,n)$ over $(Δでるた^3)^n$ such that its loop space is homotopy commutative if and only if $k$ is even, where ever… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

    Comments: 14 pages

    MSC Class: 57S12; 55P35; 55Q15

  24. arXiv:2403.20046  [pdf, other

    cs.CL

    Can LLMs Learn from Previous Mistakes? Investigating LLMs' Errors to Boost for Reasoning

    Authors: Yongqi Tong, Dawei Li, Sizhe Wang, Yujia Wang, Fei Teng, Jingbo Shang

    Abstract: Recent works have shown the benefits to LLMs from fine-tuning golden-standard Chain-of-Thought (CoT) rationales or using them as correct examples in few-shot prompting. While humans can indeed imitate correct examples, learning from our mistakes is another vital aspect of human cognition. Hence, a question naturally arises: \textit{can LLMs learn and benefit from their mistakes, especially for the… ▽ More

    Submitted 7 June, 2024; v1 submitted 29 March, 2024; originally announced March 2024.

    Comments: The 62nd Annual Meeting of the Association for Computational Linguistics (ACL 2024) - Main Conference

  25. arXiv:2403.15875  [pdf, other

    cs.AI cs.CL

    LAMPER: LanguAge Model and Prompt EngineeRing for zero-shot time series classification

    Authors: Zhicheng Du, Zhaotian Xie, Yan Tong, Peiwu Qin

    Abstract: This study constructs the LanguAge Model with Prompt EngineeRing (LAMPER) framework, designed to systematically evaluate the adaptability of pre-trained language models (PLMs) in accommodating diverse prompts and their integration in zero-shot time series (TS) classification. We deploy LAMPER in experimental assessments using 128 univariate TS datasets sourced from the UCR archive. Our findings in… ▽ More

    Submitted 23 March, 2024; originally announced March 2024.

    Comments: Accepted as tiny paper in ICLR 2024

  26. arXiv:2403.09616  [pdf, other

    cs.CV

    Explore In-Context Segmentation via Latent Diffusion Models

    Authors: Chaoyang Wang, Xiangtai Li, Henghui Ding, Lu Qi, Jiangning Zhang, Yunhai Tong, Chen Change Loy, Shuicheng Yan

    Abstract: In-context segmentation has drawn more attention with the introduction of vision foundation models. Most existing approaches adopt metric learning or masked image modeling to build the correlation between visual prompts and input image queries. In this work, we explore this problem from a new perspective, using one representative generation model, the latent diffusion model (LDM). We observe a tas… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

  27. Human Activity Recognition with Low-Resolution Infrared Array Sensor Using Semi-supervised Cross-domain Neural Networks for Indoor Environment

    Authors: Cunyi Yin, Xiren Miao, Jing Chen, Hao Jiang, Deying Chen, Yixuan Tong, Shaocong Zheng

    Abstract: Low-resolution infrared-based human activity recognition (HAR) attracted enormous interests due to its low-cost and private. In this paper, a novel semi-supervised crossdomain neural network (SCDNN) based on 8 $\times$ 8 low-resolution infrared sensor is proposed for accurately identifying human activity despite changes in the environment at a low-cost. The SCDNN consists of feature extractor, dom… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

  28. arXiv:2403.01387  [pdf, other

    cs.LG cs.DC

    A Comprehensive Survey of Federated Transfer Learning: Challenges, Methods and Applications

    Authors: Wei Guo, Fuzhen Zhuang, Xiao Zhang, Yiqi Tong, Jin Dong

    Abstract: Federated learning (FL) is a novel distributed machine learning paradigm that enables participants to collaboratively train a centralized model with privacy preservation by eliminating the requirement of data sharing. In practice, FL often involves multiple participants and requires the third party to aggregate global information to guide the update of the target participant. Therefore, many FL me… ▽ More

    Submitted 2 March, 2024; originally announced March 2024.

  29. arXiv:2402.16138  [pdf, other

    physics.chem-ph cond-mat.mtrl-sci

    Integration of Conventional Surface Science Techniques with Surface-Sensitive Azimuthal and Polarization Dependent Femtosecond-Resolved Sum Frequency Generation Spectroscopy

    Authors: Zhipeng Huang, Tobias Roos, Yujin Tong, R. Kramer Campen

    Abstract: Experimental insight into the elementary processes underlying charge transfer across interfaces has blossomed with the wide-spread availability of ultra-high vacuum set-ups that allow the preparation and characterization of solid surfaces with well-defined molecular adsorbates over a wide ranges of temperatures. Thick layers of molecular adsorbates or heterostructures of 2D materials generally pre… ▽ More

    Submitted 25 February, 2024; originally announced February 2024.

  30. arXiv:2402.05247  [pdf, other

    physics.flu-dyn

    A Geometric VOF Method for Interface Flow Simulations

    Authors: Dezhi Dai, Haomin Yuan, Albert Y. Tong, Adrian Tentner

    Abstract: A novel numerical technique designed for interface flow simulations using the Volume of Fluid (VOF) method on arbitrary unstructured meshes has been introduced. The method is called SimPLIC, which seamlessly integrates Piecewise Linear Interface Calculation (PLIC) and Simpson's rule. The main focus of the proposed method is to compute the volume of the primary phase that moves across a mesh face w… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

  31. arXiv:2402.01713  [pdf, other

    cs.CL cs.AI cs.LG

    Prompting Large Language Models for Zero-Shot Clinical Prediction with Structured Longitudinal Electronic Health Record Data

    Authors: Yinghao Zhu, Zixiang Wang, Junyi Gao, Yuning Tong, Jingkun An, Weibin Liao, Ewen M. Harrison, Liantao Ma, Chengwei Pan

    Abstract: The inherent complexity of structured longitudinal Electronic Health Records (EHR) data poses a significant challenge when integrated with Large Language Models (LLMs), which are traditionally tailored for natural language processing. Motivated by the urgent need for swift decision-making during new disease outbreaks, where traditional predictive models often fail due to a lack of historical data,… ▽ More

    Submitted 10 February, 2024; v1 submitted 25 January, 2024; originally announced February 2024.

  32. arXiv:2401.12544  [pdf

    cond-mat.mes-hall

    Correlation between magnetic domain structures and quantum anomalous Hall effect in epitaxial MnBi2Te4 thin films

    Authors: Yang Shi, Yunhe Bai, Yuanzhao Li, Yang Feng, Qiang Li, Huanyu Zhang, Yang Chen, Yitian Tong, Jianli Luan, Ruixuan Liu, Pengfei Ji, Zongwei Gao, Hangwen Guo, Jinsong Zhang, Yayu Wang, Xiao Feng, Ke He, Xiaodong Zhou, Jian Shen

    Abstract: We use magnetic force microscopy (MFM) to study spatial uniformity of magnetization of epitaxially grown MnBi2Te4 thin films. Compared to films which exhibit no quantum anomalous Hall effect (QAH), films with QAH are observed to have more spatial uniformity of magnetization with larger domain size. The domain evolution upon magnetic field sweeping indicates that the magnetic domains or the spatial… ▽ More

    Submitted 23 January, 2024; originally announced January 2024.

    Comments: 14 pages, 4 figures

  33. arXiv:2401.11450  [pdf

    cond-mat.mes-hall

    Reentrant quantum anomalous Hall effect in molecular beam epitaxy-grown MnBi2Te4 thin films

    Authors: Yuanzhao Li, Yunhe Bai, Yang Feng, Jianli Luan, Zongwei Gao, Yang Chen, Yitian Tong, Ruixuan Liu, Su Kong Chong, Kang L. Wang, Xiaodong Zhou, Jian Shen, Jinsong Zhang, Yayu Wang, Chui-Zhen Chen, XinCheng Xie, Xiao Feng, Ke He, Qi-Kun Xue

    Abstract: In this study, we investigate intrinsic magnetic topological insulator MnBi2Te4 thin films grown by molecular beam epitaxy. We observe a reentrant quantum anomalous Hall effect when the Fermi energy enters the valance band and magnetic field equals zero, indicating the emergence of the Chern Anderson insulator state. The discovery opens a new avenue for realizing the QAH effect and underscores the… ▽ More

    Submitted 21 January, 2024; originally announced January 2024.

    Comments: 15 pages, 4 figures

  34. arXiv:2401.10228  [pdf, other

    cs.CV

    RAP-SAM: Towards Real-Time All-Purpose Segment Anything

    Authors: Shilin Xu, Haobo Yuan, Qingyu Shi, Lu Qi, Jingbo Wang, Yibo Yang, Yining Li, Kai Chen, Yunhai Tong, Bernard Ghanem, Xiangtai Li, Ming-Hsuan Yang

    Abstract: Advanced by transformer architecture, vision foundation models (VFMs) achieve remarkable progress in performance and generalization ability. Segment Anything Model (SAM) is one remarkable model that can achieve generalized segmentation. However, most VFMs cannot run in realtime, which makes it difficult to transfer them into several products. On the other hand, current real-time segmentation mainl… ▽ More

    Submitted 18 January, 2024; originally announced January 2024.

    Comments: Project Page: https://xushilin1.github.io/rap_sam/

  35. arXiv:2401.10226  [pdf, other

    cs.CV

    Towards Language-Driven Video Inpainting via Multimodal Large Language Models

    Authors: Jianzong Wu, Xiangtai Li, Chenyang Si, Shangchen Zhou, Jingkang Yang, Jiangning Zhang, Yining Li, Kai Chen, Yunhai Tong, Ziwei Liu, Chen Change Loy

    Abstract: We introduce a new task -- language-driven video inpainting, which uses natural language instructions to guide the inpainting process. This approach overcomes the limitations of traditional video inpainting methods that depend on manually labeled binary masks, a process often tedious and labor-intensive. We present the Remove Objects from Videos by Instructions (ROVI) dataset, containing 5,650 vid… ▽ More

    Submitted 18 January, 2024; originally announced January 2024.

    Comments: Project Page: https://jianzongwu.github.io/projects/rovi

  36. arXiv:2401.04136  [pdf, other

    cs.CR cs.AI

    The Stronger the Diffusion Model, the Easier the Backdoor: Data Poisoning to Induce Copyright Breaches Without Adjusting Finetuning Pipeline

    Authors: Haonan Wang, Qianli Shen, Yao Tong, Yang Zhang, Kenji Kawaguchi

    Abstract: The commercialization of text-to-image diffusion models (DMs) brings forth potential copyright concerns. Despite numerous attempts to protect DMs from copyright issues, the vulnerabilities of these solutions are underexplored. In this study, we formalized the Copyright Infringement Attack on generative AI models and proposed a backdoor attack method, SilentBadDiffusion, to induce copyright infring… ▽ More

    Submitted 26 May, 2024; v1 submitted 7 January, 2024; originally announced January 2024.

    Comments: Accepted for presentation at ICML 2024

  37. arXiv:2401.03664  [pdf

    eess.IV cs.CV cs.LG

    Dual-Channel Reliable Breast Ultrasound Image Classification Based on Explainable Attribution and Uncertainty Quantification

    Authors: Shuge Lei, Haonan Hu, Dasheng Sun, Huabin Zhang, Kehong Yuan, Jian Dai, Jijun Tang, Yan Tong

    Abstract: This paper focuses on the classification task of breast ultrasound images and researches on the reliability measurement of classification results. We proposed a dual-channel evaluation framework based on the proposed inference reliability and predictive reliability scores. For the inference reliability evaluation, human-aligned and doctor-agreed inference rationales based on the improved feature a… ▽ More

    Submitted 7 January, 2024; originally announced January 2024.

  38. arXiv:2312.16197  [pdf, other

    cs.CV cs.LG

    INFAMOUS-NeRF: ImproviNg FAce MOdeling Using Semantically-Aligned Hypernetworks with Neural Radiance Fields

    Authors: Andrew Hou, Feng Liu, Zhiyuan Ren, Michel Sarkis, Ning Bi, Yiying Tong, Xiaoming Liu

    Abstract: We propose INFAMOUS-NeRF, an implicit morphable face model that introduces hypernetworks to NeRF to improve the representation power in the presence of many training subjects. At the same time, INFAMOUS-NeRF resolves the classic hypernetwork tradeoff of representation power and editability by learning semantically-aligned latent spaces despite the subject-specific models, all without requiring a l… ▽ More

    Submitted 22 December, 2023; originally announced December 2023.

  39. arXiv:2312.12171  [pdf, other

    math.DS math.NA physics.comp-ph

    Equivariant divergence formula for chaotic flows

    Authors: Angxiu Ni, Yao Tong

    Abstract: We prove the equivariant divergence formula for the axiom A flow attractors, which is a recursive formula for perturbation of transfer operators of physical measures along center-unstable manifolds. Hence the linear response acquires an `ergodic theorem', which means that it can be sampled by recursively computing only $2u$ many vectors on one orbit, where $u$ is the unstable dimension.

    Submitted 19 December, 2023; originally announced December 2023.

    Comments: comments are welcome!

  40. arXiv:2312.12012  [pdf, other

    cs.DB

    Efficient and Private Federated Trajectory Matching

    Authors: Yuxiang Wang, Yuxiang Zeng, Yi Xu, Zimu Zhou, Yongxin Tong

    Abstract: Federated Trajectory Matching (FTM) is gaining increasing importance in big trajectory data analytics, supporting diverse applications such as public health, law enforcement, and emergency response. FTM retrieves trajectories that match with a query trajectory from a large-scale trajectory database, while safeguarding the privacy of trajectories in both the query and the database. A naive solution… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

    Comments: 14 pages

  41. arXiv:2311.14818  [pdf, other

    quant-ph

    Stochastic error cancellation in analog quantum simulation

    Authors: Yiyi Cai, Yu Tong, John Preskill

    Abstract: Analog quantum simulation is a promising path towards solving classically intractable problems in many-body physics on near-term quantum devices. However, the presence of noise limits the size of the system and the length of time that can be simulated. In our work, we consider an error model in which the actual Hamiltonian of the simulator differs from the target Hamiltonian we want to simulate by… ▽ More

    Submitted 24 November, 2023; originally announced November 2023.

    Comments: 14 pages, 2 figures

  42. Large-area, freestanding single-crystal gold of single nanometer thickness

    Authors: Chenxinyu Pan, Yuanbiao Tong, Haoliang Qian, Alexey V. Krasavin, Jialin Li, Jiajie Zhu, Yiyun Zhang, Bowen Cui, Zhiyong Li, Chenming Wu, Zhenxin Wang, Lufang Liu, Linjun Li, Xin Guo, Anatoly V. Zayats, Limin Tong, Pan Wang

    Abstract: Two-dimensional single-crystal metals are highly sought after for next-generation technologies. Here, we report large-area (>10^4 μみゅーm2), single-crystal two-dimensional gold with thicknesses down to a single-nanometer level, employing an atomic-level-precision chemical etching approach. The ultrathin thickness and single-crystal quality endow two-dimensional gold with unique properties including sig… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

    Journal ref: Nature Commun. 15 (2024) 2840-2849

  43. arXiv:2311.05282  [pdf, other

    physics.optics eess.SP

    Empowering high-dimensional optical fiber communications with integrated photonic processors

    Authors: Kaihang Lu, Zengqi Chen, Hao Chen, Wu Zhou, Zunyue Zhang, Hon Ki Tsang, Yeyu Tong

    Abstract: Mode division multiplexing (MDM) in optical fibers enables multichannel capabilities for various applications, including data transmission, quantum networks, imaging, and sensing. However, MDM optical fiber systems, usually necessities bulk-optics approaches for launching different orthogonal fiber modes into the multimode optical fiber, and multiple-input multiple-output digital electronic signal… ▽ More

    Submitted 9 November, 2023; originally announced November 2023.

  44. arXiv:2311.03675  [pdf, other

    physics.optics physics.app-ph

    Ultra-compact and efficient integrated multichannel mode multiplexer in silicon for few-mode fibers

    Authors: Wu Zhou, Zunyue Zhang, Hao Chen, Hon Ki Tsang, Yeyu Tong

    Abstract: Space-division multiplexing (SDM) is one of the key enabling technologies to increase the capacity of fiber communication systems. However, implementing SDM-based systems using multimode fiber has been challenging with the need for compact, low-cost, and scalable mode de/multiplexer (DE/MUX). Here we present a novel integrated mode MUX for few-mode fibers (FMFs) which can launch up to eight spatia… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

    Comments: 10 pages, 5 figures

  45. arXiv:2310.19657  [pdf, other

    cond-mat.mes-hall cond-mat.mtrl-sci physics.optics

    Isolating the Nonlinear Optical Response of a MoS$_2$ Monolayer under Extreme Screening of a Metal Substrate

    Authors: Tao Yang, Stephan Sleziona, Erik Pollmann, Eckart Hasselbrink, Peter Kratzer, Marika Schleberger, R. Kramer Campen, Yujin Tong

    Abstract: Transition metal dichalcogenides (TMDCs) monolayers, as two-dimensional (2D) direct bandgap semiconductors, hold promise for advanced optoelectronic and photocatalytic devices. Interaction with three-dimensional (3D) metals, like Au, profoundly affects their optical properties, posing challenges in characterizing the monolayer's optical responses within the semiconductor-metal junction. In this st… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

    Comments: 14 pages, 4 figures + supplemental material

    Journal ref: Physical Review B 109, L161402 (2024)

  46. arXiv:2310.17389  [pdf, other

    cs.CL cs.AI

    ToxicChat: Unveiling Hidden Challenges of Toxicity Detection in Real-World User-AI Conversation

    Authors: Zi Lin, Zihan Wang, Yongqi Tong, Yangkun Wang, Yuxin Guo, Yujia Wang, Jingbo Shang

    Abstract: Despite remarkable advances that large language models have achieved in chatbots, maintaining a non-toxic user-AI interactive environment has become increasingly critical nowadays. However, previous efforts in toxicity detection have been mostly based on benchmarks derived from social media content, leaving the unique challenges inherent to real-world user-AI interactions insufficiently explored.… ▽ More

    Submitted 26 October, 2023; originally announced October 2023.

    Journal ref: EMNLP findings 2023

  47. arXiv:2310.13287  [pdf

    physics.optics cond-mat.mtrl-sci

    Space-confined solid-phase growth of two-domain 1T'-ReSe2 for tunable optoelectronics

    Authors: Yunhao Tong, Fanyi Kong, Lei Zhang, Xinyi Hou, Zhengxian Zha, Zheng Hao, Jianxun Dai, Changsen Sun, Jingfeng Song, Huolin Huang, Chenhua Ji, Lujun Pan, Dawei Li

    Abstract: Two-dimensional layered ReX2 (X = Se, S) has attracted researcher's great interest due to its unusual in-plane anisotropic optical and electrical properties and great potential in polarization-sensitive optoelectronic devices, while the clean, energy-saving, and ecological synthesis of highly-crystalline ReSe2 with controlled domains remains challenging yet promising. Here, we develop a novel spac… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

    Comments: 24 pages, 6 figures

  48. arXiv:2310.12342  [pdf, other

    cs.CL cs.AI

    Eliminating Reasoning via Inferring with Planning: A New Framework to Guide LLMs' Non-linear Thinking

    Authors: Yongqi Tong, Yifan Wang, Dawei Li, Sizhe Wang, Zi Lin, Simeng Han, Jingbo Shang

    Abstract: Chain-of-Thought(CoT) prompting and its variants explore equipping large language models (LLMs) with high-level reasoning abilities by emulating human-like linear cognition and logic. However, the human mind is complicated and mixed with both linear and nonlinear thinking. In this work, we propose \textbf{I}nferential \textbf{E}xclusion \textbf{P}rompting (IEP), a novel prompting that combines the… ▽ More

    Submitted 14 November, 2023; v1 submitted 18 October, 2023; originally announced October 2023.

  49. arXiv:2310.08850  [pdf

    cond-mat.mtrl-sci

    An unprecedented synergy of high-temperature tensile strength and ductility in a NiCoCrAlTi high-entropy alloy

    Authors: Hongmin Zhang, Fanchao Meng, Haoyan Meng, Yang Tong, Peter K. Liaw, Xiao Yang, Lei Zhao, Haizhou Wang, Yanfei Gao, Shuying Chen

    Abstract: The present work reported a novel L12-strengthening NiCoCrAlTi high entropy alloy (HEA) with an outstanding synergy of tensile strength and ductility at both ambient and high temperatures. Transmission electron microscopy (TEM) characterization revealed a high density of rod-like and spheroidal L12 precipitates distributing in the micro/nanograins and non-recrystallized regions in the annealed spe… ▽ More

    Submitted 13 October, 2023; originally announced October 2023.

  50. arXiv:2310.01393  [pdf, other

    cs.CV

    DST-Det: Simple Dynamic Self-Training for Open-Vocabulary Object Detection

    Authors: Shilin Xu, Xiangtai Li, Size Wu, Wenwei Zhang, Yunhai Tong, Chen Change Loy

    Abstract: Open-vocabulary object detection (OVOD) aims to detect the objects beyond the set of classes observed during training. This work introduces a straightforward and efficient strategy that utilizes pre-trained vision-language models (VLM), like CLIP, to identify potential novel classes through zero-shot classification. Previous methods use a class-agnostic region proposal network to detect object pro… ▽ More

    Submitted 1 April, 2024; v1 submitted 2 October, 2023; originally announced October 2023.