(Translated by https://www.hiragana.jp/)
Search | arXiv e-print repository
Skip to main content

Showing 1–50 of 2,154 results for author: Gao, J

.
  1. arXiv:2408.03705  [pdf, other

    hep-ph

    A 95 GeV Higgs boson and spontaneous CP-violation at the finite temperature

    Authors: Jing Gao, Jinghong Ma, Lei Wang, Haotian Xu

    Abstract: The ATLAS and CMS collaborations reported a diphoton excess in the invariant mass distribution around the 95.4 GeV with a local significance of $3.1σしぐま$. Moreover, there is another $2.3σしぐま$ local excess in the $b\bar{b}$ final state at LEP in the same mass region. A plausible solution is that the Higgs sector is extended to include an additional Higgs bosom with a mass of $95.4$ GeV. We study a comple… ▽ More

    Submitted 7 August, 2024; originally announced August 2024.

    Comments: 28 pages, 8 figures, 1 tables. arXiv admin note: text overlap with arXiv:2311.02828

  2. arXiv:2408.03238  [pdf, other

    cs.RO cs.CV

    LAC-Net: Linear-Fusion Attention-Guided Convolutional Network for Accurate Robotic Grasping Under the Occlusion

    Authors: Jinyu Zhang, Yongchong Gu, Jianxiong Gao, Haitao Lin, Qiang Sun, Xinwei Sun, Xiangyang Xue, Yanwei Fu

    Abstract: This paper addresses the challenge of perceiving complete object shapes through visual perception. While prior studies have demonstrated encouraging outcomes in segmenting the visible parts of objects within a scene, amodal segmentation, in particular, has the potential to allow robots to infer the occluded parts of objects. To this end, this paper introduces a new framework that explores amodal s… ▽ More

    Submitted 6 August, 2024; originally announced August 2024.

    Comments: accepted by IROS2024

  3. arXiv:2408.03079  [pdf, other

    cs.CL cs.AI

    Enhancing Complex Causality Extraction via Improved Subtask Interaction and Knowledge Fusion

    Authors: Jinglong Gao, Chen Lu, Xiao Ding, Zhongyang Li, Ting Liu, Bing Qin

    Abstract: Event Causality Extraction (ECE) aims at extracting causal event pairs from texts. Despite ChatGPT's recent success, fine-tuning small models remains the best approach for the ECE task. However, existing fine-tuning based ECE methods cannot address all three key challenges in ECE simultaneously: 1) Complex Causality Extraction, where multiple causal-effect pairs occur within a single sentence; 2)… ▽ More

    Submitted 6 August, 2024; originally announced August 2024.

    Comments: NLPCC 2024 Oral

  4. arXiv:2408.03075  [pdf

    astro-ph.EP physics.space-ph

    Characterizing the current systems in the Martian ionosphere

    Authors: Jiawei Gao, Shibang Li, Anna Mittelholz, Zhaojin Rong, Moa Persson, Zhen Shi, Haoyu Lu, Chi Zhang, Xiaodong Wang, Chuanfei Dong, Lucy Klinger, Jun Cui, Yong Wei, Yongxin Pan

    Abstract: When the solar wind interacts with the ionosphere of an unmagnetized planet, it induces currents that form an induced magnetosphere. These currents and their associated magnetic fields play a pivotal role in controlling the movement of charged particles, which is essential for understanding the escape of planetary ions. Unlike the well-documented magnetospheric current systems, the ionospheric cur… ▽ More

    Submitted 6 August, 2024; originally announced August 2024.

    Comments: 20 pages, 6 figures

  5. arXiv:2408.01970  [pdf, other

    cs.AI cs.CV

    SR-CIS: Self-Reflective Incremental System with Decoupled Memory and Reasoning

    Authors: Biqing Qi, Junqi Gao, Xinquan Chen, Dong Li, Weinan Zhang, Bowen Zhou

    Abstract: The ability of humans to rapidly learn new knowledge while retaining old memories poses a significant challenge for current deep learning models. To handle this challenge, we draw inspiration from human memory and learning mechanisms and propose the Self-Reflective Complementary Incremental System (SR-CIS). Comprising the deconstructed Complementary Inference Module (CIM) and Complementary Memory… ▽ More

    Submitted 4 August, 2024; originally announced August 2024.

  6. arXiv:2408.01766  [pdf, other

    cs.CV

    MultiFuser: Multimodal Fusion Transformer for Enhanced Driver Action Recognition

    Authors: Ruoyu Wang, Wenqian Wang, Jianjun Gao, Dan Lin, Kim-Hui Yap, Bingbing Li

    Abstract: Driver action recognition, aiming to accurately identify drivers' behaviours, is crucial for enhancing driver-vehicle interactions and ensuring driving safety. Unlike general action recognition, drivers' environments are often challenging, being gloomy and dark, and with the development of sensors, various cameras such as IR and depth cameras have emerged for analyzing drivers' behaviors. Therefor… ▽ More

    Submitted 3 August, 2024; originally announced August 2024.

  7. arXiv:2408.01343  [pdf, other

    cs.CV cs.AI cs.LG

    StitchFusion: Weaving Any Visual Modalities to Enhance Multimodal Semantic Segmentation

    Authors: Bingyu Li, Da Zhang, Zhiyuan Zhao, Junyu Gao, Xuelong Li

    Abstract: Multimodal semantic segmentation shows significant potential for enhancing segmentation accuracy in complex scenes. However, current methods often incorporate specialized feature fusion modules tailored to specific modalities, thereby restricting input flexibility and increasing the number of training parameters. To address these challenges, we propose StitchFusion, a straightforward yet effective… ▽ More

    Submitted 2 August, 2024; originally announced August 2024.

  8. arXiv:2408.01132  [pdf, other

    math.NA

    Spectral methods on a triangle and W-systems

    Authors: Jing Gao, Arieh Iserles

    Abstract: We present an overarching framework for stable spectral methods on a triangle, defined by a multivariate W-system and based on orthogonal polynomials on the triangle. Motivated by the Koornwinder orthogonal polynomials on the triangle, we introduce a Koornwinder W-system. Once discretised by this W-system, the resulting spatial differentiation matrix is skew symmetric, affording important advantag… ▽ More

    Submitted 2 August, 2024; originally announced August 2024.

    MSC Class: Primary 65M70; secondary 42C05

  9. arXiv:2408.01091  [pdf, other

    cs.AI

    Dissecting Dissonance: Benchmarking Large Multimodal Models Against Self-Contradictory Instructions

    Authors: Jin Gao, Lei Gan, Yuankai Li, Yixin Ye, Dequan Wang

    Abstract: Large multimodal models (LMMs) excel in adhering to human instructions. However, self-contradictory instructions may arise due to the increasing trend of multimodal interaction and context length, which is challenging for language beginners and vulnerable populations. We introduce the Self-Contradictory Instructions benchmark to evaluate the capability of LMMs in recognizing conflicting commands.… ▽ More

    Submitted 5 August, 2024; v1 submitted 2 August, 2024; originally announced August 2024.

    Comments: Accepted by the 18th European Conference on Computer Vision ECCV 2024

  10. arXiv:2408.00355  [pdf, other

    cs.CV cs.AI

    DNTextSpotter: Arbitrary-Shaped Scene Text Spotting via Improved Denoising Training

    Authors: Yu Xie, Qian Qiao, Jun Gao, Tianxiang Wu, Shaoyao Huang, Jiaqing Fan, Ziqiang Cao, Zili Wang, Yue Zhang, Jielei Zhang, Huyang Sun

    Abstract: More and more end-to-end text spotting methods based on Transformer architecture have demonstrated superior performance. These methods utilize a bipartite graph matching algorithm to perform one-to-one optimal matching between predicted objects and actual objects. However, the instability of bipartite graph matching can lead to inconsistent optimization targets, thereby affecting the training perf… ▽ More

    Submitted 1 August, 2024; originally announced August 2024.

    Comments: Accepted by ACMMM2024

  11. arXiv:2407.21308  [pdf, other

    cs.CV

    Enhanced Self-Checkout System for Retail Based on Improved YOLOv10

    Authors: Lianghao Tan, Shubing Liu, Jing Gao, Xiaoyi Liu, Linyue Chu, Huangqi Jiang

    Abstract: With the rapid advancement of deep learning technologies, computer vision has shown immense potential in retail automation. This paper presents a novel self-checkout system for retail based on an improved YOLOv10 network, aimed at enhancing checkout efficiency and reducing labor costs. We propose targeted optimizations to the YOLOv10 model, by incorporating the detection head structure from YOLOv8… ▽ More

    Submitted 30 July, 2024; originally announced July 2024.

  12. arXiv:2407.20789  [pdf, other

    math.AP math.FA math.MG

    Hölder regularity of harmonic functions on metric measure spaces

    Authors: Jin Gao, Meng Yang

    Abstract: We introduce the Hölder regularity condition for harmonic functions on metric measure spaces and prove that under mild volume regular condition and upper heat kernel estimate, the Hölder regularity condition, the weak Bakry-Émery non-negative curvature condition, the heat kernel Hölder continuity with or without exponential terms and the heat kernel near-diagonal lower bound are equivalent. As app… ▽ More

    Submitted 30 July, 2024; originally announced July 2024.

    Comments: 36 pages

    MSC Class: 28A80; 35K08

  13. arXiv:2407.20613  [pdf

    cond-mat.soft physics.bio-ph physics.chem-ph

    Escape of an Active Ring from an Attractive Surface: Behaving Like a Self-Propelled Brownian Particle

    Authors: Bin Tang, Jin-cheng Gao, Kang Chen, Tian Hui Zhang, Wen-de Tian

    Abstract: Escape of active agents from metastable states are of great current interest in statistical and biological physics. In this study, we find that a flexible active Brownian ring escapes from a flat attractive surface though two distinct mechanisms: Kramers-like thermal activation at large rotational diffusion coefficients, but with an effective temperature, and the first-passage process at small rot… ▽ More

    Submitted 30 July, 2024; originally announced July 2024.

  14. arXiv:2407.20610  [pdf

    cond-mat.soft physics.bio-ph

    Constrained motion of self-propelling eccentric disks linked by a spring

    Authors: Tian-liang Xu, Chao-ran Qin, Bin Tang, Jin-cheng Gao, Jiankang Zhou, Kang Chen, Tian Hui Zhang, Wen-de Tian

    Abstract: It has been supposed that the interplay of elasticity and activity plays a key role in triggering the non-equilibrium behaviors in biological systems. However, the experimental model system is missing to investigate the spatiotemporally dynamical phenomena. Here, a model system of an active chain, where active eccentric-disks are linked by a spring, is designed to study the interplay of activity,… ▽ More

    Submitted 30 July, 2024; originally announced July 2024.

  15. arXiv:2407.20606  [pdf

    cond-mat.mes-hall cond-mat.mtrl-sci

    Evidence for Two-dimensional Weyl Fermions in Air-Stable Monolayer PtTe$_{1.75}$

    Authors: Zhihao Cai, Haijun Cao, Haohao Sheng, Xuegao Hu, Zhenyu Sun, Qiaoxiao Zhao, Jisong Gao, Shin-ichiro Ideta, Kenya Shimada, Jiawei Huang, Peng Cheng, Lan Chen, Yugui Yao, Sheng Meng, Kehui Wu, Zhijun Wang, Baojie Feng

    Abstract: The Weyl semimetals represent a distinct category of topological materials wherein the low-energy excitations appear as the long-sought Weyl fermions. Exotic transport and optical properties are expected because of the chiral anomaly and linear energy-momentum dispersion. While three-dimensional Weyl semimetals have been successfully realized, the quest for their two-dimensional (2D) counterparts… ▽ More

    Submitted 30 July, 2024; originally announced July 2024.

    Comments: Nano Letters, In Press

  16. arXiv:2407.20478  [pdf

    physics.soc-ph cond-mat.stat-mech

    Hidden high-risky states identification from routine urban traffic

    Authors: Shiyan Liu, Mingyang Bai, Shengmin Guo, Jianxi Gao, Huijun Sun, Ziyou Gao, Daqing Li

    Abstract: One of the core risk management tasks is to identify hidden high-risky states that may lead to system breakdown, which can provide valuable early warning knowledge. However, due to high dimensionality and nonlinear interaction embedded in large-scale complex systems like urban traffic, it remains challenging to identify hidden high-risky states from huge system state space where over 99% of possib… ▽ More

    Submitted 29 July, 2024; originally announced July 2024.

  17. arXiv:2407.19679  [pdf

    cs.CV cs.AI

    Harnessing Large Vision and Language Models in Agriculture: A Review

    Authors: Hongyan Zhu, Shuai Qin, Min Su, Chengzhi Lin, Anjie Li, Junfeng Gao

    Abstract: Large models can play important roles in many domains. Agriculture is another key factor affecting the lives of people around the world. It provides food, fabric, and coal for humanity. However, facing many challenges such as pests and diseases, soil degradation, global warming, and food security, how to steadily increase the yield in the agricultural sector is a problem that humans still need to… ▽ More

    Submitted 28 July, 2024; originally announced July 2024.

  18. arXiv:2407.19389  [pdf, other

    cs.DC cs.LG math.OC

    FIARSE: Model-Heterogeneous Federated Learning via Importance-Aware Submodel Extraction

    Authors: Feijie Wu, Xingchen Wang, Yaqing Wang, Tianci Liu, Lu Su, Jing Gao

    Abstract: In federated learning (FL), accommodating clients' varied computational capacities poses a challenge, often limiting the participation of those with constrained resources in global model training. To address this issue, the concept of model heterogeneity through submodel extraction has emerged, offering a tailored solution that aligns the model's complexity with each client's computational capacit… ▽ More

    Submitted 28 July, 2024; originally announced July 2024.

  19. arXiv:2407.19150  [pdf, other

    cs.AR

    RoSE-Opt: Robust and Efficient Analog Circuit Parameter Optimization with Knowledge-infused Reinforcement Learning

    Authors: Weidong Cao, Jian Gao, Tianrui Ma, Rui Ma, Mouhacine Benosman, Xuan Zhang

    Abstract: This paper proposes a learning framework, RoSE-Opt, to achieve robust and efficient analog circuit parameter optimization. RoSE-Opt has two important features. First, it incorporates key domain knowledge of analog circuit design, such as circuit topology, couplings between circuit specifications, and variations of process, supply voltage, and temperature, into the learning loop. This strategy faci… ▽ More

    Submitted 26 July, 2024; originally announced July 2024.

    Comments: 14 pages, 12 Figures. Accepted by IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems

  20. arXiv:2407.18590  [pdf, other

    cs.CV

    From 2D to 3D: AISG-SLA Visual Localization Challenge

    Authors: Jialin Gao, Bill Ong, Darld Lwi, Zhen Hao Ng, Xun Wei Yee, Mun-Thye Mak, Wee Siong Ng, See-Kiong Ng, Hui Ying Teo, Victor Khoo, Georg Bökman, Johan Edstedt, Kirill Brodt, Clémentin Boittiaux, Maxime Ferrera, Stepan Konev

    Abstract: Research in 3D mapping is crucial for smart city applications, yet the cost of acquiring 3D data often hinders progress. Visual localization, particularly monocular camera position estimation, offers a solution by determining the camera's pose solely through visual cues. However, this task is challenging due to limited data from a single camera. To tackle these challenges, we organized the AISG-SL… ▽ More

    Submitted 26 July, 2024; originally announced July 2024.

  21. arXiv:2407.18525  [pdf, other

    cs.CL cs.AI cs.LG

    Is larger always better? Evaluating and prompting large language models for non-generative medical tasks

    Authors: Yinghao Zhu, Junyi Gao, Zixiang Wang, Weibin Liao, Xiaochen Zheng, Lifang Liang, Yasha Wang, Chengwei Pan, Ewen M. Harrison, Liantao Ma

    Abstract: The use of Large Language Models (LLMs) in medicine is growing, but their ability to handle both structured Electronic Health Record (EHR) data and unstructured clinical notes is not well-studied. This study benchmarks various models, including GPT-based LLMs, BERT-based models, and traditional clinical predictive models, for non-generative medical tasks utilizing renowned datasets. We assessed 14… ▽ More

    Submitted 26 July, 2024; originally announced July 2024.

    Comments: arXiv admin note: text overlap with arXiv:2402.01713

  22. arXiv:2407.18116  [pdf

    cond-mat.soft physics.flu-dyn

    Sedimenting microrollers navigate saturated porous media

    Authors: Samuel R. Wilson-Whitford, David Kramer, Jinghui Gao, Maria Chiara Roffin, James F. Gilchrist

    Abstract: Particle sedimentation through porous media is limited by the inability of passive material to overcome surface interactions and a tortuous network of pores. This limits transport, delivery, and effectiveness of chemicals used as reactants, nutrients, pesticides, or for waste remediation. This work develops magnetically responsive microrollers that navigate the complex interstitial network of poro… ▽ More

    Submitted 25 July, 2024; originally announced July 2024.

  23. arXiv:2407.16935  [pdf, other

    stat.ML cs.LG

    Federated Automatic Latent Variable Selection in Multi-output Gaussian Processes

    Authors: Jingyi Gao, Seokhyun Chung

    Abstract: This paper explores a federated learning approach that automatically selects the number of latent processes in multi-output Gaussian processes (MGPs). The MGP has seen great success as a transfer learning tool when data is generated from multiple sources/units/entities. A common approach in MGPs to transfer knowledge across units involves gathering all data from each unit to a central server and e… ▽ More

    Submitted 23 July, 2024; originally announced July 2024.

  24. arXiv:2407.16584  [pdf

    q-bio.BM

    The need to implement FAIR principles in biomolecular simulations

    Authors: Rommie Amaro, Johan Åqvist, Ivet Bahar, Federica Battistini, Adam Bellaiche, Daniel Beltran, Philip C. Biggin, Massimiliano Bonomi, Gregory R. Bowman, Richard Bryce, Giovanni Bussi, Paolo Carloni, David Case, Andrea Cavalli, Chie-En A. Chang, Thomas E. Cheatham III, Margaret S. Cheung, Cris Chipot, Lillian T. Chong, Preeti Choudhary, Cecilia Clementi, Rosana Collepardo-Guevara, Peter Coveney, T. Daniel Crawford, Matteo Dal Peraro , et al. (96 additional authors not shown)

    Abstract: This letter illustrates the opinion of the molecular dynamics (MD) community on the need to adopt a new FAIR paradigm for the use of molecular simulations. It highlights the necessity of a collaborative effort to create, establish, and sustain a database that allows findability, accessibility, interoperability, and reusability of molecular dynamics simulation data. Such a development would democra… ▽ More

    Submitted 23 July, 2024; originally announced July 2024.

  25. arXiv:2407.16326   

    cs.AI cs.LG

    On The Expressive Power of Knowledge Graph Embedding Methods

    Authors: Jiexing Gao, Dmitry Rodin, Vasily Motolygin, Denis Zaytsev

    Abstract: Knowledge Graph Embedding (KGE) is a popular approach, which aims to represent entities and relations of a knowledge graph in latent spaces. Their representations are known as embeddings. To measure the plausibility of triplets, score functions are defined over embedding spaces. Despite wide dissemination of KGE in various tasks, KGE methods have limitations in reasoning abilities. In this paper w… ▽ More

    Submitted 26 July, 2024; v1 submitted 23 July, 2024; originally announced July 2024.

    Comments: This paper may involve data that is not readily available to the public

    MSC Class: MCS 68T30 ACM Class: I.2.4

  26. arXiv:2407.16061  [pdf, other

    hep-ph

    A framework for simultaneous fit of QCD and BSM parameters with xFitter

    Authors: XiaoMin Shen, Simone Amoroso, Jun Gao, Katerina Lipka, Oleksandr Zenaiev

    Abstract: An extension of the xFitter open-source program for QCD analyses is presented, allowing for a polynomial parameterization of the dependence of physical observables on theoretical parameters. This extension enables simultaneous determination of parton distribution functions (PDFs) and new physics parameters within the framework of the Standard Model Effective Field Theory (SMEFT). The functionaliti… ▽ More

    Submitted 22 July, 2024; originally announced July 2024.

    Comments: 31 pages, 10 figures

  27. arXiv:2407.14086  [pdf, other

    cs.CV

    Temporal Correlation Meets Embedding: Towards a 2nd Generation of JDE-based Real-Time Multi-Object Tracking

    Authors: Yunfei Zhang, Chao Liang, Jin Gao, Zhipeng Zhang, Weiming Hu, Stephen Maybank, Xue Zhou, Liang Li

    Abstract: Joint Detection and Embedding (JDE) trackers have demonstrated excellent performance in Multi-Object Tracking (MOT) tasks by incorporating the extraction of appearance features as auxiliary tasks through embedding Re-Identification task (ReID) into the detector, achieving a balance between inference speed and tracking performance. However, solving the competition between the detector and the featu… ▽ More

    Submitted 6 August, 2024; v1 submitted 19 July, 2024; originally announced July 2024.

    Comments: A submission to IJCV

  28. arXiv:2407.13985  [pdf

    cond-mat.mtrl-sci physics.comp-ph

    Cluster Sliding Ferroelectricity in Trilayer Quasi-Hexagonal C60

    Authors: Xuefei Wang, Yanhan Ren, Shi Qiu, Fan Zhang, Xueao Li, Junfeng Gao, Weiwei Gao, Jijun Zhao

    Abstract: Electric polarization typically originates from non-centrosymmetric charge distributions. Since chemical bonds between atoms of the same elements favor centrosymmetric crystal structures and symmetrically distributed electron charges, elemental ferroelectrics are extremely rare. In comparison to atoms, elemental clusters are less symmetric and typically have various preferred orientations in cryst… ▽ More

    Submitted 18 July, 2024; originally announced July 2024.

    Comments: 5 figures

  29. arXiv:2407.13899  [pdf, other

    astro-ph.EP

    Main-sequence systems: orbital stability around single star hosts

    Authors: Hareesh Gautham Bhaskar, Nathaniel W. H. Moore, Jiapeng Gao, Gongjie Li, Billy Quarles

    Abstract: Stability is one of the most fundamental aspects regarding planetary systems. It plays an important role in our understanding on the formation channel of the planetary systems, as well as their habitability. Many approaches have been adopted to determine the stability of these systems, including brute-force N-body simulations, semi-analytical calculations, and more recently machine learning method… ▽ More

    Submitted 18 July, 2024; originally announced July 2024.

    Comments: Preprint of a chapter for the 'Encyclopedia of Astrophysics' (Editor-in-Chief Ilya Mandel, Section Editor Dimitri Veras) to be published by Elsevier as a Reference Module. The number of references was capped

  30. arXiv:2407.13227  [pdf, other

    eess.SY

    Solving the Model Unavailable MARE using Q-Learning Algorithm

    Authors: Fei Yan, Jie Gao, Tao Feng, Jianxing Liu

    Abstract: In this paper, the discrete-time modified algebraic Riccati equation (MARE) is solved when the system model is completely unavailable. To achieve this, firstly a brand new iterative method based on the standard discrete-time algebraic Riccati equation (DARE) and its input weighting matrix is proposed to solve the MARE. For the single-input case, the iteration can be initialized by an arbitrary pos… ▽ More

    Submitted 18 July, 2024; originally announced July 2024.

  31. arXiv:2407.12204  [pdf, other

    cond-mat.mes-hall physics.app-ph

    Acoustic modulation of individual nanowire quantum dots integrated into a hybrid thin-film lithium niobate photonic platform

    Authors: Thomas Descamps, Tanguy Schetelat, Jun Gao, Philip J. Poole, Dan Dalacu, Ali W. Elshaari, Val Zwiller

    Abstract: Surface acoustic waves (SAWs) are a powerful tool for controlling a wide range of quantum systems, particularly quantum dots (QDs) via their oscillating strain fields. The resulting energy modulation of these single photon sources can be harnessed to achieve spectral overlap between two QDs otherwise emitting at different wavelengths. In this study, we integrate InAsP/InP nanowire quantum dots ont… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

  32. arXiv:2407.11781  [pdf, other

    cs.CV

    SlingBAG: Sliding ball adaptive growth algorithm with differentiable radiation enables super-efficient iterative 3D photoacoustic image reconstruction

    Authors: Shuang Li, Yibing Wang, Jian Gao, Chulhong Kim, Seongwook Choi, Yu Zhang, Qian Chen, Yao Yao, Changhui Li

    Abstract: High-quality 3D photoacoustic imaging (PAI) reconstruction under sparse view or limited view has long been challenging. Traditional 3D iterative-based reconstruction methods suffer from both slow speed and high memory consumption. Recently, in computer graphics, the differentiable rendering has made significant progress, particularly with the rise of 3D Gaussian Splatting. Inspired by these, we in… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

  33. arXiv:2407.11398  [pdf, other

    cs.CV

    Animate3D: Animating Any 3D Model with Multi-view Video Diffusion

    Authors: Yanqin Jiang, Chaohui Yu, Chenjie Cao, Fan Wang, Weiming Hu, Jin Gao

    Abstract: Recent advances in 4D generation mainly focus on generating 4D content by distilling pre-trained text or single-view image-conditioned models. It is inconvenient for them to take advantage of various off-the-shelf 3D assets with multi-view attributes, and their results suffer from spatiotemporal inconsistency owing to the inherent ambiguity in the supervision signals. In this work, we present Anim… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

    Comments: Project Page: https://animate3d.github.io/

  34. arXiv:2407.10249  [pdf, other

    cs.DS

    Low Sensitivity Hopsets

    Authors: Vikrant Ashvinkumar, Aaron Bernstein, Chengyuan Deng, Jie Gao, Nicole Wein

    Abstract: Given a weighted graph $G$, a $(βべーた,\varepsilon)$-hopset $H$ is an edge set such that for any $s,t \in V(G)$, where $s$ can reach $t$ in $G$, there is a path from $s$ to $t$ in $G \cup H$ which uses at most $βべーた$ hops whose length is in the range $[dist_G(s,t), (1+\varepsilon)dist_G(s,t)]$. We break away from the traditional question that asks for a hopset that achieves small $|H|$ and instead study i… ▽ More

    Submitted 14 July, 2024; originally announced July 2024.

    Comments: Abstract shortened to meet arXiv requirements

  35. arXiv:2407.10105  [pdf, other

    cs.CV cs.AI

    Hierarchical Multi-modal Transformer for Cross-modal Long Document Classification

    Authors: Tengfei Liu, Yongli Hu, Junbin Gao, Yanfeng Sun, Baocai Yin

    Abstract: Long Document Classification (LDC) has gained significant attention recently. However, multi-modal data in long documents such as texts and images are not being effectively utilized. Prior studies in this area have attempted to integrate texts and images in document-related tasks, but they have only focused on short text sequences and images of pages. How to classify long documents with hierarchic… ▽ More

    Submitted 14 July, 2024; originally announced July 2024.

    Comments: IEEE Transactions on Multimedia

  36. arXiv:2407.10059  [pdf, other

    hep-ph hep-ex

    Towards ultimate fragmentation functions at future lepton colliders

    Authors: Bin Zhou, Jun Gao

    Abstract: In this work, we study the constraining power of future lepton colliders on fragmentation functions (FFs) to light charged hadrons from quarks and gluon in the framework of QCD collinear factorization. We perform analyses of FFs by including a wide range of pseudo--data from future lepton colliders, such as measurements on hadron multiplicities in the inclusive production of two jets and $W$ boson… ▽ More

    Submitted 13 July, 2024; originally announced July 2024.

    Comments: 30 pages, 9 figures, 2 tables. Code is available at https://fmnlo.sjtu.edu.cn/~fmnlo/

  37. arXiv:2407.09698  [pdf, other

    cs.LG

    RIO-CPD: A Riemannian Geometric Method for Correlation-aware Online Change Point Detection

    Authors: Chengyuan Deng, Zhengzhang Chen, Xujiang Zhao, Haoyu Wang, Junxiang Wang, Haifeng Chen, Jie Gao

    Abstract: The objective of change point detection is to identify abrupt changes at potentially multiple points within a data sequence. This task is particularly challenging in the online setting where various types of changes can occur, including shifts in both the marginal and joint distributions of the data. This paper tackles these challenges by sequentially tracking correlation matrices on the Riemannia… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

  38. arXiv:2407.09590  [pdf, other

    cs.CL cs.LG

    Diversifying the Expert Knowledge for Task-Agnostic Pruning in Sparse Mixture-of-Experts

    Authors: Zeliang Zhang, Xiaodong Liu, Hao Cheng, Chenliang Xu, Jianfeng Gao

    Abstract: By increasing model parameters but activating them sparsely when performing a task, the use of Mixture-of-Experts (MoE) architecture significantly improves the performance of Large Language Models (LLMs) without increasing the inference cost. However, the memory consumption due to the growing number of experts presents a challenge to the deployment of these models in many real world settings. Our… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: 13pages, 6 figures

  39. arXiv:2407.08959  [pdf, other

    cs.CL

    Domain-Hierarchy Adaptation via Chain of Iterative Reasoning for Few-shot Hierarchical Text Classification

    Authors: Ke Ji, Peng Wang, Wenjun Ke, Guozheng Li, Jiajun Liu, Jingsheng Gao, Ziyu Shang

    Abstract: Recently, various pre-trained language models (PLMs) have been proposed to prove their impressive performances on a wide range of few-shot tasks. However, limited by the unstructured prior knowledge in PLMs, it is difficult to maintain consistent performance on complex structured scenarios, such as hierarchical text classification (HTC), especially when the downstream data is extremely scarce. The… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    Comments: 9 pages, 2 figures, Accepted by IJCAI2024

  40. arXiv:2407.08937  [pdf, other

    cs.CL cs.AI

    Self-Evolving GPT: A Lifelong Autonomous Experiential Learner

    Authors: Jinglong Gao, Xiao Ding, Yiming Cui, Jianbai Zhao, Hepeng Wang, Ting Liu, Bing Qin

    Abstract: To improve the performance of large language models (LLMs), researchers have explored providing LLMs with textual task-solving experience via prompts. However, they rely on manual efforts to acquire and apply such experience for each task, which is not feasible for the growing demand for LLMs and the variety of user questions. To address this issue, we design a lifelong autonomous experiential lea… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    Comments: Accepted by ACL 2024 MAIN

  41. arXiv:2407.08639  [pdf, other

    cs.AI cs.LG

    $βべーた$-DPO: Direct Preference Optimization with Dynamic $βべーた$

    Authors: Junkang Wu, Yuexiang Xie, Zhengyi Yang, Jiancan Wu, Jinyang Gao, Bolin Ding, Xiang Wang, Xiangnan He

    Abstract: Direct Preference Optimization (DPO) has emerged as a compelling approach for training Large Language Models (LLMs) to adhere to human preferences. However, the performance of DPO is sensitive to the fine-tuning of its trade-off parameter $βべーた$, as well as to the quality of the preference data. We analyze the impact of $βべーた$ and data quality on DPO, uncovering that optimal $βべーた$ values vary with the inf… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

  42. arXiv:2407.07880  [pdf, other

    cs.LG cs.AI cs.CL

    Towards Robust Alignment of Language Models: Distributionally Robustifying Direct Preference Optimization

    Authors: Junkang Wu, Yuexiang Xie, Zhengyi Yang, Jiancan Wu, Jiawei Chen, Jinyang Gao, Bolin Ding, Xiang Wang, Xiangnan He

    Abstract: This study addresses the challenge of noise in training datasets for Direct Preference Optimization (DPO), a method for aligning Large Language Models (LLMs) with human preferences. We categorize noise into pointwise noise, which includes low-quality data points, and pairwise noise, which encompasses erroneous data pair associations that affect preference rankings. Utilizing Distributionally Robus… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

  43. arXiv:2407.06453  [pdf, ps, other

    math.RA

    Dual minus partial order

    Authors: Ju Gao, Hongxing Wang, Xiaoji Liu

    Abstract: In this paper, we introduce the Dual-minus partial order, get some characterizations of the partial order, and prove that both the dual star partial order and the dual sharp partial order are Dual-minus-type partial orders. Based on the Dual-minus partial order, we introduce the Dual-minus sharp partial order and the Dual-minus star partial order, which are also Dual-minus-type partial orders. In… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: 23 pages

    MSC Class: 15A09; 15A24; 62G30

  44. arXiv:2407.06022  [pdf

    math.NA

    Investigation of microstructural evolution of irradiation-induced defects in tungsten: an experimental-numerical approach

    Authors: Salahudeen Mohamed, Qian Yuan, Dimitri Litvinov, Jie Gao, Ermile Gaganidze, Dmitry Terentyev, Hans-Christian Schneider, Jarir Aktaa

    Abstract: The hostile condition in a fusion tokomak reactor poses the main challenge in the development and design of in-vessel components such as divertor and breeding blanket due to fusion relevant irradiation conditions (14 MeV) and large thermal loads. The current work describes the employment of an integrated experimental-numerical approach to assess the microstructure evolution of dislocation loops an… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

  45. arXiv:2407.05771  [pdf, other

    cs.CV

    Multi-times Monte Carlo Rendering for Inter-reflection Reconstruction

    Authors: Tengjie Zhu, Zhuo Chen, Jingnan Gao, Yichao Yan, Xiaokang Yang

    Abstract: Inverse rendering methods have achieved remarkable performance in reconstructing high-fidelity 3D objects with disentangled geometries, materials, and environmental light. However, they still face huge challenges in reflective surface reconstruction. Although recent methods model the light trace to learn specularity, the ignorance of indirect illumination makes it hard to handle inter-reflections… ▽ More

    Submitted 7 August, 2024; v1 submitted 8 July, 2024; originally announced July 2024.

    Comments: 10 pages,6 figures,NeurIPS 2024 Submitted

  46. arXiv:2407.05705  [pdf, other

    cs.AI

    Fast and Continual Knowledge Graph Embedding via Incremental LoRA

    Authors: Jiajun Liu, Wenjun Ke, Peng Wang, Jiahao Wang, Jinhua Gao, Ziyu Shang, Guozheng Li, Zijie Xu, Ke Ji, Yining Li

    Abstract: Continual Knowledge Graph Embedding (CKGE) aims to efficiently learn new knowledge and simultaneously preserve old knowledge. Dominant approaches primarily focus on alleviating catastrophic forgetting of old knowledge but neglect efficient learning for the emergence of new knowledge. However, in real-world scenarios, knowledge graphs (KGs) are continuously growing, which brings a significant chall… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: Accepted by IJCAI2024

  47. arXiv:2407.04422  [pdf, other

    hep-ph hep-ex nucl-ex nucl-th

    Global analysis of fragmentation functions to charged hadrons with high-precision data from the LHC

    Authors: Jun Gao, ChongYang Liu, XiaoMin Shen, Hongxi Xing, Yuxiang Zhao

    Abstract: Fragmentation functions (FFs) are essential non-perturbative QCD inputs for predicting hadron production cross sections in high energy scatterings. In this study, we present a joint determination of FFs for light charged hadrons through a global analysis at next-to-leading order (NLO) in QCD. Our analysis incorporates a wide range of precision measurements from the LHC, as well as data from electr… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

    Comments: 44 pages, 45 figures

  48. arXiv:2407.03038  [pdf, other

    cs.CL cs.DC cs.LG

    On the Client Preference of LLM Fine-tuning in Federated Learning

    Authors: Feijie Wu, Xiaoze Liu, Haoyu Wang, Xingchen Wang, Jing Gao

    Abstract: Reinforcement learning with human feedback (RLHF) fine-tunes a pretrained large language model (LLM) using preference datasets, enabling the LLM to generate outputs that align with human preferences. Given the sensitive nature of these preference datasets held by various clients, there is a need to implement RLHF within a federated learning (FL) framework, where clients are reluctant to share thei… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: Work in progress

  49. arXiv:2407.01875  [pdf, ps, other

    cs.AI

    Spatio-Temporal Graphical Counterfactuals: An Overview

    Authors: Mingyu Kang, Duxin Chen, Ziyuan Pu, Jianxi Gao, Wenwu Yu

    Abstract: Counterfactual thinking is a critical yet challenging topic for artificial intelligence to learn knowledge from data and ultimately improve their performances for new scenarios. Many research works, including Potential Outcome Model and Structural Causal Model, have been proposed to realize it. However, their modelings, theoretical foundations and application approaches are usually different. More… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  50. arXiv:2407.01414  [pdf, other

    cs.CV

    StyleShot: A Snapshot on Any Style

    Authors: Junyao Gao, Yanchen Liu, Yanan Sun, Yinhao Tang, Yanhong Zeng, Kai Chen, Cairong Zhao

    Abstract: In this paper, we show that, a good style representation is crucial and sufficient for generalized style transfer without test-time tuning. We achieve this through constructing a style-aware encoder and a well-organized style dataset called StyleGallery. With dedicated design for style learning, this style-aware encoder is trained to extract expressive style representation with decoupling training… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: project page:https://styleshot.github.io/