(Translated by https://www.hiragana.jp/)
Search | arXiv e-print repository
Skip to main content

Showing 1–50 of 3,506 results for author: Huang, Z

.
  1. arXiv:2407.08440  [pdf, other

    cs.CL cs.AI

    Beyond Instruction Following: Evaluating Rule Following of Large Language Models

    Authors: Wangtao Sun, Chenxiang Zhang, Xueyou Zhang, Ziyang Huang, Haotian Xu, Pei Chen, Shizhu He, Jun Zhao, Kang Liu

    Abstract: Although Large Language Models (LLMs) have demonstrated strong instruction-following ability to be helpful, they are further supposed to be controlled and guided by rules in real-world scenarios to be safe, and accurate in responses. This demands the possession of rule-following capability of LLMs. However, few works have made a clear evaluation of the rule-following capability of LLMs. Previous s… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

  2. arXiv:2407.08297  [pdf, other

    quant-ph cond-mat.stat-mech

    The trade-off between diagonal and off-diagonal elements in the eigenstate thermalization hypothesis

    Authors: Zhiqiang Huang

    Abstract: In order to bypass the local measurement as an intermediate quantity in the proof of the eigenstate thermalization hypothesis (ETH), we have proposed a measurement-independent distinguishability measure. In this paper, we establish the connection between this measure and several other ETH measures in a more unified way. We first prove a universal trade-off relation for the diagonal and off-diagona… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    Comments: 10 pages, 6 figures

  3. arXiv:2407.07717  [pdf, other

    stat.ME math.ST

    High-dimensional Covariance Estimation by Pairwise Likelihood Truncation

    Authors: Alessandro Casa, Davide Ferrari, Zhendong Huang

    Abstract: Pairwise likelihood offers a useful approximation to the full likelihood function for covariance estimation in high-dimensional context. It simplifies high-dimensional dependencies by combining marginal bivariate likelihood objects, thereby making estimation more manageable. In certain models, including the Gaussian model, both pairwise and full likelihoods are known to be maximized by the same pa… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

    Comments: 37 pages

  4. arXiv:2407.07614  [pdf, other

    cs.CV

    MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis

    Authors: Wanggui He, Siming Fu, Mushui Liu, Xierui Wang, Wenyi Xiao, Fangxun Shu, Yi Wang, Lei Zhang, Zhelun Yu, Haoyuan Li, Ziwei Huang, LeiLei Gan, Hao Jiang

    Abstract: Auto-regressive models have made significant progress in the realm of language generation, yet they do not perform on par with diffusion models in the domain of image synthesis. In this work, we introduce MARS, a novel framework for T2I generation that incorporates a specially designed Semantic Vision-Language Integration Expert (SemVIE). This innovative component integrates pre-trained LLMs by in… ▽ More

    Submitted 11 July, 2024; v1 submitted 10 July, 2024; originally announced July 2024.

    Comments: 14 pages, 9 figures

  5. arXiv:2407.07554  [pdf, other

    cs.GR cs.SD eess.AS

    Beat-It: Beat-Synchronized Multi-Condition 3D Dance Generation

    Authors: Zikai Huang, Xuemiao Xu, Cheng Xu, Huaidong Zhang, Chenxi Zheng, Jing Qin, Shengfeng He

    Abstract: Dance, as an art form, fundamentally hinges on the precise synchronization with musical beats. However, achieving aesthetically pleasing dance sequences from music is challenging, with existing methods often falling short in controllability and beat alignment. To address these shortcomings, this paper introduces Beat-It, a novel framework for beat-specific, key pose-guided dance generation. Unlike… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

    Comments: ECCV 2024

  6. arXiv:2407.06491  [pdf, other

    cs.CV

    VideoEval: Comprehensive Benchmark Suite for Low-Cost Evaluation of Video Foundation Model

    Authors: Xinhao Li, Zhenpeng Huang, Jing Wang, Kunchang Li, Limin Wang

    Abstract: With the growth of high-quality data and advancement in visual pre-training paradigms, Video Foundation Models (VFMs) have made significant progress recently, demonstrating their remarkable performance on traditional video understanding benchmarks. However, the existing benchmarks (e.g. Kinetics) and their evaluation protocols are often limited by relatively poor diversity, high evaluation costs,… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

  7. arXiv:2407.06176  [pdf, other

    cs.CV eess.IV

    Contour-weighted loss for class-imbalanced image segmentation

    Authors: Zhhengyong Huang, Yao Sui

    Abstract: Image segmentation is critically important in almost all medical image analysis for automatic interpretations and processing. However, it is often challenging to perform image segmentation due to data imbalance between intra- and inter-class, resulting in over- or under-segmentation. Consequently, we proposed a new methodology to address the above issue, with a compact yet effective contour-weight… ▽ More

    Submitted 7 June, 2024; originally announced July 2024.

    Comments: ICIP 2024

  8. arXiv:2407.06127  [pdf, other

    cs.CV

    Better Sampling, towards Better End-to-end Small Object Detection

    Authors: Zile Huang, Chong Zhang, Mingyu Jin, Fangyu Wu, Chengzhi Liu, Xiaobo Jin

    Abstract: While deep learning-based general object detection has made significant strides in recent years, the effectiveness and efficiency of small object detection remain unsatisfactory. This is primarily attributed not only to the limited characteristics of such small targets but also to the high density and mutual overlap among these targets. The existing transformer-based small object detectors do not… ▽ More

    Submitted 17 May, 2024; originally announced July 2024.

    Comments: 14 pages, 5 figures

  9. arXiv:2407.05784  [pdf, other

    cs.AR

    Hecaton: Training and Finetuning Large Language Models with Scalable Chiplet Systems

    Authors: Zongle Huang, Shupei Fan, Chen Tang, Xinyuan Lin, Shuwen Deng, Yongpan Liu

    Abstract: Large Language Models (LLMs) have achieved remarkable success in various fields, but their training and finetuning require massive computation and memory, necessitating parallelism which introduces heavy communication overheads. Driven by advances in packaging, the chiplet architecture emerges as a potential solution, as it can integrate computing power, as well as utilize on-package links with be… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

  10. arXiv:2407.05458  [pdf, other

    cs.AI

    A Survey of Models for Cognitive Diagnosis: New Developments and Future Directions

    Authors: Fei Wang, Weibo Gao, Qi Liu, Jiatong Li, Guanhao Zhao, Zheng Zhang, Zhenya Huang, Mengxiao Zhu, Shijin Wang, Wei Tong, Enhong Chen

    Abstract: Cognitive diagnosis has been developed for decades as an effective measurement tool to evaluate human cognitive status such as ability level and knowledge mastery. It has been applied to a wide range of fields including education, sport, psychological diagnosis, etc. By providing better awareness of cognitive status, it can serve as the basis for personalized services such as well-designed medical… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

  11. arXiv:2407.05381  [pdf, ps, other

    cs.DS

    Online Matching: A Brief Survey

    Authors: Zhiyi Huang, Zhihao Gavin Tang, David Wajc

    Abstract: Matching, capturing allocation of items to unit-demand buyers, or tasks to workers, or pairs of collaborators, is a central problem in economics. Indeed, the growing prevalence of matching-based markets, many of which online in nature, has motivated much research in economics, operations research, computer science, and their intersection. This brief survey is meant as an introduction to the area o… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

    Comments: Also in SIGECOM Exchanges

  12. arXiv:2407.04779  [pdf, other

    cond-mat.quant-gas cond-mat.mes-hall cond-mat.stat-mech cond-mat.str-el

    Interaction-induced topological phase transition at finite temperature

    Authors: Ze-Min Huang, Sebastian Diehl

    Abstract: We demonstrate the existence of topological phase transitions in interacting, symmetry-protected quantum matter at finite temperatures. Using a combined numerical and analytical approach, we study a one-dimensional Su-Schrieffer-Heeger model with added Hubbard interactions, where no thermodynamic phase transition occurs at finite temperatures. The transition is signalled by a quantized, non-local… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

    Comments: 6+4 pages, 7 figures

  13. arXiv:2407.04334  [pdf, other

    cs.CV cs.LG

    Learning Geometric Invariant Features for Classification of Vector Polygons with Graph Message-passing Neural Network

    Authors: Zexian Huang, Kourosh Khoshelham, Martin Tomko

    Abstract: Geometric shape classification of vector polygons remains a non-trivial learning task in spatial analysis. Previous studies mainly focus on devising deep learning approaches for representation learning of rasterized vector polygons, whereas the study of discrete representations of polygons and subsequent deep learning approaches have not been fully investigated. In this study, we investigate a gra… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

  14. arXiv:2407.04326  [pdf, other

    cs.CV

    LMSeg: A deep graph message-passing network for efficient and accurate semantic segmentation of large-scale 3D landscape meshes

    Authors: Zexian Huang, Kourosh Khoshelham, Gunditj Mirring Traditional Owners Corporation, Martin Tomko

    Abstract: Semantic segmentation of large-scale 3D landscape meshes is pivotal for various geospatial applications, including spatial analysis, automatic mapping and localization of target objects, and urban planning and development. This requires an efficient and accurate 3D perception system to understand and analyze real-world environments. However, traditional mesh segmentation methods face challenges in… ▽ More

    Submitted 10 July, 2024; v1 submitted 5 July, 2024; originally announced July 2024.

  15. arXiv:2407.04185  [pdf, other

    cs.CL

    HAF-RM: A Hybrid Alignment Framework for Reward Model Training

    Authors: Shujun Liu, Xiaoyu Shen, Yuhang Lai, Siyuan Wang, Shengbin Yue, Zengfeng Huang, Xuanjing Huang, Zhongyu Wei

    Abstract: The reward model has become increasingly important in alignment, assessment, and data construction for large language models (LLMs). Most existing researchers focus on enhancing reward models through data improvements, following the conventional training framework for reward models that directly optimizes the predicted rewards. In this paper, we propose a hybrid alignment framework HaF-RM for rewa… ▽ More

    Submitted 11 July, 2024; v1 submitted 4 July, 2024; originally announced July 2024.

  16. arXiv:2407.03408  [pdf, ps, other

    hep-th

    All Next-Next-to-Extremal One-Loop Correlators of AdS Supergluons and Supergravitons

    Authors: Zhongjie Huang, Bo Wang, Ellis Ye Yuan

    Abstract: We bootstrap all of the next-next-to-extremal one-loop four-point correlators of supergravitons and supergluons in ${\rm AdS_5}$ using a differential representation, and obtain closed formulas that are valid in both position space and Mellin space simultaneously.

    Submitted 8 July, 2024; v1 submitted 3 July, 2024; originally announced July 2024.

    Comments: 10 pages + 1 Mathematica notebook

  17. Thermal and mechanical properties and the structural phase transition under pressure in $A$In$_2$As$_2$ ($A$ = Ca, Sr, Ba)

    Authors: Wen-Ti Guo, Zhigao Huang, Jian-Min Zhang

    Abstract: Experimental results that BaIn2As2 and Ca(Sr)In2As2, which are the same class of alkali metal compounds, belong to different structural phases have puzzled the current materials physics community. Here, we investigate the pressure-induced structural phase transition of AIn2As2 and its accompanying improvement in mechanical and thermal properties. Firstly, the structural stability of the materials… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: 22 pages, 13 figures

    Journal ref: Phys. Rev. B 108, 094111 (2023)

  18. arXiv:2407.02888  [pdf, ps, other

    cs.LG cs.AI

    Joint Optimization of Resource Allocation and Data Selection for Fast and Cost-Efficient Federated Edge Learning

    Authors: Yunjian Jia, Zhen Huang, Jiping Yan, Yulu Zhang, Kun Luo, Wanli Wen

    Abstract: Deploying federated learning at the wireless edge introduces federated edge learning (FEEL). Given FEEL's limited communication resources and potential mislabeled data on devices, improper resource allocation or data selection can hurt convergence speed and increase training costs. Thus, to realize an efficient FEEL system, this paper emphasizes jointly optimizing resource allocation and data sele… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  19. arXiv:2407.02827  [pdf, ps, other

    cs.LG math.OC

    Convergence of Implicit Gradient Descent for Training Two-Layer Physics-Informed Neural Networks

    Authors: Xianliang Xu, Zhongyi Huang, Ye Li

    Abstract: Optimization algorithms is crucial in training physics-informed neural networks (PINNs), unsuitable methods may lead to poor solutions. Compared to the common gradient descent algorithm, implicit gradient descent (IGD) outperforms it in handling some multi-scale problems. In this paper, we provide convergence analysis for the implicit gradient descent for training over-parametrized two-layer PINNs… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  20. arXiv:2407.02319  [pdf, other

    cond-mat.mtrl-sci

    Catalogue of $C$-paired spin-valley locking in antiferromagnetic systems

    Authors: Mengli Hu, Xingkai Cheng, Zhenqiao Huang, Junwei Liu

    Abstract: Antiferromagnetic materials (AFMs) have been gaining lots of attentions due to its great potential in spintronics devices and the recently discovered novel spin structure in the momentum space, i.e., $C$-paired spin-valley or spin-momentum locking (CSVL), where spins and valleys/momenta are locked to each other due to the crystal symmetry guaranteeing zero magnetization. Here, we systematically st… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: Main text: 9 pages, 4 figures

  21. arXiv:2407.02165  [pdf, other

    cs.CV

    WildAvatar: Web-scale In-the-wild Video Dataset for 3D Avatar Creation

    Authors: Zihao Huang, Shoukang Hu, Guangcong Wang, Tianqi Liu, Yuhang Zang, Zhiguo Cao, Wei Li, Ziwei Liu

    Abstract: Existing human datasets for avatar creation are typically limited to laboratory environments, wherein high-quality annotations (e.g., SMPL estimation from 3D scans or multi-view images) can be ideally provided. However, their annotating requirements are impractical for real-world images or videos, posing challenges toward real-world applications on current avatar creation methods. To this end, we… ▽ More

    Submitted 10 July, 2024; v1 submitted 2 July, 2024; originally announced July 2024.

    Comments: Project page: https://wildavatar.github.io/

  22. arXiv:2407.02124  [pdf

    eess.SY

    Data-Driven Subsynchronous Oscillation Suppression for Renewable Energy Integrated Power Systems Based on Koopman Operator

    Authors: Zihan Wang, Ziyang Huang, Xiaonan Zhang, Gengyin Li, Le Zheng

    Abstract: Recently, subsynchronous oscillations (SSOs) have emerged frequently worldwide, with the high penetration of renewable power generation in modern power systems. The SSO introduced by renewables has become a prominent new stability problem, seriously threatening the stable operation of systems. This paper proposes a data-driven dynamic optimal controller for renewable energy integrated power system… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

  23. arXiv:2407.01951  [pdf, other

    cs.CG cs.DS

    Spanner for the $0/1/\infty$ weighted region problem

    Authors: Joachim Gudmundsson, Zijin Huang, André van Renssen, Sampson Wong

    Abstract: We consider the problem of computing an approximate weighted shortest path in a weighted subdivision, with weights assigned from the set $\{0, 1, \infty\}$. We present a data structure $B$, which stores a set of convex, non-overlapping regions. These include zero-cost regions (0-regions) with a weight of $0$ and obstacles with a weight of $\infty$, all embedded in a plane with a weight of $1$. The… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

  24. arXiv:2407.01886  [pdf, other

    cs.LG cs.AI

    Core Knowledge Learning Framework for Graph Adaptation and Scalability Learning

    Authors: Bowen Zhang, Zhichao Huang, Genan Dai, Guangning Xu, Xiaomao Fan, Hu Huang

    Abstract: Graph classification is a pivotal challenge in machine learning, especially within the realm of graph-based data, given its importance in numerous real-world applications such as social network analysis, recommendation systems, and bioinformatics. Despite its significance, graph classification faces several hurdles, including adapting to diverse prediction tasks, training across multiple target do… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  25. arXiv:2407.01619  [pdf, other

    cs.LG cs.AI cs.DB

    TabSketchFM: Sketch-based Tabular Representation Learning for Data Discovery over Data Lakes

    Authors: Aamod Khatiwada, Harsha Kokel, Ibrahim Abdelaziz, Subhajit Chaudhury, Julian Dolby, Oktie Hassanzadeh, Zhenhan Huang, Tejaswini Pedapati, Horst Samulowitz, Kavitha Srinivas

    Abstract: Enterprises have a growing need to identify relevant tables in data lakes; e.g. tables that are unionable, joinable, or subsets of each other. Tabular neural models can be helpful for such data discovery tasks. In this paper, we present TabSketchFM, a neural tabular model for data discovery over data lakes. First, we propose a novel pre-training sketch-based approach to enhance the effectiveness o… ▽ More

    Submitted 28 June, 2024; originally announced July 2024.

    Comments: arXiv admin note: text overlap with arXiv:2307.04217

  26. arXiv:2407.01461  [pdf, other

    cs.CL

    Enhancing the Capability and Robustness of Large Language Models through Reinforcement Learning-Driven Query Refinement

    Authors: Zisu Huang, Xiaohua Wang, Feiran Zhang, Zhibo Xu, Cenyuan Zhang, Xiaoqing Zheng, Xuanjing Huang

    Abstract: The capacity of large language models (LLMs) to generate honest, harmless, and helpful responses heavily relies on the quality of user prompts. However, these prompts often tend to be brief and vague, thereby significantly limiting the full potential of LLMs. Moreover, harmful prompts can be meticulously crafted and manipulated by adversaries to jailbreak LLMs, inducing them to produce potentially… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  27. arXiv:2407.01231  [pdf, other

    cs.CL cs.AI

    MIRAI: Evaluating LLM Agents for Event Forecasting

    Authors: Chenchen Ye, Ziniu Hu, Yihe Deng, Zijie Huang, Mingyu Derek Ma, Yanqiao Zhu, Wei Wang

    Abstract: Recent advancements in Large Language Models (LLMs) have empowered LLM agents to autonomously collect world information, over which to conduct reasoning to solve complex problems. Given this capability, increasing interests have been put into employing LLM agents for predicting international events, which can influence decision-making and shape policy development on an international scale. Despite… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: 66 pages, 8 figures, 6 tables; Website: https://mirai-llm.github.io/

  28. arXiv:2407.01155  [pdf, other

    cs.LG

    CPT: Consistent Proxy Tuning for Black-box Optimization

    Authors: Yuanyang He, Zitong Huang, Xinxing Xu, Rick Siow Mong Goh, Salman Khan, Wangmeng Zuo, Yong Liu, Chun-Mei Feng

    Abstract: Black-box tuning has attracted recent attention due to that the structure or inner parameters of advanced proprietary models are not accessible. Proxy-tuning provides a test-time output adjustment for tuning black-box language models. It applies the difference of the output logits before and after tuning a smaller white-box "proxy" model to improve the black-box model. However, this technique serv… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: 10 pages,2 figures plus supplementary materials

  29. arXiv:2407.00915  [pdf, ps, other

    astro-ph.SR physics.space-ph

    The differences in the origination and properties of the near-Earth solar wind between solar cycles 23 and 24

    Authors: Xinzheng Shi, Hui Fu, Zhenghua Huang, Limei Yan, Chi Ma, Chenxi Huangfu, Hongqiang Song, Lidong Xia

    Abstract: The dependence of the sources and properties of the near-Earth solar wind on solar cycle activity is an important issue in solar and space physics. We use the improved two-step mapping procedure that takes into account the initial acceleration processes to trace the near-Earth solar winds back to their source regions from 1999 to 2020, covering solar cycles (SCs) 23 and 24. Then the solar wind is… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

    Comments: 21 pages, 11 figures

  30. arXiv:2407.00488  [pdf, other

    cs.CL cs.AI

    PFME: A Modular Approach for Fine-grained Hallucination Detection and Editing of Large Language Models

    Authors: Kunquan Deng, Zeyu Huang, Chen Li, Chenghua Lin, Min Gao, Wenge Rong

    Abstract: Large Language Models (LLMs) excel in fluency but risk producing inaccurate content, called "hallucinations." This paper outlines a standardized process for categorizing fine-grained hallucination types and proposes an innovative framework--the Progressive Fine-grained Model Editor (PFME)--specifically designed to detect and correct fine-grained hallucinations in LLMs. PFME consists of two collabo… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

  31. arXiv:2407.00136  [pdf, other

    hep-ex

    Observation of the Electromagnetic Dalitz Transition $h_c \rightarrow e^+e^-ηいーた_c$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, S. Ahmed, M. Albrecht, R. Aliberti, A. Amoroso, M. R. An, Q. An, X. H. Bai, Y. Bai, O. Bakina, R. Baldini Ferroli, I. Balossino, Y. Ban, K. Begzsuren, N. Berger, M. Bertani, D. Bettoni, F. Bianchi, J. Bloms, A. Bortone, I. Boyko, R. A. Briere , et al. (495 additional authors not shown)

    Abstract: Using $(27.12\pm 0.14)\times10^8$ $ψぷさい(3686)$ decays and data samples of $e^+e^-$ collisions with $\sqrt{s}$ from 4.130 to 4.780~GeV collected with the BESIII detector, we report the first observation of the electromagnetic Dalitz transition $h_c\to e^+e^-ηいーた_c$ with a statistical significance of $5.4σしぐま$. We measure the ratio of the branching fractions… ▽ More

    Submitted 2 July, 2024; v1 submitted 28 June, 2024; originally announced July 2024.

  32. arXiv:2406.19640  [pdf, other

    cs.CV

    Efficient Event Stream Super-Resolution with Recursive Multi-Branch Fusion

    Authors: Quanmin Liang, Zhilin Huang, Xiawu Zheng, Feidiao Yang, Jun Peng, Kai Huang, Yonghong Tian

    Abstract: Current Event Stream Super-Resolution (ESR) methods overlook the redundant and complementary information present in positive and negative events within the event stream, employing a direct mixing approach for super-resolution, which may lead to detail loss and inefficiency. To address these issues, we propose an efficient Recursive Multi-Branch Information Fusion Network (RMFNet) that separates po… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

    Journal ref: International Joint Conference on Artificial Intelligence 2024

  33. arXiv:2406.19072  [pdf, other

    eess.SP

    Scatterer Recognition from LiDAR Point Clouds for Environment-Embedded Vehicular Channel Modeling via Synesthesia of Machines

    Authors: Ziwei Huang, Lu Bai, Zengrui Han, Xiang Cheng

    Abstract: In this paper, a novel environment-embedded vehicular channel model is proposed by scatterer recognition from light detection and ranging (LiDAR) point clouds via Synesthesia of Machines (SoM). To provide a robust data foundation, a new intelligent sensing-communication integration dataset in vehicular urban scenarios is constructed. Based on the constructed dataset, the complex SoM mechanism, i.e… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  34. arXiv:2406.19060  [pdf, other

    quant-ph cs.IT math-ph math.OC

    Semi-definite optimization of the measured relative entropies of quantum states and channels

    Authors: Zixin Huang, Mark M. Wilde

    Abstract: The measured relative entropies of quantum states and channels find operational significance in quantum information theory as achievable error rates in hypothesis testing tasks. They are of interest in the near term, as they correspond to hybrid quantum-classical strategies with technological requirements far less challenging to implement than required by the most general strategies allowed by qua… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: 33 pages

  35. arXiv:2406.18916  [pdf, other

    cs.CL cs.AI

    TrustUQA: A Trustful Framework for Unified Structured Data Question Answering

    Authors: Wen Zhang, Long Jin, Yushan Zhu, Jiaoyan Chen, Zhiwei Huang, Junjie Wang, Yin Hua, Lei Liang, Huajun Chen

    Abstract: Natural language question answering (QA) over structured data sources such as tables and knowledge graphs (KGs) have been widely investigated, for example with Large Language Models (LLMs). The main solutions include question to formal query parsing and retrieval-based answer generation. However, current methods of the former often suffer from weak generalization, failing to dealing with multiple… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  36. arXiv:2406.18426  [pdf

    physics.med-ph

    Fast 3D 31P B1+ mapping with a weighted stack of spiral trajectory at 7 Tesla

    Authors: Mark Widmaier, Antonia Kaiser, Salome Baup, Daniel Wenz, Katarzyna Pierzchala, Ying Xiao, Zhiwei Huang, Yun Jiang, Lijing Xin

    Abstract: Purpose: Phosphorus Magnetic Resonance Spectroscopy (31P MRS) enables non-invasive assessment of energy metabolism, yet its application is hindered by sensitivity limitations. To overcome this, often high magnetic fields are used, leading to challenges such as spatial B_1^+ inhomogeneity and therefore the need for accurate flip angle determination in accelerated acquisitions with short repetition… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

  37. arXiv:2406.18259  [pdf, other

    cs.CL cs.AI

    Detecting Machine-Generated Texts: Not Just "AI vs Humans" and Explainability is Complicated

    Authors: Jiazhou Ji, Ruizhe Li, Shujun Li, Jie Guo, Weidong Qiu, Zheng Huang, Chiyu Chen, Xiaoyu Jiang, Xinru Lu

    Abstract: As LLMs rapidly advance, increasing concerns arise regarding risks about actual authorship of texts we see online and in real world. The task of distinguishing LLM-authored texts is complicated by the nuanced and overlapping behaviors of both machines and humans. In this paper, we challenge the current practice of considering LLM-generated text detection a binary classification task of differentia… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: 19 pages, 2 figures

  38. arXiv:2406.18219  [pdf, other

    cs.CL cs.LG

    A Closer Look into Mixture-of-Experts in Large Language Models

    Authors: Ka Man Lo, Zeyu Huang, Zihan Qiu, Zili Wang, Jie Fu

    Abstract: Mixture-of-experts (MoE) is gaining increasing attention due to its unique properties and remarkable performance, especially for language tasks. By sparsely activating a subset of parameters for each token, MoE architecture could increase the model size without sacrificing computational efficiency, achieving a better trade-off between performance and training costs. However, the underlying mechani… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

  39. arXiv:2406.18167  [pdf, other

    astro-ph.HE

    H.E.S.S. observations of the 2021 periastron passage of PSR B1259-63/LS 2883

    Authors: H. E. S. S. Collaboration, F. Aharonian, F. Ait Benkhali, J. Aschersleben, H. Ashkar, M. Backes, V. Barbosa Martins, R. Batzofin, Y. Becherini, D. Berge, K. Bernlöhr, M. Böttcher, C. Boisson, J. Bolmont, M. de Bony de Lavergne, J. Borowska, M. Bouyahiaoui, R. Brose, A. Brown, F. Brun, B. Bruno, T. Bulik, C. Burger-Scheidlin, S. Caroff, S. Casanova , et al. (119 additional authors not shown)

    Abstract: PSR B1259-63 is a gamma-ray binary system that hosts a pulsar in an eccentric orbit, with a 3.4 year period, around an O9.5Ve star. At orbital phases close to periastron passages, the system radiates bright and variable non-thermal emission. We report on an extensive VHE observation campaign conducted with the High Energy Stereoscopic System, comprised of ~100 hours of data taken from $t_p-24$ day… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: accepted to A&A

  40. Start from Zero: Triple Set Prediction for Automatic Knowledge Graph Completion

    Authors: Wen Zhang, Yajing Xu, Peng Ye, Zhiwei Huang, Zezhong Xu, Jiaoyan Chen, Jeff Z. Pan, Huajun Chen

    Abstract: Knowledge graph (KG) completion aims to find out missing triples in a KG. Some tasks, such as link prediction and instance completion, have been proposed for KG completion. They are triple-level tasks with some elements in a missing triple given to predict the missing element of the triple. However, knowing some elements of the missing triple in advance is not always a realistic setting. In this p… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: Paper accepted by TKDE in 2024

  41. arXiv:2406.18007  [pdf, other

    cs.MM

    Deep Mamba Multi-modal Learning

    Authors: Jian Zhu, Xin Zou, Yu Cui, Zhangmin Huang, Chenshu Hu, Bo Lyu

    Abstract: Inspired by the excellent performance of Mamba networks, we propose a novel Deep Mamba Multi-modal Learning (DMML). It can be used to achieve the fusion of multi-modal features. We apply DMML to the field of multimedia retrieval and propose an innovative Deep Mamba Multi-modal Hashing (DMMH) method. It combines the advantages of algorithm accuracy and inference speed. We validated the effectivenes… ▽ More

    Submitted 9 April, 2024; originally announced June 2024.

    Comments: Deep Mamba Multi-modal Learning; Deep Mamba Multi-modal Hashing

  42. arXiv:2406.17335  [pdf, other

    cs.IR cs.LG

    A Thorough Performance Benchmarking on Lightweight Embedding-based Recommender Systems

    Authors: Hung Vinh Tran, Tong Chen, Quoc Viet Hung Nguyen, Zi Huang, Lizhen Cui, Hongzhi Yin

    Abstract: Since the creation of the Web, recommender systems (RSs) have been an indispensable mechanism in information filtering. State-of-the-art RSs primarily depend on categorical features, which ecoded by embedding vectors, resulting in excessively large embedding tables. To prevent over-parameterized embedding tables from harming scalability, both academia and industry have seen increasing efforts in c… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  43. arXiv:2406.17263  [pdf, other

    cs.LG math.DS math.NA

    Efficient, Multimodal, and Derivative-Free Bayesian Inference With Fisher-Rao Gradient Flows

    Authors: Yifan Chen, Daniel Zhengyu Huang, Jiaoyang Huang, Sebastian Reich, Andrew M. Stuart

    Abstract: In this paper, we study efficient approximate sampling for probability distributions known up to normalization constants. We specifically focus on a problem class arising in Bayesian inference for large-scale inverse problems in science and engineering applications. The computational challenges we address with the proposed methodology are: (i) the need for repeated evaluations of expensive forward… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: 42 pages, 9 figures

  44. arXiv:2406.17245  [pdf, other

    cs.LG cs.AI cs.CL

    Unlocking Continual Learning Abilities in Language Models

    Authors: Wenyu Du, Shuang Cheng, Tongxu Luo, Zihan Qiu, Zeyu Huang, Ka Chun Cheung, Reynold Cheng, Jie Fu

    Abstract: Language models (LMs) exhibit impressive performance and generalization capabilities. However, LMs struggle with the persistent challenge of catastrophic forgetting, which undermines their long-term sustainability in continual learning (CL). Existing approaches usually address the issue by incorporating old task data or task-wise inductive bias into LMs. However, old data and accurate task informa… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: preprint, 19 pages

  45. arXiv:2406.16772  [pdf, other

    cs.CL cs.AI

    OlympicArena Medal Ranks: Who Is the Most Intelligent AI So Far?

    Authors: Zhen Huang, Zengzhi Wang, Shijie Xia, Pengfei Liu

    Abstract: In this report, we pose the following question: Who is the most intelligent AI model to date, as measured by the OlympicArena (an Olympic-level, multi-discipline, multi-modal benchmark for superintelligent AI)? We specifically focus on the most recently released models: Claude-3.5-Sonnet, Gemini-1.5-Pro, and GPT-4o. For the first time, we propose using an Olympic medal Table approach to rank AI mo… ▽ More

    Submitted 26 June, 2024; v1 submitted 24 June, 2024; originally announced June 2024.

    Comments: 10 pages

  46. arXiv:2406.16101  [pdf, ps, other

    math.CO

    Extremal digraphs containing at most $t$ paths of length 2 with the same endpoints

    Authors: Zejun Huang, Zhenhua Lyu

    Abstract: Given a positive integer $t$, let $P_{t,2}$ be the digraph consisting of $t$ directed paths of length 2 with the same initial and terminal vertices. In this paper, we study the maximum size of $P_{t+1,2}$-free digraphs of order $n$, which is denoted by $ex(n, P_{t+1,2})$. For sufficiently large $n$, we prove that $ex(n, P_{t+1})=g(n,t)$ when $\lfloor(n-t)/{2} \rfloor$ is odd and… ▽ More

    Submitted 27 June, 2024; v1 submitted 23 June, 2024; originally announced June 2024.

  47. arXiv:2406.15349  [pdf, other

    cs.CV cs.AI cs.LG cs.RO

    NAVSIM: Data-Driven Non-Reactive Autonomous Vehicle Simulation and Benchmarking

    Authors: Daniel Dauner, Marcel Hallgarten, Tianyu Li, Xinshuo Weng, Zhiyu Huang, Zetong Yang, Hongyang Li, Igor Gilitschenski, Boris Ivanovic, Marco Pavone, Andreas Geiger, Kashyap Chitta

    Abstract: Benchmarking vision-based driving policies is challenging. On one hand, open-loop evaluation with real data is easy, but these results do not reflect closed-loop performance. On the other, closed-loop evaluation is possible in simulation, but is hard to scale due to its significant computational demands. Further, the simulators available today exhibit a large domain gap to real data. This has resu… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

  48. arXiv:2406.15222  [pdf

    eess.IV cs.AI cs.CV

    Rapid and Accurate Diagnosis of Acute Aortic Syndrome using Non-contrast CT: A Large-scale, Retrospective, Multi-center and AI-based Study

    Authors: Yujian Hu, Yilang Xiang, Yan-Jie Zhou, Yangyan He, Shifeng Yang, Xiaolong Du, Chunlan Den, Youyao Xu, Gaofeng Wang, Zhengyao Ding, Jingyong Huang, Wenjun Zhao, Xuejun Wu, Donglin Li, Qianqian Zhu, Zhenjiang Li, Chenyang Qiu, Ziheng Wu, Yunjun He, Chen Tian, Yihui Qiu, Zuodong Lin, Xiaolong Zhang, Yuan He, Zhenpeng Yuan , et al. (15 additional authors not shown)

    Abstract: Chest pain symptoms are highly prevalent in emergency departments (EDs), where acute aortic syndrome (AAS) is a catastrophic cardiovascular emergency with a high fatality rate, especially when timely and accurate treatment is not administered. However, current triage practices in the ED can cause up to approximately half of patients with AAS to have an initially missed diagnosis or be misdiagnosed… ▽ More

    Submitted 24 June, 2024; v1 submitted 13 June, 2024; originally announced June 2024.

    Comments: under peer review

  49. arXiv:2406.14931  [pdf, other

    eess.SP

    Multi-beam Training for Near-field Communications in High-frequency Bands

    Authors: Cong Zhou, Changsheng You, Zixuan Huang, Shuo Shi, Yi Gong, Chan-Byoung Chae, Kaibin Huang

    Abstract: In this paper, we study efficient multi-beam training design for near-field communications to reduce the beam training overhead of conventional single-beam training methods. In particular, the array-division based multi-beam training method, which is widely used in far-field communications, cannot be directly applied to the near-field scenario, since different sub-arrays may observe different user… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Comments: In this paper, a novel near-field multi-beam training scheme is proposed by sparsely activating a portion of antennas to form a sparse linear array

  50. arXiv:2406.14924  [pdf, other

    cs.CV

    DiPEx: Dispersing Prompt Expansion for Class-Agnostic Object Detection

    Authors: Jia Syuen Lim, Zhuoxiao Chen, Mahsa Baktashmotlagh, Zhi Chen, Xin Yu, Zi Huang, Yadan Luo

    Abstract: Class-agnostic object detection (OD) can be a cornerstone or a bottleneck for many downstream vision tasks. Despite considerable advancements in bottom-up and multi-object discovery methods that leverage basic visual cues to identify salient objects, consistently achieving a high recall rate remains difficult due to the diversity of object types and their contextual complexity. In this work, we in… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Comments: 19 pages