(Translated by https://www.hiragana.jp/)
Search | arXiv e-print repository
Skip to main content

Showing 1–50 of 212 results for author: Geng, S

.
  1. arXiv:2407.01651  [pdf, other

    physics.flu-dyn

    Phase-field modeling of dendritic growth with gas bubbles in the solidification of binary alloys

    Authors: Chengjie Zhan, Zhenhua Chai, Dongke Sun, Baochang Shi, Shaoning Geng, Ping Jiang

    Abstract: In this work, a phase-field model is developed for the dendritic growth with gas bubbles in the solidification of binary alloys. In this model, a total free energy for the complex gas-liquid-dendrite system is proposed through considering the interactions of gas bubbles, liquid melt and solid dendrites, and it can reduce to the energy for gas-liquid flows in the region far from the solid phase, wh… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: 29 pages, 23 figures

  2. arXiv:2406.08698  [pdf, other

    astro-ph.HE hep-ph

    Constraints on Ultra Heavy Dark Matter Properties from Dwarf Spheroidal Galaxies with LHAASO Observations

    Authors: Zhen Cao, F. Aharonian, Q. An, Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, J. T. Cai, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. H. Chen, S. Z. Chen , et al. (255 additional authors not shown)

    Abstract: In this work we try to search for signals generated by ultra-heavy dark matter at the Large High Altitude Air Shower Observatory (LHAASO) data. We look for possible gamma-ray by dark matter annihilation or decay from 16 dwarf spheroidal galaxies in the field of view of LHAASO. Dwarf spheroidal galaxies are among the most promising targets for indirect detection of dark matter which have low fluxes… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: 17 pages, 12 figures, accepted by PRL

  3. arXiv:2406.05184  [pdf, other

    cs.CV

    The Unmet Promise of Synthetic Training Images: Using Retrieved Real Images Performs Better

    Authors: Scott Geng, Cheng-Yu Hsieh, Vivek Ramanujan, Matthew Wallingford, Chun-Liang Li, Pang Wei Koh, Ranjay Krishna

    Abstract: Generative text-to-image models enable us to synthesize unlimited amounts of images in a controllable manner, spurring many recent efforts to train vision models with synthetic data. However, every synthetic image ultimately originates from the upstream data used to train the generator. What additional value does the intermediate generator provide over directly training on relevant parts of the up… ▽ More

    Submitted 3 July, 2024; v1 submitted 7 June, 2024; originally announced June 2024.

    Comments: Correspondence to sgeng at cs dot washington dot edu. RK and PWK equally advised the project

  4. arXiv:2405.11826  [pdf, other

    astro-ph.IM hep-ex physics.ins-det

    Data quality control system and long-term performance monitor of the LHAASO-KM2A

    Authors: Zhen Cao, F. Aharonian, Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, W. Bian, A. V. Bukevich, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, H. X. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. Chen , et al. (263 additional authors not shown)

    Abstract: The KM2A is the largest sub-array of the Large High Altitude Air Shower Observatory (LHAASO). It consists of 5216 electromagnetic particle detectors (EDs) and 1188 muon detectors (MDs). The data recorded by the EDs and MDs are used to reconstruct primary information of cosmic ray and gamma-ray showers. This information is used for physical analysis in gamma-ray astronomy and cosmic ray physics. To… ▽ More

    Submitted 13 June, 2024; v1 submitted 20 May, 2024; originally announced May 2024.

    Comments: 15 pages, 9 figures

  5. arXiv:2405.07691  [pdf, other

    astro-ph.HE

    Discovery of Very-high-energy Gamma-ray Emissions from the Low Luminosity AGN NGC 4278 by LHAASO

    Authors: Zhen Cao, F. Aharonian, Q. An, Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, J. T. Cai, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. H. Chen, S. Z. Chen , et al. (255 additional authors not shown)

    Abstract: The first source catalog of Large High Altitude Air Shower Observatory reported the detection of a very-high-energy gamma ray source, 1LHAASO J1219+2915. In this paper a further detailed study of the spectral and temporal behavior of this point-like source have been carried. The best-fit position of the TeV source ($\rm{RA}=185.05^{\circ}\pm0.04^{\circ}$, $\rm{Dec}=29.25^{\circ}\pm0.03^{\circ}$) i… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Comments: 11 pages, 5 figures

  6. arXiv:2405.05945  [pdf, other

    cs.CV

    Lumina-T2X: Transforming Text into Any Modality, Resolution, and Duration via Flow-based Large Diffusion Transformers

    Authors: Peng Gao, Le Zhuo, Dongyang Liu, Ruoyi Du, Xu Luo, Longtian Qiu, Yuhang Zhang, Chen Lin, Rongjie Huang, Shijie Geng, Renrui Zhang, Junlin Xi, Wenqi Shao, Zhengkai Jiang, Tianshuo Yang, Weicai Ye, He Tong, Jingwen He, Yu Qiao, Hongsheng Li

    Abstract: Sora unveils the potential of scaling Diffusion Transformer for generating photorealistic images and videos at arbitrary resolutions, aspect ratios, and durations, yet it still lacks sufficient implementation details. In this technical report, we introduce the Lumina-T2X family - a series of Flow-based Large Diffusion Transformers (Flag-DiT) equipped with zero-initialized attention, as a unified f… ▽ More

    Submitted 13 June, 2024; v1 submitted 9 May, 2024; originally announced May 2024.

    Comments: Technical Report; Code at: https://github.com/Alpha-VLLM/Lumina-T2X

  7. arXiv:2405.03239  [pdf, other

    cs.LG cs.AI

    Deep Learning for Detecting and Early Predicting Chronic Obstructive Pulmonary Disease from Spirogram Time Series: A UK Biobank Study

    Authors: Shuhao Mei, Yuxi Zhou, Jiahao Xu, Yuxuan Wan, Shan Cao, Qinghao Zhao, Shijia Geng, Junqing Xie, Shenda Hong

    Abstract: Chronic Obstructive Pulmonary Disease (COPD) is a chronic inflammatory lung condition that causes airflow obstruction. The existing methods can only detect patients who already have COPD based on obvious features shown in the spirogram (In this article, the spirogram specifically involves measuring Volume-Flow curve time series). Early prediction of COPD risk is vital for monitoring COPD disease p… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

  8. arXiv:2404.09399  [pdf, ps, other

    quant-ph

    Characterizing Kirkwood-Dirac positive states based on discrete Fourier transform

    Authors: Ying-Hui Yang, Shuang Yao, Shi-Jiao Geng, Xiao-Li Wang, Pei-Ying Chen

    Abstract: Kirkwood-Dirac (KD) distribution is helpful to describe nonclassical phenomena and quantum advantages, which have been linked with nonpositive entries of KD distribution. Suppose that $\mathcal{A}$ and $\mathcal{B}$ are the eigenprojectors of the two eigenbases of two observables and the discrete Fourier transform (DFT) matrix is the transition matrix between the two eigenbases. In a system with p… ▽ More

    Submitted 14 April, 2024; originally announced April 2024.

    Comments: 24 pages

  9. arXiv:2404.07940  [pdf, other

    cs.SE cs.LG

    InfiBench: Evaluating the Question-Answering Capabilities of Code Large Language Models

    Authors: Linyi Li, Shijie Geng, Zhenwen Li, Yibo He, Hao Yu, Ziyue Hua, Guanghan Ning, Siwei Wang, Tao Xie, Hongxia Yang

    Abstract: Large Language Models for code (code LLMs) have witnessed tremendous progress in recent years. With the rapid development of code LLMs, many popular evaluation benchmarks, such as HumanEval, DS-1000, and MBPP, have emerged to measure the performance of code LLMs with a particular focus on code generation tasks. However, they are insufficient to cover the full range of expected capabilities of code… ▽ More

    Submitted 27 June, 2024; v1 submitted 10 March, 2024; originally announced April 2024.

    Comments: 30 pages, 10 pages for main content, work in progress

  10. arXiv:2404.04801  [pdf, ps, other

    astro-ph.IM astro-ph.HE

    LHAASO-KM2A detector simulation using Geant4

    Authors: Zhen Cao, F. Aharonian, Q. An, Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, J. T. Cai, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. H. Chen, S. Z. Chen , et al. (254 additional authors not shown)

    Abstract: KM2A is one of the main sub-arrays of LHAASO, working on gamma ray astronomy and cosmic ray physics at energies above 10 TeV. Detector simulation is the important foundation for estimating detector performance and data analysis. It is a big challenge to simulate the KM2A detector in the framework of Geant4 due to the need to track numerous photons from a large number of detector units (>6000) with… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

  11. arXiv:2403.11405  [pdf, other

    eess.SP

    A Deep Learning Method for Beat-Level Risk Analysis and Interpretation of Atrial Fibrillation Patients during Sinus Rhythm

    Authors: Jun Lei, Yuxi Zhou, Xue Tian, Qinghao Zhao, Qi Zhang, Shijia Geng, Qingbo Wu, Shenda Hong

    Abstract: Atrial Fibrillation (AF) is a common cardiac arrhythmia. Many AF patients experience complications such as stroke and other cardiovascular issues. Early detection of AF is crucial. Existing algorithms can only distinguish ``AF rhythm in AF patients'' from ``sinus rhythm in normal individuals'' . However, AF patients do not always exhibit AF rhythm, posing a challenge for diagnosis when the AF rhyt… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

  12. arXiv:2403.11183  [pdf, other

    cs.CL

    Decoding Continuous Character-based Language from Non-invasive Brain Recordings

    Authors: Cenyuan Zhang, Xiaoqing Zheng, Ruicheng Yin, Shujie Geng, Jianhan Xu, Xuan Gao, Changze Lv, Zixuan Ling, Xuanjing Huang, Miao Cao, Jianfeng Feng

    Abstract: Deciphering natural language from brain activity through non-invasive devices remains a formidable challenge. Previous non-invasive decoders either require multiple experiments with identical stimuli to pinpoint cortical regions and enhance signal-to-noise ratios in brain activity, or they are limited to discerning basic linguistic elements such as letters and words. We propose a novel approach to… ▽ More

    Submitted 19 March, 2024; v1 submitted 17 March, 2024; originally announced March 2024.

  13. Measurements of All-Particle Energy Spectrum and Mean Logarithmic Mass of Cosmic Rays from 0.3 to 30 PeV with LHAASO-KM2A

    Authors: The LHAASO Collaboration, Zhen Cao, F. Aharonian, Q. An, A. Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, J. T. Cai, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. H. Chen , et al. (256 additional authors not shown)

    Abstract: We present the measurements of all-particle energy spectrum and mean logarithmic mass of cosmic rays in the energy range of 0.3-30 PeV using data collected from LHAASO-KM2A between September 2021 and December 2022, which is based on a nearly composition-independent energy reconstruction method, achieving unprecedented accuracy. Our analysis reveals the position of the knee at… ▽ More

    Submitted 26 March, 2024; v1 submitted 15 March, 2024; originally announced March 2024.

    Comments: 8 pages, 3 figures

    Journal ref: Physical Review Letters 132, 131002 (2024)

  14. arXiv:2403.06011  [pdf, other

    cs.LG math.OC

    Reinforcement Learning Paycheck Optimization for Multivariate Financial Goals

    Authors: Melda Alaluf, Giulia Crippa, Sinong Geng, Zijian Jing, Nikhil Krishnan, Sanjeev Kulkarni, Wyatt Navarro, Ronnie Sircar, Jonathan Tang

    Abstract: We study paycheck optimization, which examines how to allocate income in order to achieve several competing financial goals. For paycheck optimization, a quantitative methodology is missing, due to a lack of a suitable problem formulation. To deal with this issue, we formulate the problem as a utility maximization problem. The proposed formulation is able to (i) unify different financial goals; (i… ▽ More

    Submitted 9 March, 2024; originally announced March 2024.

    Journal ref: Risk and Decision Analysis, Volume 9, 2023

  15. arXiv:2402.05935  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    SPHINX-X: Scaling Data and Parameters for a Family of Multi-modal Large Language Models

    Authors: Dongyang Liu, Renrui Zhang, Longtian Qiu, Siyuan Huang, Weifeng Lin, Shitian Zhao, Shijie Geng, Ziyi Lin, Peng Jin, Kaipeng Zhang, Wenqi Shao, Chao Xu, Conghui He, Junjun He, Hao Shao, Pan Lu, Hongsheng Li, Yu Qiao, Peng Gao

    Abstract: We propose SPHINX-X, an extensive Multimodality Large Language Model (MLLM) series developed upon SPHINX. To improve the architecture and training efficiency, we modify the SPHINX framework by removing redundant visual encoders, bypassing fully-padded sub-images with skip tokens, and simplifying multi-stage training into a one-stage all-in-one paradigm. To fully unleash the potential of MLLMs, we… ▽ More

    Submitted 26 June, 2024; v1 submitted 8 February, 2024; originally announced February 2024.

    Comments: Accepted by ICML 2024. Code and models are released at https://github.com/Alpha-VLLM/LLaMA2-Accessory

  16. arXiv:2402.02968  [pdf, other

    cs.CV cs.LG

    Delving into Multi-modal Multi-task Foundation Models for Road Scene Understanding: From Learning Paradigm Perspectives

    Authors: Sheng Luo, Wei Chen, Wanxin Tian, Rui Liu, Luanxuan Hou, Xiubao Zhang, Haifeng Shen, Ruiqi Wu, Shuyi Geng, Yi Zhou, Ling Shao, Yi Yang, Bojun Gao, Qun Li, Guobin Wu

    Abstract: Foundation models have indeed made a profound impact on various fields, emerging as pivotal components that significantly shape the capabilities of intelligent systems. In the context of intelligent vehicles, leveraging the power of foundation models has proven to be transformative, offering notable advancements in visual understanding. Equipped with multi-modal and multi-task learning capabilitie… ▽ More

    Submitted 26 May, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

    Comments: Accepted to IEEE Transactions on Intelligent Vehicles(T-IV). 24 pages, 9 figures, 1 table

  17. arXiv:2401.12783  [pdf, other

    cs.AI cs.LG eess.SP

    A Review of Deep Learning Methods for Photoplethysmography Data

    Authors: Guangkun Nie, Jiabao Zhu, Gongzheng Tang, Deyun Zhang, Shijia Geng, Qinghao Zhao, Shenda Hong

    Abstract: Photoplethysmography (PPG) is a highly promising device due to its advantages in portability, user-friendly operation, and non-invasive capabilities to measure a wide range of physiological information. Recent advancements in deep learning have demonstrated remarkable outcomes by leveraging PPG signals for tasks related to personal health management and other multifaceted applications. In this rev… ▽ More

    Submitted 23 January, 2024; originally announced January 2024.

  18. arXiv:2401.09967  [pdf, other

    cs.CL

    Sketch-Guided Constrained Decoding for Boosting Blackbox Large Language Models without Logit Access

    Authors: Saibo Geng, Berkay Döner, Chris Wendler, Martin Josifoski, Robert West

    Abstract: Constrained decoding, a technique for enforcing constraints on language model outputs, offers a way to control text generation without retraining or architectural modifications. Its application is, however, typically restricted to models that give users access to next-token distributions (usually via softmax logits), which poses a limitation with blackbox large language models (LLMs). This paper i… ▽ More

    Submitted 2 July, 2024; v1 submitted 18 January, 2024; originally announced January 2024.

    Comments: Accepted to ACL 2024 Main Conference

  19. arXiv:2401.05158  [pdf, ps, other

    math.RT

    $τたう$-tilting graphs and quotients

    Authors: Changjian Fu, Shengfei Geng, Pin Liu

    Abstract: We investigate $τたう$-tilting graphs of algebras and their quotient algebras, and obtain a sufficient condition for the connectivity of $τたう$-tilting graphs to be maintained in quotient algebras. It is worth pointing out that $g$-tame algebras satisfy this condition. As a consequence, we newly obtain a large class of algebras whose $τたう$-tilting graphs are connected, including in particular the quotient… ▽ More

    Submitted 10 January, 2024; v1 submitted 10 January, 2024; originally announced January 2024.

    Comments: 9 pages

  20. arXiv:2310.17082  [pdf, ps, other

    astro-ph.HE

    Does or did the supernova remnant Cassiopeia A operate as a PeVatron?

    Authors: Zhen Cao, F. Aharonian, Q. An, Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, J. T. Cai, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. H. Chen, S. Z. Chen , et al. (255 additional authors not shown)

    Abstract: For decades, supernova remnants (SNRs) have been considered the prime sources of Galactic Cosmic rays (CRs). But whether SNRs can accelerate CR protons to PeV energies and thus dominate CR flux up to the knee is currently under intensive theoretical and phenomenological debate. The direct test of the ability of SNRs to operate as CR PeVatrons can be provided by ultrahigh-energy (UHE;… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

    Comments: 11 pages, 3 figures, Accepted by the APJL

  21. Isospin-dependence of the charge-changing cross-section shaped by the charged-particle evaporation process

    Authors: J. W. Zhao, B. -H. Sun, I. Tanihata, S. Terashima, A. Prochazka, J. Y. Xu, L. H. Zhu, J. Meng, J. Su, K. Y. Zhang, L. S. Geng, L. C. He, C. Y. Liu, G. S. Li, C. G. Lu, W. J. Lin, W. P. Lin, Z. Liu, P. P Ren, Z. Y. Sun, F. Wang, J. Wang, M. Wang, S. T. Wang, X. L. Wei , et al. (4 additional authors not shown)

    Abstract: We present the charge-changing cross sections (CCCS) of $^{11-15}$C, $^{13-17}$N, and $^{15,17-18}$O at around 300 MeV/nucleon on a carbon target, which extends to $p$-shell isotopes with $N < Z$ for the first time. The Glauber model, which considers only the proton distribution of projectile nuclei, underestimates the cross sections by more than 10\%. We show that this discrepancy can be resolved… ▽ More

    Submitted 21 October, 2023; originally announced October 2023.

    Comments: 5 figures

    Journal ref: Phys. Lett. B 847 (2023) 138269

  22. Very high energy gamma-ray emission beyond 10 TeV from GRB 221009A

    Authors: Zhen Cao, F. Aharonian, Q. An, A. Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, J. T. Cai, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. H. Chen, S. Z. Chen , et al. (255 additional authors not shown)

    Abstract: The highest energy gamma-rays from gamma-ray bursts (GRBs) have important implications for their radiation mechanism. Here we report for the first time the detection of gamma-rays up to 13 TeV from the brightest GRB 221009A by the Large High Altitude Air-shower Observatory (LHAASO). The LHAASO-KM2A detector registered more than 140 gamma-rays with energies above 3 TeV during 230$-$900s after the t… ▽ More

    Submitted 22 November, 2023; v1 submitted 13 October, 2023; originally announced October 2023.

    Comments: 49pages, 11figures

    Journal ref: Science Advances, 9, eadj2778 (2023) 15 November 2023

  23. arXiv:2310.02208  [pdf, other

    eess.SY

    An Integer Clustering Approach for Modeling Large-Scale EV Fleets with Guaranteed Performance

    Authors: Sijia Geng, Thomas Lee, Dharik Mallapragada, Audun Botterud

    Abstract: Large-scale integration of electric vehicles (EVs) leads to a tighter integration between transportation and electric energy systems. In this paper, we develop a novel integer-clustering approach to model a large number of EVs that manages vehicle charging and energy at the fleet level yet maintain individual trip dispatch. The model is then used to develop a spatially and temporally-resolved deci… ▽ More

    Submitted 3 October, 2023; originally announced October 2023.

    Comments: 8 pages, 4 figures

  24. arXiv:2310.01911  [pdf, ps, other

    eess.SY

    Approximating Voltage Stability Boundary Under High Variability of Renewables Using Differential Geometry

    Authors: Dan Wu, Franz-Erich Wolter, Sijia Geng

    Abstract: This paper proposes a novel method rooted in differential geometry to approximate the voltage stability boundary of power systems under high variability of renewable generation. We extract intrinsic geometric information of the power flow solution manifold at a given operating point. Specifically, coefficients of the Levi-Civita connection are constructed to approximate the geodesics of the manifo… ▽ More

    Submitted 3 October, 2023; originally announced October 2023.

    Comments: 8 pages, 9 figures

  25. arXiv:2309.15940  [pdf, other

    cs.RO cs.CV

    Context-Aware Entity Grounding with Open-Vocabulary 3D Scene Graphs

    Authors: Haonan Chang, Kowndinya Boyalakuntla, Shiyang Lu, Siwei Cai, Eric Jing, Shreesh Keskar, Shijie Geng, Adeeb Abbas, Lifeng Zhou, Kostas Bekris, Abdeslam Boularias

    Abstract: We present an Open-Vocabulary 3D Scene Graph (OVSG), a formal framework for grounding a variety of entities, such as object instances, agents, and regions, with free-form text-based queries. Unlike conventional semantic-based object localization approaches, our system facilitates context-aware entity localization, allowing for queries such as ``pick up a cup on a kitchen table" or ``navigate to a… ▽ More

    Submitted 27 September, 2023; originally announced September 2023.

    Comments: The code and dataset used for evaluation can be found at https://github.com/changhaonan/OVSG}{https://github.com/changhaonan/OVSG. This paper has been accepted by CoRL2023

  26. arXiv:2308.03312  [pdf, other

    cs.LG cs.CR cs.PL

    Exploiting Code Symmetries for Learning Program Semantics

    Authors: Kexin Pei, Weichen Li, Qirui Jin, Shuyang Liu, Scott Geng, Lorenzo Cavallaro, Junfeng Yang, Suman Jana

    Abstract: This paper tackles the challenge of teaching code semantics to Large Language Models (LLMs) for program analysis by incorporating code symmetries into the model architecture. We introduce a group-theoretic framework that defines code symmetries as semantics-preserving transformations, where forming a code symmetry group enables precise and efficient reasoning of code semantics. Our solution, SymC,… ▽ More

    Submitted 6 June, 2024; v1 submitted 7 August, 2023; originally announced August 2023.

  27. arXiv:2308.01285  [pdf, other

    cs.AI cs.HC

    Flows: Building Blocks of Reasoning and Collaborating AI

    Authors: Martin Josifoski, Lars Klein, Maxime Peyrard, Nicolas Baldwin, Yifei Li, Saibo Geng, Julian Paul Schnitzler, Yuxing Yao, Jiheng Wei, Debjit Paul, Robert West

    Abstract: Recent advances in artificial intelligence (AI) have produced highly capable and controllable systems. This creates unprecedented opportunities for structured reasoning as well as collaboration among multiple AI systems and humans. To fully realize this potential, it is essential to develop a principled way of designing and studying such structured interactions. For this purpose, we introduce the… ▽ More

    Submitted 7 February, 2024; v1 submitted 2 August, 2023; originally announced August 2023.

  28. arXiv:2306.16046  [pdf, other

    cs.RO

    Robo-centric ESDF: A Fast and Accurate Whole-body Collision Evaluation Tool for Any-shape Robotic Planning

    Authors: Shuang Geng, Qianhao Wang, Lei Xie, Chao Xu, Yanjun Cao, Fei Gao

    Abstract: For letting mobile robots travel flexibly through complicated environments, increasing attention has been paid to the whole-body collision evaluation. Most existing works either opt for the conservative corridor-based methods that impose strict requirements on the corridor generation, or ESDF-based methods that suffer from high computational overhead. It is still a great challenge to achieve fast… ▽ More

    Submitted 28 June, 2023; originally announced June 2023.

    Comments: Accepted at IROS 2023

  29. arXiv:2306.06691  [pdf, other

    cs.CV

    Self-Enhancement Improves Text-Image Retrieval in Foundation Visual-Language Models

    Authors: Yuguang Yang, Yiming Wang, Shupeng Geng, Runqi Wang, Yimi Wang, Sheng Wu, Baochang Zhang

    Abstract: The emergence of cross-modal foundation models has introduced numerous approaches grounded in text-image retrieval. However, on some domain-specific retrieval tasks, these models fail to focus on the key attributes required. To address this issue, we propose a self-enhancement framework, A^{3}R, based on the CLIP-ViT/G-14, one of the largest cross-modal models. First, we perform an Attribute Augme… ▽ More

    Submitted 11 June, 2023; originally announced June 2023.

    Comments: Accepted by CVPR 2023 Workshop

  30. arXiv:2306.03833  [pdf

    cs.LG

    Predicting Consultation Success in Online Health Platforms Using Dynamic Knowledge Networks and Multimodal Data Fusion

    Authors: Shuang Geng, Wenli Zhang, Jiaheng Xie, Gemin Liang, Ben Niu, Sudha Ram

    Abstract: Online healthcare consultation in virtual health is an emerging industry marked by innovation and fierce competition. Accurate and timely prediction of healthcare consultation success can proactively help online platforms address patient concerns and improve retention rates. However, predicting online consultation success is challenging due to the partial role of virtual consultations in patients'… ▽ More

    Submitted 14 June, 2024; v1 submitted 6 June, 2023; originally announced June 2023.

    MSC Class: K.5 ACM Class: H.4.m

  31. arXiv:2306.00321  [pdf, other

    cs.LG

    Improving Offline RL by Blending Heuristics

    Authors: Sinong Geng, Aldo Pacchiano, Andrey Kolobov, Ching-An Cheng

    Abstract: We propose Heuristic Blending (HUBL), a simple performance-improving technique for a broad class of offline RL algorithms based on value bootstrapping. HUBL modifies the Bellman operators used in these algorithms, partially replacing the bootstrapped values with heuristic ones that are estimated with Monte-Carlo returns. For trajectories with higher returns, HUBL relies more on the heuristic value… ▽ More

    Submitted 15 March, 2024; v1 submitted 31 May, 2023; originally announced June 2023.

  32. arXiv:2305.17030  [pdf, other

    astro-ph.HE hep-ph

    The First LHAASO Catalog of Gamma-Ray Sources

    Authors: Zhen Cao, F. Aharonian, Q. An, Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, J. T. Cai, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. H. Chen, S. Z. Chen , et al. (255 additional authors not shown)

    Abstract: We present the first catalog of very-high energy and ultra-high energy gamma-ray sources detected by the Large High Altitude Air Shower Observatory (LHAASO). The catalog was compiled using 508 days of data collected by the Water Cherenkov Detector Array (WCDA) from March 2021 to September 2022 and 933 days of data recorded by the Kilometer Squared Array (KM2A) from January 2020 to September 2022.… ▽ More

    Submitted 27 November, 2023; v1 submitted 26 May, 2023; originally announced May 2023.

    Comments: 40 pages, 13 figures, 4 tables

    Journal ref: The Astrophysical Journal Supplement Series, 271 (2024) 25

  33. arXiv:2305.14302  [pdf, other

    cs.IR cs.AI cs.HC cs.LG cs.MM

    VIP5: Towards Multimodal Foundation Models for Recommendation

    Authors: Shijie Geng, Juntao Tan, Shuchang Liu, Zuohui Fu, Yongfeng Zhang

    Abstract: Computer Vision (CV), Natural Language Processing (NLP), and Recommender Systems (RecSys) are three prominent AI applications that have traditionally developed independently, resulting in disparate modeling and engineering methodologies. This has impeded the ability for these fields to directly benefit from each other's advancements. With the recent development of foundation models, large language… ▽ More

    Submitted 14 October, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: Accepted by EMNLP 2023

  34. arXiv:2305.13971  [pdf, other

    cs.CL cs.AI cs.LG

    Grammar-Constrained Decoding for Structured NLP Tasks without Finetuning

    Authors: Saibo Geng, Martin Josifoski, Maxime Peyrard, Robert West

    Abstract: Despite their impressive performance, large language models (LMs) still struggle with reliably generating complex output structures when not finetuned to follow the required output format exactly. To address this issue, grammar-constrained decoding (GCD) can be used to control the generation of LMs, guaranteeing that the output follows a given structure. Most existing GCD methods are, however, lim… ▽ More

    Submitted 18 January, 2024; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: Accepted at EMNLP 2023 Main Conference

  35. Measurement of ultra-high-energy diffuse gamma-ray emission of the Galactic plane from 10 TeV to 1 PeV with LHAASO-KM2A

    Authors: Zhen Cao, F. Aharonian, Q. An, Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, J. T. Cai, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. H. Chen, S. Z. Chen , et al. (255 additional authors not shown)

    Abstract: The diffuse Galactic $γがんま$-ray emission, mainly produced via interactions between cosmic rays and the interstellar medium and/or radiation field, is a very important probe of the distribution, propagation, and interaction of cosmic rays in the Milky Way. In this work we report the measurements of diffuse $γがんま$-rays from the Galactic plane between 10 TeV and 1 PeV energies, with the square kilometer ar… ▽ More

    Submitted 19 August, 2023; v1 submitted 9 May, 2023; originally announced May 2023.

    Comments: 12 pages, 8 figures, 5 tables; accepted for publication in Physical Review Letters; source mask file provided as ancillary file

    Journal ref: Phys. Rev. Lett. 131, 151001 (2023)

  36. arXiv:2304.15010  [pdf, other

    cs.CV cs.AI cs.CL cs.LG cs.MM

    LLaMA-Adapter V2: Parameter-Efficient Visual Instruction Model

    Authors: Peng Gao, Jiaming Han, Renrui Zhang, Ziyi Lin, Shijie Geng, Aojun Zhou, Wei Zhang, Pan Lu, Conghui He, Xiangyu Yue, Hongsheng Li, Yu Qiao

    Abstract: How to efficiently transform large language models (LLMs) into instruction followers is recently a popular research direction, while training LLM for multi-modal reasoning remains less explored. Although the recent LLaMA-Adapter demonstrates the potential to handle visual inputs with LLMs, it still cannot generalize well to open-ended visual instructions and lags behind GPT-4. In this paper, we pr… ▽ More

    Submitted 28 April, 2023; originally announced April 2023.

    Comments: Code and models are available at https://github.com/ZrrSkywalker/LLaMA-Adapter

  37. arXiv:2304.04916  [pdf, other

    cs.LG stat.ML

    A Data-Driven State Aggregation Approach for Dynamic Discrete Choice Models

    Authors: Sinong Geng, Houssam Nassif, Carlos A. Manzanares

    Abstract: We study dynamic discrete choice models, where a commonly studied problem involves estimating parameters of agent reward functions (also known as "structural" parameters), using agent behavioral data. Maximum likelihood estimation for such models requires dynamic programming, which is limited by the curse of dimensionality. In this work, we present a novel algorithm that provides a data-driven met… ▽ More

    Submitted 31 May, 2023; v1 submitted 10 April, 2023; originally announced April 2023.

    Journal ref: The Conference on Uncertainty in Artificial Intelligence (UAI'23), Pittsburgh, PA, pp. 647-657, 2023

  38. Characterizing Kirkwood-Dirac nonclassicality and uncertainty diagram based on discrete Fourier transform

    Authors: Ying-Hui Yang, Bing-Bing Zhang, Xiao-Li Wang, Shi-Jiao Geng, Pei-Ying Chen

    Abstract: In this paper, we investigate the Kirkwood-Dirac nonclassicality and uncertainty diagram based on discrete Fourier transform (DFT) in a $d$ dimensional system. The uncertainty diagram of complete incompatibility bases $\mathcal {A},\mathcal {B}$ are characterized by De Bièvre [arXiv: 2207.07451]. We show that for the uncertainty diagram of the DFT matrix which is a transition matrix from basis… ▽ More

    Submitted 30 March, 2023; originally announced March 2023.

    Journal ref: Entropy, 25, 1075 (2023)

  39. arXiv:2303.15790  [pdf, other

    hep-ex hep-ph physics.ins-det

    STCF Conceptual Design Report: Volume 1 -- Physics & Detector

    Authors: M. Achasov, X. C. Ai, R. Aliberti, L. P. An, Q. An, X. Z. Bai, Y. Bai, O. Bakina, A. Barnyakov, V. Blinov, V. Bobrovnikov, D. Bodrov, A. Bogomyagkov, A. Bondar, I. Boyko, Z. H. Bu, F. M. Cai, H. Cai, J. J. Cao, Q. H. Cao, Z. Cao, Q. Chang, K. T. Chao, D. Y. Chen, H. Chen , et al. (413 additional authors not shown)

    Abstract: The Super $τたう$-Charm facility (STCF) is an electron-positron collider proposed by the Chinese particle physics community. It is designed to operate in a center-of-mass energy range from 2 to 7 GeV with a peak luminosity of $0.5\times 10^{35}{\rm cm}^{-2}{\rm s}^{-1}$ or higher. The STCF will produce a data sample about a factor of 100 larger than that by the present $τたう$-Charm factory -- the BEPCII,… ▽ More

    Submitted 5 October, 2023; v1 submitted 28 March, 2023; originally announced March 2023.

    Journal ref: Front. Phys. 19(1), 14701 (2024)

  40. arXiv:2303.14865  [pdf, other

    cs.CV

    Revisiting Multimodal Representation in Contrastive Learning: From Patch and Token Embeddings to Finite Discrete Tokens

    Authors: Yuxiao Chen, Jianbo Yuan, Yu Tian, Shijie Geng, Xinyu Li, Ding Zhou, Dimitris N. Metaxas, Hongxia Yang

    Abstract: Contrastive learning-based vision-language pre-training approaches, such as CLIP, have demonstrated great success in many vision-language tasks. These methods achieve cross-modal alignment by encoding a matched image-text pair with similar feature embeddings, which are generated by aggregating information from visual patches and language tokens. However, direct aligning cross-modal information usi… ▽ More

    Submitted 26 March, 2023; originally announced March 2023.

    Comments: Accepted to CVPR 2023

  41. arXiv:2303.02995  [pdf, other

    cs.CV cs.CL cs.LG

    HiCLIP: Contrastive Language-Image Pretraining with Hierarchy-aware Attention

    Authors: Shijie Geng, Jianbo Yuan, Yu Tian, Yuxiao Chen, Yongfeng Zhang

    Abstract: The success of large-scale contrastive vision-language pretraining (CLIP) has benefited both visual recognition and multimodal content understanding. The concise design brings CLIP the advantage in inference efficiency against other vision-language models with heavier cross-attention fusion layers, making it a popular choice for a wide spectrum of downstream tasks. However, CLIP does not explicitl… ▽ More

    Submitted 6 March, 2023; originally announced March 2023.

    Comments: Accepted at ICLR 2023

  42. arXiv:2302.10301  [pdf, other

    cs.CV cs.AI

    Artificial Intelligence System for Detection and Screening of Cardiac Abnormalities using Electrocardiogram Images

    Authors: Deyun Zhang, Shijia Geng, Yang Zhou, Weilun Xu, Guodong Wei, Kai Wang, Jie Yu, Qiang Zhu, Yongkui Li, Yonghong Zhao, Xingyue Chen, Rui Zhang, Zhaoji Fu, Rongbo Zhou, Yanqi E, Sumei Fan, Qinghao Zhao, Chuandong Cheng, Nan Peng, Liang Zhang, Linlin Zheng, Jianjun Chu, Hongbin Xu, Chen Tan, Jian Liu , et al. (6 additional authors not shown)

    Abstract: The artificial intelligence (AI) system has achieved expert-level performance in electrocardiogram (ECG) signal analysis. However, in underdeveloped countries or regions where the healthcare information system is imperfect, only paper ECGs can be provided. Analysis of real-world ECG images (photos or scans of paper ECGs) remains challenging due to complex environments or interference. In this stud… ▽ More

    Submitted 10 February, 2023; originally announced February 2023.

    Comments: 47 pages, 29 figures

  43. arXiv:2301.13244  [pdf, other

    cs.RO cs.CV

    Mono-STAR: Mono-camera Scene-level Tracking and Reconstruction

    Authors: Haonan Chang, Dhruv Metha Ramesh, Shijie Geng, Yuqiu Gan, Abdeslam Boularias

    Abstract: We present Mono-STAR, the first real-time 3D reconstruction system that simultaneously supports semantic fusion, fast motion tracking, non-rigid object deformation, and topological change under a unified framework. The proposed system solves a new optimization problem incorporating optical-flow-based 2D constraints to deal with fast motion and a novel semantic-aware deformation graph (SAD-graph) f… ▽ More

    Submitted 30 January, 2023; originally announced January 2023.

    Comments: This paper has been accepted by ICRA2023

  44. arXiv:2301.10939  [pdf, other

    cs.CV cs.CL cs.LG

    Affective Faces for Goal-Driven Dyadic Communication

    Authors: Scott Geng, Revant Teotia, Purva Tendulkar, Sachit Menon, Carl Vondrick

    Abstract: We introduce a video framework for modeling the association between verbal and non-verbal communication during dyadic conversation. Given the input speech of a speaker, our approach retrieves a video of a listener, who has facial expressions that would be socially appropriate given the context. Our approach further allows the listener to be conditioned on their own goals, personalities, or backgro… ▽ More

    Submitted 26 January, 2023; originally announced January 2023.

  45. Repercussion of the $a_0(1710)$ [$a_0(1817)$] resonance and future developments

    Authors: E. Oset, L. R. Dai, L. S. Geng

    Abstract: In this paper, we discuss the significance and prospect for the newly discovered a0(1710)[a0(1817)] resonance state at BESIII experiment, in which they reported the observation of a scalar meson of spin-parity $J^P=0^+$ with isospin $I=1$, branded as $a_0(1817)$. This state may be the same particle as the $a_0(1710)$ observed by the BaBar experiment earlier. As early as 2008, we found that f0(1710… ▽ More

    Submitted 25 January, 2023; v1 submitted 20 January, 2023; originally announced January 2023.

    Comments: typos corrected. 6 pages,4 figures, to be published as a "Perspective" in Science Bulletin

    Journal ref: Sci. Bull. 68 (2023) 243

  46. arXiv:2212.11497  [pdf, other

    math.RT math.CO math.RA

    Intersection vectors over tilings with applications to gentle algebras and cluster algebras

    Authors: Changjian Fu, Shengfei Geng

    Abstract: It is proved that a multiset of permissible arcs over a tiling is uniquely determined by its intersection vector under a mild condition. This generalizes a classical result over marked surfaces with triangulations. We apply this result to study $τたう$-tilting theory of gentle algebras and denominator conjecture in cluster algebras. In the case of gentle algebras, it is proved that different $τたう$-rigid… ▽ More

    Submitted 4 October, 2023; v1 submitted 22 December, 2022; originally announced December 2022.

    Comments: The proof of Lemma 2.8 (Lemma 2.3 in v1) is improved. Any comments are welcome!

  47. arXiv:2212.07016  [pdf, other

    cs.CV

    Understanding Zero-Shot Adversarial Robustness for Large-Scale Models

    Authors: Chengzhi Mao, Scott Geng, Junfeng Yang, Xin Wang, Carl Vondrick

    Abstract: Pretrained large-scale vision-language models like CLIP have exhibited strong generalization over unseen tasks. Yet imperceptible adversarial perturbations can significantly reduce CLIP's performance on new tasks. In this work, we identify and explore the problem of \emph{adapting large-scale models for zero-shot adversarial robustness}. We first identify two key factors during model adaption -- t… ▽ More

    Submitted 21 April, 2023; v1 submitted 13 December, 2022; originally announced December 2022.

  48. arXiv:2210.02853  [pdf, other

    cs.CR cs.LG cs.PL cs.SE

    NeuDep: Neural Binary Memory Dependence Analysis

    Authors: Kexin Pei, Dongdong She, Michael Wang, Scott Geng, Zhou Xuan, Yaniv David, Junfeng Yang, Suman Jana, Baishakhi Ray

    Abstract: Determining whether multiple instructions can access the same memory location is a critical task in binary analysis. It is challenging as statically computing precise alias information is undecidable in theory. The problem aggravates at the binary level due to the presence of compiler optimizations and the absence of symbols and types. Existing approaches either produce significant spurious depend… ▽ More

    Submitted 4 October, 2022; originally announced October 2022.

    Comments: ESEC/FSE 2022

  49. A diffuse-interface lattice Boltzmann method for the dendritic growth with thermosolutal convection

    Authors: Chengjie Zhan, Zhenhua Chai, Baochang Shi, Ping Jiang, Shaoning Geng, Dongke Sun

    Abstract: In this work, we proposed a diffuse interface model for the dendritic growth with thermosolutal convection. In this model, the sharp boundary between the fluid and solid dendrite is replaced by a thin but nonzero thickness diffuse interface, which is described by the order parameter governed by the phase-field equation for the dendritic growth. The governing equations for solute and heat transfer… ▽ More

    Submitted 3 September, 2022; originally announced September 2022.

    Comments: 20 pages, 14 figures

  50. arXiv:2208.03550  [pdf, other

    cs.CV

    Frozen CLIP Models are Efficient Video Learners

    Authors: Ziyi Lin, Shijie Geng, Renrui Zhang, Peng Gao, Gerard de Melo, Xiaogang Wang, Jifeng Dai, Yu Qiao, Hongsheng Li

    Abstract: Video recognition has been dominated by the end-to-end learning paradigm -- first initializing a video recognition model with weights of a pretrained image model and then conducting end-to-end training on videos. This enables the video network to benefit from the pretrained image model. However, this requires substantial computation and memory resources for finetuning on videos and the alternative… ▽ More

    Submitted 6 August, 2022; originally announced August 2022.

    Comments: ECCV 2022