(Translated by https://www.hiragana.jp/)
Search | arXiv e-print repository
Skip to main content

Showing 1–50 of 295 results for author: Ding, W

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.10288  [pdf, other

    cs.CL cs.AI

    Timeline-based Sentence Decomposition with In-Context Learning for Temporal Fact Extraction

    Authors: Jianhao Chen, Haoyuan Ouyang, Junyang Ren, Wentao Ding, Wei Hu, Yuzhong Qu

    Abstract: Facts extraction is pivotal for constructing knowledge graphs. Recently, the increasing demand for temporal facts in downstream tasks has led to the emergence of the task of temporal fact extraction. In this paper, we specifically address the extraction of temporal facts from natural language text. Previous studies fail to handle the challenge of establishing time-to-fact correspondences in comple… ▽ More

    Submitted 2 June, 2024; v1 submitted 16 May, 2024; originally announced May 2024.

    Comments: Accepted to ACL2024 main conference

  2. arXiv:2405.00334  [pdf, other

    cs.LG

    A Survey on Deep Active Learning: Recent Advances and New Frontiers

    Authors: Dongyuan Li, Zhen Wang, Yankai Chen, Renhe Jiang, Weiping Ding, Manabu Okumura

    Abstract: Active learning seeks to achieve strong performance with fewer training samples. It does this by iteratively asking an oracle to label new selected samples in a human-in-the-loop manner. This technique has gained increasing popularity due to its broad applicability, yet its survey papers, especially for deep learning-based active learning (DAL), remain scarce. Therefore, we conduct an advanced and… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

    Comments: This paper is accepted by IEEE Transactions on Neural Networks and Learning Systems

  3. arXiv:2404.15384  [pdf, other

    cs.LG cs.AI

    FL-TAC: Enhanced Fine-Tuning in Federated Learning via Low-Rank, Task-Specific Adapter Clustering

    Authors: Siqi Ping, Yuzhu Mao, Yang Liu, Xiao-Ping Zhang, Wenbo Ding

    Abstract: Although large-scale pre-trained models hold great potential for adapting to downstream tasks through fine-tuning, the performance of such fine-tuned models is often limited by the difficulty of collecting sufficient high-quality, task-specific data. Federated Learning (FL) offers a promising solution by enabling fine-tuning across large-scale clients with a variety of task data, but it is bottlen… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

  4. arXiv:2404.10342  [pdf, other

    cs.CV cs.MM

    Referring Flexible Image Restoration

    Authors: Runwei Guan, Rongsheng Hu, Zhuhao Zhou, Tianlang Xue, Ka Lok Man, Jeremy Smith, Eng Gee Lim, Weiping Ding, Yutao Yue

    Abstract: In reality, images often exhibit multiple degradations, such as rain and fog at night (triple degradations). However, in many cases, individuals may not want to remove all degradations, for instance, a blurry lens revealing a beautiful snowy landscape (double degradations). In such scenarios, people may only desire to deblur. These situations and requirements shed light on a new challenge in image… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

    Comments: 15 pages, 19 figures

  5. arXiv:2404.08501  [pdf, ps, other

    cs.NE cs.AI

    Analyzing and Overcoming Local Optima in Complex Multi-Objective Optimization by Decomposition-Based Evolutionary Algorithms

    Authors: Ting Dong, Haoxin Wang, Hengxi Zhang, Wenbo Ding

    Abstract: When addressing the challenge of complex multi-objective optimization problems, particularly those with non-convex and non-uniform Pareto fronts, Decomposition-based Multi-Objective Evolutionary Algorithms (MOEADs) often converge to local optima, thereby limiting solution diversity. Despite its significance, this issue has received limited theoretical exploration. Through a comprehensive geometric… ▽ More

    Submitted 12 April, 2024; originally announced April 2024.

  6. The Survey on Multi-Source Data Fusion in Cyber-Physical-Social Systems:Foundational Infrastructure for Industrial Metaverses and Industries 5.0

    Authors: Xiao Wang, Yutong Wang, Jing Yang, Xiaofeng Jia, Lijun Li, Weiping Ding, Fei-Yue Wang

    Abstract: As the concept of Industries 5.0 develops, industrial metaverses are expected to operate in parallel with the actual industrial processes to offer ``Human-Centric" Safe, Secure, Sustainable, Sensitive, Service, and Smartness ``6S" manufacturing solutions. Industrial metaverses not only visualize the process of productivity in a dynamic and evolutional way, but also provide an immersive laboratory… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

    Journal ref: Information Fusion 2024

  7. arXiv:2404.06836  [pdf, other

    cs.CV

    O2V-Mapping: Online Open-Vocabulary Mapping with Neural Implicit Representation

    Authors: Muer Tie, Julong Wei, Zhengjun Wang, Ke Wu, Shansuai Yuan, Kaizhao Zhang, Jie Jia, Jieru Zhao, Zhongxue Gan, Wenchao Ding

    Abstract: Online construction of open-ended language scenes is crucial for robotic applications, where open-vocabulary interactive scene understanding is required. Recently, neural implicit representation has provided a promising direction for online interactive mapping. However, implementing open-vocabulary scene understanding capability into online neural implicit mapping still faces three challenges: lac… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

  8. arXiv:2404.06036  [pdf, other

    cs.CV

    Space-Time Video Super-resolution with Neural Operator

    Authors: Yuantong Zhang, Hanyou Zheng, Daiqin Yang, Zhenzhong Chen, Haichuan Ma, Wenpeng Ding

    Abstract: This paper addresses the task of space-time video super-resolution (ST-VSR). Existing methods generally suffer from inaccurate motion estimation and motion compensation (MEMC) problems for large motions. Inspired by recent progress in physics-informed neural networks, we model the challenges of MEMC in ST-VSR as a mapping between two continuous function spaces. Specifically, our approach transform… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

  9. arXiv:2404.04970  [pdf, other

    cs.LG

    How to characterize imprecision in multi-view clustering?

    Authors: Jinyi Xu, Zuowei Zhang, Ze Lin, Yixiang Chen, Zhe Liu, Weiping Ding

    Abstract: It is still challenging to cluster multi-view data since existing methods can only assign an object to a specific (singleton) cluster when combining different view information. As a result, it fails to characterize imprecision of objects in overlapping regions of different clusters, thus leading to a high risk of errors. In this paper, we thereby want to answer the question: how to characterize im… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

    Comments: 19 pages with 8 pages of supplementary

  10. arXiv:2404.04799  [pdf, other

    cs.CV

    Few-Shot Object Detection: Research Advances and Challenges

    Authors: Zhimeng Xin, Shiming Chen, Tianxu Wu, Yuanjie Shao, Weiping Ding, Xinge You

    Abstract: Object detection as a subfield within computer vision has achieved remarkable progress, which aims to accurately identify and locate a specific object from images or videos. Such methods rely on large-scale labeled training samples for each object category to ensure accurate detection, but obtaining extensive annotated data is a labor-intensive and expensive process in many real-world scenarios. T… ▽ More

    Submitted 6 April, 2024; originally announced April 2024.

  11. arXiv:2404.00443  [pdf, ps, other

    cs.RO

    UDE-based Dynamic Motion Force Control of Mobile Manipulators

    Authors: Songqun Gao, Wendi Ding, Qinyuan Ren, Ben M. Chen

    Abstract: Mobile manipulators are known for their superior mobility over manipulators on fixed bases, offering promising applications in smart industry and housekeeping scenarios. However, the dynamic coupling nature between the mobile base and the manipulator presents challenges for the physical interactive tasks of the mobile manipulator. Current methods suffer from complex modeling processes and poor tra… ▽ More

    Submitted 30 March, 2024; originally announced April 2024.

  12. arXiv:2403.20327  [pdf, other

    cs.CL cs.AI

    Gecko: Versatile Text Embeddings Distilled from Large Language Models

    Authors: Jinhyuk Lee, Zhuyun Dai, Xiaoqi Ren, Blair Chen, Daniel Cer, Jeremy R. Cole, Kai Hui, Michael Boratko, Rajvi Kapadia, Wen Ding, Yi Luan, Sai Meher Karthik Duddu, Gustavo Hernandez Abrego, Weiqiang Shi, Nithi Gupta, Aditya Kusupati, Prateek Jain, Siddhartha Reddy Jonnalagadda, Ming-Wei Chang, Iftekhar Naim

    Abstract: We present Gecko, a compact and versatile text embedding model. Gecko achieves strong retrieval performance by leveraging a key idea: distilling knowledge from large language models (LLMs) into a retriever. Our two-step distillation process begins with generating diverse, synthetic paired data using an LLM. Next, we further refine the data quality by retrieving a set of candidate passages for each… ▽ More

    Submitted 29 March, 2024; originally announced March 2024.

    Comments: 18 pages

  13. arXiv:2403.20159  [pdf, other

    cs.CV

    HGS-Mapping: Online Dense Mapping Using Hybrid Gaussian Representation in Urban Scenes

    Authors: Ke Wu, Kaizhao Zhang, Zhiwei Zhang, Shanshuai Yuan, Muer Tie, Julong Wei, Zijun Xu, Jieru Zhao, Zhongxue Gan, Wenchao Ding

    Abstract: Online dense mapping of urban scenes forms a fundamental cornerstone for scene understanding and navigation of autonomous vehicles. Recent advancements in mapping methods are mainly based on NeRF, whose rendering speed is too slow to meet online requirements. 3D Gaussian Splatting (3DGS), with its rendering speed hundreds of times faster than NeRF, holds greater potential in online dense mapping.… ▽ More

    Submitted 29 March, 2024; originally announced March 2024.

  14. Fusion Dynamical Systems with Machine Learning in Imitation Learning: A Comprehensive Overview

    Authors: Yingbai Hu, Fares J. Abu-Dakka, Fei Chen, Xiao Luo, Zheng Li, Alois Knoll, Weiping Ding

    Abstract: Imitation Learning (IL), also referred to as Learning from Demonstration (LfD), holds significant promise for capturing expert motor skills through efficient imitation, facilitating adept navigation of complex scenarios. A persistent challenge in IL lies in extending generalization from historical demonstrations, enabling the acquisition of new skills without re-teaching. Dynamical system-based IL… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

  15. arXiv:2403.18957  [pdf, other

    cs.CY cs.CL cs.LG cs.SI

    Moderating Illicit Online Image Promotion for Unsafe User-Generated Content Games Using Large Vision-Language Models

    Authors: Keyan Guo, Ayush Utkarsh, Wenbo Ding, Isabelle Ondracek, Ziming Zhao, Guo Freeman, Nishant Vishwamitra, Hongxin Hu

    Abstract: Online user-generated content games (UGCGs) are increasingly popular among children and adolescents for social interaction and more creative online entertainment. However, they pose a heightened risk of exposure to explicit content, raising growing concerns for the online safety of children and adolescents. Despite these concerns, few studies have addressed the issue of illicit image-based promoti… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

    Comments: To Appear in the 33rd USENIX Security Symposium, August 14-16, 2024

  16. arXiv:2403.13208  [pdf, other

    cs.RO

    CaDRE: Controllable and Diverse Generation of Safety-Critical Driving Scenarios using Real-World Trajectories

    Authors: Peide Huang, Wenhao Ding, Jonathan Francis, Bingqing Chen, Ding Zhao

    Abstract: Simulation is an indispensable tool in the development and testing of autonomous vehicles (AVs), offering an efficient and safe alternative to road testing by allowing the exploration of a wide range of scenarios. Despite its advantages, a significant challenge within simulation-based testing is the generation of safety-critical scenarios, which are essential to ensure that AVs can handle rare but… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

  17. arXiv:2402.11496  [pdf, other

    cs.RO

    Point-Wise Vibration Pattern Production via a Sparse Actuator Array for Surface Tactile Feedback

    Authors: Xiaosa Li, Runze Zhao, Chengyue Lu, Xiao Xiao, Wenbo Ding

    Abstract: Surface vibration tactile feedback is capable of conveying various semantic information to humans via the handheld electronic devices, like smartphone, touch panel,and game controller. However, covering the whole device contacting surface with dense actuator arrangement can affect its normal use, how to produce desired vibration patterns at any contact point with only several sparse actuators depl… ▽ More

    Submitted 18 February, 2024; originally announced February 2024.

  18. arXiv:2402.07648  [pdf, other

    cs.RO

    DeformNet: Latent Space Modeling and Dynamics Prediction for Deformable Object Manipulation

    Authors: Chenchang Li, Zihao Ai, Tong Wu, Xiaosa Li, Wenbo Ding, Huazhe Xu

    Abstract: Manipulating deformable objects is a ubiquitous task in household environments, demanding adequate representation and accurate dynamics prediction due to the objects' infinite degrees of freedom. This work proposes DeformNet, which utilizes latent space modeling with a learned 3D representation model to tackle these challenges effectively. The proposed representation model combines a PointNet enco… ▽ More

    Submitted 12 February, 2024; originally announced February 2024.

    Comments: 7 pages, Submitted to 2024 IEEE International Conference on Robotics and Automation (ICRA), Japan, Yokohama

  19. arXiv:2402.05948  [pdf, other

    cs.LG cs.CL

    DE$^3$-BERT: Distance-Enhanced Early Exiting for BERT based on Prototypical Networks

    Authors: Jianing He, Qi Zhang, Weiping Ding, Duoqian Miao, Jun Zhao, Liang Hu, Longbing Cao

    Abstract: Early exiting has demonstrated its effectiveness in accelerating the inference of pre-trained language models like BERT by dynamically adjusting the number of layers executed. However, most existing early exiting methods only consider local information from an individual test sample to determine their exiting indicators, failing to leverage the global information offered by sample population. This… ▽ More

    Submitted 3 February, 2024; originally announced February 2024.

    Comments: 16 pages

  20. arXiv:2402.05725  [pdf, other

    cs.RO eess.SP

    Dual-modal Tactile E-skin: Enabling Bidirectional Human-Robot Interaction via Integrated Tactile Perception and Feedback

    Authors: Shilong Mu, Runze Zhao, Zenan Lin, Yan Huang, Shoujie Li, Chenchang Li, Xiao-Ping Zhang, Wenbo Ding

    Abstract: To foster an immersive and natural human-robot interaction, the implementation of tactile perception and feedback becomes imperative, effectively bridging the conventional sensory gap. In this paper, we propose a dual-modal electronic skin (e-skin) that integrates magnetic tactile sensing and vibration feedback for enhanced human-robot interaction. The dual-modal tactile e-skin offers multi-functi… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

    Comments: 7 pages, 8 figures. Submitted to 2024 IEEE International Conference on Robotics and Automation (ICRA), Japan, Yokohama

  21. arXiv:2402.02175  [pdf, other

    cs.CL cs.IR

    Enhancing Complex Question Answering over Knowledge Graphs through Evidence Pattern Retrieval

    Authors: Wentao Ding, Jinmao Li, Liangchuan Luo, Yuzhong Qu

    Abstract: Information retrieval (IR) methods for KGQA consist of two stages: subgraph extraction and answer reasoning. We argue current subgraph extraction methods underestimate the importance of structural dependencies among evidence facts. We propose Evidence Pattern Retrieval (EPR) to explicitly model the structural dependencies during subgraph extraction. We implement EPR by indexing the atomic adjacenc… ▽ More

    Submitted 3 February, 2024; originally announced February 2024.

    Comments: Accepted to TheWebConf'24 (WWW 2024). This is a preprint version; the CR version will include more details. Github: https://github.com/nju-websoft/EPR-KGQA

  22. arXiv:2402.00585  [pdf, other

    cs.RO

    SATac: A Thermoluminescence Enabled Tactile Sensor for Concurrent Perception of Temperature, Pressure, and Shear

    Authors: Ziwu Song, Ran Yu, Xuan Zhang, Kit Wa Sou, Shilong Mu, Dengfeng Peng, Xiao-Ping Zhang, Wenbo Ding

    Abstract: Most vision-based tactile sensors use elastomer deformation to infer tactile information, which can not sense some modalities, like temperature. As an important part of human tactile perception, temperature sensing can help robots better interact with the environment. In this work, we propose a novel multimodal vision-based tactile sensor, SATac, which can simultaneously perceive information of te… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

  23. arXiv:2402.00367  [pdf, other

    cs.CL

    Don't Hallucinate, Abstain: Identifying LLM Knowledge Gaps via Multi-LLM Collaboration

    Authors: Shangbin Feng, Weijia Shi, Yike Wang, Wenxuan Ding, Vidhisha Balachandran, Yulia Tsvetkov

    Abstract: Despite efforts to expand the knowledge of large language models (LLMs), knowledge gaps -- missing or outdated information in LLMs -- might always persist given the evolving nature of knowledge. In this work, we study approaches to identify LLM knowledge gaps and abstain from answering questions when knowledge gaps are present. We first adapt existing approaches to model calibration or adaptation… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

  24. arXiv:2401.16791  [pdf, other

    cs.LG

    Accelerated Cloud for Artificial Intelligence (ACAI)

    Authors: Dachi Chen, Weitian Ding, Chen Liang, Chang Xu, Junwei Zhang, Majd Sakr

    Abstract: Training an effective Machine learning (ML) model is an iterative process that requires effort in multiple dimensions. Vertically, a single pipeline typically includes an initial ETL (Extract, Transform, Load) of raw datasets, a model training stage, and an evaluation stage where the practitioners obtain statistics of the model performance. Horizontally, many such pipelines may be required to find… ▽ More

    Submitted 30 January, 2024; originally announced January 2024.

  25. arXiv:2401.13462  [pdf, other

    cs.RO cs.AI

    Growing from Exploration: A self-exploring framework for robots based on foundation models

    Authors: Shoujie Li, Ran Yu, Tong Wu, JunWen Zhong, Xiao-Ping Zhang, Wenbo Ding

    Abstract: Intelligent robot is the ultimate goal in the robotics field. Existing works leverage learning-based or optimization-based methods to accomplish human-defined tasks. However, the challenge of enabling robots to explore various environments autonomously remains unresolved. In this work, we propose a framework named GExp, which enables robots to explore and learn autonomously without human intervent… ▽ More

    Submitted 24 January, 2024; originally announced January 2024.

    Comments: 19 pages

  26. arXiv:2401.11547  [pdf, other

    cs.CR

    Understanding the Security Risks of Decentralized Exchanges by Uncovering Unfair Trades in the Wild

    Authors: Jiaqi Chen, Yibo Wang, Yuxuan Zhou, Wanning Ding, Yuzhe Tang, XiaoFeng Wang, Kai Li

    Abstract: DEX, or decentralized exchange, is a prominent class of decentralized finance (DeFi) applications on blockchains, attracting a total locked value worth tens of billions of USD today. This paper presents the first large-scale empirical study that uncovers unfair trades on popular DEX services on Ethereum and Binance Smart Chain (BSC). By joining and analyzing 60 million transactions, we find 671,… ▽ More

    Submitted 21 January, 2024; originally announced January 2024.

  27. arXiv:2401.09574  [pdf, ps, other

    cs.LG cs.CR

    Towards Scalable and Robust Model Versioning

    Authors: Wenxin Ding, Arjun Nitin Bhagoji, Ben Y. Zhao, Haitao Zheng

    Abstract: As the deployment of deep learning models continues to expand across industries, the threat of malicious incursions aimed at gaining access to these deployed models is on the rise. Should an attacker gain access to a deployed model, whether through server breaches, insider attacks, or model inversion techniques, they can then construct white-box adversarial attacks to manipulate the model's classi… ▽ More

    Submitted 10 March, 2024; v1 submitted 17 January, 2024; originally announced January 2024.

    Comments: Published in IEEE SaTML 2024

  28. arXiv:2401.07286  [pdf, other

    cs.CL

    CANDLE: Iterative Conceptualization and Instantiation Distillation from Large Language Models for Commonsense Reasoning

    Authors: Weiqi Wang, Tianqing Fang, Chunyang Li, Haochen Shi, Wenxuan Ding, Baixuan Xu, Zhaowei Wang, Jiaxin Bai, Xin Liu, Jiayang Cheng, Chunkit Chan, Yangqiu Song

    Abstract: The sequential process of conceptualization and instantiation is essential to generalizable commonsense reasoning as it allows the application of existing knowledge to unfamiliar scenarios. However, existing works tend to undervalue the step of instantiation and heavily rely on pre-built concept taxonomies and human annotations to collect both types of knowledge, resulting in a lack of instantiate… ▽ More

    Submitted 21 May, 2024; v1 submitted 14 January, 2024; originally announced January 2024.

    Comments: ACL2024

  29. arXiv:2401.05902  [pdf, other

    cs.IT

    Optimized Asymmetric Feedback Detection for Rate-adaptive HARQ with Unreliable Feedback

    Authors: Weihang Ding, Mohammad Shikh-Bahaei

    Abstract: This work considers downlink incremental redundancy Hybrid Automatic Repeat Request (IR-HARQ) over unreliable feedback channels. Since the impact of positive feedback (i.e., ACK) error is smaller than that of negative feedback (i.e., NACK) error, an asymmetric feedback detection scheme is proposed to protect NACK and further reduce the outage probability. We formulate the HARQ process as a Markov… ▽ More

    Submitted 11 January, 2024; originally announced January 2024.

    Comments: arXiv admin note: text overlap with arXiv:2305.02948

  30. arXiv:2401.05898  [pdf, other

    cs.IT eess.SY

    A Partial Compress-and-Forward Strategy for Relay-assisted Wireless Networks Based on Rateless Coding

    Authors: Weihang Ding, Mohammad Shikh-Bahaei

    Abstract: In this work, we propose a novel partial compress-and-forward (PCF) scheme for improving the maximum achievable transmission rate of a diamond relay network with two noisy relays. PCF combines conventional compress-and-forward (CF) and amplify-and-forward (AF) protocols, enabling one relay to operate alternately in the CF or the AF mode, while the other relay works purely in the CF mode. As the di… ▽ More

    Submitted 11 January, 2024; originally announced January 2024.

  31. arXiv:2401.05362  [pdf, other

    cs.CV cs.AI

    DualTeacher: Bridging Coexistence of Unlabelled Classes for Semi-supervised Incremental Object Detection

    Authors: Ziqi Yuan, Liyuan Wang, Wenbo Ding, Xingxing Zhang, Jiachen Zhong, Jianyong Ai, Jianmin Li, Jun Zhu

    Abstract: In real-world applications, an object detector often encounters object instances from new classes and needs to accommodate them effectively. Previous work formulated this critical problem as incremental object detection (IOD), which assumes the object instances of new classes to be fully annotated in incremental data. However, as supervisory signals are usually rare and expensive, the supervised I… ▽ More

    Submitted 13 December, 2023; originally announced January 2024.

  32. arXiv:2312.15480  [pdf, other

    cs.CV

    A Two-stage Personalized Virtual Try-on Framework with Shape Control and Texture Guidance

    Authors: Shufang Zhang, Minxue Ni, Lei Wang, Wenxin Ding, Shuai Chen, Yuhong Liu

    Abstract: The Diffusion model has a strong ability to generate wild images. However, the model can just generate inaccurate images with the guidance of text, which makes it very challenging to directly apply the text-guided generative model for virtual try-on scenarios. Taking images as guiding conditions of the diffusion model, this paper proposes a brand new personalized virtual try-on model (PE-VITON), w… ▽ More

    Submitted 24 December, 2023; originally announced December 2023.

  33. arXiv:2312.13303  [pdf, other

    cs.LG cs.AI

    RealGen: Retrieval Augmented Generation for Controllable Traffic Scenarios

    Authors: Wenhao Ding, Yulong Cao, Ding Zhao, Chaowei Xiao, Marco Pavone

    Abstract: Simulation plays a crucial role in the development of autonomous vehicles (AVs) due to the potential risks associated with real-world testing. Although significant progress has been made in the visual aspects of simulators, generating complex behavior among agents remains a formidable challenge. It is not only imperative to ensure realism in the scenarios generated but also essential to incorporat… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

  34. arXiv:2312.11053  [pdf, other

    cs.AI cs.DB

    Conflict Detection for Temporal Knowledge Graphs:A Fast Constraint Mining Algorithm and New Benchmarks

    Authors: Jianhao Chen, Junyang Ren, Wentao Ding, Haoyuan Ouyang, Wei Hu, Yuzhong Qu

    Abstract: Temporal facts, which are used to describe events that occur during specific time periods, have become a topic of increased interest in the field of knowledge graph (KG) research. In terms of quality management, the introduction of time restrictions brings new challenges to maintaining the temporal consistency of KGs. Previous studies rely on manually enumerated temporal constraints to detect conf… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

  35. arXiv:2312.08851  [pdf, other

    cs.CV cs.CE cs.RO

    Achelous++: Power-Oriented Water-Surface Panoptic Perception Framework on Edge Devices based on Vision-Radar Fusion and Pruning of Heterogeneous Modalities

    Authors: Runwei Guan, Haocheng Zhao, Shanliang Yao, Ka Lok Man, Xiaohui Zhu, Limin Yu, Yong Yue, Jeremy Smith, Eng Gee Lim, Weiping Ding, Yutao Yue

    Abstract: Urban water-surface robust perception serves as the foundation for intelligent monitoring of aquatic environments and the autonomous navigation and operation of unmanned vessels, especially in the context of waterway safety. It is worth noting that current multi-sensor fusion and multi-task learning models consume substantial power and heavily rely on high-power GPUs for inference. This contribute… ▽ More

    Submitted 14 December, 2023; originally announced December 2023.

    Comments: 18 pages, 9 figures

  36. arXiv:2312.04861  [pdf, other

    cs.CV cs.AI

    Exploring Radar Data Representations in Autonomous Driving: A Comprehensive Review

    Authors: Shanliang Yao, Runwei Guan, Zitian Peng, Chenhang Xu, Yilu Shi, Weiping Ding, Eng Gee Lim, Yong Yue, Hyungjoon Seo, Ka Lok Man, Jieming Ma, Xiaohui Zhu, Yutao Yue

    Abstract: With the rapid advancements of sensor technology and deep learning, autonomous driving systems are providing safe and efficient access to intelligent vehicles as well as intelligent transportation. Among these equipped sensors, the radar sensor plays a crucial role in providing robust perception information in diverse environmental conditions. This review focuses on exploring different radar data… ▽ More

    Submitted 19 April, 2024; v1 submitted 8 December, 2023; originally announced December 2023.

    Comments: 24 pages, 10 figures, 5 tables. arXiv admin note: text overlap with arXiv:2304.10410

  37. arXiv:2312.02684  [pdf, other

    cs.CV cs.LG cs.RO

    DeepPointMap: Advancing LiDAR SLAM with Unified Neural Descriptors

    Authors: Xiaze Zhang, Ziheng Ding, Qi Jing, Yuejie Zhang, Wenchao Ding, Rui Feng

    Abstract: Point clouds have shown significant potential in various domains, including Simultaneous Localization and Mapping (SLAM). However, existing approaches either rely on dense point clouds to achieve high localization accuracy or use generalized descriptors to reduce map size. Unfortunately, these two aspects seem to conflict with each other. To address this limitation, we propose a unified architectu… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

  38. arXiv:2312.02642  [pdf, other

    cs.CR

    Understanding Ethereum Mempool Security under Asymmetric DoS by Symbolic Fuzzing

    Authors: Yibo Wang, Wanning Ding, Kai Li, Yuzhe Tang

    Abstract: In blockchains, mempool controls transaction flow before consensus, denial of whose service hurts the health and security of blockchain networks. This paper presents MPFUZZ, the first mempool fuzzer to find asymmetric DoS bugs by symbolically exploring mempool state space and optimistically estimating the promisingness an intermediate state is in reaching bug oracles. Compared to the baseline bloc… ▽ More

    Submitted 21 December, 2023; v1 submitted 5 December, 2023; originally announced December 2023.

  39. arXiv:2312.01573  [pdf

    eess.IV cs.CV

    Survey on deep learning in multimodal medical imaging for cancer detection

    Authors: Yan Tian, Zhaocheng Xu, Yujun Ma, Weiping Ding, Ruili Wang, Zhihong Gao, Guohua Cheng, Linyang He, Xuran Zhao

    Abstract: The task of multimodal cancer detection is to determine the locations and categories of lesions by using different imaging techniques, which is one of the key research methods for cancer diagnosis. Recently, deep learning-based object detection has made significant developments due to its strength in semantic feature extraction and nonlinear function fitting. However, multimodal cancer detection r… ▽ More

    Submitted 3 December, 2023; originally announced December 2023.

    Journal ref: Neural Computing and Applications. 2023 Nov 29:1-6

  40. arXiv:2311.10747  [pdf, other

    cs.RO cs.AI cs.LG

    Safety-aware Causal Representation for Trustworthy Offline Reinforcement Learning in Autonomous Driving

    Authors: Haohong Lin, Wenhao Ding, Zuxin Liu, Yaru Niu, Jiacheng Zhu, Yuming Niu, Ding Zhao

    Abstract: In the domain of autonomous driving, the offline Reinforcement Learning~(RL) approaches exhibit notable efficacy in addressing sequential decision-making problems from offline datasets. However, maintaining safety in diverse safety-critical scenarios remains a significant challenge due to long-tailed and unforeseen scenarios absent from offline datasets. In this paper, we introduce the saFety-awar… ▽ More

    Submitted 12 March, 2024; v1 submitted 31 October, 2023; originally announced November 2023.

  41. arXiv:2311.08487  [pdf, other

    cs.CL cs.AI

    Alignment is not sufficient to prevent large language models from generating harmful information: A psychoanalytic perspective

    Authors: Zi Yin, Wei Ding, Jia Liu

    Abstract: Large Language Models (LLMs) are central to a multitude of applications but struggle with significant risks, notably in generating harmful content and biases. Drawing an analogy to the human psyche's conflict between evolutionary survival instincts and societal norm adherence elucidated in Freud's psychoanalysis theory, we argue that LLMs suffer a similar fundamental conflict, arising between thei… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

  42. arXiv:2311.05298  [pdf, other

    cs.CV

    Improving Vision-and-Language Reasoning via Spatial Relations Modeling

    Authors: Cheng Yang, Rui Xu, Ye Guo, Peixiang Huang, Yiru Chen, Wenkui Ding, Zhongyuan Wang, Hong Zhou

    Abstract: Visual commonsense reasoning (VCR) is a challenging multi-modal task, which requires high-level cognition and commonsense reasoning ability about the real world. In recent years, large-scale pre-training approaches have been developed and promoted the state-of-the-art performance of VCR. However, the existing approaches almost employ the BERT-like objectives to learn multi-modal representations. T… ▽ More

    Submitted 9 November, 2023; originally announced November 2023.

  43. arXiv:2310.20223  [pdf, other

    cs.LG

    STDA-Meta: A Meta-Learning Framework for Few-Shot Traffic Prediction

    Authors: Maoxiang Sun, Weilong Ding, Tianpu Zhang, Zijian Liu, Mengda Xing

    Abstract: As the development of cities, traffic congestion becomes an increasingly pressing issue, and traffic prediction is a classic method to relieve that issue. Traffic prediction is one specific application of spatio-temporal prediction learning, like taxi scheduling, weather prediction, and ship trajectory prediction. Against these problems, classical spatio-temporal prediction learning methods includ… ▽ More

    Submitted 31 October, 2023; originally announced October 2023.

  44. arXiv:2310.15299  [pdf, other

    math.NA cs.AI cs.LG physics.flu-dyn

    Neural Network with Local Converging Input (NNLCI) for Supersonic Flow Problems with Unstructured Grids

    Authors: Weiming Ding, Haoxiang Huang, Tzu Jung Lee, Yingjie Liu, Vigor Yang

    Abstract: In recent years, surrogate models based on deep neural networks (DNN) have been widely used to solve partial differential equations, which were traditionally handled by means of numerical simulations. This kind of surrogate models, however, focuses on global interpolation of the training dataset, and thus requires a large network structure. The process is both time consuming and computationally co… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

    Comments: 23 pages, 21 figures

    MSC Class: 35Q31

  45. arXiv:2310.13828  [pdf, other

    cs.CR cs.AI

    Nightshade: Prompt-Specific Poisoning Attacks on Text-to-Image Generative Models

    Authors: Shawn Shan, Wenxin Ding, Josephine Passananti, Stanley Wu, Haitao Zheng, Ben Y. Zhao

    Abstract: Data poisoning attacks manipulate training data to introduce unexpected behaviors into machine learning models at training time. For text-to-image generative models with massive training datasets, current understanding of poisoning attacks suggests that a successful attack would require injecting millions of poison samples into their training pipeline. In this paper, we show that poisoning attacks… ▽ More

    Submitted 29 April, 2024; v1 submitted 20 October, 2023; originally announced October 2023.

    Comments: IEEE Security and Privacy 2024

  46. arXiv:2310.13398  [pdf, other

    cs.CV

    OpenAnnotate3D: Open-Vocabulary Auto-Labeling System for Multi-modal 3D Data

    Authors: Yijie Zhou, Likun Cai, Xianhui Cheng, Zhongxue Gan, Xiangyang Xue, Wenchao Ding

    Abstract: In the era of big data and large models, automatic annotating functions for multi-modal data are of great significance for real-world AI-driven applications, such as autonomous driving and embodied AI. Unlike traditional closed-set annotation, open-vocabulary annotation is essential to achieve human-level cognition capability. However, there are few open-vocabulary auto-labeling systems for multi-… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

    Comments: The source code will be released at https://github.com/Fudan-ProjectTitan/OpenAnnotate3D

  47. arXiv:2310.11303  [pdf, other

    cs.CL

    QADYNAMICS: Training Dynamics-Driven Synthetic QA Diagnostic for Zero-Shot Commonsense Question Answering

    Authors: Haochen Shi, Weiqi Wang, Tianqing Fang, Baixuan Xu, Wenxuan Ding, Xin Liu, Yangqiu Song

    Abstract: Zero-shot commonsense Question-Answering (QA) requires models to reason about general situations beyond specific benchmarks. State-of-the-art approaches fine-tune language models on QA pairs constructed from CommonSense Knowledge Bases (CSKBs) to equip the models with more commonsense knowledge in a QA context. However, current QA synthesis protocols may introduce noise from the CSKBs and generate… ▽ More

    Submitted 17 October, 2023; originally announced October 2023.

    Comments: Findings of EMNLP2023

  48. arXiv:2310.08670  [pdf, other

    cs.LG cs.DC

    Every Parameter Matters: Ensuring the Convergence of Federated Learning with Dynamic Heterogeneous Models Reduction

    Authors: Hanhan Zhou, Tian Lan, Guru Venkataramani, Wenbo Ding

    Abstract: Cross-device Federated Learning (FL) faces significant challenges where low-end clients that could potentially make unique contributions are excluded from training large models due to their resource bottlenecks. Recent research efforts have focused on model-heterogeneous FL, by extracting reduced-size models from the global model and applying them to local clients accordingly. Despite the empirica… ▽ More

    Submitted 26 October, 2023; v1 submitted 12 October, 2023; originally announced October 2023.

    Comments: Accepted at NeurIPS 2023

  49. arXiv:2310.04455  [pdf, other

    cs.LG cs.AI cs.CL

    Inclusive Data Representation in Federated Learning: A Novel Approach Integrating Textual and Visual Prompt

    Authors: Zihao Zhao, Zhenpeng Shi, Yang Liu, Wenbo Ding

    Abstract: Federated Learning (FL) is often impeded by communication overhead issues. Prompt tuning, as a potential solution, has been introduced to only adjust a few trainable parameters rather than the whole model. However, current single-modality prompt tuning approaches fail to comprehensively portray local clients' data. To overcome this limitation, we present Twin Prompt Federated learning (TPFL), a pi… ▽ More

    Submitted 4 October, 2023; originally announced October 2023.

  50. arXiv:2310.01290  [pdf, other

    cs.CL cs.AI

    Knowledge Crosswords: Geometric Reasoning over Structured Knowledge with Large Language Models

    Authors: Wenxuan Ding, Shangbin Feng, Yuhan Liu, Zhaoxuan Tan, Vidhisha Balachandran, Tianxing He, Yulia Tsvetkov

    Abstract: Large language models (LLMs) are widely adopted in knowledge-intensive tasks and have achieved impressive performance thanks to their knowledge abilities. While LLMs have demonstrated outstanding performance on atomic or linear (multi-hop) QA tasks, whether they can reason in knowledge-rich scenarios with interweaving constraints remains an underexplored problem. In this work, we propose geometric… ▽ More

    Submitted 2 October, 2023; originally announced October 2023.