(Translated by https://www.hiragana.jp/)
Search | arXiv e-print repository
Skip to main content

Showing 1–50 of 92 results for author: Lim, E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.03867  [pdf, other

    quant-ph cs.ET

    A Comprehensive Study of Quantum Arithmetic Circuits

    Authors: Siyi Wang, Xiufan Li, Wei Jie Bryan Lee, Suman Deb, Eugene Lim, Anupam Chattopadhyay

    Abstract: In recent decades, the field of quantum computing has experienced remarkable progress. This progress is marked by the superior performance of many quantum algorithms compared to their classical counterparts, with Shor's algorithm serving as a prominent illustration. Quantum arithmetic circuits, which are the fundamental building blocks in numerous quantum algorithms, have attracted much attention.… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: Under review at the Royal Society's Philosophical Transactions A

  2. arXiv:2405.12821  [pdf, other

    cs.RO cs.CV

    Talk2Radar: Bridging Natural Language with 4D mmWave Radar for 3D Referring Expression Comprehension

    Authors: Runwei Guan, Ruixiao Zhang, Ningwei Ouyang, Jianan Liu, Ka Lok Man, Xiaohao Cai, Ming Xu, Jeremy Smith, Eng Gee Lim, Yutao Yue, Hui Xiong

    Abstract: Embodied perception is essential for intelligent vehicles and robots, enabling more natural interaction and task execution. However, these advancements currently embrace vision level, rarely focusing on using 3D modeling sensors, which limits the full understanding of surrounding objects with multi-granular characteristics. Recently, as a promising automotive sensor with affordable cost, 4D Millim… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

    Comments: 8 pages, 5 figures

  3. arXiv:2405.10150  [pdf, other

    cs.CL

    Speaker Verification in Agent-Generated Conversations

    Authors: Yizhe Yang, Palakorn Achananuparp, Heyan Huang, Jing Jiang, Ee-Peng Lim

    Abstract: The recent success of large language models (LLMs) has attracted widespread interest to develop role-playing conversational agents personalized to the characteristics and styles of different speakers to enhance their abilities to perform both general and special purpose dialogue tasks. However, the ability to personalize the generated utterances to speakers, whether conducted by human or LLM, has… ▽ More

    Submitted 5 June, 2024; v1 submitted 16 May, 2024; originally announced May 2024.

  4. arXiv:2405.10126  [pdf

    stat.ML cs.LG math.ST

    Estimating a Function and Its Derivatives Under a Smoothness Condition

    Authors: Eunji Lim

    Abstract: We consider the problem of estimating an unknown function f* and its partial derivatives from a noisy data set of n observations, where we make no assumptions about f* except that it is smooth in the sense that it has square integrable partial derivatives of order m. A natural candidate for the estimator of f* in such a case is the best fit to the data set that satisfies a certain smoothness condi… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

    Comments: 27 pages. Mathematics of Operations Research 2024

    MSC Class: 62G08; 62G20

  5. arXiv:2404.19381  [pdf, other

    cs.AR

    Low-overhead General-purpose Near-Data Processing in CXL Memory Expanders

    Authors: Hyungkyu Ham, Jeongmin Hong, Geonwoo Park, Yunseon Shin, Okkyun Woo, Wonhyuk Yang, Jinhoon Bae, Eunhyeok Park, Hyojin Sung, Euicheol Lim, Gwangsun Kim

    Abstract: To overcome the memory capacity wall of large-scale AI and big data applications, Compute Express Link (CXL) enables cost-efficient memory expansion beyond the local DRAM of processors. While its CXL.mem protocol stack minimizes interconnect latency, CXL memory accesses can still result in significant slowdowns for memory-bound applications. While near-data processing (NDP) in CXL memory can overc… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

  6. arXiv:2404.10342  [pdf, other

    cs.CV cs.MM

    Referring Flexible Image Restoration

    Authors: Runwei Guan, Rongsheng Hu, Zhuhao Zhou, Tianlang Xue, Ka Lok Man, Jeremy Smith, Eng Gee Lim, Weiping Ding, Yutao Yue

    Abstract: In reality, images often exhibit multiple degradations, such as rain and fog at night (triple degradations). However, in many cases, individuals may not want to remove all degradations, for instance, a blurry lens revealing a beautiful snowy landscape (double degradations). In such scenarios, people may only desire to deblur. These situations and requirements shed light on a new challenge in image… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

    Comments: 15 pages, 19 figures

  7. arXiv:2404.01409  [pdf, other

    cs.CV cs.AI cs.MM

    OVFoodSeg: Elevating Open-Vocabulary Food Image Segmentation via Image-Informed Textual Representation

    Authors: Xiongwei Wu, Sicheng Yu, Ee-Peng Lim, Chong-Wah Ngo

    Abstract: In the realm of food computing, segmenting ingredients from images poses substantial challenges due to the large intra-class variance among the same ingredients, the emergence of new ingredients, and the high annotation costs associated with large food segmentation datasets. Existing approaches primarily utilize a closed-vocabulary and static text embeddings setting. These methods often fall short… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

    Comments: CVPR 2024; 12 pages

  8. arXiv:2403.12686  [pdf, other

    cs.CV cs.MM cs.RO

    WaterVG: Waterway Visual Grounding based on Text-Guided Vision and mmWave Radar

    Authors: Runwei Guan, Liye Jia, Fengyufan Yang, Shanliang Yao, Erick Purwanto, Xiaohui Zhu, Eng Gee Lim, Jeremy Smith, Ka Lok Man, Xuming Hu, Yutao Yue

    Abstract: The perception of waterways based on human intent is significant for autonomous navigation and operations of Unmanned Surface Vehicles (USVs) in water environments. Inspired by visual grounding, we introduce WaterVG, the first visual grounding dataset designed for USV-based waterway perception based on human prompts. WaterVG encompasses prompts describing multiple targets, with annotations at the… ▽ More

    Submitted 4 April, 2024; v1 submitted 19 March, 2024; originally announced March 2024.

    Comments: 10 pages, 10 figures

  9. arXiv:2403.10135  [pdf, other

    cs.IR cs.AI cs.CL

    The Whole is Better than the Sum: Using Aggregated Demonstrations in In-Context Learning for Sequential Recommendation

    Authors: Lei Wang, Ee-Peng Lim

    Abstract: Large language models (LLMs) have shown excellent performance on various NLP tasks. To use LLMs as strong sequential recommenders, we explore the in-context learning approach to sequential recommendation. We investigate the effects of instruction format, task consistency, demonstration selection, and number of demonstrations. As increasing the number of demonstrations in ICL does not improve accur… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

    Comments: NAACL 2024 (Findings)

  10. arXiv:2403.01206  [pdf, other

    quant-ph cs.ET

    Boosting the Efficiency of Quantum Divider through Effective Design Space Exploration

    Authors: Siyi Wang, Eugene Lim, Anupam Chattopadhyay

    Abstract: Rapid progress in the design of scalable, robust quantum computing necessitates efficient quantum circuit implementation for algorithms with practical relevance. For several algorithms, arithmetic kernels, in particular, division plays an important role. In this manuscript, we focus on enhancing the performance of quantum slow dividers by exploring the design choices of its sub-blocks, such as, ad… ▽ More

    Submitted 2 March, 2024; originally announced March 2024.

    Comments: This is accepted for publication in ISCAS 2024

  11. arXiv:2402.17971  [pdf, other

    cs.CV cs.AI cs.CL

    All in an Aggregated Image for In-Image Learning

    Authors: Lei Wang, Wanyu Xu, Zhiqiang Hu, Yihuai Lan, Shan Dong, Hao Wang, Roy Ka-Wei Lee, Ee-Peng Lim

    Abstract: This paper introduces a new in-context learning (ICL) mechanism called In-Image Learning (I$^2$L) that combines demonstration examples, visual cues, and chain-of-thought reasoning into an aggregated image to enhance the capabilities of Large Multimodal Models (e.g., GPT-4V) in multimodal reasoning tasks. Unlike previous approaches that rely on converting images to text or incorporating visual inpu… ▽ More

    Submitted 2 April, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

    Comments: Preprint

  12. arXiv:2402.16075  [pdf, other

    cs.LG cs.AI cs.RO

    Don't Start from Scratch: Behavioral Refinement via Interpolant-based Policy Diffusion

    Authors: Kaiqi Chen, Eugene Lim, Kelvin Lin, Yiyang Chen, Harold Soh

    Abstract: Imitation learning empowers artificial agents to mimic behavior by learning from demonstrations. Recently, diffusion models, which have the ability to model high-dimensional and multimodal distributions, have shown impressive performance on imitation learning tasks. These models learn to shape a policy by diffusing actions (or states) from standard Gaussian noise. However, the target policy to be… ▽ More

    Submitted 22 May, 2024; v1 submitted 25 February, 2024; originally announced February 2024.

  13. arXiv:2402.11887  [pdf, other

    cs.LG

    Generative Semi-supervised Graph Anomaly Detection

    Authors: Hezhe Qiao, Qingsong Wen, Xiaoli Li, Ee-Peng Lim, Guansong Pang

    Abstract: This work considers a practical semi-supervised graph anomaly detection (GAD) scenario, where part of the nodes in a graph are known to be normal, contrasting to the extensively explored unsupervised setting with a fully unlabeled graph. We reveal that having access to the normal nodes, even just a small percentage of normal nodes, helps enhance the detection performance of existing unsupervised G… ▽ More

    Submitted 28 May, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

    Comments: 20 pages, 11 figures

  14. arXiv:2402.06119  [pdf, other

    cs.CV

    ContPhy: Continuum Physical Concept Learning and Reasoning from Videos

    Authors: Zhicheng Zheng, Xin Yan, Zhenfang Chen, Jingzhou Wang, Qin Zhi Eddie Lim, Joshua B. Tenenbaum, Chuang Gan

    Abstract: We introduce the Continuum Physical Dataset (ContPhy), a novel benchmark for assessing machine physical commonsense. ContPhy complements existing physical reasoning benchmarks by encompassing the inference of diverse physical properties, such as mass and density, across various scenarios and predicting corresponding dynamics. We evaluated a range of AI models and found that they still struggle to… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

    Comments: The first three authors contributed equally to this work

  15. arXiv:2312.08851  [pdf, other

    cs.CV cs.CE cs.RO

    Achelous++: Power-Oriented Water-Surface Panoptic Perception Framework on Edge Devices based on Vision-Radar Fusion and Pruning of Heterogeneous Modalities

    Authors: Runwei Guan, Haocheng Zhao, Shanliang Yao, Ka Lok Man, Xiaohui Zhu, Limin Yu, Yong Yue, Jeremy Smith, Eng Gee Lim, Weiping Ding, Yutao Yue

    Abstract: Urban water-surface robust perception serves as the foundation for intelligent monitoring of aquatic environments and the autonomous navigation and operation of unmanned vessels, especially in the context of waterway safety. It is worth noting that current multi-sensor fusion and multi-task learning models consume substantial power and heavily rely on high-power GPUs for inference. This contribute… ▽ More

    Submitted 14 December, 2023; originally announced December 2023.

    Comments: 18 pages, 9 figures

  16. arXiv:2312.04861  [pdf, other

    cs.CV cs.AI

    Exploring Radar Data Representations in Autonomous Driving: A Comprehensive Review

    Authors: Shanliang Yao, Runwei Guan, Zitian Peng, Chenhang Xu, Yilu Shi, Weiping Ding, Eng Gee Lim, Yong Yue, Hyungjoon Seo, Ka Lok Man, Jieming Ma, Xiaohui Zhu, Yutao Yue

    Abstract: With the rapid advancements of sensor technology and deep learning, autonomous driving systems are providing safe and efficient access to intelligent vehicles as well as intelligent transportation. Among these equipped sensors, the radar sensor plays a crucial role in providing robust perception information in diverse environmental conditions. This review focuses on exploring different radar data… ▽ More

    Submitted 19 April, 2024; v1 submitted 8 December, 2023; originally announced December 2023.

    Comments: 24 pages, 10 figures, 5 tables. arXiv admin note: text overlap with arXiv:2304.10410

  17. arXiv:2312.01701  [pdf, other

    cs.CV cs.CL

    Mitigating Fine-Grained Hallucination by Fine-Tuning Large Vision-Language Models with Caption Rewrites

    Authors: Lei Wang, Jiabang He, Shenshen Li, Ning Liu, Ee-Peng Lim

    Abstract: Large language models (LLMs) have shown remarkable performance in natural language processing (NLP) tasks. To comprehend and execute diverse human instructions over image data, instruction-tuned large vision-language models (LVLMs) have been introduced. However, LVLMs may suffer from different types of object hallucinations. Nevertheless, LVLMs are evaluated for coarse-grained object hallucination… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

    Comments: MMM 2024

  18. arXiv:2312.00353  [pdf, other

    cs.CL cs.AI

    On Exploring the Reasoning Capability of Large Language Models with Knowledge Graphs

    Authors: Pei-Chi Lo, Yi-Hang Tsai, Ee-Peng Lim, San-Yih Hwang

    Abstract: This paper examines the capacity of LLMs to reason with knowledge graphs using their internal knowledge graph, i.e., the knowledge graph they learned during pre-training. Two research questions are formulated to investigate the accuracy of LLMs in recalling information from pre-training knowledge graphs and their ability to infer knowledge graph relations from context. To address these questions,… ▽ More

    Submitted 1 December, 2023; originally announced December 2023.

    Comments: Presented at the Generative-IR Workshop during SIGIR 2023. https://coda.io/@sigir/gen-ir

  19. arXiv:2310.14985  [pdf, other

    cs.CL

    LLM-Based Agent Society Investigation: Collaboration and Confrontation in Avalon Gameplay

    Authors: Yihuai Lan, Zhiqiang Hu, Lei Wang, Yang Wang, Deheng Ye, Peilin Zhao, Ee-Peng Lim, Hui Xiong, Hao Wang

    Abstract: This paper aims to investigate the open research problem of uncovering the social behaviors of LLM-based agents. To achieve this goal, we adopt Avalon, a representative communication game, as the environment and use system prompts to guide LLM agents to play the game. While previous studies have conducted preliminary investigations into gameplay with LLM agents, there lacks research on their socia… ▽ More

    Submitted 7 March, 2024; v1 submitted 23 October, 2023; originally announced October 2023.

  20. arXiv:2310.12498  [pdf, other

    cs.LG math.NA

    Quasi Manhattan Wasserstein Distance

    Authors: Evan Unit Lim

    Abstract: The Quasi Manhattan Wasserstein Distance (QMWD) is a metric designed to quantify the dissimilarity between two matrices by combining elements of the Wasserstein Distance with specific transformations. It offers improved time and space complexity compared to the Manhattan Wasserstein Distance (MWD) while maintaining accuracy. QMWD is particularly advantageous for large datasets or situations with l… ▽ More

    Submitted 19 October, 2023; originally announced October 2023.

  21. arXiv:2310.07652  [pdf, other

    cs.HC cs.CL

    LLM4Vis: Explainable Visualization Recommendation using ChatGPT

    Authors: Lei Wang, Songheng Zhang, Yun Wang, Ee-Peng Lim, Yong Wang

    Abstract: Data visualization is a powerful tool for exploring and communicating insights in various domains. To automate visualization choice for datasets, a task known as visualization recommendation has been proposed. Various machine-learning-based approaches have been developed for this purpose, but they often require a large corpus of dataset-visualization pairs for training and lack natural explanation… ▽ More

    Submitted 15 October, 2023; v1 submitted 11 October, 2023; originally announced October 2023.

    Comments: EMNLP 2023 (Industry Track)

  22. arXiv:2308.10287  [pdf, other

    cs.CV cs.RO

    Efficient-VRNet: An Exquisite Fusion Network for Riverway Panoptic Perception based on Asymmetric Fair Fusion of Vision and 4D mmWave Radar

    Authors: Runwei Guan, Shanliang Yao, Xiaohui Zhu, Ka Lok Man, Yong Yue, Jeremy Smith, Eng Gee Lim, Yutao Yue

    Abstract: Panoptic perception is essential to unmanned surface vehicles (USVs) for autonomous navigation. The current panoptic perception scheme is mainly based on vision only, that is, object detection and semantic segmentation are performed simultaneously based on camera sensors. Nevertheless, the fusion of camera and radar sensors is regarded as a promising method which could substitute pure vision metho… ▽ More

    Submitted 20 August, 2023; originally announced August 2023.

  23. arXiv:2307.07102  [pdf, other

    cs.CV cs.RO

    Achelous: A Fast Unified Water-surface Panoptic Perception Framework based on Fusion of Monocular Camera and 4D mmWave Radar

    Authors: Runwei Guan, Shanliang Yao, Xiaohui Zhu, Ka Lok Man, Eng Gee Lim, Jeremy Smith, Yong Yue, Yutao Yue

    Abstract: Current perception models for different tasks usually exist in modular forms on Unmanned Surface Vehicles (USVs), which infer extremely slowly in parallel on edge devices, causing the asynchrony between perception results and USV position, and leading to error decisions of autonomous navigation. Compared with Unmanned Ground Vehicles (UGVs), the robust perception of USVs develops relatively slowly… ▽ More

    Submitted 13 July, 2023; originally announced July 2023.

    Comments: Accepted by ITSC 2023

  24. arXiv:2307.06505  [pdf, other

    cs.CV cs.RO

    WaterScenes: A Multi-Task 4D Radar-Camera Fusion Dataset and Benchmark for Autonomous Driving on Water Surfaces

    Authors: Shanliang Yao, Runwei Guan, Zhaodong Wu, Yi Ni, Zile Huang, Zixian Zhang, Yong Yue, Weiping Ding, Eng Gee Lim, Hyungjoon Seo, Ka Lok Man, Xiaohui Zhu, Yutao Yue

    Abstract: Autonomous driving on water surfaces plays an essential role in executing hazardous and time-consuming missions, such as maritime surveillance, survivors rescue, environmental monitoring, hydrography mapping and waste cleaning. This work presents WaterScenes, the first multi-task 4D radar-camera fusion dataset for autonomous driving on water surfaces. Equipped with a 4D radar and a monocular camer… ▽ More

    Submitted 14 August, 2023; v1 submitted 12 July, 2023; originally announced July 2023.

  25. arXiv:2306.06461  [pdf

    eess.AS cs.SD

    Semi-supervsied Learning-based Sound Event Detection using Freuqency Dynamic Convolution with Large Kernel Attention for DCASE Challenge 2023 Task 4

    Authors: Ji Won Kim, Sang Won Son, Yoonah Song, Hong Kook Kim, Il Hoon Song, Jeong Eun Lim

    Abstract: This report proposes a frequency dynamic convolution (FDY) with a large kernel attention (LKA)-convolutional recurrent neural network (CRNN) with a pre-trained bidirectional encoder representation from audio transformers (BEATs) embedding-based sound event detection (SED) model that employs a mean-teacher and pseudo-label approach to address the challenge of limited labeled data for DCASE 2023 Tas… ▽ More

    Submitted 10 June, 2023; originally announced June 2023.

    Comments: DCASE 2023 Challenge Task 4A, 5 pages

  26. Ethics in conversation: Building an ethics assurance case for autonomous AI-enabled voice agents in healthcare

    Authors: Marten H. L. Kaas, Zoe Porter, Ernest Lim, Aisling Higham, Sarah Khavandi, Ibrahim Habli

    Abstract: The deployment and use of AI systems should be both safe and broadly ethically acceptable. The principles-based ethics assurance argument pattern is one proposal in the AI ethics landscape that seeks to support and achieve that aim. The purpose of this argument pattern or framework is to structure reasoning about, and to communicate and foster confidence in, the ethical acceptability of uses of sp… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

    Comments: 19 pages, 3 figures, 1 table, pre-print of paper for Trustworthy Autonomous Systems conference

    Journal ref: TAS 2023: Proceedings of the First International Symposium on Trustworthy Autonomous Systems

  27. arXiv:2305.04091  [pdf, other

    cs.CL

    Plan-and-Solve Prompting: Improving Zero-Shot Chain-of-Thought Reasoning by Large Language Models

    Authors: Lei Wang, Wanyu Xu, Yihuai Lan, Zhiqiang Hu, Yunshi Lan, Roy Ka-Wei Lee, Ee-Peng Lim

    Abstract: Large language models (LLMs) have recently been shown to deliver impressive performance in various NLP tasks. To tackle multi-step reasoning tasks, few-shot chain-of-thought (CoT) prompting includes a few manually crafted step-by-step reasoning demonstrations which enable LLMs to explicitly generate reasoning steps and improve their reasoning task accuracy. To eliminate the manual effort, Zero-sho… ▽ More

    Submitted 26 May, 2023; v1 submitted 6 May, 2023; originally announced May 2023.

    Comments: ACL 2023

  28. arXiv:2304.10893  [pdf, other

    cs.CV cs.MM

    FindVehicle and VehicleFinder: A NER dataset for natural language-based vehicle retrieval and a keyword-based cross-modal vehicle retrieval system

    Authors: Runwei Guan, Ka Lok Man, Feifan Chen, Shanliang Yao, Rongsheng Hu, Xiaohui Zhu, Jeremy Smith, Eng Gee Lim, Yutao Yue

    Abstract: Natural language (NL) based vehicle retrieval is a task aiming to retrieve a vehicle that is most consistent with a given NL query from among all candidate vehicles. Because NL query can be easily obtained, such a task has a promising prospect in building an interactive intelligent traffic system (ITS). Current solutions mainly focus on extracting both text and image features and mapping them to t… ▽ More

    Submitted 21 April, 2023; originally announced April 2023.

  29. arXiv:2304.10410  [pdf, other

    cs.CV cs.AI cs.RO

    Radar-Camera Fusion for Object Detection and Semantic Segmentation in Autonomous Driving: A Comprehensive Review

    Authors: Shanliang Yao, Runwei Guan, Xiaoyu Huang, Zhuoxiao Li, Xiangyu Sha, Yong Yue, Eng Gee Lim, Hyungjoon Seo, Ka Lok Man, Xiaohui Zhu, Yutao Yue

    Abstract: Driven by deep learning techniques, perception technology in autonomous driving has developed rapidly in recent years, enabling vehicles to accurately detect and interpret surrounding environment for safe and efficient navigation. To achieve accurate and robust perception capabilities, autonomous vehicles are often equipped with multiple sensors, making sensor fusion a crucial part of the percepti… ▽ More

    Submitted 23 August, 2023; v1 submitted 20 April, 2023; originally announced April 2023.

    Comments: Accepted by IEEE Transactions on Intelligent Vehicles (T-IV)

    Journal ref: IEEE Transactions on Intelligent Vehicles 2023

  30. arXiv:2304.03153  [pdf, other

    cs.IR cs.CL

    Zero-Shot Next-Item Recommendation using Large Pretrained Language Models

    Authors: Lei Wang, Ee-Peng Lim

    Abstract: Large language models (LLMs) have achieved impressive zero-shot performance in various natural language processing (NLP) tasks, demonstrating their capabilities for inference without training examples. Despite their success, no research has yet explored the potential of LLMs to perform next-item recommendations in the zero-shot setting. We have identified two major challenges that must be addresse… ▽ More

    Submitted 6 April, 2023; originally announced April 2023.

    Comments: Technical Report

  31. arXiv:2304.01933  [pdf, other

    cs.CL

    LLM-Adapters: An Adapter Family for Parameter-Efficient Fine-Tuning of Large Language Models

    Authors: Zhiqiang Hu, Lei Wang, Yihuai Lan, Wanyu Xu, Ee-Peng Lim, Lidong Bing, Xing Xu, Soujanya Poria, Roy Ka-Wei Lee

    Abstract: The success of large language models (LLMs), like GPT-4 and ChatGPT, has led to the development of numerous cost-effective and accessible alternatives that are created by finetuning open-access LLMs with task-specific data (e.g., ChatDoctor) or instruction data (e.g., Alpaca). Among the various fine-tuning methods, adapter-based parameter-efficient fine-tuning (PEFT) is undoubtedly one of the most… ▽ More

    Submitted 9 October, 2023; v1 submitted 4 April, 2023; originally announced April 2023.

    Comments: EMNLP 2023. The code of our framework can be found at https://github.com/AGI-Edgerunners/LLM-Adapters. We will keep all of the code open-source and continue to update the framework with new adapters, LLMs, and tasks

  32. arXiv:2303.11899  [pdf, other

    cs.AI

    Large-Scale Traffic Signal Control Using Constrained Network Partition and Adaptive Deep Reinforcement Learning

    Authors: Hankang Gu, Shangbo Wang, Xiaoguang Ma, Dongyao Jia, Guoqiang Mao, Eng Gee Lim, Cheuk Pong Ryan Wong

    Abstract: Multi-agent Deep Reinforcement Learning (MADRL) based traffic signal control becomes a popular research topic in recent years. To alleviate the scalability issue of completely centralized RL techniques and the non-stationarity issue of completely decentralized RL techniques on large-scale traffic networks, some literature utilizes a regional control approach where the whole network is firstly part… ▽ More

    Submitted 7 September, 2023; v1 submitted 21 March, 2023; originally announced March 2023.

  33. arXiv:2212.10278  [pdf, other

    cs.CV

    Fully and Weakly Supervised Referring Expression Segmentation with End-to-End Learning

    Authors: Hui Li, Mingjie Sun, Jimin Xiao, Eng Gee Lim, Yao Zhao

    Abstract: Referring Expression Segmentation (RES), which is aimed at localizing and segmenting the target according to the given language expression, has drawn increasing attention. Existing methods jointly consider the localization and segmentation steps, which rely on the fused visual and linguistic features for both steps. We argue that the conflict between the purpose of identifying an object and genera… ▽ More

    Submitted 17 December, 2022; originally announced December 2022.

  34. arXiv:2210.08452  [pdf, other

    cs.CV

    Efficient Cross-Modal Video Retrieval with Meta-Optimized Frames

    Authors: Ning Han, Xun Yang, Ee-Peng Lim, Hao Chen, Qianru Sun

    Abstract: Cross-modal video retrieval aims to retrieve the semantically relevant videos given a text as a query, and is one of the fundamental tasks in Multimedia. Most of top-performing methods primarily leverage Visual Transformer (ViT) to extract video features [1, 2, 3], suffering from high computational complexity of ViT especially for encoding long videos. A common and simple solution is to uniformly… ▽ More

    Submitted 16 October, 2022; originally announced October 2022.

  35. arXiv:2210.07592  [pdf, other

    cs.RO cs.GR

    TSP-Bot: Robotic TSP Pen Art using High-DoF Manipulators

    Authors: Daeun Song, Eunjung Lim, Jiyoon Park, Minjung Jung, Young J. Kim

    Abstract: TSP art is an art form for drawing an image using piecewise-continuous line segments. We present TSP-Bot, a robotic pen drawing system capable of creating complicated TSP pen art on a planar surface using multiple colors. The system begins by converting a colored raster image into a set of points that represent the image's tone, which can be controlled by adjusting the point density. Next, the sys… ▽ More

    Submitted 10 April, 2024; v1 submitted 14 October, 2022; originally announced October 2022.

  36. arXiv:2210.06787  [pdf, other

    cs.LG cs.HC

    Observed Adversaries in Deep Reinforcement Learning

    Authors: Eugene Lim, Harold Soh

    Abstract: In this work, we point out the problem of observed adversaries for deep policies. Specifically, recent work has shown that deep reinforcement learning is susceptible to adversarial attacks where an observed adversary acts under environmental constraints to invoke natural but adversarial observations. This setting is particularly relevant for HRI since HRI-related robots are expected to perform the… ▽ More

    Submitted 13 October, 2022; originally announced October 2022.

    Report number: AIHRI/2022/7817

  37. arXiv:2209.01352  [pdf, other

    cs.CL

    Improving Compositional Generalization in Math Word Problem Solving

    Authors: Yunshi Lan, Lei Wang, Jing Jiang, Ee-Peng Lim

    Abstract: Compositional generalization refers to a model's capability to generalize to newly composed input data based on the data components observed during training. It has triggered a series of compositional generalization analysis on different tasks as generalization is an important aspect of language and problem solving skills. However, the similar discussion on math word problems (MWPs) is limited. In… ▽ More

    Submitted 3 September, 2022; originally announced September 2022.

  38. arXiv:2209.01347  [pdf, other

    cs.IR

    Explanation Guided Contrastive Learning for Sequential Recommendation

    Authors: Lei Wang, Ee-Peng Lim, Zhiwei Liu, Tianxiang Zhao

    Abstract: Recently, contrastive learning has been applied to the sequential recommendation task to address data sparsity caused by users with few item interactions and items with few user adoptions. Nevertheless, the existing contrastive learning-based methods fail to ensure that the positive (or negative) sequence obtained by some random augmentation (or sequence sampling) on a given anchor user sequence r… ▽ More

    Submitted 3 September, 2022; originally announced September 2022.

    Comments: CIKM 2022

  39. arXiv:2206.07515  [pdf

    eess.SP cs.AI cs.LG

    A Deep Learning Network for the Classification of Intracardiac Electrograms in Atrial Tachycardia

    Authors: Zerui Chen, Sonia Xhyn Teo, Andrie Ochtman, Shier Nee Saw, Nicholas Cheng, Eric Tien Siang Lim, Murphy Lyu, Hwee Kuan Lee

    Abstract: A key technology enabling the success of catheter ablation treatment for atrial tachycardia is activation mapping, which relies on manual local activation time (LAT) annotation of all acquired intracardiac electrogram (EGM) signals. This is a time-consuming and error-prone procedure, due to the difficulty in identifying the signal activation peaks for fractionated signals. This work presents a Dee… ▽ More

    Submitted 2 June, 2022; originally announced June 2022.

    Comments: 34 pages, 10 figures

    ACM Class: J.3

  40. arXiv:2203.12428  [pdf

    cs.CV

    An Attention-based Method for Action Unit Detection at the 3rd ABAW Competition

    Authors: Duy Le Hoai, Eunchae Lim, Eunbin Choi, Sieun Kim, Sudarshan Pant, Guee-Sang Lee, Soo-Huyng Kim, Hyung-Jeong Yang

    Abstract: Facial Action Coding System is an approach for modeling the complexity of human emotional expression. Automatic action unit (AUえーゆー) detection is a crucial research area in human-computer interaction. This paper describes our submission to the third Affective Behavior Analysis in-the-wild (ABAW) competition 2022. We proposed a method for detecting facial action units in the video. At the first stage,… ▽ More

    Submitted 23 March, 2022; originally announced March 2022.

  41. arXiv:2203.05787  [pdf, other

    cs.CV

    Democracy Does Matter: Comprehensive Feature Mining for Co-Salient Object Detection

    Authors: Siyue Yu, Jimin Xiao, Bingfeng Zhang, Eng Gee Lim

    Abstract: Co-salient object detection, with the target of detecting co-existed salient objects among a group of images, is gaining popularity. Recent works use the attention mechanism or extra information to aggregate common co-salient features, leading to incomplete even incorrect responses for target objects. In this paper, we aim to mine comprehensive co-salient features with democracy and reduce backgro… ▽ More

    Submitted 11 March, 2022; originally announced March 2022.

    Comments: accepted by cvpr2022

  42. arXiv:2112.11207  [pdf

    cs.CY cs.CL cs.LG stat.AP

    How are cities pledging net zero? A computational approach to analyzing subnational climate strategies

    Authors: Siddharth Sachdeva, Angel Hsu, Ian French, Elwin Lim

    Abstract: Cities have become primary actors on climate change and are increasingly setting goals aimed at net-zero emissions. The rapid proliferation of subnational governments "racing to zero" emissions and articulating their own climate mitigation plans warrants closer examination to understand how these actors intend to meet these goals. The scattered, incomplete and heterogeneous nature of city climate… ▽ More

    Submitted 14 December, 2021; originally announced December 2021.

    Comments: 14 pages, 6 figures, submitted to nature urban sustainability

  43. arXiv:2109.10604  [pdf, other

    cs.CL

    NOAHQA: Numerical Reasoning with Interpretable Graph Question Answering Dataset

    Authors: Qiyuan Zhang, Lei Wang, Sicheng Yu, Shuohang Wang, Yang Wang, Jing Jiang, Ee-Peng Lim

    Abstract: While diverse question answering (QA) datasets have been proposed and contributed significantly to the development of deep learning models for QA tasks, the existing datasets fall short in two aspects. First, we lack QA datasets covering complex questions that involve answers as well as the reasoning processes to get the answers. As a result, the state-of-the-art QA research on numerical reasoning… ▽ More

    Submitted 14 October, 2021; v1 submitted 22 September, 2021; originally announced September 2021.

    Comments: Findings of EMNLP 2021. Code will be released at: https://github.com/Don-Joey/NoahQA

  44. arXiv:2109.00799  [pdf, other

    cs.CL

    MWPToolkit: An Open-Source Framework for Deep Learning-Based Math Word Problem Solvers

    Authors: Yihuai Lan, Lei Wang, Qiyuan Zhang, Yunshi Lan, Bing Tian Dai, Yan Wang, Dongxiang Zhang, Ee-Peng Lim

    Abstract: Developing automatic Math Word Problem (MWP) solvers has been an interest of NLP researchers since the 1960s. Over the last few years, there are a growing number of datasets and deep learning-based methods proposed for effectively solving MWPs. However, most existing methods are benchmarked soly on one or two datasets, varying in different configurations, which leads to a lack of unified, standard… ▽ More

    Submitted 17 September, 2021; v1 submitted 2 September, 2021; originally announced September 2021.

    Comments: 9 pages, 2 figures

  45. arXiv:2107.05194  [pdf, other

    cs.HC

    Monoscopic vs. Stereoscopic Views and Display Types in the Teleoperation of Unmanned Ground Vehicles for Object Avoidance

    Authors: Yiming Luo, Jialin Wang, Hai-Ning Liang, Shan Luo, Eng Gee Lim

    Abstract: Virtual reality (VR) head-mounted displays (HMD) have recently been used to provide an immersive, first-person vision/view in real-time for manipulating remotely-controlled unmanned ground vehicles (UGV). The teleoperation of UGV can be challenging for operators when it is done in real time. One big challenge is for operators to perceive quickly and rapidly the distance of objects that are around… ▽ More

    Submitted 12 July, 2021; originally announced July 2021.

  46. arXiv:2107.04279  [pdf, other

    cs.CV

    Fast Pixel-Matching for Video Object Segmentation

    Authors: Siyue Yu, Jimin Xiao, BingFeng Zhang, Eng Gee Lim

    Abstract: Video object segmentation, aiming to segment the foreground objects given the annotation of the first frame, has been attracting increasing attentions. Many state-of-the-art approaches have achieved great performance by relying on online model updating or mask-propagation techniques. However, most online models require high computational cost due to model fine-tuning during inference. Most mask-pr… ▽ More

    Submitted 9 July, 2021; originally announced July 2021.

    Comments: Accepted by Signal Processing: Image Communication

  47. arXiv:2106.04053  [pdf, other

    cs.CV cs.MM

    Discriminative Triad Matching and Reconstruction for Weakly Referring Expression Grounding

    Authors: Mingjie Sun, Jimin Xiao, Eng Gee Lim, Si Liu, John Y. Goulermas

    Abstract: In this paper, we are tackling the weakly-supervised referring expression grounding task, for the localization of a referent object in an image according to a query sentence, where the mapping between image regions and queries are not available during the training stage. In traditional methods, an object region that best matches the referring expression is picked out, and then the query sentence i… ▽ More

    Submitted 7 June, 2021; originally announced June 2021.

    Comments: TPAMI

  48. arXiv:2106.00489  [pdf, other

    cs.RO cs.LG

    Extended Tactile Perception: Vibration Sensing through Tools and Grasped Objects

    Authors: Tasbolat Taunyazov, Luar Shui Song, Eugene Lim, Hian Hian See, David Lee, Benjamin C. K. Tee, Harold Soh

    Abstract: Humans display the remarkable ability to sense the world through tools and other held objects. For example, we are able to pinpoint impact locations on a held rod and tell apart different textures using a rigid probe. In this work, we consider how we can enable robots to have a similar capacity, i.e., to embody tools and extend perception using standard grasped objects. We propose that vibro-tacti… ▽ More

    Submitted 29 September, 2021; v1 submitted 1 June, 2021; originally announced June 2021.

    Comments: 9 pages, 7 figures. This version adds additional related work and updated results

    Journal ref: IROS 2021

  49. arXiv:2105.12342  [pdf, ps, other

    math.OC cs.LG econ.EM eess.SY stat.ML

    A data-driven approach to beating SAA out-of-sample

    Authors: Jun-ya Gotoh, Michael Jong Kim, Andrew E. B. Lim

    Abstract: While solutions of Distributionally Robust Optimization (DRO) problems can sometimes have a higher out-of-sample expected reward than the Sample Average Approximation (SAA), there is no guarantee. In this paper, we introduce a class of Distributionally Optimistic Optimization (DOO) models, and show that it is always possible to ``beat" SAA out-of-sample if we consider not just worst-case (DRO) mod… ▽ More

    Submitted 11 June, 2023; v1 submitted 26 May, 2021; originally announced May 2021.

    Comments: 25 pages, 2 page bibliography, 2 Figures, 12 page Appendix

    MSC Class: 90C17; 90C31; 93B35; 90C47; 90B50; 62G35; 62K25;

  50. arXiv:2105.06960  [pdf, ps, other

    cs.LG stat.ML

    Thompson Sampling for Gaussian Entropic Risk Bandits

    Authors: Ming Liang Ang, Eloise Y. Y. Lim, Joel Q. L. Chang

    Abstract: The multi-armed bandit (MAB) problem is a ubiquitous decision-making problem that exemplifies exploration-exploitation tradeoff. Standard formulations exclude risk in decision making. Risknotably complicates the basic reward-maximising objectives, in part because there is no universally agreed definition of it. In this paper, we consider an entropic risk (ER) measure and explore the performance of… ▽ More

    Submitted 14 May, 2021; originally announced May 2021.

    Comments: arXiv admin note: text overlap with arXiv:2011.08046