(Translated by https://www.hiragana.jp/)
Search | arXiv e-print repository
Skip to main content

Showing 1–50 of 74 results for author: Zhuo, Y

.
  1. arXiv:2407.15356  [pdf, other

    cs.CV cs.AI

    X-Recon: Learning-based Patient-specific High-Resolution CT Reconstruction from Orthogonal X-Ray Images

    Authors: Yunpeng Wang, Kang Wang, Yaoyao Zhuo, Weiya Shi, Fei Shan, Lei Liu

    Abstract: Rapid and accurate diagnosis of pneumothorax, utilizing chest X-ray and computed tomography (CT), is crucial for assisted diagnosis. Chest X-ray is commonly used for initial localization of pneumothorax, while CT ensures accurate quantification. However, CT scans involve high radiation doses and can be costly. To achieve precise quantitative diagnosis while minimizing radiation exposure, we propos… ▽ More

    Submitted 21 July, 2024; originally announced July 2024.

  2. arXiv:2407.08956  [pdf, other

    cs.CR cs.SE

    DeCE: Deceptive Cross-Entropy Loss Designed for Defending Backdoor Attacks

    Authors: Guang Yang, Yu Zhou, Xiang Chen, Xiangyu Zhang, Terry Yue Zhuo, David Lo, Taolue Chen

    Abstract: Code Language Models (CLMs), particularly those leveraging deep learning, have achieved significant success in code intelligence domain. However, the issue of security, particularly backdoor attacks, is often overlooked in this process. The previous research has focused on designing backdoor attacks for CLMs, but effective defenses have not been adequately addressed. In particular, existing defens… ▽ More

    Submitted 20 August, 2024; v1 submitted 11 July, 2024; originally announced July 2024.

    Comments: Under Review; Waiting for updates

  3. arXiv:2406.15877  [pdf, other

    cs.SE cs.AI cs.CL

    BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions

    Authors: Terry Yue Zhuo, Minh Chien Vu, Jenny Chim, Han Hu, Wenhao Yu, Ratnadira Widyasari, Imam Nur Bani Yusuf, Haolan Zhan, Junda He, Indraneil Paul, Simon Brunner, Chen Gong, Thong Hoang, Armel Randy Zebaze, Xiaoheng Hong, Wen-Ding Li, Jean Kaddour, Ming Xu, Zhihan Zhang, Prateek Yadav, Naman Jain, Alex Gu, Zhoujun Cheng, Jiawei Liu, Qian Liu , et al. (8 additional authors not shown)

    Abstract: Automated software engineering has been greatly empowered by the recent advances in Large Language Models (LLMs) for programming. While current benchmarks have shown that LLMs can perform various software engineering tasks like human developers, the majority of their evaluations are limited to short and self-contained algorithmic tasks. Solving challenging and practical programming tasks requires… ▽ More

    Submitted 26 June, 2024; v1 submitted 22 June, 2024; originally announced June 2024.

    Comments: 44 pages, 14 figures, 7 tables, built with love by the BigCode community :)

  4. IG2: Integrated Gradient on Iterative Gradient Path for Feature Attribution

    Authors: Yue Zhuo, Zhiqiang Ge

    Abstract: Feature attribution explains Artificial Intelligence (AI) at the instance level by providing importance scores of input features' contributions to model prediction. Integrated Gradients (IG) is a prominent path attribution method for deep neural networks, involving the integration of gradients along a path from the explained input (explicand) to a counterfactual instance (baseline). Current IG var… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

    Comments: in IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024

  5. arXiv:2405.20595  [pdf, other

    eess.SP

    Multi-Beam Integrated Sensing and Communication: State-of-the-Art, Challenges and Opportunities

    Authors: Yinxiao Zhuo, Tianqi Mao, Haojin Li, Chen Sun, Zhaocheng Wang, Zhu Han, Sheng Chen

    Abstract: Integrated sensing and communication (ISAC) has been envisioned as a critical enabling technology for the next-generation wireless communication, which can realize location/motion detection of surroundings with communication devices. This additional sensing capability leads to a substantial network quality gain and expansion of the service scenarios. As the system evolves to millimeter wave (mmWav… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  6. arXiv:2404.15247  [pdf, other

    cs.CL cs.AI cs.LG cs.SE

    XFT: Unlocking the Power of Code Instruction Tuning by Simply Merging Upcycled Mixture-of-Experts

    Authors: Yifeng Ding, Jiawei Liu, Yuxiang Wei, Terry Yue Zhuo, Lingming Zhang

    Abstract: We introduce XFT, a simple yet powerful training scheme, by simply merging upcycled Mixture-of-Experts (MoE) to unleash the performance limit of instruction-tuned code Large Language Models (LLMs). While vanilla sparse upcycling fails to improve instruction tuning, XFT introduces a shared expert mechanism with a novel routing weight normalization strategy into sparse upcycling, which significantly… ▽ More

    Submitted 6 June, 2024; v1 submitted 23 April, 2024; originally announced April 2024.

  7. arXiv:2404.00399  [pdf, other

    cs.CL cs.AI cs.LG

    Aurora-M: The First Open Source Multilingual Language Model Red-teamed according to the U.S. Executive Order

    Authors: Taishi Nakamura, Mayank Mishra, Simone Tedeschi, Yekun Chai, Jason T Stillerman, Felix Friedrich, Prateek Yadav, Tanmay Laud, Vu Minh Chien, Terry Yue Zhuo, Diganta Misra, Ben Bogin, Xuan-Son Vu, Marzena Karpinska, Arnav Varma Dantuluri, Wojciech Kusa, Tommaso Furlanello, Rio Yokota, Niklas Muennighoff, Suhas Pai, Tosin Adewumi, Veronika Laippala, Xiaozhe Yao, Adalberto Junior, Alpay Ariyak , et al. (20 additional authors not shown)

    Abstract: Pretrained language models underpin several AI applications, but their high computational cost for training limits accessibility. Initiatives such as BLOOM and StarCoder aim to democratize access to pretrained models for collaborative community development. However, such existing models face challenges: limited multilingual capabilities, continual pretraining causing catastrophic forgetting, where… ▽ More

    Submitted 23 April, 2024; v1 submitted 30 March, 2024; originally announced April 2024.

    Comments: Preprint

  8. arXiv:2402.19173  [pdf, other

    cs.SE cs.AI

    StarCoder 2 and The Stack v2: The Next Generation

    Authors: Anton Lozhkov, Raymond Li, Loubna Ben Allal, Federico Cassano, Joel Lamy-Poirier, Nouamane Tazi, Ao Tang, Dmytro Pykhtar, Jiawei Liu, Yuxiang Wei, Tianyang Liu, Max Tian, Denis Kocetkov, Arthur Zucker, Younes Belkada, Zijian Wang, Qian Liu, Dmitry Abulkhanov, Indraneil Paul, Zhuang Li, Wen-Ding Li, Megan Risdal, Jia Li, Jian Zhu, Terry Yue Zhuo , et al. (41 additional authors not shown)

    Abstract: The BigCode project, an open-scientific collaboration focused on the responsible development of Large Language Models for Code (Code LLMs), introduces StarCoder2. In partnership with Software Heritage (SWH), we build The Stack v2 on top of the digital commons of their source code archive. Alongside the SWH repositories spanning 619 programming languages, we carefully select other high-quality data… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

  9. arXiv:2401.00788  [pdf, other

    cs.CL cs.AI cs.SE

    Astraios: Parameter-Efficient Instruction Tuning Code Large Language Models

    Authors: Terry Yue Zhuo, Armel Zebaze, Nitchakarn Suppattarachai, Leandro von Werra, Harm de Vries, Qian Liu, Niklas Muennighoff

    Abstract: The high cost of full-parameter fine-tuning (FFT) of Large Language Models (LLMs) has led to a series of parameter-efficient fine-tuning (PEFT) methods. However, it remains unclear which methods provide the best cost-performance trade-off at different model scales. We introduce Astraios, a suite of 28 instruction-tuned OctoCoder models using 7 tuning methods and 4 model sizes up to 16 billion para… ▽ More

    Submitted 1 January, 2024; originally announced January 2024.

    Comments: 25 pages (12 main), 19 figures, 8 tables

  10. arXiv:2312.05803  [pdf, other

    cs.CV

    Transformer-based Selective Super-Resolution for Efficient Image Refinement

    Authors: Tianyi Zhang, Kishore Kasichainula, Yaoxin Zhuo, Baoxin Li, Jae-sun Seo, Yu Cao

    Abstract: Conventional super-resolution methods suffer from two drawbacks: substantial computational cost in upscaling an entire large image, and the introduction of extraneous or potentially detrimental information for downstream computer vision tasks during the refinement of the background. To solve these issues, we propose a novel transformer-based algorithm, Selective Super-Resolution (SSR), which parti… ▽ More

    Submitted 10 December, 2023; originally announced December 2023.

  11. arXiv:2312.05562  [pdf, other

    cs.SE

    Chain-of-Thought in Neural Code Generation: From and For Lightweight Language Models

    Authors: Guang Yang, Yu Zhou, Xiang Chen, Xiangyu Zhang, Terry Yue Zhuo, Taolue Chen

    Abstract: Large Language Models (LLMs) have demonstrated remarkable potential in code generation. The integration of Chain of Thought (CoT) reasoning can further boost their performance. However, current CoT methods often require manual writing or LLMs with over 100 billion parameters to generate, impeding their applicability in resource-constrained scenarios. In this study, we investigate lightweight Langu… ▽ More

    Submitted 4 August, 2024; v1 submitted 9 December, 2023; originally announced December 2023.

    Comments: ACCEPT IN TSE

  12. arXiv:2311.05316  [pdf, other

    cs.LG cs.AI

    ABIGX: A Unified Framework for eXplainable Fault Detection and Classification

    Authors: Yue Zhuo, Jinchuan Qian, Zhihuan Song, Zhiqiang Ge

    Abstract: For explainable fault detection and classification (FDC), this paper proposes a unified framework, ABIGX (Adversarial fault reconstruction-Based Integrated Gradient eXplanation). ABIGX is derived from the essentials of previous successful fault diagnosis methods, contribution plots (CP) and reconstruction-based contribution (RBC). It is the first explanation framework that provides variable contri… ▽ More

    Submitted 9 November, 2023; originally announced November 2023.

  13. arXiv:2311.02274  [pdf, other

    cs.CV

    Patch-based Selection and Refinement for Early Object Detection

    Authors: Tianyi Zhang, Kishore Kasichainula, Yaoxin Zhuo, Baoxin Li, Jae-Sun Seo, Yu Cao

    Abstract: Early object detection (OD) is a crucial task for the safety of many dynamic systems. Current OD algorithms have limited success for small objects at a long distance. To improve the accuracy and efficiency of such a task, we propose a novel set of algorithms that divide the image into patches, select patches with objects at various scales, elaborate the details of a small object, and detect it as… ▽ More

    Submitted 3 November, 2023; originally announced November 2023.

    Journal ref: Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 2024, pp. 729-738

  14. Can ChatGPT Perform Reasoning Using the IRAC Method in Analyzing Legal Scenarios Like a Lawyer?

    Authors: Xiaoxi Kang, Lizhen Qu, Lay-Ki Soon, Adnan Trakic, Terry Yue Zhuo, Patrick Charles Emerton, Genevieve Grant

    Abstract: Large Language Models (LLMs), such as ChatGPT, have drawn a lot of attentions recently in the legal domain due to its emergent ability to tackle a variety of legal tasks. However, it is still unknown if LLMs are able to analyze a legal case and perform reasoning in the same manner as lawyers. Therefore, we constructed a novel corpus consisting of scenarios pertain to Contract Acts Malaysia and Aus… ▽ More

    Submitted 2 November, 2023; v1 submitted 23 October, 2023; originally announced October 2023.

    Comments: EMNLP 2023 Findings

    Report number: 2023.findings-emnlp.929

    Journal ref: 2023.findings-emnlp.929

  15. arXiv:2310.08843  [pdf

    stat.AP

    A Longitudinal Analysis about the Effect of Air Pollution on Astigmatism for Children and Young Adults

    Authors: Lin An, Qiuyue Hu, Jieying Guan, Yingting Zhu, Chenyao Jiang, Xiaoyun Zhong, Shuyue Ma, Dongmei Yu, Canyang Zhang, Yehong Zhuo, Peiwu Qin

    Abstract: Purpose: This study aimed to investigate the correlation between air pollution and astigmatism, considering the detrimental effects of air pollution on respiratory, cardiovascular, and eye health. Methods: A longitudinal study was conducted with 127,709 individuals aged 4-27 years from 9 cities in Guangdong Province, China, spanning from 2019 to 2021. Astigmatism was measured using cylinder values… ▽ More

    Submitted 13 October, 2023; originally announced October 2023.

  16. arXiv:2310.04688  [pdf, other

    cs.CV

    PatchProto Networks for Few-shot Visual Anomaly Classification

    Authors: Jian Wang, Yue Zhuo

    Abstract: The visual anomaly diagnosis can automatically analyze the defective products, which has been widely applied in industrial quality inspection. The anomaly classification can classify the defective products into different categories. However, the anomaly samples are hard to access in practice, which impedes the training of canonical machine learning models. This paper studies a practical issue that… ▽ More

    Submitted 7 October, 2023; originally announced October 2023.

  17. arXiv:2309.08674  [pdf, other

    cs.CL cs.AI

    Fake News Detectors are Biased against Texts Generated by Large Language Models

    Authors: Jinyan Su, Terry Yue Zhuo, Jonibek Mansurov, Di Wang, Preslav Nakov

    Abstract: The spread of fake news has emerged as a critical challenge, undermining trust and posing threats to society. In the era of Large Language Models (LLMs), the capability to generate believable fake content has intensified these concerns. In this study, we present a novel paradigm to evaluate fake news detectors in scenarios involving both human-written and LLM-generated misinformation. Intriguingly… ▽ More

    Submitted 15 September, 2023; originally announced September 2023.

    Comments: The first two authors contributed equally

  18. arXiv:2309.07804  [pdf, other

    cs.SE cs.CL

    Pop Quiz! Do Pre-trained Code Models Possess Knowledge of Correct API Names?

    Authors: Terry Yue Zhuo, Xiaoning Du, Zhenchang Xing, Jiamou Sun, Haowei Quan, Li Li, Liming Zhu

    Abstract: Recent breakthroughs in pre-trained code models, such as CodeBERT and Codex, have shown their superior performance in various downstream tasks. The correctness and unambiguity of API usage among these code models are crucial for achieving desirable program functionalities, requiring them to learn various API fully qualified names structurally and semantically. Recent studies reveal that even state… ▽ More

    Submitted 14 September, 2023; originally announced September 2023.

  19. arXiv:2308.07124  [pdf, other

    cs.CL cs.AI

    OctoPack: Instruction Tuning Code Large Language Models

    Authors: Niklas Muennighoff, Qian Liu, Armel Zebaze, Qinkai Zheng, Binyuan Hui, Terry Yue Zhuo, Swayam Singh, Xiangru Tang, Leandro von Werra, Shayne Longpre

    Abstract: Finetuning large language models (LLMs) on instructions leads to vast performance improvements on natural language tasks. We apply instruction tuning using code, leveraging the natural structure of Git commits, which pair code changes with human instructions. We compile CommitPack: 4 terabytes of Git commits across 350 programming languages. We benchmark CommitPack against other natural and synthe… ▽ More

    Submitted 18 February, 2024; v1 submitted 14 August, 2023; originally announced August 2023.

    Comments: 60 pages (9 main), 40 figures, 19 tables

  20. arXiv:2307.12328  [pdf, other

    cs.SE

    A First Look at On-device Models in iOS Apps

    Authors: Han Hu, Yujin Huang, Qiuyuan Chen, Terry Yue Zhuo, Chunyang Chen

    Abstract: Powered by the rising popularity of deep learning techniques on smartphones, on-device deep learning models are being used in vital fields like finance, social media, and driving assistance. Because of the transparency of the Android platform and the on-device models inside, on-device models on Android smartphones have been proven to be extremely vulnerable. However, due to the challenge in ac… ▽ More

    Submitted 27 July, 2023; v1 submitted 23 July, 2023; originally announced July 2023.

    Comments: 30 pages, 7 pages, journal paper

  21. arXiv:2306.05540  [pdf, other

    cs.CL cs.AI

    DetectLLM: Leveraging Log Rank Information for Zero-Shot Detection of Machine-Generated Text

    Authors: Jinyan Su, Terry Yue Zhuo, Di Wang, Preslav Nakov

    Abstract: With the rapid progress of large language models (LLMs) and the huge amount of text they generated, it becomes more and more impractical to manually distinguish whether a text is machine-generated. Given the growing use of LLMs in social media and education, it prompts us to develop methods to detect machine-generated text, preventing malicious usage such as plagiarism, misinformation, and propaga… ▽ More

    Submitted 23 May, 2023; originally announced June 2023.

    Comments: machine-generated text, large language models, LLMs, zero-shot

    MSC Class: 68T50 ACM Class: F.2.2; I.2.7

  22. arXiv:2305.19915  [pdf, other

    cs.CL cs.AI cs.SE

    Source Code Data Augmentation for Deep Learning: A Survey

    Authors: Terry Yue Zhuo, Zhou Yang, Zhensu Sun, Yufei Wang, Li Li, Xiaoning Du, Zhenchang Xing, David Lo

    Abstract: The increasingly popular adoption of deep learning models in many critical source code tasks motivates the development of data augmentation (DA) techniques to enhance training data and improve various capabilities (e.g., robustness and generalizability) of these models. Although a series of DA methods have been proposed and tailored for source code models, there lacks a comprehensive survey and ex… ▽ More

    Submitted 13 November, 2023; v1 submitted 31 May, 2023; originally announced May 2023.

    Comments: ongoing work; 89 publications

  23. arXiv:2305.17497  [pdf, other

    cs.CL

    FACTUAL: A Benchmark for Faithful and Consistent Textual Scene Graph Parsing

    Authors: Zhuang Li, Yuyang Chai, Terry Yue Zhuo, Lizhen Qu, Gholamreza Haffari, Fei Li, Donghong Ji, Quan Hung Tran

    Abstract: Textual scene graph parsing has become increasingly important in various vision-language applications, including image caption evaluation and image retrieval. However, existing scene graph parsers that convert image captions into scene graphs often suffer from two types of errors. First, the generated scene graphs fail to capture the true semantics of the captions or the corresponding images, resu… ▽ More

    Submitted 1 June, 2023; v1 submitted 27 May, 2023; originally announced May 2023.

    Comments: 9 pages, ACL 2023 (findings)

  24. arXiv:2305.06161  [pdf, other

    cs.CL cs.AI cs.PL cs.SE

    StarCoder: may the source be with you!

    Authors: Raymond Li, Loubna Ben Allal, Yangtian Zi, Niklas Muennighoff, Denis Kocetkov, Chenghao Mou, Marc Marone, Christopher Akiki, Jia Li, Jenny Chim, Qian Liu, Evgenii Zheltonozhskii, Terry Yue Zhuo, Thomas Wang, Olivier Dehaene, Mishig Davaadorj, Joel Lamy-Poirier, João Monteiro, Oleh Shliazhko, Nicolas Gontier, Nicholas Meade, Armel Zebaze, Ming-Ho Yee, Logesh Kumar Umapathi, Jian Zhu , et al. (42 additional authors not shown)

    Abstract: The BigCode community, an open-scientific collaboration working on the responsible development of Large Language Models for Code (Code LLMs), introduces StarCoder and StarCoderBase: 15.5B parameter models with 8K context length, infilling capabilities and fast large-batch inference enabled by multi-query attention. StarCoderBase is trained on 1 trillion tokens sourced from The Stack, a large colle… ▽ More

    Submitted 13 December, 2023; v1 submitted 9 May, 2023; originally announced May 2023.

  25. arXiv:2304.14317  [pdf, other

    cs.AI cs.CL cs.SE

    ICE-Score: Instructing Large Language Models to Evaluate Code

    Authors: Terry Yue Zhuo

    Abstract: Recent advancements in the field of natural language generation have facilitated the use of large language models to assess the quality of generated text. Although these models have shown promising results in tasks such as machine translation and summarization, their applicability in code intelligence tasks remains limited without human involvement. The complexity of programming concepts required… ▽ More

    Submitted 22 January, 2024; v1 submitted 27 April, 2023; originally announced April 2023.

    Comments: Accepted to Findings of EACL 2024

  26. arXiv:2303.15868  [pdf

    eess.IV

    Displacement field calculation of large-scale structures using computer vision with physical constraints

    Authors: Yapeng Guo, Peng Zhong, Yi Zhuo, Fanzeng Meng, Hao Di, Shunlong Li

    Abstract: Because of the advantages of easy deployment, low cost and non-contact, computer vision-based structural displacement acquisition technique has received wide attention and research in recent years. However, the displacement field acquisition of large-scale structures is a challenging topic due to the contradiction of camera field of view and resolution. This paper presents a large-scale structural… ▽ More

    Submitted 31 March, 2023; v1 submitted 28 March, 2023; originally announced March 2023.

  27. arXiv:2303.15865  [pdf

    eess.SP

    Chloride Ion Erosion of Pre-Stressed Concrete Bridges in Cold Regions

    Authors: Hongtao Cui, Yi Zhuo, Dongyuan Ke, Zhonglong Li, Shunlong Li

    Abstract: The erosion of chloride ions in concrete bridges will accelerate the corrosion of reinforcement, which is an important reason for the decline of bridge durability. The erosion process of chloride ion, especially deicing salt solution in cold regions, is complex and has many influencing factors. It is very important to use accurate and effective methods to analyze the chloride ion erosion process i… ▽ More

    Submitted 28 March, 2023; originally announced March 2023.

  28. arXiv:2303.15701  [pdf

    eess.SP

    Local track irregularity identification based on multi-sensor time-frequency features of high-speed railway bridge accelerations

    Authors: Ye Mo, Yi Zhuo, Shunlong Li

    Abstract: Shortwave track diseases are generally reflected in the form of local track irregularity. Such diseases will greatly impact the train-track-bridge interaction (TTBI) dynamic system, seriously affecting train safety. Therefore, a method is proposed to detect and localize local track irregularities based on multis-sensor time-frequency features of high-speed railway bridge accelerations. Continuous… ▽ More

    Submitted 27 March, 2023; originally announced March 2023.

  29. arXiv:2303.13388  [pdf

    eess.SP

    Damage detection of high-speed railway box girder using train-induced dynamic responses

    Authors: Xin Wang, Yi Zhuo, Shunlong Li

    Abstract: This paper proposes a damage detection method based on the train-induced responses of high-speed railway box girder. Under the coupling effects of bending and torsion, the traditional damage detection method based on the Euler beam theory cannot be applied. In this research, the box girder section is divided into different components based on the plate element analysis method. The strain responses… ▽ More

    Submitted 23 March, 2023; originally announced March 2023.

  30. arXiv:2302.04116  [pdf, other

    cs.CR cs.AI cs.CL

    Training-free Lexical Backdoor Attacks on Language Models

    Authors: Yujin Huang, Terry Yue Zhuo, Qiongkai Xu, Han Hu, Xingliang Yuan, Chunyang Chen

    Abstract: Large-scale language models have achieved tremendous success across various natural language processing (NLP) applications. Nevertheless, language models are vulnerable to backdoor attacks, which inject stealthy triggers into models for steering them to undesirable behaviors. Most existing backdoor attacks, such as data poisoning, require further (re)training or fine-tuning language models to lear… ▽ More

    Submitted 8 February, 2023; originally announced February 2023.

    Comments: Accepted to International World Wide Web Conference 2023, Security, Privacy & Trust Track

  31. arXiv:2302.02362  [pdf, ps, other

    cond-mat.supr-con cond-mat.mes-hall

    Electrically controllable thermal transport in Josephson junctions based on buckled two-dimensional materials

    Authors: Yu-Hao Zhuo, Biao Wu, Gang Ouyang, Hai Li

    Abstract: We investigate the thermal transport properties in superconductor-antiferromagnet-superconductor and superconductor-ferromagnet-superconductor junctions based on buckled two-dimensional materials (BTDMs). Owing to the unique buckled sublattice structures of BTDMs, in both junctions the phase dependence of the thermal conductance can be effectively controlled by perpendicular electric fields. The u… ▽ More

    Submitted 5 February, 2023; originally announced February 2023.

  32. arXiv:2301.12868  [pdf, other

    cs.CL

    On Robustness of Prompt-based Semantic Parsing with Large Pre-trained Language Model: An Empirical Study on Codex

    Authors: Terry Yue Zhuo, Zhuang Li, Yujin Huang, Fatemeh Shiri, Weiqing Wang, Gholamreza Haffari, Yuan-Fang Li

    Abstract: Semantic parsing is a technique aimed at constructing a structured representation of the meaning of a natural-language question. Recent advancements in few-shot language models trained on code have demonstrated superior performance in generating these representations compared to traditional unimodal language models, which are trained on downstream tasks. Despite these advancements, existing fine-t… ▽ More

    Submitted 9 March, 2023; v1 submitted 30 January, 2023; originally announced January 2023.

    Comments: Accepted at EACL2023 (main)

  33. arXiv:2301.12867  [pdf, other

    cs.CL cs.SE

    Red teaming ChatGPT via Jailbreaking: Bias, Robustness, Reliability and Toxicity

    Authors: Terry Yue Zhuo, Yujin Huang, Chunyang Chen, Zhenchang Xing

    Abstract: Recent breakthroughs in natural language processing (NLP) have permitted the synthesis and comprehension of coherent text in an open-ended way, therefore translating the theoretical algorithms into practical applications. The large language models (LLMs) have significantly impacted businesses such as report summarization software and copywriters. Observations indicate, however, that LLMs may exhib… ▽ More

    Submitted 29 May, 2023; v1 submitted 30 January, 2023; originally announced January 2023.

    Comments: Technical Report

  34. arXiv:2301.03988  [pdf, other

    cs.SE cs.AI cs.LG

    SantaCoder: don't reach for the stars!

    Authors: Loubna Ben Allal, Raymond Li, Denis Kocetkov, Chenghao Mou, Christopher Akiki, Carlos Munoz Ferrandis, Niklas Muennighoff, Mayank Mishra, Alex Gu, Manan Dey, Logesh Kumar Umapathi, Carolyn Jane Anderson, Yangtian Zi, Joel Lamy Poirier, Hailey Schoelkopf, Sergey Troshin, Dmitry Abulkhanov, Manuel Romero, Michael Lappert, Francesco De Toni, Bernardo García del Río, Qian Liu, Shamik Bose, Urvashi Bhattacharyya, Terry Yue Zhuo , et al. (16 additional authors not shown)

    Abstract: The BigCode project is an open-scientific collaboration working on the responsible development of large language models for code. This tech report describes the progress of the collaboration until December 2022, outlining the current state of the Personally Identifiable Information (PII) redaction pipeline, the experiments conducted to de-risk the model architecture, and the experiments investigat… ▽ More

    Submitted 24 February, 2023; v1 submitted 9 January, 2023; originally announced January 2023.

  35. arXiv:2210.12401  [pdf, other

    cs.CL

    PcMSP: A Dataset for Scientific Action Graphs Extraction from Polycrystalline Materials Synthesis Procedure Text

    Authors: Xianjun Yang, Ya Zhuo, Julia Zuo, Xinlu Zhang, Stephen Wilson, Linda Petzold

    Abstract: Scientific action graphs extraction from materials synthesis procedures is important for reproducible research, machine automation, and material prediction. But the lack of annotated data has hindered progress in this field. We demonstrate an effort to annotate Polycrystalline Materials Synthesis Procedures (PcMSP) from 305 open access scientific articles for the construction of synthesis action g… ▽ More

    Submitted 22 October, 2022; originally announced October 2022.

    Comments: Findings of EMNLP 2022

  36. arXiv:2210.05556  [pdf, other

    cs.CV cs.CL

    ViLPAct: A Benchmark for Compositional Generalization on Multimodal Human Activities

    Authors: Terry Yue Zhuo, Yaqing Liao, Yuecheng Lei, Lizhen Qu, Gerard de Melo, Xiaojun Chang, Yazhou Ren, Zenglin Xu

    Abstract: We introduce ViLPAct, a novel vision-language benchmark for human activity planning. It is designed for a task where embodied AI agents can reason and forecast future actions of humans based on video clips about their initial activities and intents in text. The dataset consists of 2.9k videos from \charades extended with intents via crowdsourcing, a multi-choice question test set, and four strong… ▽ More

    Submitted 9 March, 2023; v1 submitted 11 October, 2022; originally announced October 2022.

    Comments: Accepted at EACL2023 (Findings)

  37. arXiv:2209.07351  [pdf, other

    cs.CL

    Rethinking Round-Trip Translation for Machine Translation Evaluation

    Authors: Terry Yue Zhuo, Qiongkai Xu, Xuanli He, Trevor Cohn

    Abstract: Automatic evaluation on low-resource language translation suffers from a deficiency of parallel corpora. Round-trip translation could be served as a clever and straightforward technique to alleviate the requirement of the parallel evaluation corpus. However, there was an observation of obscure correlations between the evaluation scores by forward and round-trip translations in the era of statistic… ▽ More

    Submitted 15 May, 2023; v1 submitted 15 September, 2022; originally announced September 2022.

    Comments: Accepted to Findings of ACL 2023

  38. arXiv:2203.10854  [pdf, other

    cs.CL

    Paraphrasing Techniques for Maritime QA system

    Authors: Fatemeh Shiri, Terry Yue Zhuo, Zhuang Li, Van Nguyen, Shirui Pan, Weiqing Wang, Reza Haffari, Yuan-Fang Li

    Abstract: There has been an increasing interest in incorporating Artificial Intelligence (AI) into Defence and military systems to complement and augment human intelligence and capabilities. However, much work still needs to be done toward achieving an effective human-machine partnership. This work is aimed at enhancing human-machine communications by developing a capability for automatically translating hu… ▽ More

    Submitted 9 March, 2023; v1 submitted 21 March, 2022; originally announced March 2022.

    Comments: 8 pages. The first three authors contribute equally

  39. arXiv:2111.07901  [pdf

    physics.med-ph

    Free-breathing 3D Cardiac T1 Mapping with Transmit B1 Correction at 3T

    Authors: Paul Kyu Han, Thibault Marin, Yanis Djebra, Vanessa Landes, Yue Zhuo, Georges El Fakhri, Chao Ma

    Abstract: Purpose: To develop a cardiac T1 mapping method for free-breathing 3D T1 mapping of the whole heart at 3T with transmit B1 (B1+) correction Methods: A free-breathing, ECG-gated inversion recovery sequence with spoiled gradient-echo readout was developed and optimized for cardiac T1 mapping at 3T. High-frame rate dynamic images were reconstructed from sparse (k,t)-space data acquired along a stack-… ▽ More

    Submitted 15 November, 2021; originally announced November 2021.

    Comments: 33 pages, 10 figures, 3 supplementary figures, 1 supplementary table

  40. Deep learning-based GTV contouring modeling inter- and intra- observer variability in sarcomas

    Authors: Thibault Marin, Yue Zhuo, Rita Maria Lahoud, Fei Tian, Xiaoyue Ma, Fangxu Xing, Maryam Moteabbed, Xiaofeng Liu, Kira Grogg, Nadya Shusharina, Jonghye Woo, Chao Ma, Yen-Lin E. Chen, Georges El Fakhri

    Abstract: Background and purpose: The delineation of the gross tumor volume (GTV) is a critical step for radiation therapy treatment planning. The delineation procedure is typically performed manually which exposes two major issues: cost and reproducibility. Delineation is a time-consuming process that is subject to inter- and intra-observer variability. While methods have been proposed to predict GTV conto… ▽ More

    Submitted 10 October, 2021; originally announced October 2021.

  41. arXiv:2110.01502  [pdf, other

    cond-mat.str-el cond-mat.mtrl-sci

    Optical spectroscopy and ultrafast pump-probe study of a quasi-one-dimensional charge density wave in CuTe

    Authors: R. S. Li, L. Yue, Q. Wu, S. X. Xu, Q. M. Liu, Z. X. Wang, T. C. Hu, X. Y. Zhuo, L. Y. Shi, S. J. Zhang, D. Wu, T. Dong, N. L. Wang

    Abstract: CuTe is a two-dimensional (2D) layered material, yet forming a quasi-one-dimensional (quasi-1D) charge-density-wave (CDW) along the a-axis in the ab-plane at high transition temperature $T_{CDW}=335$ K. However, the anisotropic properties of CuTe remain to be explored. Here we performed combined transport, polarized infrared reflectivity, and ultrafast pump-probe spectroscopy to investigate the un… ▽ More

    Submitted 1 March, 2022; v1 submitted 4 October, 2021; originally announced October 2021.

    Comments: 6 pages, 4 figures

    Journal ref: Phys.Rev.B 105,115102(2022)

  42. arXiv:2109.12491  [pdf, other

    econ.GN

    Smartphone Data Reveal Neighborhood-Level Racial Disparities in Police Presence

    Authors: M. Keith Chen, Katherine L. Christensen, Elicia John, Emily Owens, Yilin Zhuo

    Abstract: While extensive, research on policing in America has focused on documented actions such as stops and arrests -- less is known about patrolling and presence. We map the movements of over ten thousand police officers across twenty-one of America's largest cities by combining anonymized smartphone data with station and precinct boundaries. Police spend considerably more time in Black neighborhoods, a… ▽ More

    Submitted 9 March, 2022; v1 submitted 26 September, 2021; originally announced September 2021.

  43. arXiv:2105.06717  [pdf, other

    cs.AI cs.CL

    Neural-Symbolic Commonsense Reasoner with Relation Predictors

    Authors: Farhad Moghimifar, Lizhen Qu, Yue Zhuo, Gholamreza Haffari, Mahsa Baktashmotlagh

    Abstract: Commonsense reasoning aims to incorporate sets of commonsense facts, retrieved from Commonsense Knowledge Graphs (CKG), to draw conclusion about ordinary situations. The dynamic nature of commonsense knowledge postulates models capable of performing multi-hop reasoning over new situations. This feature also results in having large-scale sparse Knowledge Graphs, where such reasoning process is need… ▽ More

    Submitted 14 May, 2021; originally announced May 2021.

    Comments: ACL2021

  44. arXiv:2105.05328  [pdf, other

    cs.LG stat.ML

    Comparing interpretability and explainability for feature selection

    Authors: Jack Dunn, Luca Mingardi, Ying Daisy Zhuo

    Abstract: A common approach for feature selection is to examine the variable importance scores for a machine learning model, as a way to understand which features are the most relevant for making predictions. Given the significance of feature selection, it is crucial for the calculated importance scores to reflect reality. Falsely overestimating the importance of irrelevant features can lead to false discov… ▽ More

    Submitted 11 May, 2021; originally announced May 2021.

  45. PyArmadillo: a streamlined linear algebra library for Python

    Authors: Jason Rumengan, Terry Yue Zhuo, Conrad Sanderson

    Abstract: PyArmadillo is a linear algebra library for the Python language, with the aim of closely mirroring the programming interface of the widely used Armadillo C++ library, which in turn is deliberately similar to Matlab. PyArmadillo hence facilitates algorithm prototyping with Matlab-like syntax directly in Python, and relatively straightforward conversion of PyArmadillo-based Python code into performa… ▽ More

    Submitted 20 October, 2021; v1 submitted 22 April, 2021; originally announced April 2021.

    MSC Class: 15-04; 62-04; 65-04; 68-04 ACM Class: G.4; D.3; D.2.3

    Journal ref: Journal of Open Source Software, Vol. 66, No. 6, 2021

  46. arXiv:2103.11852  [pdf, other

    cs.LG cs.CY

    Detecting Racial Bias in Jury Selection

    Authors: Jack Dunn, Ying Daisy Zhuo

    Abstract: To support the 2019 U.S. Supreme Court case "Flowers v. Mississippi", APM Reports collated historical court records to assess whether the State exhibited a racial bias in striking potential jurors. This analysis used backward stepwise logistic regression to conclude that race was a significant factor, however this method for selecting relevant features is only a heuristic, and additionally cannot… ▽ More

    Submitted 22 March, 2021; originally announced March 2021.

  47. arXiv:2102.06509  [pdf, other

    cs.LG

    Interpretable Predictive Maintenance for Hard Drives

    Authors: Maxime Amram, Jack Dunn, Jeremy J. Toledano, Ying Daisy Zhuo

    Abstract: Existing machine learning approaches for data-driven predictive maintenance are usually black boxes that claim high predictive power yet cannot be understood by humans. This limits the ability of humans to use these models to derive insights and understanding of the underlying failure mechanisms, and also limits the degree of confidence that can be placed in such a system to perform well on future… ▽ More

    Submitted 12 February, 2021; originally announced February 2021.

  48. arXiv:2101.07995  [pdf, ps, other

    cs.CV

    FedNS: Improving Federated Learning for collaborative image classification on mobile clients

    Authors: Yaoxin Zhuo, Baoxin Li

    Abstract: Federated Learning (FL) is a paradigm that aims to support loosely connected clients in learning a global model collaboratively with the help of a centralized server. The most popular FL algorithm is Federated Averaging (FedAvg), which is based on taking weighted average of the client models, with the weights determined largely based on dataset sizes at the clients. In this paper, we propose a new… ▽ More

    Submitted 20 January, 2021; originally announced January 2021.

  49. arXiv:2012.02279  [pdf, other

    cs.LG

    Optimal Policy Trees

    Authors: Maxime Amram, Jack Dunn, Ying Daisy Zhuo

    Abstract: We propose an approach for learning optimal tree-based prescription policies directly from data, combining methods for counterfactual estimation from the causal inference literature with recent advances in training globally-optimal decision trees. The resulting method, Optimal Policy Trees, yields interpretable prescription policies, is highly scalable, and handles both discrete and continuous tre… ▽ More

    Submitted 3 December, 2020; originally announced December 2020.

  50. arXiv:2011.00777  [pdf, other

    cs.CL

    COSMO: Conditional SEQ2SEQ-based Mixture Model for Zero-Shot Commonsense Question Answering

    Authors: Farhad Moghimifar, Lizhen Qu, Yue Zhuo, Mahsa Baktashmotlagh, Gholamreza Haffari

    Abstract: Commonsense reasoning refers to the ability of evaluating a social situation and acting accordingly. Identification of the implicit causes and effects of a social context is the driving capability which can enable machines to perform commonsense reasoning. The dynamic world of social interactions requires context-dependent on-demand systems to infer such underlying information. However, current ap… ▽ More

    Submitted 2 November, 2020; originally announced November 2020.

    Comments: COLING2020