(Translated by https://www.hiragana.jp/)
Search | arXiv e-print repository
Skip to main content

Showing 1–50 of 80 results for author: Lyu, Z

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.05715  [pdf, other

    cs.AI cs.SE

    Top Pass: Improve Code Generation by Pass@k-Maximized Code Ranking

    Authors: Zhi-Cun Lyu, Xin-Ye Li, Zheng Xie, Ming Li

    Abstract: Code generation has been greatly enhanced by the profound advancements in Large Language Models (LLMs) recently. Nevertheless, such LLM-based code generation approaches still struggle to generate error-free code in a few tries when faced with complex problems. To address this, the prevailing strategy is to sample a huge number of candidate programs, with the hope of any one in them could work. How… ▽ More

    Submitted 11 August, 2024; originally announced August 2024.

    Comments: Accepted by Frontier of Computer Science

  2. arXiv:2408.05542  [pdf, other

    cs.SE

    You Augment Me: Exploring ChatGPT-based Data Augmentation for Semantic Code Search

    Authors: Yanlin Wang, Lianghong Guo, Ensheng Shic, Wenqing Chen, Jiachi Chen, Wanjun Zhong, Menghan Wang, Hui Li, Hongyu Zhang, Ziyu Lyu, Zibin Zheng

    Abstract: Code search plays a crucial role in software development, enabling developers to retrieve and reuse code using natural language queries. While the performance of code search models improves with an increase in high-quality data, obtaining such data can be challenging and expensive. Recently, large language models (LLMs) such as ChatGPT have made remarkable progress in both natural and programming… ▽ More

    Submitted 10 August, 2024; originally announced August 2024.

    Comments: Accepted at ICSME 2023

  3. arXiv:2408.03877  [pdf, other

    cs.LG cs.AI

    Knowledge Probing for Graph Representation Learning

    Authors: Mingyu Zhao, Xingyu Huang, Ziyu Lyu, Yanlin Wang, Lixin Cui, Lu Bai

    Abstract: Graph learning methods have been extensively applied in diverse application areas. However, what kind of inherent graph properties e.g. graph proximity, graph structural information has been encoded into graph representation learning for downstream tasks is still under-explored. In this paper, we propose a novel graph probing framework (GraphProbe) to investigate and interpret whether the family o… ▽ More

    Submitted 7 August, 2024; originally announced August 2024.

  4. arXiv:2407.16347  [pdf, other

    cs.CL

    FACTTRACK: Time-Aware World State Tracking in Story Outlines

    Authors: Zhiheng Lyu, Kevin Yang, Lingpeng Kong, Daniel Klein

    Abstract: While accurately detecting and correcting factual contradictions in language model outputs has become increasingly important as their capabilities improve, doing so is highly challenging. We propose a novel method, FACTTRACK, for tracking atomic facts and addressing factual contradictions. Crucially, FACTTRACK also maintains time-aware validity intervals for each fact, allowing for change over tim… ▽ More

    Submitted 23 July, 2024; originally announced July 2024.

    Comments: 22 pages

  5. arXiv:2406.17520  [pdf, other

    cs.CV cs.RO

    Tell Me Where You Are: Multimodal LLMs Meet Place Recognition

    Authors: Zonglin Lyu, Juexiao Zhang, Mingxuan Lu, Yiming Li, Chen Feng

    Abstract: Large language models (LLMs) exhibit a variety of promising capabilities in robotics, including long-horizon planning and commonsense reasoning. However, their performance in place recognition is still underexplored. In this work, we introduce multimodal LLMs (MLLMs) to visual place recognition (VPR), where a robot must localize itself using visual observations. Our key design is to use vision-bas… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  6. arXiv:2406.15252  [pdf, other

    cs.CV cs.AI

    VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation

    Authors: Xuan He, Dongfu Jiang, Ge Zhang, Max Ku, Achint Soni, Sherman Siu, Haonan Chen, Abhranil Chandra, Ziyan Jiang, Aaran Arulraj, Kai Wang, Quy Duc Do, Yuansheng Ni, Bohan Lyu, Yaswanth Narsupalli, Rongqi Fan, Zhiheng Lyu, Yuchen Lin, Wenhu Chen

    Abstract: The recent years have witnessed great advances in video generation. However, the development of automatic video metrics is lagging significantly behind. None of the existing metric is able to provide reliable scores over generated videos. The main barrier is the lack of large-scale human-annotated dataset. In this paper, we release VideoFeedback, the first large-scale dataset containing human-prov… ▽ More

    Submitted 24 June, 2024; v1 submitted 21 June, 2024; originally announced June 2024.

  7. arXiv:2406.14283  [pdf, other

    cs.AI

    Q*: Improving Multi-step Reasoning for LLMs with Deliberative Planning

    Authors: Chaojie Wang, Yanchen Deng, Zhiyi Lyu, Liang Zeng, Jujie He, Shuicheng Yan, Bo An

    Abstract: Large Language Models (LLMs) have demonstrated impressive capability in many natural language tasks. However, the auto-regressive generation process makes LLMs prone to produce errors, hallucinations and inconsistent statements when performing multi-step reasoning. In this paper, by casting multi-step reasoning of LLMs as a heuristic search problem, we aim to alleviate the pathology by introducing… ▽ More

    Submitted 22 July, 2024; v1 submitted 20 June, 2024; originally announced June 2024.

  8. arXiv:2406.09383  [pdf, other

    cs.CV

    Multiagent Multitraversal Multimodal Self-Driving: Open MARS Dataset

    Authors: Yiming Li, Zhiheng Li, Nuo Chen, Moonjun Gong, Zonglin Lyu, Zehong Wang, Peili Jiang, Chen Feng

    Abstract: Large-scale datasets have fueled recent advancements in AI-based autonomous vehicle research. However, these datasets are usually collected from a single vehicle's one-time pass of a certain location, lacking multiagent interactions or repeated traversals of the same place. Such information could lead to transformative enhancements in autonomous vehicles' perception, prediction, and planning capab… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: Accepted by CVPR 2024

  9. arXiv:2405.12155  [pdf, other

    cs.IT

    Embracing Radiance Field Rendering in 6G: Over-the-Air Training and Inference with 3D Contents

    Authors: Guanlin Wu, Zhonghao Lyu, Juyong Zhang, Jie Xu

    Abstract: The efficient representation, transmission, and reconstruction of three-dimensional (3D) contents are becoming increasingly important for sixth-generation (6G) networks that aim to merge virtual and physical worlds for offering immersive communication experiences. Neural radiance field (NeRF) and 3D Gaussian splatting (3D-GS) have recently emerged as two promising 3D representation techniques base… ▽ More

    Submitted 18 June, 2024; v1 submitted 20 May, 2024; originally announced May 2024.

    Comments: 16 pages,7 figures

  10. arXiv:2405.05953  [pdf, other

    cs.CV

    Frame Interpolation with Consecutive Brownian Bridge Diffusion

    Authors: Zonglin Lyu, Ming Li, Jianbo Jiao, Chen Chen

    Abstract: Recent work in Video Frame Interpolation (VFI) tries to formulate VFI as a diffusion-based conditional image generation problem, synthesizing the intermediate frame given a random noise and neighboring frames. Due to the relatively high resolution of videos, Latent Diffusion Models (LDMs) are employed as the conditional generation model, where the autoencoder compresses images into latent represen… ▽ More

    Submitted 6 August, 2024; v1 submitted 9 May, 2024; originally announced May 2024.

    Comments: corrected typo

  11. arXiv:2405.05549  [pdf, other

    cs.IT eess.SP

    Intelligent Reflecting Surface Aided AirComp: Multi-Timescale Design and Performance Analysis

    Authors: Guangji Chen, Jun Li, Qingqing Wu, Meng Hua, Kaitao Meng, Zhonghao Lyu

    Abstract: The integration of intelligent reflecting surface (IRS) into over-the-air computation (AirComp) is an effective solution for reducing the computational mean squared error (MSE) via its high passive beamforming gain. Prior works on IRS aided AirComp generally rely on the full instantaneous channel state information (I-CSI), which is not applicable to large-scale systems due to its heavy signalling… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

    Comments: submitted to IEEE Journal for possible publication

  12. arXiv:2404.11055  [pdf, other

    cs.CL

    On the Causal Nature of Sentiment Analysis

    Authors: Zhiheng Lyu, Zhijing Jin, Fernando Gonzalez, Rada Mihalcea, Bernhard Schoelkopf, Mrinmaya Sachan

    Abstract: Sentiment analysis (SA) aims to identify the sentiment expressed in a text, such as a product review. Given a review and the sentiment associated with it, this paper formulates SA as a combination of two tasks: (1) a causal discovery task that distinguishes whether a review "primes" the sentiment (Causal Hypothesis C1), or the sentiment "primes" the review (Causal Hypothesis C2); and (2) the tradi… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

    Comments: An enhanced version of our previous exploration in arXiv:2305.01764

  13. arXiv:2404.00836  [pdf, ps, other

    cs.IT cs.DC cs.LG

    Rethinking Resource Management in Edge Learning: A Joint Pre-training and Fine-tuning Design Paradigm

    Authors: Zhonghao Lyu, Yuchen Li, Guangxu Zhu, Jie Xu, H. Vincent Poor, Shuguang Cui

    Abstract: In some applications, edge learning is experiencing a shift in focusing from conventional learning from scratch to new two-stage learning unifying pre-training and task-specific fine-tuning. This paper considers the problem of joint communication and computation resource management in a two-stage edge learning system. In this system, model pre-training is first conducted at an edge server via cent… ▽ More

    Submitted 31 March, 2024; originally announced April 2024.

  14. arXiv:2403.16133  [pdf, other

    cs.AI cs.LG

    SSHPool: The Separated Subgraph-based Hierarchical Pooling

    Authors: Zhuo Xu, Lixin Cui, Ming Li, Yue Wang, Ziyu Lyu, Hangyuan Du, Lu Bai, Philip S. Yu, Edwin R. Hancock

    Abstract: In this paper, we develop a novel local graph pooling method, namely the Separated Subgraph-based Hierarchical Pooling (SSHPool), for graph classification. We commence by assigning the nodes of a sample graph into different clusters, resulting in a family of separated subgraphs. We individually employ the local graph convolution units as the local structure to further compress each subgraph into a… ▽ More

    Submitted 13 August, 2024; v1 submitted 24 March, 2024; originally announced March 2024.

  15. arXiv:2403.11990  [pdf, other

    cs.CV

    GetMesh: A Controllable Model for High-quality Mesh Generation and Manipulation

    Authors: Zhaoyang Lyu, Ben Fei, Jinyi Wang, Xudong Xu, Ya Zhang, Weidong Yang, Bo Dai

    Abstract: Mesh is a fundamental representation of 3D assets in various industrial applications, and is widely supported by professional softwares. However, due to its irregular structure, mesh creation and manipulation is often time-consuming and labor-intensive. In this paper, we propose a highly controllable generative model, GetMesh, for mesh generation and manipulation across different categories. By ta… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

  16. arXiv:2402.11185  [pdf, other

    cs.LG cs.NE

    Minimally Supervised Topological Projections of Self-Organizing Maps for Phase of Flight Identification

    Authors: Zimeng Lyu, Pujan Thapa, Travis Desell

    Abstract: Identifying phases of flight is important in the field of general aviation, as knowing which phase of flight data is collected from aircraft flight data recorders can aid in the more effective detection of safety or hazardous events. General aviation flight data for phase of flight identification is usually per-second data, comes on a large scale, and is class imbalanced. It is expensive to manual… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

  17. arXiv:2401.11795  [pdf, other

    cs.GR cs.CG math.DG math.NA

    Spherical Density-Equalizing Map for Genus-0 Closed Surfaces

    Authors: Zhiyuan Lyu, Lok Ming Lui, Gary P. T. Choi

    Abstract: Density-equalizing maps are a class of mapping methods in which the shape deformation is driven by prescribed density information. In recent years, they have been widely used for data visualization on planar domains and planar parameterization of open surfaces. However, the theory and computation of density-equalizing maps for closed surfaces are much less explored. In this work, we develop a nove… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

  18. arXiv:2401.06923  [pdf, other

    cs.LG cs.NE

    Minimally Supervised Learning using Topological Projections in Self-Organizing Maps

    Authors: Zimeng Lyu, Alexander Ororbia, Rui Li, Travis Desell

    Abstract: Parameter prediction is essential for many applications, facilitating insightful interpretation and decision-making. However, in many real life domains, such as power systems, medicine, and engineering, it can be very expensive to acquire ground truth labels for certain datasets as they may require extensive and expensive laboratory testing. In this work, we introduce a semi-supervised learning ap… ▽ More

    Submitted 15 February, 2024; v1 submitted 12 January, 2024; originally announced January 2024.

  19. arXiv:2312.17047  [pdf, other

    math.ST cs.LG stat.ME stat.ML

    Inconsistency of cross-validation for structure learning in Gaussian graphical models

    Authors: Zhao Lyu, Wai Ming Tai, Mladen Kolar, Bryon Aragam

    Abstract: Despite numerous years of research into the merits and trade-offs of various model selection criteria, obtaining robust results that elucidate the behavior of cross-validation remains a challenging endeavor. In this paper, we highlight the inherent limitations of cross-validation when employed to discern the structure of a Gaussian graphical model. We provide finite-sample bounds on the probabilit… ▽ More

    Submitted 28 December, 2023; originally announced December 2023.

    Comments: Preliminary version; 47 pages, 15 figures

  20. arXiv:2312.16391  [pdf, other

    cs.RO

    Toward Spatial Temporal Consistency of Joint Visual Tactile Perception in VR Applications

    Authors: Fuqiang Zhao, Kehan Zhang, Qian Liu, Zhuoyi Lyu

    Abstract: With the development of VR technology, especially the emergence of the metaverse concept, the integration of visual and tactile perception has become an expected experience in human-machine interaction. Therefore, achieving spatial-temporal consistency of visual and tactile information in VR applications has become a necessary factor for realizing this experience. The state-of-the-art vibrotactile… ▽ More

    Submitted 28 December, 2023; v1 submitted 26 December, 2023; originally announced December 2023.

    Comments: This paper is accepted by the IEEE Haptic Symposium 2024

  21. arXiv:2312.16242  [pdf, other

    cs.LG

    Revisiting Knowledge Distillation under Distribution Shift

    Authors: Songming Zhang, Ziyu Lyu, Xiaofeng Chen

    Abstract: Knowledge distillation transfers knowledge from large models into small models, and has recently made remarkable achievements. However, few studies has investigated the mechanism of knowledge distillation against distribution shift. Distribution shift refers to the data distribution drifts between training and testing phases. In this paper, we reconsider the paradigm of knowledge distillation by r… ▽ More

    Submitted 7 January, 2024; v1 submitted 25 December, 2023; originally announced December 2023.

    Comments: Code: [this http URL.](https://github.com/ZZhangsm/OOKD)

  22. arXiv:2312.04350  [pdf, other

    cs.CL cs.AI cs.LG

    CLadder: Assessing Causal Reasoning in Language Models

    Authors: Zhijing Jin, Yuen Chen, Felix Leeb, Luigi Gresele, Ojasv Kamal, Zhiheng Lyu, Kevin Blin, Fernando Gonzalez Adauto, Max Kleiman-Weiner, Mrinmaya Sachan, Bernhard Schölkopf

    Abstract: The ability to perform causal reasoning is widely considered a core feature of intelligence. In this work, we investigate whether large language models (LLMs) can coherently reason about causality. Much of the existing work in natural language processing (NLP) focuses on evaluating commonsense causal reasoning in LLMs, thus failing to assess whether a model can perform causal inference in accordan… ▽ More

    Submitted 17 January, 2024; v1 submitted 7 December, 2023; originally announced December 2023.

    Comments: NeurIPS 2023; updated with CLadder dataset v1.5

  23. arXiv:2311.15598  [pdf, other

    math.ST cs.LG cs.SI stat.ME stat.ML

    Optimal Clustering of Discrete Mixtures: Binomial, Poisson, Block Models, and Multi-layer Networks

    Authors: Zhongyuan Lyu, Ting Li, Dong Xia

    Abstract: In this paper, we first study the fundamental limit of clustering networks when a multi-layer network is present. Under the mixture multi-layer stochastic block model (MMSBM), we show that the minimax optimal network clustering error rate, which takes an exponential form and is characterized by the Renyi divergence between the edge probability distributions of the component networks. We propose a… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

  24. arXiv:2311.14960  [pdf, other

    cs.CV

    Point Cloud Pre-training with Diffusion Models

    Authors: Xiao Zheng, Xiaoshui Huang, Guofeng Mei, Yuenan Hou, Zhaoyang Lyu, Bo Dai, Wanli Ouyang, Yongshun Gong

    Abstract: Pre-training a model and then fine-tuning it on downstream tasks has demonstrated significant success in the 2D image and NLP domains. However, due to the unordered and non-uniform density characteristics of point clouds, it is non-trivial to explore the prior knowledge of point clouds and pre-train a point cloud backbone. In this paper, we propose a novel pre-training method called Point cloud Di… ▽ More

    Submitted 25 November, 2023; originally announced November 2023.

  25. arXiv:2310.17661  [pdf, other

    eess.SP cs.NI

    An Overview on IEEE 802.11bf: WLAN Sensing

    Authors: Rui Du, Haocheng Hua, Hailiang Xie, Xianxin Song, Zhonghao Lyu, Mengshi Hu, Narengerile, Yan Xin, Stephen McCann, Michael Montemurro, Tony Xiao Han, Jie Xu

    Abstract: With recent advancements, the wireless local area network (WLAN) or wireless fidelity (Wi-Fi) technology has been successfully utilized to realize sensing functionalities such as detection, localization, and recognition. However, the WLANs standards are developed mainly for the purpose of communication, and thus may not be able to meet the stringent requirements for emerging sensing applications.… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

    Comments: 31 pages, 25 figures, this is a significant updated version of arXiv:2207.04859

  26. arXiv:2310.05541  [pdf, other

    cs.RO

    Collaborative Visual Place Recognition

    Authors: Yiming Li, Zonglin Lyu, Mingxuan Lu, Chao Chen, Michael Milford, Chen Feng

    Abstract: Visual place recognition (VPR) capabilities enable autonomous robots to navigate complex environments by discovering the environment's topology based on visual input. Most research efforts focus on enhancing the accuracy and robustness of single-robot VPR but often encounter issues such as occlusion due to individual viewpoints. Despite a number of research on multi-robot metric-based localization… ▽ More

    Submitted 9 October, 2023; originally announced October 2023.

    Comments: https://ai4ce.github.io/CoVPR/

  27. arXiv:2308.15070  [pdf, other

    cs.CV

    DiffBIR: Towards Blind Image Restoration with Generative Diffusion Prior

    Authors: Xinqi Lin, Jingwen He, Ziyan Chen, Zhaoyang Lyu, Bo Dai, Fanghua Yu, Wanli Ouyang, Yu Qiao, Chao Dong

    Abstract: We present DiffBIR, a general restoration pipeline that could handle different blind image restoration tasks in a unified framework. DiffBIR decouples blind image restoration problem into two stages: 1) degradation removal: removing image-independent content; 2) information regeneration: generating the lost image content. Each stage is developed independently but they work seamlessly in a cascaded… ▽ More

    Submitted 12 April, 2024; v1 submitted 29 August, 2023; originally announced August 2023.

  28. arXiv:2308.13386  [pdf, other

    cs.LG cs.AI

    TFDNet: Time-Frequency Enhanced Decomposed Network for Long-term Time Series Forecasting

    Authors: Yuxiao Luo, Ziyu Lyu, Xingyu Huang

    Abstract: Long-term time series forecasting is a vital task and has a wide range of real applications. Recent methods focus on capturing the underlying patterns from one single domain (e.g. the time domain or the frequency domain), and have not taken a holistic view to process long-term time series from the time-frequency domains. In this paper, we propose a Time-Frequency Enhanced Decomposed Network (TFDNe… ▽ More

    Submitted 25 August, 2023; originally announced August 2023.

  29. arXiv:2308.09278  [pdf, other

    cs.CV

    MATLABER: Material-Aware Text-to-3D via LAtent BRDF auto-EncodeR

    Authors: Xudong Xu, Zhaoyang Lyu, Xingang Pan, Bo Dai

    Abstract: Based on powerful text-to-image diffusion models, text-to-3D generation has made significant progress in generating compelling geometry and appearance. However, existing methods still struggle to recover high-fidelity object materials, either only considering Lambertian reflectance, or failing to disentangle BRDF materials from the environment lights. In this work, we propose Material-Aware Text-t… ▽ More

    Submitted 17 August, 2023; originally announced August 2023.

  30. arXiv:2308.05579  [pdf, other

    cs.CG cs.GR math.CV math.DG

    Bijective Density-Equalizing Quasiconformal Map for Multiply-Connected Open Surfaces

    Authors: Zhiyuan Lyu, Gary P. T. Choi, Lok Ming Lui

    Abstract: This paper proposes a novel method for computing bijective density-equalizing quasiconformal (DEQ) flattening maps for multiply-connected open surfaces. In conventional density-equalizing maps, shape deformations are solely driven by prescribed constraints on the density distribution, defined as the population per unit area, while the bijectivity and local geometric distortions of the mappings are… ▽ More

    Submitted 15 August, 2023; v1 submitted 10 August, 2023; originally announced August 2023.

    Journal ref: SIAM Journal on Imaging Sciences, 17(1), 706-755 (2024)

  31. arXiv:2307.09751  [pdf, other

    cs.IR cs.AI

    Information Retrieval Meets Large Language Models: A Strategic Report from Chinese IR Community

    Authors: Qingyao Ai, Ting Bai, Zhao Cao, Yi Chang, Jiawei Chen, Zhumin Chen, Zhiyong Cheng, Shoubin Dong, Zhicheng Dou, Fuli Feng, Shen Gao, Jiafeng Guo, Xiangnan He, Yanyan Lan, Chenliang Li, Yiqun Liu, Ziyu Lyu, Weizhi Ma, Jun Ma, Zhaochun Ren, Pengjie Ren, Zhiqiang Wang, Mingwen Wang, Ji-Rong Wen, Le Wu , et al. (8 additional authors not shown)

    Abstract: The research field of Information Retrieval (IR) has evolved significantly, expanding beyond traditional search to meet diverse user information needs. Recently, Large Language Models (LLMs) have demonstrated exceptional capabilities in text understanding, generation, and knowledge inference, opening up exciting avenues for IR research. LLMs not only facilitate generative retrieval but also offer… ▽ More

    Submitted 26 July, 2023; v1 submitted 19 July, 2023; originally announced July 2023.

    Comments: 17 pages

  32. arXiv:2306.16806  [pdf, ps, other

    cs.LO math.CT

    Free dcpo-algebras via directed spaces

    Authors: Yuxu Chen, Hui Kou, Zhenchao Lyu

    Abstract: Directed spaces are natural topological extensions of dcpos in domain theory and form a cartesian closed category. We will show that the D-completion of free algebras over a Scott space $ΣしぐまL$, on the context of directed spaces, are exactly the free dcpo-algebras over dcpo $L$, which reveals the close connection between directed powerspaces and powerdomains. By this result, we provide a topological… ▽ More

    Submitted 29 June, 2023; originally announced June 2023.

    Comments: 18 pages

    MSC Class: 54A10; 54A20; 06B35

  33. arXiv:2306.05836  [pdf, other

    cs.CL cs.AI cs.LG

    Can Large Language Models Infer Causation from Correlation?

    Authors: Zhijing Jin, Jiarui Liu, Zhiheng Lyu, Spencer Poff, Mrinmaya Sachan, Rada Mihalcea, Mona Diab, Bernhard Schölkopf

    Abstract: Causal inference is one of the hallmarks of human intelligence. While the field of CausalNLP has attracted much interest in the recent years, existing causal inference datasets in NLP primarily rely on discovering causality from empirical knowledge (e.g., commonsense knowledge). In this work, we propose the first benchmark dataset to test the pure causal inference skills of large language models (… ▽ More

    Submitted 17 April, 2024; v1 submitted 9 June, 2023; originally announced June 2023.

    Comments: ICLR 2024

  34. arXiv:2305.14597  [pdf, other

    cs.CL cs.AI cs.LG

    Voices of Her: Analyzing Gender Differences in the AI Publication World

    Authors: Yiwen Ding, Jiarui Liu, Zhiheng Lyu, Kun Zhang, Bernhard Schoelkopf, Zhijing Jin, Rada Mihalcea

    Abstract: While several previous studies have analyzed gender bias in research, we are still missing a comprehensive analysis of gender differences in the AI community, covering diverse topics and different development trends. Using the AI Scholar dataset of 78K researchers in the field of AI, we identify several gender differences: (1) Although female researchers tend to have fewer overall citations than m… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

  35. Backpropagation-Free 4D Continuous Ant-Based Neural Topology Search

    Authors: AbdElRahman ElSaid, Karl Ricanek, Zeming Lyu, Alexander Ororbia, Travis Desell

    Abstract: Continuous Ant-based Topology Search (CANTS) is a previously introduced novel nature-inspired neural architecture search (NAS) algorithm that is based on ant colony optimization (ACO). CANTS utilizes a continuous search space to indirectly-encode a neural architecture search space. Synthetic ant agents explore CANTS' continuous search space based on the density and distribution of pheromones, stro… ▽ More

    Submitted 30 January, 2024; v1 submitted 11 May, 2023; originally announced May 2023.

    Comments: arXiv admin note: text overlap with arXiv:2011.10831

    Journal ref: j.asoc.2023.110737

  36. arXiv:2305.01764  [pdf, other

    cs.CL cs.AI cs.LG stat.ME

    Psychologically-Inspired Causal Prompts

    Authors: Zhiheng Lyu, Zhijing Jin, Justus Mattern, Rada Mihalcea, Mrinmaya Sachan, Bernhard Schoelkopf

    Abstract: NLP datasets are richer than just input-output pairs; rather, they carry causal relations between the input and output variables. In this work, we take sentiment classification as an example and look into the causal relations between the review (X) and sentiment (Y). As psychology studies show that language can affect emotion, different psychological processes are evoked when a person first makes… ▽ More

    Submitted 2 May, 2023; originally announced May 2023.

  37. arXiv:2304.02317  [pdf, ps, other

    cs.NI

    Semantic Communications for Image Recovery and Classification via Deep Joint Source and Channel Coding

    Authors: Zhonghao Lyu, Guangxu Zhu, Jie Xu, Bo Ai, Shuguang Cui

    Abstract: With the recent advancements in edge artificial intelligence (AI), future sixth-generation (6G) networks need to support new AI tasks such as classification and clustering apart from data recovery. Motivated by the success of deep learning, the semantic-aware and task-oriented communications with deep joint source and channel coding (JSCC) have emerged as new paradigm shifts in 6G from the convent… ▽ More

    Submitted 5 April, 2023; originally announced April 2023.

  38. arXiv:2304.01247  [pdf, other

    cs.CV

    Generative Diffusion Prior for Unified Image Restoration and Enhancement

    Authors: Ben Fei, Zhaoyang Lyu, Liang Pan, Junzhe Zhang, Weidong Yang, Tianyue Luo, Bo Zhang, Bo Dai

    Abstract: Existing image restoration methods mostly leverage the posterior distribution of natural images. However, they often assume known degradation and also require supervised training, which restricts their adaptation to complex real applications. In this work, we propose the Generative Diffusion Prior (GDP) to effectively model the posterior distributions in an unsupervised sampling manner. GDP utiliz… ▽ More

    Submitted 3 April, 2023; originally announced April 2023.

    Comments: 46 pages, 38 figures, accepted by CVPR2023

  39. arXiv:2303.14557  [pdf, other

    cs.HC

    Clo(o)k: A Clock That Looks

    Authors: Zhuoyue Lyu

    Abstract: What if a clock could do more than just tell time - what if it could actually see? This paper delves into the conceptualization, design, and construction of a timepiece with visual perception capabilities, featuring three applications that expand the possibilities of human-time interaction. Insights from an Open House showcase are also shared, highlighting the unique user experiences of this devic… ▽ More

    Submitted 25 March, 2023; originally announced March 2023.

    Comments: CHI '23 Human Computer Interaction Across Borders (HCIxB) Workshop Papers

  40. arXiv:2303.07938  [pdf, other

    cs.CV

    Controllable Mesh Generation Through Sparse Latent Point Diffusion Models

    Authors: Zhaoyang Lyu, Jinyi Wang, Yuwei An, Ya Zhang, Dahua Lin, Bo Dai

    Abstract: Mesh generation is of great value in various applications involving computer graphics and virtual content, yet designing generative models for meshes is challenging due to their irregular data structure and inconsistent topology of meshes in the same category. In this work, we design a novel sparse latent point diffusion model for mesh generation. Our key insight is to regard point clouds as an in… ▽ More

    Submitted 14 March, 2023; v1 submitted 14 March, 2023; originally announced March 2023.

    Comments: Accepted to CVPR 2023. Project page is at https://slide-3d.github.io

  41. arXiv:2302.10347  [pdf, other

    cs.LG cs.NE

    Online Evolutionary Neural Architecture Search for Multivariate Non-Stationary Time Series Forecasting

    Authors: Zimeng Lyu, Alexander Ororbia, Travis Desell

    Abstract: Time series forecasting (TSF) is one of the most important tasks in data science given the fact that accurate time series (TS) predictive models play a major role across a wide variety of domains including finance, transportation, health care, and power systems. Real-world utilization of machine learning (ML) typically involves (pre-)training models on collected, historical data and then applying… ▽ More

    Submitted 20 February, 2023; originally announced February 2023.

    Comments: arXiv admin note: text overlap with arXiv:2202.13471

  42. arXiv:2302.04437  [pdf, other

    stat.ML cs.LG stat.AP

    rMultiNet: An R Package For Multilayer Networks Analysis

    Authors: Ting Li, Zhongyuan Lyu, Chenyu Ren, Dong Xia

    Abstract: This paper develops an R package rMultiNet to analyze multilayer network data. We provide two general frameworks from recent literature, e.g. mixture multilayer stochastic block model(MMSBM) and mixture multilayer latent space model(MMLSM) to generate the multilayer network. We also provide several methods to reveal the embedding of both nodes and layers followed by further data analysis methods,… ▽ More

    Submitted 8 February, 2023; originally announced February 2023.

  43. A note on the category of c-spaces

    Authors: Z. Lyu, X. Xie, H. Kou

    Abstract: We prove that the category of c-spaces with continuous maps is not cartesian closed. As a corollary the category of locally finitary compact spaces with continuous maps is also not cartesian closed.

    Submitted 18 March, 2023; v1 submitted 23 November, 2022; originally announced November 2022.

    Comments: 5 pages

    Journal ref: Electronic Notes in Theoretical Informatics and Computer Science, Volume 2 - Proceedings of ISDT 9 (March 21, 2023) entics:10362

  44. arXiv:2211.10872  [pdf, other

    cs.CV

    MetaMax: Improved Open-Set Deep Neural Networks via Weibull Calibration

    Authors: Zongyao Lyu, Nolan B. Gutierrez, William J. Beksi

    Abstract: Open-set recognition refers to the problem in which classes that were not seen during training appear at inference time. This requires the ability to identify instances of novel classes while maintaining discriminative capability for closed-set classification. OpenMax was the first deep neural network-based approach to address open-set recognition by calibrating the predictive scores of a standard… ▽ More

    Submitted 20 November, 2022; originally announced November 2022.

    Comments: To be presented at the 2023 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) Workshop on Dealing with Novelty in Open Worlds (DNOW)

  45. arXiv:2211.02574  [pdf, other

    cs.IT

    Pushing AI to Wireless Network Edge: An Overview on Integrated Sensing, Communication, and Computation towards 6G

    Authors: Guangxu Zhu, Zhonghao Lyu, Xiang Jiao, Peixi Liu, Mingzhe Chen, Jie Xu, Shuguang Cui, Ping Zhang

    Abstract: Pushing artificial intelligence (AI) from central cloud to network edge has reached board consensus in both industry and academia for materializing the vision of artificial intelligence of things (AIoT) in the sixth-generation (6G) era. This gives rise to an emerging research area known as edge intelligence, which concerns the distillation of human-like intelligence from the huge amount of data sc… ▽ More

    Submitted 4 November, 2022; originally announced November 2022.

  46. arXiv:2210.13785  [pdf, other

    stat.ML cs.LG

    Analysis of Estimating the Bayes Rule for Gaussian Mixture Models with a Specified Missing-Data Mechanism

    Authors: Ziyang Lyu

    Abstract: Semi-supervised learning (SSL) approaches have been successfully applied in a wide range of engineering and scientific fields. This paper investigates the generative model framework with a missingness mechanism for unclassified observations, as introduced by Ahfock and McLachlan(2020). We show that in a partially classified sample, a classifier using Bayes rule of allocation with a missing-data me… ▽ More

    Submitted 29 December, 2023; v1 submitted 25 October, 2022; originally announced October 2022.

    Comments: 24 pages

  47. arXiv:2210.09475  [pdf, other

    cs.LG

    FIMP: Foundation Model-Informed Message Passing for Graph Neural Networks

    Authors: Syed Asad Rizvi, Nazreen Pallikkavaliyaveetil, David Zhang, Zhuoyang Lyu, Nhi Nguyen, Haoran Lyu, Benjamin Christensen, Josue Ortega Caro, Antonio H. O. Fonseca, Emanuele Zappala, Maryam Bagherian, Christopher Averill, Chadi G. Abdallah, Amin Karbasi, Rex Ying, Maria Brbic, Rahul Madhav Dhodapkar, David van Dijk

    Abstract: Foundation models have achieved remarkable success across many domains, relying on pretraining over vast amounts of data. Graph-structured data often lacks the same scale as unstructured data, making the development of graph foundation models challenging. In this work, we propose Foundation-Informed Message Passing (FIMP), a Graph Neural Network (GNN) message-passing framework that leverages pretr… ▽ More

    Submitted 1 July, 2024; v1 submitted 17 October, 2022; originally announced October 2022.

    Comments: 16 pages (12 + 4 pages appendix). 5 figures and 4 tables

  48. arXiv:2208.05643  [pdf, ps, other

    cs.IT

    An Overview on Over-the-Air Federated Edge Learning

    Authors: Xiaowen Cao, Zhonghao Lyu, Guangxu Zhu, Jie Xu, Lexi Xu, Shuguang Cui

    Abstract: Over-the-air federated edge learning (Air-FEEL) has emerged as a promising solution to support edge artificial intelligence (AI) in future beyond 5G (B5G) and 6G networks. In Air-FEEL, distributed edge devices use their local data to collaboratively train AI models while preserving data privacy, in which the over-the-air model/gradient aggregation is exploited for enhancing the learning efficiency… ▽ More

    Submitted 11 August, 2022; originally announced August 2022.

  49. arXiv:2207.04600  [pdf, other

    math.ST cs.IT cs.LG stat.ME

    Optimal Clustering by Lloyd Algorithm for Low-Rank Mixture Model

    Authors: Zhongyuan Lyu, Dong Xia

    Abstract: This paper investigates the computational and statistical limits in clustering matrix-valued observations. We propose a low-rank mixture model (LrMM), adapted from the classical Gaussian mixture model (GMM) to treat matrix-valued observations, which assumes low-rankness for population center matrices. A computationally efficient clustering method is designed by integrating Lloyd's algorithm and lo… ▽ More

    Submitted 6 June, 2023; v1 submitted 10 July, 2022; originally announced July 2022.

  50. arXiv:2205.14969  [pdf, other

    cs.CV cs.AI

    Guided Diffusion Model for Adversarial Purification

    Authors: Jinyi Wang, Zhaoyang Lyu, Dahua Lin, Bo Dai, Hongfei Fu

    Abstract: With wider application of deep neural networks (DNNs) in various algorithms and frameworks, security threats have become one of the concerns. Adversarial attacks disturb DNN-based image classifiers, in which attackers can intentionally add imperceptible adversarial perturbations on input images to fool the classifiers. In this paper, we propose a novel purification approach, referred to as guided… ▽ More

    Submitted 28 June, 2022; v1 submitted 30 May, 2022; originally announced May 2022.