(Translated by https://www.hiragana.jp/)
Search | arXiv e-print repository
Skip to main content

Showing 1–50 of 111 results for author: Zhong, F

.
  1. arXiv:2407.06813  [pdf, other

    cs.AI cs.MA cs.SI

    Richelieu: Self-Evolving LLM-Based Agents for AI Diplomacy

    Authors: Zhenyu Guan, Xiangyu Kong, Fangwei Zhong, Yizhou Wang

    Abstract: Diplomacy is one of the most sophisticated activities in human society. The complex interactions among multiple parties/ agents involve various abilities like social reasoning, negotiation arts, and long-term strategy planning. Previous AI agents surely have proved their capability of handling multi-step games and larger action spaces on tasks involving multiple agents. However, diplomacy involves… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

  2. arXiv:2406.06432  [pdf, other

    cs.CV

    SYM3D: Learning Symmetric Triplanes for Better 3D-Awareness of GANs

    Authors: Jing Yang, Kyle Fogarty, Fangcheng Zhong, Cengiz Oztireli

    Abstract: Despite the growing success of 3D-aware GANs, which can be trained on 2D images to generate high-quality 3D assets, they still rely on multi-view images with camera annotations to synthesize sufficient details from all viewing directions. However, the scarce availability of calibrated multi-view image datasets, especially in comparison to single-view images, has limited the potential of 3D GANs. M… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 11

  3. arXiv:2405.12069  [pdf, other

    cs.CV

    Gaussian Head & Shoulders: High Fidelity Neural Upper Body Avatars with Anchor Gaussian Guided Texture Warping

    Authors: Tianhao Wu, Jing Yang, Zhilin Guo, Jingyi Wan, Fangcheng Zhong, Cengiz Oztireli

    Abstract: By equipping the most recent 3D Gaussian Splatting representation with head 3D morphable models (3DMM), existing methods manage to create head avatars with high fidelity. However, most existing methods only reconstruct a head without the body, substantially limiting their application scenarios. We found that naively applying Gaussians to model the clothed chest and shoulders tends to result in blu… ▽ More

    Submitted 21 May, 2024; v1 submitted 20 May, 2024; originally announced May 2024.

    Comments: Project Page: https://gaussian-head-shoulders.netlify.app/

  4. arXiv:2405.11231  [pdf, ps, other

    cond-mat.stat-mech

    Nucleation and growth manifest universal scaling, surely

    Authors: Fan Zhong

    Abstract: When a system is brought to a metastable state, nuclei of the equilibrium phase form and grow. This is the well-known nucleation and growth of first-order phase transitions. Near a critical point of a continuous phase transition, critical phenomena such as critical opalescence characterized by universal scaling emerge. These two sets of behavior are so completely different that it might appear abs… ▽ More

    Submitted 25 May, 2024; v1 submitted 18 May, 2024; originally announced May 2024.

    Comments: 5 pages, 2 figures

  5. arXiv:2405.02890  [pdf

    cond-mat.mtrl-sci

    Vacancies making jerky flow in complex alloys

    Authors: Zhida Liang, Fengxian Liu, Li Wang, Zihan You, Fanqi Zhong, Alan Cocks, Florian Pyczak

    Abstract: Longevity of materials, especially alloys, is crucial for enhancing the sustainability and efficiency of various applications, including gas turbines. Jerky flow, also known as dynamic strain aging effect, can indeed have a significant impact on the fatigue life of high-temperature components in gas turbines. In general, three jerky flow types, i.e., A, B and C, existed in superalloys. Type A and… ▽ More

    Submitted 5 May, 2024; originally announced May 2024.

  6. arXiv:2405.01839  [pdf, other

    cs.AI cs.MA

    SocialGFs: Learning Social Gradient Fields for Multi-Agent Reinforcement Learning

    Authors: Qian Long, Fangwei Zhong, Mingdong Wu, Yizhou Wang, Song-Chun Zhu

    Abstract: Multi-agent systems (MAS) need to adaptively cope with dynamic environments, changing agent populations, and diverse tasks. However, most of the multi-agent systems cannot easily handle them, due to the complexity of the state and task space. The social impact theory regards the complex influencing factors as forces acting on an agent, emanating from the environment, other agents, and the agent's… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

    Comments: AAAI 2024 Cooperative Multi-Agent Systems Decision-Making and Learning (CMASDL) Workshop

  7. arXiv:2404.09857  [pdf, other

    cs.CV cs.AI cs.RO

    Empowering Embodied Visual Tracking with Visual Foundation Models and Offline RL

    Authors: Fangwei Zhong, Kui Wu, Hai Ci, Churan Wang, Hao Chen

    Abstract: Embodied visual tracking is to follow a target object in dynamic 3D environments using an agent's egocentric vision. This is a vital and challenging skill for embodied agents. However, existing methods suffer from inefficient training and poor generalization. In this paper, we propose a novel framework that combines visual foundation models (VFM) and offline reinforcement learning (offline RL) to… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

  8. arXiv:2404.00219  [pdf, ps, other

    cond-mat.stat-mech

    Complete universal scaling in first-order phase transitions

    Authors: Fan Zhong

    Abstract: Phase transitions and critical phenomena are among the most intriguing phenomena in nature and society. They are classified as first-order phase transitions (FOPTs) and continuous ones. While the latter show marvelous phenomena of scaling and universality, whether the former behaves similarly is a long-standing controversial issue. Here we definitely demonstrate complete universal scaling in field… ▽ More

    Submitted 29 March, 2024; originally announced April 2024.

    Comments: 6 pages, 1 figure

  9. arXiv:2402.02468  [pdf, other

    cs.AI cs.LG cs.MA

    Fast Peer Adaptation with Context-aware Exploration

    Authors: Long Ma, Yuanfei Wang, Fangwei Zhong, Song-Chun Zhu, Yizhou Wang

    Abstract: Fast adapting to unknown peers (partners or opponents) with different strategies is a key challenge in multi-agent games. To do so, it is crucial for the agent to efficiently probe and identify the peer's strategy, as this is the prerequisite for carrying out the best response in adaptation. However, it is difficult to explore the strategies of unknown peers, especially when the games are partiall… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

  10. arXiv:2401.12737  [pdf

    physics.optics cond-mat.mtrl-sci physics.app-ph

    Controlling thermal emission with metasurfaces and its applications

    Authors: Qiongqiong Chu, Fan Zhong, Xiaohe Shang, Ye Zhang, Shining Zhu, Hui Liu

    Abstract: Thermal emission caused by the thermal motion of the charged particles is commonly broadband, un-polarized, and incoherent, like a melting pot of electromagnetic waves, which makes it unsuitable for infrared applications in many cases requiring specific thermal emission properties. Metasurfaces, characterized by two-dimensional subwavelength artificial nanostructures, have been extensively investi… ▽ More

    Submitted 23 January, 2024; originally announced January 2024.

    Comments: 28 pages, 10 figures

    Journal ref: Nanophotonics

  11. arXiv:2312.04574  [pdf, other

    cs.LG cs.AI cs.GR cs.NE

    Differentiable Visual Computing for Inverse Problems and Machine Learning

    Authors: Andrew Spielberg, Fangcheng Zhong, Konstantinos Rematas, Krishna Murthy Jatavallabhula, Cengiz Oztireli, Tzu-Mao Li, Derek Nowrouzezahrai

    Abstract: Originally designed for applications in computer graphics, visual computing (VC) methods synthesize information about physical and virtual worlds, using prescribed algorithms optimized for spatial computing. VC is used to analyze geometry, physically simulate solids, fluids, and other media, and render the world via optical techniques. These fine-tuned computations that operate explicitly on a giv… ▽ More

    Submitted 21 November, 2023; originally announced December 2023.

  12. arXiv:2311.15783  [pdf, other

    cs.GR

    Hypernetworks for Generalizable BRDF Representation

    Authors: Fazilet Gokbudak, Alejandro Sztrajman, Chenliang Zhou, Fangcheng Zhong, Rafal Mantiuk, Cengiz Oztireli

    Abstract: In this paper, we introduce a technique to estimate measured BRDFs from a sparse set of samples. Our approach offers accurate BRDF reconstructions that are generalizable to new materials. This opens the door to BDRF reconstructions from a variety of data sources. The success of our approach relies on the ability of hypernetworks to generate a robust representation of BRDFs and a set encoder that a… ▽ More

    Submitted 7 March, 2024; v1 submitted 27 November, 2023; originally announced November 2023.

  13. arXiv:2311.12090  [pdf, other

    cs.CV

    FrePolad: Frequency-Rectified Point Latent Diffusion for Point Cloud Generation

    Authors: Chenliang Zhou, Fangcheng Zhong, Param Hanji, Zhilin Guo, Kyle Fogarty, Alejandro Sztrajman, Hongyun Gao, Cengiz Oztireli

    Abstract: We propose FrePolad: frequency-rectified point latent diffusion, a point cloud generation pipeline integrating a variational autoencoder (VAE) with a denoising diffusion probabilistic model (DDPM) for the latent distribution. FrePolad simultaneously achieves high quality, diversity, and flexibility in point cloud cardinality for generation tasks while maintaining high computational efficiency. The… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

  14. arXiv:2311.12040  [pdf

    q-bio.QM cs.AI cs.LG

    TransCDR: a deep learning model for enhancing the generalizability of cancer drug response prediction through transfer learning and multimodal data fusion for drug representation

    Authors: Xiaoqiong Xia, Chaoyu Zhu, Yuqi Shan, Fan Zhong, Lei Liu

    Abstract: Accurate and robust drug response prediction is of utmost importance in precision medicine. Although many models have been developed to utilize the representations of drugs and cancer cell lines for predicting cancer drug responses (CDR), their performances can be improved by addressing issues such as insufficient data modality, suboptimal fusion algorithms, and poor generalizability for novel dru… ▽ More

    Submitted 17 November, 2023; originally announced November 2023.

    Comments: 8 figures

  15. arXiv:2311.04146  [pdf, other

    astro-ph.IM astro-ph.GA

    Galaxy Spectra neural Network (GaSNet). II. Using Deep Learning for Spectral Classification and Redshift Predictions

    Authors: Fucheng Zhong, Nicola R. Napolitano, Caroline Heneka, Rui Li, Franz Erik Bauer, Nicolas Bouche, Johan Comparat, Young-Lo Kim, Jens-Kristian Krogager, Marcella Longhetti, Jonathan Loveday, Boudewijn F. Roukema, Benedict L. Rouse, Mara Salvato, Crescenzo Tortora, Roberto J. Assef, Letizia P. Cassarà, Luca Costantin, Scott Croom, Luke J M Davies, Alexander Fritz, Guillaume Guiglion, Andrew Humphrey, Emanuela Pompei, Claudio Ricci , et al. (3 additional authors not shown)

    Abstract: Large sky spectroscopic surveys have reached the scale of photometric surveys in terms of sample sizes and data complexity. These huge datasets require efficient, accurate, and flexible automated tools for data analysis and science exploitation. We present the Galaxy Spectra Network/GaSNet-II, a supervised multi-network deep learning tool for spectra classification and redshift prediction. GaSNet-… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

    Comments: 23 pages and 31 figures. The draft has been submitted to MNRAS

  16. arXiv:2309.07796  [pdf, other

    cs.CV

    For A More Comprehensive Evaluation of 6DoF Object Pose Tracking

    Authors: Yang Li, Fan Zhong, Xin Wang, Shuangbing Song, Jiachen Li, Xueying Qin, Changhe Tu

    Abstract: Previous evaluations on 6DoF object pose tracking have presented obvious limitations along with the development of this area. In particular, the evaluation protocols are not unified for different methods, the widely-used YCBV dataset contains significant annotation error, and the error metrics also may be biased. As a result, it is hard to fairly compare the methods, which has became a big obstacl… ▽ More

    Submitted 14 September, 2023; v1 submitted 14 September, 2023; originally announced September 2023.

  17. arXiv:2308.12452  [pdf, other

    cs.CV cs.GR

    ARF-Plus: Controlling Perceptual Factors in Artistic Radiance Fields for 3D Scene Stylization

    Authors: Wenzhao Li, Tianhao Wu, Fangcheng Zhong, Cengiz Oztireli

    Abstract: The radiance fields style transfer is an emerging field that has recently gained popularity as a means of 3D scene stylization, thanks to the outstanding performance of neural radiance fields in 3D reconstruction and view synthesis. We highlight a research gap in radiance fields style transfer, the lack of sufficient perceptual controllability, motivated by the existing concept in the 2D image sty… ▽ More

    Submitted 6 September, 2023; v1 submitted 23 August, 2023; originally announced August 2023.

  18. arXiv:2307.09582  [pdf, other

    cs.CV cs.GR

    Guided Linear Upsampling

    Authors: Shuangbing Song, Fan Zhong, Tianju Wang, Xueying Qin, Changhe Tu

    Abstract: Guided upsampling is an effective approach for accelerating high-resolution image processing. In this paper, we propose a simple yet effective guided upsampling method. Each pixel in the high-resolution image is represented as a linear interpolation of two low-resolution pixels, whose indices and weights are optimized to minimize the upsampling error. The downsampling can be jointly optimized in o… ▽ More

    Submitted 13 July, 2023; originally announced July 2023.

    Comments: ACM SIGGRAPH

  19. arXiv:2306.08943  [pdf, other

    cs.LG math.NA

    Neural Fields with Hard Constraints of Arbitrary Differential Order

    Authors: Fangcheng Zhong, Kyle Fogarty, Param Hanji, Tianhao Wu, Alejandro Sztrajman, Andrew Spielberg, Andrea Tagliasacchi, Petra Bosilj, Cengiz Oztireli

    Abstract: While deep learning techniques have become extremely popular for solving a broad range of optimization problems, methods to enforce hard constraints during optimization, particularly on deep neural networks, remain underdeveloped. Inspired by the rich literature on meshless interpolation and its extension to spectral collocation methods in scientific computing, we develop a series of approaches fo… ▽ More

    Submitted 29 October, 2023; v1 submitted 15 June, 2023; originally announced June 2023.

    Comments: 37th Conference on Neural Information Processing Systems (NeurIPS 2023)

  20. arXiv:2305.09470  [pdf, ps, other

    cs.RO eess.SY

    Integrated Planning and Control of Robotic Surgical Instruments for Tasks Autonomy

    Authors: Fangxun Zhong, Yun-Hui Liu

    Abstract: Agile maneuvers are essential for robot-enabled complex tasks such as surgical procedures. Prior explorations on surgery autonomy are limited to feasibility study of completing a single task without systematically addressing generic manipulation safety across different tasks. We present an integrated planning and control framework for 6-DoF robotic instruments for pipeline automation of surgical t… ▽ More

    Submitted 13 May, 2023; originally announced May 2023.

    Comments: 28 pages

  21. Learning Semantic-Agnostic and Spatial-Aware Representation for Generalizable Visual-Audio Navigation

    Authors: Hongcheng Wang, Yuxuan Wang, Fangwei Zhong, Mingdong Wu, Jianwei Zhang, Yizhou Wang, Hao Dong

    Abstract: Visual-audio navigation (VAN) is attracting more and more attention from the robotic community due to its broad applications, \emph{e.g.}, household robots and rescue robots. In this task, an embodied agent must search for and navigate to the sound source with egocentric visual and audio observations. However, the existing methods are limited in two aspects: 1) poor generalization to unheard sound… ▽ More

    Submitted 21 June, 2023; v1 submitted 21 April, 2023; originally announced April 2023.

    Journal ref: The IEEE Robotics and Automation Letters 2023

  22. arXiv:2304.09142  [pdf, other

    astro-ph.CO astro-ph.IM

    Cosmology with Galaxy Cluster Properties using Machine Learning

    Authors: Lanlan Qiu, Nicola R. Napolitano, Stefano Borgani, Fucheng Zhong, Xiaodong Li, Mario Radovich, Weipeng Lin, Klaus Dolag, Crescenzo Tortora, Yang Wang, Rhea-Silvia Remus, Sirui Wu, Giuseppe Longo

    Abstract: [Abridged] Galaxy clusters are the most massive gravitationally-bound systems in the universe and are widely considered to be an effective cosmological probe. We propose the first Machine Learning method using galaxy cluster properties to derive unbiased constraints on a set of cosmological parameters, including Omega_m, sigma_8, Omega_b, and h_0. We train the machine learning model with mock cata… ▽ More

    Submitted 12 November, 2023; v1 submitted 18 April, 2023; originally announced April 2023.

    Comments: 19 pages, 20 figures. Revised version after the referee report. Resubmitted to A&A

    Journal ref: A&A 687, A1 (2024)

  23. Modal-graph 3D shape servoing of deformable objects with raw point clouds

    Authors: Bohan Yang, Congying Sui, Fangxun Zhong, Yun-Hui Liu

    Abstract: Deformable object manipulation (DOM) with point clouds has great potential as non-rigid 3D shapes can be measured without detecting and tracking image features. However, robotic shape control of deformable objects with point clouds is challenging due to: the unknown point-wise correspondences and the noisy partial observability of raw point clouds; the modeling difficulties of the relationship bet… ▽ More

    Submitted 28 November, 2023; v1 submitted 18 April, 2023; originally announced April 2023.

    Comments: This paper has been accepted by the International Journal of The International Journal of Robotics Research. SAGE copyright

  24. arXiv:2304.06002  [pdf

    cs.CV

    Fast vehicle detection algorithm based on lightweight YOLO7-tiny

    Authors: Bo Li, YiHua Chen, Hao Xu, Fei Zhong

    Abstract: The swift and precise detection of vehicles plays a significant role in intelligent transportation systems. Current vehicle detection algorithms encounter challenges of high computational complexity, low detection rate, and limited feasibility on mobile devices. To address these issues, this paper proposes a lightweight vehicle detection algorithm based on YOLOv7-tiny (You Only Look Once version s… ▽ More

    Submitted 17 April, 2023; v1 submitted 12 April, 2023; originally announced April 2023.

  25. arXiv:2304.03623  [pdf, other

    cs.RO cs.AI cs.CV

    RSPT: Reconstruct Surroundings and Predict Trajectories for Generalizable Active Object Tracking

    Authors: Fangwei Zhong, Xiao Bi, Yudi Zhang, Wei Zhang, Yizhou Wang

    Abstract: Active Object Tracking (AOT) aims to maintain a specific relation between the tracker and object(s) by autonomously controlling the motion system of a tracker given observations. AOT has wide-ranging applications, such as in mobile robots and autonomous driving. However, building a generalizable active tracker that works robustly across different scenarios remains a challenge, especially in unstru… ▽ More

    Submitted 7 April, 2023; originally announced April 2023.

    Comments: AAAI 2023 (Oral)

  26. arXiv:2303.10083  [pdf, other

    cs.CV

    $αあるふぁ$Surf: Implicit Surface Reconstruction for Semi-Transparent and Thin Objects with Decoupled Geometry and Opacity

    Authors: Tianhao Wu, Hanxue Liang, Fangcheng Zhong, Gernot Riegler, Shimon Vainer, Cengiz Oztireli

    Abstract: Implicit surface representations such as the signed distance function (SDF) have emerged as a promising approach for image-based surface reconstruction. However, existing optimization methods assume solid surfaces and are therefore unable to properly reconstruct semi-transparent surfaces and thin structures, which also exhibit low opacity due to the blending effect with the background. While neura… ▽ More

    Submitted 17 March, 2023; originally announced March 2023.

  27. arXiv:2303.03964  [pdf, other

    cs.GR cs.DS cs.HC cs.SI

    Force-Directed Graph Layouts Revisited: A New Force Based on the T-Distribution

    Authors: Fahai Zhong, Mingliang Xue, Jian Zhang, Fan Zhang, Rui Ban, Oliver Deussen, Yunhai Wang

    Abstract: In this paper, we propose the t-FDP model, a force-directed placement method based on a novel bounded short-range force (t-force) defined by Student's t-distribution. Our formulation is flexible, exerts limited repulsive forces for nearby nodes and can be adapted separately in its short- and long-range effects. Using such forces in force-directed graph layouts yields better neighborhood preservati… ▽ More

    Submitted 4 March, 2023; originally announced March 2023.

    Comments: To appear in IEEE Transactions on Visualization and Computer Graphics

  28. arXiv:2303.03767  [pdf, other

    cs.CV cs.LG cs.MA

    Proactive Multi-Camera Collaboration For 3D Human Pose Estimation

    Authors: Hai Ci, Mickel Liu, Xuehai Pan, Fangwei Zhong, Yizhou Wang

    Abstract: This paper presents a multi-agent reinforcement learning (MARL) scheme for proactive Multi-Camera Collaboration in 3D Human Pose Estimation in dynamic human crowds. Traditional fixed-viewpoint multi-camera solutions for human motion capture (MoCap) are limited in capture space and susceptible to dynamic occlusions. Active camera approaches proactively control camera poses to find optimal viewpoint… ▽ More

    Submitted 7 March, 2023; originally announced March 2023.

    Comments: ICLR 2023 poster

  29. arXiv:2212.11076  [pdf, ps, other

    cond-mat.stat-mech

    Theory of Critical Phenomena with Long-Range Temporal Interaction

    Authors: Shaolong Zeng, Fan Zhong

    Abstract: We develop a systematic theory for the critical phenomena with memory in all spatial dimensions, including $d<d_c$, $d=d_c$, and $d>d_c$, the upper critical dimension. We show that the Hamiltonian plays a unique role in dynamics and the dimensional constant $\mathfrak{d}_t$ that embodies the intimate relationship between space and time is the fundamental ingredient of the theory. However, its valu… ▽ More

    Submitted 8 May, 2023; v1 submitted 21 December, 2022; originally announced December 2022.

    Comments: 32 pages, 1 figure. Version 2: 41 pages, 1 figure. Clarify the origin of the memory

    Journal ref: Physica Scripta 98, 075017 (2023)

  30. arXiv:2212.08641  [pdf, other

    cs.CV

    GFPose: Learning 3D Human Pose Prior with Gradient Fields

    Authors: Hai Ci, Mingdong Wu, Wentao Zhu, Xiaoxuan Ma, Hao Dong, Fangwei Zhong, Yizhou Wang

    Abstract: Learning 3D human pose prior is essential to human-centered AI. Here, we present GFPose, a versatile framework to model plausible 3D human poses for various applications. At the core of GFPose is a time-dependent score network, which estimates the gradient on each body joint and progressively denoises the perturbed 3D human pose to match a given task specification. During the denoising process, GF… ▽ More

    Submitted 16 December, 2022; originally announced December 2022.

  31. arXiv:2211.14738  [pdf, other

    cs.RO

    Distilled Visual and Robot Kinematics Embeddings for Metric Depth Estimation in Monocular Scene Reconstruction

    Authors: Ruofeng Wei, Bin Li, Hangjie Mo, Fangxun Zhong, Yonghao Long, Qi Dou, Yun-Hui Liu, Dong Sun

    Abstract: Estimating precise metric depth and scene reconstruction from monocular endoscopy is a fundamental task for surgical navigation in robotic surgery. However, traditional stereo matching adopts binocular images to perceive the depth information, which is difficult to transfer to the soft robotics-based surgical systems due to the use of monocular endoscopy. In this paper, we present a novel framewor… ▽ More

    Submitted 27 November, 2022; originally announced November 2022.

  32. arXiv:2210.12505  [pdf, other

    hep-ph

    Sommerfeld effect in freeze-in dark matter

    Authors: Fucheng Zhong, Xinyu Wang

    Abstract: If two annihilation products of dark matter (DM) particles are non-relativistic and coupled to a light force mediator, their plane wave functions are modified due to multiple exchanges of the force mediators. This gives rise to the Sommerfeld effect (SE). We consider the attractive and repulsive force SE on the relic density in different phases of freeze-in DM. We find that in the pure freeze-in r… ▽ More

    Submitted 8 October, 2023; v1 submitted 22 October, 2022; originally announced October 2022.

    Comments: 28 pages, 11 figures. Comments/suggestions are welcome

  33. arXiv:2210.03919  [pdf, other

    cs.CV cs.AI cs.LG

    CLIP-PAE: Projection-Augmentation Embedding to Extract Relevant Features for a Disentangled, Interpretable, and Controllable Text-Guided Face Manipulation

    Authors: Chenliang Zhou, Fangcheng Zhong, Cengiz Oztireli

    Abstract: Recently introduced Contrastive Language-Image Pre-Training (CLIP) bridges images and text by embedding them into a joint latent space. This opens the door to ample literature that aims to manipulate an input image by providing a textual explanation. However, due to the discrepancy between image and text embeddings in the joint space, using text embeddings as the optimization target often introduc… ▽ More

    Submitted 7 May, 2023; v1 submitted 8 October, 2022; originally announced October 2022.

  34. arXiv:2209.00853  [pdf, other

    cs.LG cs.AI

    TarGF: Learning Target Gradient Field to Rearrange Objects without Explicit Goal Specification

    Authors: Mingdong Wu, Fangwei Zhong, Yulong Xia, Hao Dong

    Abstract: Object Rearrangement is to move objects from an initial state to a goal state. Here, we focus on a more practical setting in object rearrangement, i.e., rearranging objects from shuffled layouts to a normative target distribution without explicit goal specification. However, it remains challenging for AI agents, as it is hard to describe the target distribution (goal specification) for reward engi… ▽ More

    Submitted 16 January, 2023; v1 submitted 2 September, 2022; originally announced September 2022.

  35. arXiv:2208.13422  [pdf

    cs.CV

    Light-YOLOv5: A Lightweight Algorithm for Improved YOLOv5 in Complex Fire Scenarios

    Authors: Hao Xu, Bo Li, Fei Zhong

    Abstract: Fire-detection technology is of great importance for successful fire-prevention measures. Image-based fire detection is one effective method. At present, object-detection algorithms are deficient in performing detection speed and accuracy tasks when they are applied in complex fire scenarios. In this study, a lightweight fire-detection algorithm, Light-YOLOv5 (You Only Look Once version five), is… ▽ More

    Submitted 1 December, 2022; v1 submitted 29 August, 2022; originally announced August 2022.

  36. arXiv:2208.02049  [pdf, other

    cs.CV

    AutoLaparo: A New Dataset of Integrated Multi-tasks for Image-guided Surgical Automation in Laparoscopic Hysterectomy

    Authors: Ziyi Wang, Bo Lu, Yonghao Long, Fangxun Zhong, Tak-Hong Cheung, Qi Dou, Yunhui Liu

    Abstract: Computer-assisted minimally invasive surgery has great potential in benefiting modern operating theatres. The video data streamed from the endoscope provides rich information to support context-awareness for next-generation intelligent surgical systems. To achieve accurate perception and automatic manipulation during the procedure, learning based technique is a promising way, which enables advance… ▽ More

    Submitted 3 August, 2022; originally announced August 2022.

    Comments: Accepted at MICCAI 2022

  37. arXiv:2207.12620  [pdf, other

    cs.CV

    Large-displacement 3D Object Tracking with Hybrid Non-local Optimization

    Authors: Xuhui Tian, Xinran Lin, Fan Zhong, Xueying Qin

    Abstract: Optimization-based 3D object tracking is known to be precise and fast, but sensitive to large inter-frame displacements. In this paper we propose a fast and effective non-local 3D tracking method. Based on the observation that erroneous local minimum are mostly due to the out-of-plane rotation, we propose a hybrid approach combining non-local and local optimizations for different parameters, resul… ▽ More

    Submitted 25 July, 2022; originally announced July 2022.

  38. arXiv:2207.01249  [pdf, other

    cs.RO

    Model-Free 3D Shape Control of Deformable Objects Using Novel Features Based on Modal Analysis

    Authors: Bohan Yang, Bo Lu, Wei Chen, Fangxun Zhong, Yun-Hui Liu

    Abstract: Shape control of deformable objects is a challenging and important robotic problem. This paper proposes a model-free controller using novel 3D global deformation features based on modal analysis. Unlike most existing controllers using geometric features, our controller employs a physically-based deformation feature by decoupling 3D global deformation into low-frequency mode shapes. Although modal… ▽ More

    Submitted 18 April, 2023; v1 submitted 4 July, 2022; originally announced July 2022.

    Comments: Accepted by the IEEE Transactions on Robotics. The paper will appear in the IEEE Transactions on Robotics. IEEE copyright

  39. arXiv:2205.15838  [pdf, other

    cs.CV

    D$^2$NeRF: Self-Supervised Decoupling of Dynamic and Static Objects from a Monocular Video

    Authors: Tianhao Wu, Fangcheng Zhong, Andrea Tagliasacchi, Forrester Cole, Cengiz Oztireli

    Abstract: Given a monocular video, segmenting and decoupling dynamic objects while recovering the static environment is a widely studied problem in machine intelligence. Existing solutions usually approach this problem in the image domain, limiting their performance and understanding of the environment. We introduce Decoupled Dynamic Neural Radiance Field (D$^2$NeRF), a self-supervised approach that takes a… ▽ More

    Submitted 5 November, 2022; v1 submitted 31 May, 2022; originally announced May 2022.

  40. Final bound-state formation effect on dark matter annihilation

    Authors: Xinyu Wang, Fucheng Zhong, Feng Luo

    Abstract: If the annihilation products of dark matter (DM) are non-relativistic and couples directly to a light force mediator, the non-perturbation effect like final state bound state (FBS) formation and final state Sommerfeld (FSS) effect must be considered. Non-relativistic region of final particles will appear when there is small mass split between DM and products, so we study those effects in the degen… ▽ More

    Submitted 21 May, 2022; v1 submitted 3 April, 2022; originally announced April 2022.

    Comments: Version to be published in Chinese Physics C. 23 pages, 13 figures

  41. arXiv:2203.16245  [pdf, ps, other

    cond-mat.stat-mech

    Effective-Dimension Theory of Critical Phenomena above Upper Critical Dimensions

    Authors: Shaolong Zeng, Sue Ping Szeto, Fan Zhong

    Abstract: Phase transitions and critical phenomena are among the most intriguing phenomena in nature and their renormalization-group theory is one of the greatest achievements of theoretical physics. However, the predictions of the theory above an upper critical dimension $d_c$ seriously disagree with reality. In addition to its fundamental significance, the problem is also of practical importance because b… ▽ More

    Submitted 30 March, 2022; originally announced March 2022.

    Comments: 6 pages, no figure

    Journal ref: A revised version was published in Physica Scripta 97, 125002 (2022)

  42. Theory of Critical Phenomena with Memory

    Authors: Shaolong Zeng, Sue ping Szeto, Fan Zhong

    Abstract: Memory is a ubiquitous characteristic of complex systems and critical phenomena are one of the most intriguing phenomena in nature. Here, we propose an Ising model with memory and develop a corresponding theory of critical phenomena with memory for complex systems and discovered a series of surprising novel results. We show that a naive theory of a usual Hamiltonian with a direct inclusion of a po… ▽ More

    Submitted 12 August, 2022; v1 submitted 30 March, 2022; originally announced March 2022.

    Comments: 6 pages, 2 figures. The new version has 8 pages, more references, more discussions on model, and used renormalization-group technique instead of power counting method

    Journal ref: Chin. Phys. Lett. 39, 120501 (2022)

  43. arXiv:2203.13572  [pdf, other

    cs.CV cs.RO

    A Visual Navigation Perspective for Category-Level Object Pose Estimation

    Authors: Jiaxin Guo, Fangxun Zhong, Rong Xiong, Yunhui Liu, Yue Wang, Yiyi Liao

    Abstract: This paper studies category-level object pose estimation based on a single monocular image. Recent advances in pose-aware generative models have paved the way for addressing this challenging task using analysis-by-synthesis. The idea is to sequentially update a set of latent variables, e.g., pose, shape, and appearance, of the generative model until the generated image best agrees with the observa… ▽ More

    Submitted 23 July, 2022; v1 submitted 25 March, 2022; originally announced March 2022.

  44. arXiv:2203.13437  [pdf

    cs.CV

    BCOT: A Markerless High-Precision 3D Object Tracking Benchmark

    Authors: Jiachen Li, Bin Wang, Shiqiang Zhu, Xin Cao, Fan Zhong, Wenxuan Chen, Te Li, Jason Gu, Xueying Qin

    Abstract: Template-based 3D object tracking still lacks a high-precision benchmark of real scenes due to the difficulty of annotating the accurate 3D poses of real moving video objects without using markers. In this paper, we present a multi-view approach to estimate the accurate 3D poses of real moving objects, and then use binocular data to construct a new benchmark for monocular textureless 3D object tra… ▽ More

    Submitted 24 March, 2022; originally announced March 2022.

  45. arXiv:2203.04647  [pdf, other

    cs.CV

    Normal and Visibility Estimation of Human Face from a Single Image

    Authors: Fuzhi Zhong, Rui Wang, Yuchi Huo, Hujun Bao

    Abstract: Recent work on the intrinsic image of humans starts to consider the visibility of incident illumination and encodes the light transfer function by spherical harmonics. In this paper, we show that such a light transfer function can be further decomposed into visibility and cosine terms related to surface normal. Such decomposition allows us to recover the surface normal in addition to visibility. W… ▽ More

    Submitted 9 March, 2022; originally announced March 2022.

  46. arXiv:2203.03570  [pdf, other

    cs.CV cs.GR cs.LG

    Kubric: A scalable dataset generator

    Authors: Klaus Greff, Francois Belletti, Lucas Beyer, Carl Doersch, Yilun Du, Daniel Duckworth, David J. Fleet, Dan Gnanapragasam, Florian Golemo, Charles Herrmann, Thomas Kipf, Abhijit Kundu, Dmitry Lagun, Issam Laradji, Hsueh-Ti, Liu, Henning Meyer, Yishu Miao, Derek Nowrouzezahrai, Cengiz Oztireli, Etienne Pot, Noha Radwan, Daniel Rebain, Sara Sabour, Mehdi S. M. Sajjadi , et al. (10 additional authors not shown)

    Abstract: Data is the driving force of machine learning, with the amount and quality of training data often being more important for the performance of a system than architecture and training details. But collecting, processing and annotating real data at scale is difficult, expensive, and frequently raises additional privacy, fairness and legal concerns. Synthetic data is a powerful tool with the potential… ▽ More

    Submitted 7 March, 2022; originally announced March 2022.

    Comments: 21 pages, CVPR2022

  47. arXiv:2203.02119  [pdf, other

    cs.RO cs.AI

    GraspARL: Dynamic Grasping via Adversarial Reinforcement Learning

    Authors: Tianhao Wu, Fangwei Zhong, Yiran Geng, Hongchen Wang, Yongjian Zhu, Yizhou Wang, Hao Dong

    Abstract: Grasping moving objects, such as goods on a belt or living animals, is an important but challenging task in robotics. Conventional approaches rely on a set of manually defined object motion patterns for training, resulting in poor generalization to unseen object trajectories. In this work, we introduce an adversarial reinforcement learning framework for dynamic grasping, namely GraspARL. To be spe… ▽ More

    Submitted 14 March, 2022; v1 submitted 3 March, 2022; originally announced March 2022.

  48. Galaxy Spectra neural Networks (GaSNets). I. Searching for strong lens candidates in eBOSS spectra using Deep Learning

    Authors: Fucheng Zhong, Rui Li, Nicola R. Napolitano

    Abstract: With the advent of new spectroscopic surveys from ground and space, observing up to hundreds of millions of galaxies, spectra classification will become overwhelming for standard analysis techniques. To prepare for this challenge, we introduce a family of deep learning tools to classify features in one-dimensional spectra. As the first application of these Galaxy Spectra neural Networks (GaSNets),… ▽ More

    Submitted 17 April, 2022; v1 submitted 16 February, 2022; originally announced February 2022.

    Comments: Submitted to RAA, 33 pages,21 figures. Comments/suggestions are welcome. It is accepted with minor revisions by RAA

  49. arXiv:2201.02374  [pdf, other

    cs.GR

    As-Continuous-As-Possible Extrusion Fabrication of Surface Models

    Authors: Fanchao Zhong, Yonglai Xu, Haisen Zhao, Lin Lu

    Abstract: We propose a novel computational framework for optimizing the toolpath continuity in fabricating surface models on an extrusion-based 3D printer. Toolpath continuity has been a critical issue for extrusion-based fabrications that affects both quality and efficiency. Transfer moves cause non-smoothor bumpy surfaces and get worse for materials with large inertia like clay. For surface models, the ef… ▽ More

    Submitted 28 May, 2022; v1 submitted 7 January, 2022; originally announced January 2022.

    Comments: 16 pages, 23 figures

    ACM Class: I.3.5

  50. arXiv:2112.05098  [pdf, other

    q-bio.PE cond-mat.stat-mech physics.bio-ph

    Intraspecific predator interference promotes biodiversity in ecosystems

    Authors: Ju Kang, Shijie Zhang, Yiyuan Niu, Fan Zhong, Xin Wang

    Abstract: Explaining biodiversity is a fundamental issue in ecology. A long-standing puzzle lies in the paradox of the plankton: many species of plankton feeding on a limited variety of resources coexist, apparently flouting the competitive exclusion principle (CEP), which holds that the number of predator (consumer) species cannot exceed that of the resources at a steady state. Here, we present a mechanist… ▽ More

    Submitted 30 April, 2024; v1 submitted 9 December, 2021; originally announced December 2021.

    Comments: Main text 14 pages, 3 figures. Appendices 34 pages, 15 Appendix-figures