(Translated by https://www.hiragana.jp/)
Search | arXiv e-print repository
Skip to main content

Showing 1–50 of 162 results for author: Fua, P

.
  1. arXiv:2407.18381  [pdf, other

    cs.CV

    Neural Surface Detection for Unsigned Distance Fields

    Authors: Federico Stella, Nicolas Talabot, Hieu Le, Pascal Fua

    Abstract: Extracting surfaces from Signed Distance Fields (SDFs) can be accomplished using traditional algorithms, such as Marching Cubes. However, since they rely on sign flips across the surface, these algorithms cannot be used directly on Unsigned Distance Fields (UDFs). In this work, we introduce a deep-learning approach to taking a UDF and turning it locally into an SDF, so that it can be effectively t… ▽ More

    Submitted 25 July, 2024; originally announced July 2024.

    Comments: Accepted to ECCV 2024

  2. arXiv:2407.14352  [pdf, ps, other

    cs.CV

    Vision-Based Power Line Cables and Pylons Detection for Low Flying Aircraft

    Authors: Jakub Gwizdała, Doruk Oner, Soumava Kumar Roy, Mian Akbar Shah, Ad Eberhard, Ivan Egorov, Philipp Krüsi, Grigory Yakushev, Pascal Fua

    Abstract: Power lines are dangerous for low-flying aircraft, especially in low-visibility conditions. Thus, a vision-based system able to analyze the aircraft's surroundings and to provide the pilots with a "second pair of eyes" can contribute to enhancing their safety. To this end, we have developed a deep learning approach to jointly detect power line cables and pylons from images captured at distances of… ▽ More

    Submitted 30 July, 2024; v1 submitted 19 July, 2024; originally announced July 2024.

    Comments: Added several declarations at the end of the publication

  3. arXiv:2406.09250  [pdf, other

    cs.CV cs.AI cs.LG

    MirrorCheck: Efficient Adversarial Defense for Vision-Language Models

    Authors: Samar Fares, Klea Ziu, Toluwani Aremu, Nikita Durasov, Martin Takáč, Pascal Fua, Karthik Nandakumar, Ivan Laptev

    Abstract: Vision-Language Models (VLMs) are becoming increasingly vulnerable to adversarial attacks as various novel attack strategies are being proposed against these models. While existing defenses excel in unimodal contexts, they currently fall short in safeguarding VLMs against adversarial threats. To mitigate this vulnerability, we propose a novel, yet elegantly simple approach for detecting adversaria… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  4. arXiv:2405.13781  [pdf, other

    cs.CV

    Addressing the Elephant in the Room: Robust Animal Re-Identification with Unsupervised Part-Based Feature Alignment

    Authors: Yingxue Yu, Vidit Vidit, Andrey Davydov, Martin Engilberge, Pascal Fua

    Abstract: Animal Re-ID is crucial for wildlife conservation, yet it faces unique challenges compared to person Re-ID. First, the scarcity and lack of diversity in datasets lead to background-biased models. Second, animal Re-ID depends on subtle, species-specific cues, further complicated by variations in pose, background, and lighting. This study addresses background biases by proposing a method to systemat… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

    Comments: Accepted to CVPR workshop CV4Animals 2024

  5. arXiv:2405.10934  [pdf, other

    cs.CV

    Reconstruction of Manipulated Garment with Guided Deformation Prior

    Authors: Ren Li, Corentin Dumery, Zhantao Deng, Pascal Fua

    Abstract: Modeling the shape of garments has received much attention, but most existing approaches assume the garments to be worn by someone, which constrains the range of shapes they can assume. In this work, we address shape recovery when garments are being manipulated instead of worn, which gives rise to an even larger range of possible shapes. To this end, we leverage the implicit sewing patterns (ISP)… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

  6. arXiv:2403.18820  [pdf, other

    cs.CV

    MetaCap: Meta-learning Priors from Multi-View Imagery for Sparse-view Human Performance Capture and Rendering

    Authors: Guoxing Sun, Rishabh Dabral, Pascal Fua, Christian Theobalt, Marc Habermann

    Abstract: Faithful human performance capture and free-view rendering from sparse RGB observations is a long-standing problem in Vision and Graphics. The main challenges are the lack of observations and the inherent ambiguities of the setting, e.g. occlusions and depth ambiguity. As a result, radiance fields, which have shown great promise in capturing high-frequency appearance and geometry details in dense… ▽ More

    Submitted 24 July, 2024; v1 submitted 27 March, 2024; originally announced March 2024.

    Comments: Project page: https://vcai.mpi-inf.mpg.de/projects/MetaCap/

  7. arXiv:2403.17755  [pdf, other

    cs.AI cs.CR cs.CV

    DataCook: Crafting Anti-Adversarial Examples for Healthcare Data Copyright Protection

    Authors: Sihan Shang, Jiancheng Yang, Zhenglong Sun, Pascal Fua

    Abstract: In the realm of healthcare, the challenges of copyright protection and unauthorized third-party misuse are increasingly significant. Traditional methods for data copyright protection are applied prior to data distribution, implying that models trained on these data become uncontrollable. This paper introduces a novel approach, named DataCook, designed to safeguard the copyright of healthcare data… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

  8. arXiv:2403.16732  [pdf, other

    cs.AI

    Enabling Uncertainty Estimation in Iterative Neural Networks

    Authors: Nikita Durasov, Doruk Oner, Jonathan Donier, Hieu Le, Pascal Fua

    Abstract: Turning pass-through network architectures into iterative ones, which use their own output as input, is a well-known approach for boosting performance. In this paper, we argue that such architectures offer an additional benefit: The convergence rate of their successive outputs is highly correlated with the accuracy of the value to which they converge. Thus, we can use the convergence rate as a use… ▽ More

    Submitted 30 May, 2024; v1 submitted 25 March, 2024; originally announced March 2024.

    Comments: Accepted at ICML 2024

  9. arXiv:2403.14066  [pdf, other

    eess.IV cs.CV

    LeFusion: Synthesizing Myocardial Pathology on Cardiac MRI via Lesion-Focus Diffusion Models

    Authors: Hantao Zhang, Jiancheng Yang, Shouhong Wan, Pascal Fua

    Abstract: Data generated in clinical practice often exhibits biases, such as long-tail imbalance and algorithmic unfairness. This study aims to mitigate these challenges through data synthesis. Previous efforts in medical imaging synthesis have struggled with separating lesion information from background context, leading to difficulties in generating high-quality backgrounds and limited control over the syn… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

    Comments: 13 pages

  10. arXiv:2403.09050  [pdf, other

    cs.CV

    CLOAF: CoLlisiOn-Aware Human Flow

    Authors: Andrey Davydov, Martin Engilberge, Mathieu Salzmann, Pascal Fua

    Abstract: Even the best current algorithms for estimating body 3D shape and pose yield results that include body self-intersections. In this paper, we present CLOAF, which exploits the diffeomorphic nature of Ordinary Differential Equations to eliminate such self-intersections while still imposing body shape constraints. We show that, unlike earlier approaches to addressing this issue, ours completely elimi… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

    Comments: CVPR 2024, 13 pages

  11. arXiv:2402.11036  [pdf, other

    cs.CV cs.LG

    Occlusion Resilient 3D Human Pose Estimation

    Authors: Soumava Kumar Roy, Ilia Badanin, Sina Honari, Pascal Fua

    Abstract: Occlusions remain one of the key challenges in 3D body pose estimation from single-camera video sequences. Temporal consistency has been extensively used to mitigate their impact but the existing algorithms in the literature do not explicitly model them. Here, we apply this by representing the deforming body as a spatio-temporal graph. We then introduce a refinement network that performs graph c… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

  12. arXiv:2402.02736  [pdf, other

    cs.CV cs.LG

    Using Motion Cues to Supervise Single-Frame Body Pose and Shape Estimation in Low Data Regimes

    Authors: Andrey Davydov, Alexey Sidnev, Artsiom Sanakoyeu, Yuhua Chen, Mathieu Salzmann, Pascal Fua

    Abstract: When enough annotated training data is available, supervised deep-learning algorithms excel at estimating human body pose and shape using a single camera. The effects of too little such data being available can be mitigated by using other information sources, such as databases of body shapes, to learn priors. Unfortunately, such sources are not always available either. We show that, in such cases,… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

    Comments: 21 pages; TMLR

  13. arXiv:2311.10356  [pdf, other

    cs.CV

    Garment Recovery with Shape and Deformation Priors

    Authors: Ren Li, Corentin Dumery, Benoît Guillard, Pascal Fua

    Abstract: While modeling people wearing tight-fitting clothing has made great strides in recent years, loose-fitting clothing remains a challenge. We propose a method that delivers realistic garment models from real-world images, regardless of garment shape or deformation. To this end, we introduce a fitting approach that utilizes shape and deformation priors learned from synthetic data to accurately captur… ▽ More

    Submitted 11 March, 2024; v1 submitted 17 November, 2023; originally announced November 2023.

    Comments: CVPR 2024

  14. arXiv:2309.17329  [pdf, other

    cs.CV cs.AI cs.GR cs.LG eess.IV

    Efficient Anatomical Labeling of Pulmonary Tree Structures via Implicit Point-Graph Networks

    Authors: Kangxian Xie, Jiancheng Yang, Donglai Wei, Ziqiao Weng, Pascal Fua

    Abstract: Pulmonary diseases rank prominently among the principal causes of death worldwide. Curing them will require, among other things, a better understanding of the many complex 3D tree-shaped structures within the pulmonary system, such as airways, arteries, and veins. In theory, they can be modeled using high-resolution image stacks. Unfortunately, standard CNN approaches operating on dense voxel grid… ▽ More

    Submitted 5 October, 2023; v1 submitted 29 September, 2023; originally announced September 2023.

  15. arXiv:2309.02777  [pdf, other

    cs.CV

    LightNeuS: Neural Surface Reconstruction in Endoscopy using Illumination Decline

    Authors: Víctor M. Batlle, José M. M. Montiel, Pascal Fua, Juan D. Tardós

    Abstract: We propose a new approach to 3D reconstruction from sequences of images acquired by monocular endoscopes. It is based on two key insights. First, endoluminal cavities are watertight, a property naturally enforced by modeling them in terms of a signed distance function. Second, the scene illumination is variable. It comes from the endoscope's light sources and decays with the inverse of the squared… ▽ More

    Submitted 6 September, 2023; originally announced September 2023.

    Comments: 12 pages, 7 figures, 1 table, submitted to MICCAI 2023

  16. arXiv:2308.16139  [pdf, other

    cs.CV cs.DB cs.LG

    MedShapeNet -- A Large-Scale Dataset of 3D Medical Shapes for Computer Vision

    Authors: Jianning Li, Zongwei Zhou, Jiancheng Yang, Antonio Pepe, Christina Gsaxner, Gijs Luijten, Chongyu Qu, Tiezheng Zhang, Xiaoxi Chen, Wenxuan Li, Marek Wodzinski, Paul Friedrich, Kangxian Xie, Yuan Jin, Narmada Ambigapathy, Enrico Nasca, Naida Solak, Gian Marco Melito, Viet Duc Vu, Afaque R. Memon, Christopher Schlachta, Sandrine De Ribaupierre, Rajnikant Patel, Roy Eagleson, Xiaojun Chen , et al. (132 additional authors not shown)

    Abstract: Prior to the deep learning era, shape was commonly used to describe the objects. Nowadays, state-of-the-art (SOTA) algorithms in medical imaging are predominantly diverging from computer vision, where voxel grids, meshes, point clouds, and implicit surface models are used. This is seen from numerous shape-related publications in premier vision conferences as well as the growing popularity of Shape… ▽ More

    Submitted 12 December, 2023; v1 submitted 30 August, 2023; originally announced August 2023.

    Comments: 16 pages

    MSC Class: 68T01

  17. arXiv:2308.10525  [pdf, other

    cs.CV

    LightDepth: Single-View Depth Self-Supervision from Illumination Decline

    Authors: Javier Rodríguez-Puigvert, Víctor M. Batlle, J. M. M. Montiel, Ruben Martinez-Cantin, Pascal Fua, Juan D. Tardós, Javier Civera

    Abstract: Single-view depth estimation can be remarkably effective if there is enough ground-truth depth data for supervised training. However, there are scenarios, especially in medicine in the case of endoscopies, where such data cannot be obtained. In such cases, multi-view self-supervision and synthetic-to-real transfer serve as alternative approaches, however, with a considerable performance reduction… ▽ More

    Submitted 19 September, 2023; v1 submitted 21 August, 2023; originally announced August 2023.

  18. arXiv:2307.08716  [pdf, other

    cs.CV

    Enforcing Topological Interaction between Implicit Surfaces via Uniform Sampling

    Authors: Hieu Le, Nicolas Talabot, Jiancheng Yang, Pascal Fua

    Abstract: Objects interact with each other in various ways, including containment, contact, or maintaining fixed distances. Ensuring these topological interactions is crucial for accurate modeling in many scenarios. In this paper, we propose a novel method to refine 3D object representations, ensuring that their surfaces adhere to a topological prior. Our key observation is that the object interaction can b… ▽ More

    Submitted 16 July, 2023; originally announced July 2023.

  19. arXiv:2305.14100  [pdf, other

    cs.CV

    ISP: Multi-Layered Garment Draping with Implicit Sewing Patterns

    Authors: Ren Li, Benoît Guillard, Pascal Fua

    Abstract: Many approaches to draping individual garments on human body models are realistic, fast, and yield outputs that are differentiable with respect to the body shape on which they are draped. However, they are either unable to handle multi-layered clothing, which is prevalent in everyday dress, or restricted to bodies in T-pose. In this paper, we introduce a parametric garment representation model tha… ▽ More

    Submitted 14 October, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: NeurIPS 2023

  20. arXiv:2305.02116  [pdf, other

    cs.CV physics.flu-dyn

    Automatic Parameterization for Aerodynamic Shape Optimization via Deep Geometric Learning

    Authors: Zhen Wei, Pascal Fua, Michaël Bauerheim

    Abstract: We propose two deep learning models that fully automate shape parameterization for aerodynamic shape optimization. Both models are optimized to parameterize via deep geometric learning to embed human prior knowledge into learned geometric patterns, eliminating the need for further handcrafting. The Latent Space Model (LSM) learns a low-dimensional latent representation of an object from a dataset… ▽ More

    Submitted 3 May, 2023; originally announced May 2023.

    Comments: 15 pages, to be appeared at AIAA Aviation Forum 2023

  21. arXiv:2303.05916  [pdf, other

    cs.CV

    GECCO: Geometrically-Conditioned Point Diffusion Models

    Authors: Michał J. Tyszkiewicz, Pascal Fua, Eduard Trulls

    Abstract: Diffusion models generating images conditionally on text, such as Dall-E 2 and Stable Diffusion, have recently made a splash far beyond the computer vision community. Here, we tackle the related problem of generating point clouds, both unconditionally, and conditionally with images. For the latter, we introduce a novel geometrically-motivated conditioning scheme based on projecting sparse image fe… ▽ More

    Submitted 25 September, 2023; v1 submitted 10 March, 2023; originally announced March 2023.

  22. arXiv:2212.14397  [pdf, other

    cs.CV

    AttEntropy: Segmenting Unknown Objects in Complex Scenes using the Spatial Attention Entropy of Semantic Segmentation Transformers

    Authors: Krzysztof Lis, Matthias Rottmann, Sina Honari, Pascal Fua, Mathieu Salzmann

    Abstract: Vision transformers have emerged as powerful tools for many computer vision tasks. It has been shown that their features and class tokens can be used for salient object segmentation. However, the properties of segmentation transformers remain largely unstudied. In this work we conduct an in-depth study of the spatial attentions of different backbone layers of semantic segmentation transformers and… ▽ More

    Submitted 29 December, 2022; originally announced December 2022.

    ACM Class: I.4.6; I.4.8; I.5.4

  23. arXiv:2211.12829  [pdf, other

    cs.CV cs.LG

    Unsupervised 3D Keypoint Discovery with Multi-View Geometry

    Authors: Sina Honari, Chen Zhao, Mathieu Salzmann, Pascal Fua

    Abstract: Analyzing and training 3D body posture models depend heavily on the availability of joint labels that are commonly acquired through laborious manual annotation of body joints or via marker-based joint localization using carefully curated markers and capturing systems. However, such annotations are not always available, especially for people performing unusual activities. In this paper, we propose… ▽ More

    Submitted 7 February, 2024; v1 submitted 23 November, 2022; originally announced November 2022.

    Comments: Accepted in "3DV 2024"

  24. arXiv:2211.11546  [pdf, other

    cs.CV

    PartAL: Efficient Partial Active Learning in Multi-Task Visual Settings

    Authors: Nikita Durasov, Nik Dorndorf, Pascal Fua

    Abstract: Multi-task learning is central to many real-world applications. Unfortunately, obtaining labelled data for all tasks is time-consuming, challenging, and expensive. Active Learning (AL) can be used to reduce this burden. Existing techniques typically involve picking images to be annotated and providing annotations for all tasks. In this paper, we show that it is more effective to select not only… ▽ More

    Submitted 21 November, 2022; originally announced November 2022.

  25. arXiv:2211.11435  [pdf, other

    cs.LG cs.CV

    ZigZag: Universal Sampling-free Uncertainty Estimation Through Two-Step Inference

    Authors: Nikita Durasov, Nik Dorndorf, Hieu Le, Pascal Fua

    Abstract: Whereas the ability of deep networks to produce useful predictions has been amply demonstrated, estimating the reliability of these predictions remains challenging. Sampling approaches such as MC-Dropout and Deep Ensembles have emerged as the most popular ones for this purpose. Unfortunately, they require many forward passes at inference time, which slows them down. Sampling-free approaches can be… ▽ More

    Submitted 26 May, 2024; v1 submitted 21 November, 2022; originally announced November 2022.

    Comments: Accepted to Transactions on Machine Learning Research (TMLR), ICML ABBI 2024

  26. arXiv:2211.11277  [pdf, other

    cs.CV

    DrapeNet: Garment Generation and Self-Supervised Draping

    Authors: Luca De Luigi, Ren Li, Benoît Guillard, Mathieu Salzmann, Pascal Fua

    Abstract: Recent approaches to drape garments quickly over arbitrary human bodies leverage self-supervision to eliminate the need for large training sets. However, they are designed to train one network per clothing item, which severely limits their generalization abilities. In our work, we rely on self-supervision to train a single network to drape multiple garments. This is achieved by predicting a 3D def… ▽ More

    Submitted 22 March, 2023; v1 submitted 21 November, 2022; originally announced November 2022.

  27. arXiv:2210.15664  [pdf, other

    cs.CV cs.GR

    State of the Art in Dense Monocular Non-Rigid 3D Reconstruction

    Authors: Edith Tretschk, Navami Kairanda, Mallikarjun B R, Rishabh Dabral, Adam Kortylewski, Bernhard Egger, Marc Habermann, Pascal Fua, Christian Theobalt, Vladislav Golyanik

    Abstract: 3D reconstruction of deformable (or non-rigid) scenes from a set of monocular 2D image observations is a long-standing and actively researched area of computer vision and graphics. It is an ill-posed inverse problem, since -- without additional prior assumptions -- it permits infinitely many solutions leading to accurate projection to the input 2D images. Non-rigid reconstruction is a foundational… ▽ More

    Submitted 24 March, 2023; v1 submitted 27 October, 2022; originally announced October 2022.

    Comments: 36 pages, 18 figures, 3 tables; State-of-the-Art Report at EUROGRAPHICS 2023

    Journal ref: Computer Graphics Forum, 2023

  28. arXiv:2210.10771  [pdf, other

    cs.CV cs.LG

    Multi-view Tracking Using Weakly Supervised Human Motion Prediction

    Authors: Martin Engilberge, Weizhe Liu, Pascal Fua

    Abstract: Multi-view approaches to people-tracking have the potential to better handle occlusions than single-view ones in crowded scenes. They often rely on the tracking-by-detection paradigm, which involves detecting people first and then connecting the detections. In this paper, we argue that an even more effective approach is to predict people motion over time and infer people's presence in individual f… ▽ More

    Submitted 19 October, 2022; originally announced October 2022.

    Comments: Accepted at WACV 2023

  29. arXiv:2210.10756  [pdf, other

    cs.CV cs.LG

    Two-level Data Augmentation for Calibrated Multi-view Detection

    Authors: Martin Engilberge, Haixin Shi, Zhiye Wang, Pascal Fua

    Abstract: Data augmentation has proven its usefulness to improve model generalization and performance. While it is commonly applied in computer vision application when it comes to multi-view systems, it is rarely used. Indeed geometric data augmentation can break the alignment among views. This is problematic since multi-view data tend to be scarce and it is expensive to annotate. In this work we propose to… ▽ More

    Submitted 19 October, 2022; originally announced October 2022.

    Comments: Accepted at WACV 2023

  30. Perspective Aware Road Obstacle Detection

    Authors: Krzysztof Lis, Sina Honari, Pascal Fua, Mathieu Salzmann

    Abstract: While road obstacle detection techniques have become increasingly effective, they typically ignore the fact that, in practice, the apparent size of the obstacles decreases as their distance to the vehicle increases. In this paper, we account for this by computing a scale map encoding the apparent size of a hypothetical object at every image location. We then leverage this perspective map to (i) ge… ▽ More

    Submitted 19 June, 2023; v1 submitted 4 October, 2022; originally announced October 2022.

    ACM Class: I.4.6; I.4.8; I.5.4

    Journal ref: IEEE Robotics and Automation Letters ( Volume: 8, Issue: 4, April 2023, Pages: 2150-2157)

  31. arXiv:2209.10986  [pdf, other

    cs.RO cs.CV

    Learning to Simulate Realistic LiDARs

    Authors: Benoit Guillard, Sai Vemprala, Jayesh K. Gupta, Ondrej Miksik, Vibhav Vineet, Pascal Fua, Ashish Kapoor

    Abstract: Simulating realistic sensors is a challenging part in data generation for autonomous systems, often involving carefully handcrafted sensor design, scene properties, and physics modeling. To alleviate this, we introduce a pipeline for data-driven simulation of a realistic LiDAR sensor. We propose a model that learns a mapping between RGB images and corresponding LiDAR features such as raydrop or pe… ▽ More

    Submitted 22 September, 2022; originally announced September 2022.

    Comments: IROS2022 paper

  32. arXiv:2209.10845  [pdf, other

    cs.CV cs.AI cs.GR cs.LG

    DIG: Draping Implicit Garment over the Human Body

    Authors: Ren Li, Benoît Guillard, Edoardo Remelli, Pascal Fua

    Abstract: Existing data-driven methods for draping garments over human bodies, despite being effective, cannot handle garments of arbitrary topology and are typically not end-to-end differentiable. To address these limitations, we propose an end-to-end differentiable pipeline that represents garments using implicit surfaces and learns a skinning field conditioned on shape and pose parameters of an articulat… ▽ More

    Submitted 24 September, 2022; v1 submitted 22 September, 2022; originally announced September 2022.

    Comments: 16 pages, 9 figures, 5 tables, ACCV 2022

  33. arXiv:2208.03257  [pdf, other

    cs.CV

    3D Pose Based Feedback for Physical Exercises

    Authors: Ziyi Zhao, Sena Kiciroglu, Hugues Vinzant, Yuan Cheng, Isinsu Katircioglu, Mathieu Salzmann, Pascal Fua

    Abstract: Unsupervised self-rehabilitation exercises and physical training can cause serious injuries if performed incorrectly. We introduce a learning-based framework that identifies the mistakes made by a user and proposes corrective measures for easier and safer individual training. Our framework does not rely on hard-coded, heuristic rules. Instead, it learns them from data, which facilitates its adapta… ▽ More

    Submitted 5 August, 2022; originally announced August 2022.

    Comments: Video: https://youtu.be/W3kyyeHe0SI

  34. Enforcing connectivity of 3D linear structures using their 2D projections

    Authors: Doruk Oner, Hussein Osman, Mateusz Kozinski, Pascal Fua

    Abstract: Many biological and medical tasks require the delineation of 3D curvilinear structures such as blood vessels and neurites from image volumes. This is typically done using neural networks trained by minimizing voxel-wise loss functions that do not capture the topological properties of these structures. As a result, the connectivity of the recovered structures is often wrong, which lessens their use… ▽ More

    Submitted 24 December, 2022; v1 submitted 14 July, 2022; originally announced July 2022.

  35. arXiv:2206.15328  [pdf, other

    cs.CV cs.AI cs.LG eess.IV

    Neural Annotation Refinement: Development of a New 3D Dataset for Adrenal Gland Analysis

    Authors: Jiancheng Yang, Rui Shi, Udaranga Wickramasinghe, Qikui Zhu, Bingbing Ni, Pascal Fua

    Abstract: The human annotations are imperfect, especially when produced by junior practitioners. Multi-expert consensus is usually regarded as golden standard, while this annotation protocol is too expensive to implement in many real-world projects. In this study, we propose a method to refine human annotation, named Neural Annotation Refinement (NeAR). It is based on a learnable implicit function, which de… ▽ More

    Submitted 7 July, 2022; v1 submitted 30 June, 2022; originally announced June 2022.

    Comments: MICCAI 2022

  36. arXiv:2206.10241  [pdf, other

    cs.CV

    Deep Active Latent Surfaces for Medical Geometries

    Authors: Patrick M. Jensen, Udaranga Wickramasinghe, Anders B. Dahl, Pascal Fua, Vedrana A. Dahl

    Abstract: Shape priors have long been known to be effective when reconstructing 3D shapes from noisy or incomplete data. When using a deep-learning based shape representation, this often involves learning a latent representation, which can be either in the form of a single global vector or of multiple local ones. The latter allows more flexibility but is prone to overfitting. In this paper, we advocate a hy… ▽ More

    Submitted 21 June, 2022; originally announced June 2022.

    Comments: 14 pages, 9 figures, submitted for review

  37. arXiv:2203.15865  [pdf, other

    cs.CV

    On Triangulation as a Form of Self-Supervision for 3D Human Pose Estimation

    Authors: Soumava Kumar Roy, Leonardo Citraro, Sina Honari, Pascal Fua

    Abstract: Supervised approaches to 3D pose estimation from single images are remarkably effective when labeled data is abundant. However, as the acquisition of ground-truth 3D labels is labor intensive and time consuming, recent attention has shifted towards semi- and weakly-supervised learning. Generating an effective form of supervision with little annotations still poses major challenge in crowded scenes… ▽ More

    Submitted 28 June, 2022; v1 submitted 29 March, 2022; originally announced March 2022.

  38. arXiv:2203.09836  [pdf, other

    cs.CV cs.RO

    Perspective Flow Aggregation for Data-Limited 6D Object Pose Estimation

    Authors: Yinlin Hu, Pascal Fua, Mathieu Salzmann

    Abstract: Most recent 6D object pose estimation methods, including unsupervised ones, require many real training images. Unfortunately, for some applications, such as those in space or deep under water, acquiring real images, even unannotated, is virtually impossible. In this paper, we propose a method that can be trained solely on synthetic images, or optionally using a few additional real ones. Given a ro… ▽ More

    Submitted 18 July, 2022; v1 submitted 18 March, 2022; originally announced March 2022.

    Comments: ECCV 2022

  39. arXiv:2112.04203  [pdf, other

    cs.CV

    Adversarial Parametric Pose Prior

    Authors: Andrey Davydov, Anastasia Remizova, Victor Constantin, Sina Honari, Mathieu Salzmann, Pascal Fua

    Abstract: The Skinned Multi-Person Linear (SMPL) model can represent a human body by mapping pose and shape parameters to body meshes. This has been shown to facilitate inferring 3D human pose and shape from images via different learning models. However, not all pose and shape parameter values yield physically-plausible or even realistic body meshes. In other words, SMPL is under-constrained and may thus le… ▽ More

    Submitted 8 December, 2021; originally announced December 2021.

  40. Adjusting the Ground Truth Annotations for Connectivity-Based Learning to Delineate

    Authors: Doruk Oner, Leonardo Citraro, Mateusz Koziński, Pascal Fua

    Abstract: Deep learning-based approaches to delineating 3D structure depend on accurate annotations to train the networks. Yet, in practice, people, no matter how conscientious, have trouble precisely delineating in 3D and on a large scale, in part because the data is often hard to interpret visually and in part because the 3D interfaces are awkward to use. In this paper, we introduce a method that explicit… ▽ More

    Submitted 24 December, 2022; v1 submitted 5 December, 2021; originally announced December 2021.

    Journal ref: IEEE Transactions on Medical Imaging ( Volume: 41, Issue: 12, December 2022)

  41. arXiv:2112.01176  [pdf, other

    cs.CV

    Overcoming the Domain Gap in Neural Action Representations

    Authors: Semih Günel, Florian Aymanns, Sina Honari, Pavan Ramdya, Pascal Fua

    Abstract: Relating animal behaviors to brain activity is a fundamental goal in neuroscience, with practical applications in building robust brain-machine interfaces. However, the domain gap between individuals is a major issue that prevents the training of general models that work on unlabeled subjects. Since 3D pose data can now be reliably extracted from multi-view video sequences without manual interve… ▽ More

    Submitted 19 January, 2022; v1 submitted 2 December, 2021; originally announced December 2021.

  42. arXiv:2112.00396  [pdf, other

    cs.CV

    Dyadic Human Motion Prediction

    Authors: Isinsu Katircioglu, Costa Georgantas, Mathieu Salzmann, Pascal Fua

    Abstract: Prior work on human motion forecasting has mostly focused on predicting the future motion of single subjects in isolation from their past pose sequence. In the presence of closely interacting people, however, this strategy fails to account for the dependencies between the different subject's motions. In this paper, we therefore introduce a motion prediction framework that explicitly reasons about… ▽ More

    Submitted 13 January, 2022; v1 submitted 1 December, 2021; originally announced December 2021.

    Comments: added reference for section 2

  43. arXiv:2111.14595  [pdf, other

    cs.CV

    Overcoming the Domain Gap in Contrastive Learning of Neural Action Representations

    Authors: Semih Günel, Florian Aymanns, Sina Honari, Pavan Ramdya, Pascal Fua

    Abstract: A fundamental goal in neuroscience is to understand the relationship between neural activity and behavior. For example, the ability to extract behavioral intentions from neural data, or neural decoding, is critical for developing effective brain machine interfaces. Although simple linear models have been applied to this challenge, they cannot identify important non-linear relationships. Thus, a se… ▽ More

    Submitted 29 November, 2021; originally announced November 2021.

    Comments: Accepted into NeurIPS 2021 Workshop: Self-Supervised Learning - Theory and Practice

  44. arXiv:2111.14549  [pdf, other

    cs.CV

    MeshUDF: Fast and Differentiable Meshing of Unsigned Distance Field Networks

    Authors: Benoit Guillard, Federico Stella, Pascal Fua

    Abstract: Unsigned Distance Fields (UDFs) can be used to represent non-watertight surfaces. However, current approaches to converting them into explicit meshes tend to either be expensive or to degrade the accuracy. Here, we extend the marching cube algorithm to handle UDFs, both fast and accurately. Moreover, our approach to surface extraction is differentiable, which is key to using pretrained UDF network… ▽ More

    Submitted 7 December, 2022; v1 submitted 29 November, 2021; originally announced November 2021.

  45. arXiv:2111.09301  [pdf, other

    cs.CV cs.AI

    Learning to Align Sequential Actions in the Wild

    Authors: Weizhe Liu, Bugra Tekin, Huseyin Coskun, Vibhav Vineet, Pascal Fua, Marc Pollefeys

    Abstract: State-of-the-art methods for self-supervised sequential action alignment rely on deep networks that find correspondences across videos in time. They either learn frame-to-frame mapping across sequences, which does not leverage temporal information, or assume monotonic alignment between each video pair, which ignores variations in the order of actions. As such, these methods are not able to deal wi… ▽ More

    Submitted 17 November, 2021; originally announced November 2021.

  46. arXiv:2111.06838  [pdf, other

    cs.CV

    Temporally-Consistent Surface Reconstruction using Metrically-Consistent Atlases

    Authors: Jan Bednarik, Noam Aigerman, Vladimir G. Kim, Siddhartha Chaudhuri, Shaifali Parashar, Mathieu Salzmann, Pascal Fua

    Abstract: We propose a method for unsupervised reconstruction of a temporally-consistent sequence of surfaces from a sequence of time-evolving point clouds. It yields dense and semantically meaningful correspondences between frames. We represent the reconstructed surfaces as atlases computed by a neural network, which enables us to establish correspondences between frames. The key to making these correspond… ▽ More

    Submitted 12 November, 2021; originally announced November 2021.

    Comments: 21 pages. arXiv admin note: substantial text overlap with arXiv:2104.06950

  47. arXiv:2110.07766  [pdf, other

    eess.IV cs.CV

    3D Reconstruction of Curvilinear Structures with Stereo Matching DeepConvolutional Neural Networks

    Authors: Okan Altingövde, Anastasiia Mishchuk, Gulnaz Ganeeva, Emad Oveisi, Cecile Hebert, Pascal Fua

    Abstract: Curvilinear structures frequently appear in microscopy imaging as the object of interest. Crystallographic defects, i.e., dislocations, are one of the curvilinear structures that have been repeatedly investigated under transmission electron microscopy (TEM) and their 3D structural information is of great importance for understanding the properties of materials. 3D information of dislocations is of… ▽ More

    Submitted 14 October, 2021; originally announced October 2021.

  48. arXiv:2110.06295  [pdf, other

    cs.CV

    Persistent Homology with Improved Locality Information for more Effective Delineation

    Authors: Doruk Oner, Adélie Garin, Mateusz Koziński, Kathryn Hess, Pascal Fua

    Abstract: Persistent Homology (PH) has been successfully used to train networks to detect curvilinear structures and to improve the topological quality of their results. However, existing methods are very global and ignore the location of topological features. In this paper, we remedy this by introducing a new filtration function that fuses two earlier approaches: thresholding-based filtration, previously u… ▽ More

    Submitted 24 December, 2022; v1 submitted 12 October, 2021; originally announced October 2021.

  49. arXiv:2109.13337  [pdf, other

    cs.LG cs.AI

    DEBOSH: Deep Bayesian Shape Optimization

    Authors: Nikita Durasov, Artem Lukoyanov, Jonathan Donier, Pascal Fua

    Abstract: Graph Neural Networks (GNNs) can predict the performance of an industrial design quickly and accurately and be used to optimize its shape effectively. However, to fully explore the shape space, one must often consider shapes deviating significantly from the training set. For these, GNN predictions become unreliable, something that is often ignored. For optimization techniques relying on Gaussian P… ▽ More

    Submitted 2 October, 2023; v1 submitted 28 September, 2021; originally announced September 2021.

  50. arXiv:2109.10767  [pdf, other

    cs.CV cs.AI cs.CG

    HybridSDF: Combining Deep Implicit Shapes and Geometric Primitives for 3D Shape Representation and Manipulation

    Authors: Subeesh Vasu, Nicolas Talabot, Artem Lukoianov, Pierre Baqué, Jonathan Donier, Pascal Fua

    Abstract: Deep implicit surfaces excel at modeling generic shapes but do not always capture the regularities present in manufactured objects, which is something simple geometric primitives are particularly good at. In this paper, we propose a representation combining latent and explicit parameters that can be decoded into a set of deep implicit and geometric shapes that are consistent with each other. As a… ▽ More

    Submitted 8 September, 2022; v1 submitted 22 September, 2021; originally announced September 2021.

    Comments: 18 pages, 21 figures, 3DV 2022