Search | arXiv e-print repository

Euclid preparation. Measuring detailed galaxy morphologies for Euclid with Machine Learning

Authors: Euclid Collaboration, B. Aussel, S. Kruk, M. Walmsley, M. Huertas-Company, M. Castellano, C. J. Conselice, M. Delli Veneri, H. Domínguez Sánchez, P. -A. Duc, U. Kuchner, A. La Marca, B. Margalef-Bentabol, F. R. Marleau, G. Stevens, Y. Toba, C. Tortora, L. Wang, N. Aghanim, B. Altieri, A. Amara, S. Andreon, N. Auricchio, M. Baldi, S. Bardelli , et al. (233 additional authors not shown)

Abstract: The Euclid mission is expected to image millions of galaxies with high resolution, providing an extensive dataset to study galaxy evolution. We investigate the application of deep learning to predict the detailed morphologies of galaxies in Euclid using Zoobot a convolutional neural network pretrained with 450000 galaxies from the Galaxy Zoo project. We adapted Zoobot for emulated Euclid images, g… ▽ More The Euclid mission is expected to image millions of galaxies with high resolution, providing an extensive dataset to study galaxy evolution. We investigate the application of deep learning to predict the detailed morphologies of galaxies in Euclid using Zoobot a convolutional neural network pretrained with 450000 galaxies from the Galaxy Zoo project. We adapted Zoobot for emulated Euclid images, generated based on Hubble Space Telescope COSMOS images, and with labels provided by volunteers in the Galaxy Zoo: Hubble project. We demonstrate that the trained Zoobot model successfully measures detailed morphology for emulated Euclid images. It effectively predicts whether a galaxy has features and identifies and characterises various features such as spiral arms, clumps, bars, disks, and central bulges. When compared to volunteer classifications Zoobot achieves mean vote fraction deviations of less than 12% and an accuracy above 91% for the confident volunteer classifications across most morphology types. However, the performance varies depending on the specific morphological class. For the global classes such as disk or smooth galaxies, the mean deviations are less than 10%, with only 1000 training galaxies necessary to reach this performance. For more detailed structures and complex tasks like detecting and counting spiral arms or clumps, the deviations are slightly higher, around 12% with 60000 galaxies used for training. In order to enhance the performance on complex morphologies, we anticipate that a larger pool of labelled galaxies is needed, which could be obtained using crowdsourcing. Finally, our findings imply that the model can be effectively adapted to new morphological labels. We demonstrate this adaptability by applying Zoobot to peculiar galaxies. In summary, our trained Zoobot CNN can readily predict morphological catalogues for Euclid images. △ Less

Submitted 15 February, 2024; originally announced February 2024.

Comments: 27 pages, 26 figures, 5 tables, submitted to A&A

arXiv:2311.10657 [pdf, other]

A BRAIN study to tackle image analysis with artificial intelligence in the ALMA 2030 era

Authors: Fabrizia Guglielmetti, Michele Delli Veneri, Ivano Baronchelli, Carmen Blanco, Andrea Dosi, Torsten Enßlin, Vishal Johnson, Giuseppe Longo, Jakob Roth, Felix Stoehr, Łukasz Tychoniec, Eric Villard

Abstract: An ESO internal ALMA development study, BRAIN, is addressing the ill-posed inverse problem of synthesis image analysis employing astrostatistics and astroinformatics. These emerging fields of research offer interdisciplinary approaches at the intersection of observational astronomy, statistics, algorithm development, and data science. In this study, we provide evidence of the benefits of employing… ▽ More An ESO internal ALMA development study, BRAIN, is addressing the ill-posed inverse problem of synthesis image analysis employing astrostatistics and astroinformatics. These emerging fields of research offer interdisciplinary approaches at the intersection of observational astronomy, statistics, algorithm development, and data science. In this study, we provide evidence of the benefits of employing these approaches to ALMA imaging for operational and scientific purposes. We show the potential of two techniques, RESOLVE and DeepFocus, applied to ALMA calibrated science data. Significant advantages are provided with the prospect to improve the quality and completeness of the data products stored in the science archive and overall processing time for operations. Both approaches evidence the logical pathway to address the incoming revolution in data rates dictated by the planned electronic upgrades. Moreover, we bring to the community additional products through a new package, ALMASim, to promote advancements in these fields, providing a refined ALMA simulator usable by a large community for training and/or testing new algorithms. △ Less

Submitted 17 November, 2023; originally announced November 2023.

Comments: 9 pages, 5 figures, MaxEnt2023 conference

arXiv:2303.09409 [pdf, other]

Repeating Outbursts from the Young Stellar Object Gaia23bab (= SPICY 97589)

Authors: Michael A. Kuhn, Robert A. Benjamin, Emille E. O. Ishida, Rafael S. de Souza, Julien Peloton, Michele Delli Veneri

Abstract: The light curve of Gaia23bab (= SPICY 97589) shows two significant ($ΔでるたG>2$ mag) brightening events, one in 2017 and an ongoing event starting in 2022. The source's quiescent spectral energy distribution indicates an embedded ($A_V>5$ mag) pre-main-sequence star, with optical accretion emission and mid-infrared disk emission. This characterization is supported by the source's membership in an embed… ▽ More The light curve of Gaia23bab (= SPICY 97589) shows two significant ($ΔでるたG>2$ mag) brightening events, one in 2017 and an ongoing event starting in 2022. The source's quiescent spectral energy distribution indicates an embedded ($A_V>5$ mag) pre-main-sequence star, with optical accretion emission and mid-infrared disk emission. This characterization is supported by the source's membership in an embedded cluster in the star-forming cloud DOBASHI 1604 at a distance of $900\pm45$~pc. Thus, the brightening events are probable accretion outbursts, likely of EX Lup-type. △ Less

Submitted 16 March, 2023; originally announced March 2023.

Comments: 4 pages and 1 figure. Submitted to Research Notes of the AAS

arXiv:2303.07943 [pdf, other]

doi 10.1093/mnras/stad1375

SKA Science Data Challenge 2: analysis and results

Authors: P. Hartley, A. Bonaldi, R. Braun, J. N. H. S. Aditya, S. Aicardi, L. Alegre, A. Chakraborty, X. Chen, S. Choudhuri, A. O. Clarke, J. Coles, J. S. Collinson, D. Cornu, L. Darriba, M. Delli Veneri, J. Forbrich, B. Fraga, A. Galan, J. Garrido, F. Gubanov, H. Håkansson, M. J. Hardcastle, C. Heneka, D. Herranz, K. M. Hess , et al. (83 additional authors not shown)

Abstract: The Square Kilometre Array Observatory (SKAO) will explore the radio sky to new depths in order to conduct transformational science. SKAO data products made available to astronomers will be correspondingly large and complex, requiring the application of advanced analysis techniques to extract key science findings. To this end, SKAO is conducting a series of Science Data Challenges, each designed t… ▽ More The Square Kilometre Array Observatory (SKAO) will explore the radio sky to new depths in order to conduct transformational science. SKAO data products made available to astronomers will be correspondingly large and complex, requiring the application of advanced analysis techniques to extract key science findings. To this end, SKAO is conducting a series of Science Data Challenges, each designed to familiarise the scientific community with SKAO data and to drive the development of new analysis techniques. We present the results from Science Data Challenge 2 (SDC2), which invited participants to find and characterise 233245 neutral hydrogen (Hi) sources in a simulated data product representing a 2000~h SKA MID spectral line observation from redshifts 0.25 to 0.5. Through the generous support of eight international supercomputing facilities, participants were able to undertake the Challenge using dedicated computational resources. Alongside the main challenge, `reproducibility awards' were made in recognition of those pipelines which demonstrated Open Science best practice. The Challenge saw over 100 participants develop a range of new and existing techniques, with results that highlight the strengths of multidisciplinary and collaborative effort. The winning strategy -- which combined predictions from two independent machine learning techniques to yield a 20 percent improvement in overall performance -- underscores one of the main Challenge outcomes: that of method complementarity. It is likely that the combination of methods in a so-called ensemble approach will be key to exploiting very large astronomical datasets. △ Less

Submitted 14 March, 2023; originally announced March 2023.

Comments: Under review by MNRAS; 28 pages, 16 figures

arXiv:2211.11462 [pdf, other]

doi 10.1093/mnras/stac3314

3D Detection and Characterisation of ALMA Sources through Deep Learning

Authors: Michele Delli Veneri, Lukasz Tychoniec, Fabrizia Guglielmetti, Giuseppe Longo, Eric Villard

Abstract: We present a Deep-Learning (DL) pipeline developed for the detection and characterization of astronomical sources within simulated Atacama Large Millimeter/submillimeter Array (ALMA) data cubes. The pipeline is composed of six DL models: a Convolutional Autoencoder for source detection within the spatial domain of the integrated data cubes, a Recurrent Neural Network (RNN) for denoising and peak d… ▽ More We present a Deep-Learning (DL) pipeline developed for the detection and characterization of astronomical sources within simulated Atacama Large Millimeter/submillimeter Array (ALMA) data cubes. The pipeline is composed of six DL models: a Convolutional Autoencoder for source detection within the spatial domain of the integrated data cubes, a Recurrent Neural Network (RNN) for denoising and peak detection within the frequency domain, and four Residual Neural Networks (ResNets) for source characterization. The combination of spatial and frequency information improves completeness while decreasing spurious signal detection. To train and test the pipeline, we developed a simulation algorithm able to generate realistic ALMA observations, i.e. both sky model and dirty cubes. The algorithm simulates always a central source surrounded by fainter ones scattered within the cube. Some sources were spatially superimposed in order to test the pipeline deblending capabilities. The detection performances of the pipeline were compared to those of other methods and significant improvements in performances were achieved. Source morphologies are detected with subpixel accuracies obtaining mean residual errors of $10^{-3}$ pixel ($0.1$ mas) and $10^{-1}$ mJy/beam on positions and flux estimations, respectively. Projection angles and flux densities are also recovered within $10\%$ of the true values for $80\%$ and $73\%$ of all sources in the test set, respectively. While our pipeline is fine-tuned for ALMA data, the technique is applicable to other interferometric observatories, as SKA, LOFAR, VLBI, and VLTI. △ Less

Submitted 21 November, 2022; originally announced November 2022.

arXiv:2210.01444 [pdf, other]

Bayesian and Machine Learning Methods in the Big Data era for astronomical imaging

Authors: Fabrizia Guglielmetti, Philipp Arras, Michele Delli Veneri, Torsten Enßlin, Giuseppe Longo, Łukasz Tychoniec, Eric Villard

Abstract: The Atacama Large Millimeter/submillimeter Array with the planned electronic upgrades will deliver an unprecedented amount of deep and high resolution observations. Wider fields of view are possible with the consequential cost of image reconstruction. Alternatives to commonly used applications in image processing have to be sought and tested. Advanced image reconstruction methods are critical to m… ▽ More The Atacama Large Millimeter/submillimeter Array with the planned electronic upgrades will deliver an unprecedented amount of deep and high resolution observations. Wider fields of view are possible with the consequential cost of image reconstruction. Alternatives to commonly used applications in image processing have to be sought and tested. Advanced image reconstruction methods are critical to meet the data requirements needed for operational purposes. Astrostatistics and astroinformatics techniques are employed. Evidence is given that these interdisciplinary fields of study applied to synthesis imaging meet the Big Data challenges and have the potentials to enable new scientific discoveries in radio astronomy and astrophysics. △ Less

Submitted 4 October, 2022; originally announced October 2022.

Comments: 8 pages, 5 figures, proceedings International Workshop on Bayesian Inference and Maximum Entropy Methods in Science and Engineering, IHP, Paris, July 18-22, 2022

arXiv:2205.14153 [pdf, other]

doi 10.3847/2515-5172/ac74c7

How have astronomers cited other fields in the last decade?

Authors: Michele Delli Veneri, Rafael S. de Souza, Alberto Krone-Martins, Emille E. O. Ishida, Maria Luiza L. Dantas, Noble Kennamer

Abstract: We present a citation pattern analysis between astronomical papers and 13 other disciplines, based on the arXiv database over the past decade ($2010 - 2020$). We analyze 12,600 astronomical papers citing over 14,531 unique publications outside astronomy. Two striking patterns are unraveled. First, general relativity recently became the most cited field by astronomers, a trend highly correlated wit… ▽ More We present a citation pattern analysis between astronomical papers and 13 other disciplines, based on the arXiv database over the past decade ($2010 - 2020$). We analyze 12,600 astronomical papers citing over 14,531 unique publications outside astronomy. Two striking patterns are unraveled. First, general relativity recently became the most cited field by astronomers, a trend highly correlated with the discovery of gravitational waves. Secondly, the fast growth of referenced papers in computer science and statistics, the first with a notable 15-fold increase since 2015. Such findings confirm the critical role of interdisciplinary efforts involving astronomy, statistics, and computer science in recent astronomical research. △ Less

Submitted 26 May, 2022; originally announced May 2022.

Comments: Submitted to RNAAS

arXiv:2103.04116 [pdf]

doi 10.1038/s41598-021-85254-x

A novel approach to the classification of terrestrial drainage networks based on deep learning and preliminary results on Solar System bodies

Authors: Carlo Donadio, Massimo Brescia, Alessia Riccardo, Giuseppe Angora, Michele Delli Veneri, Giuseppe Riccio

Abstract: Several approaches were proposed to describe the geomorphology of drainage networks and the abiotic/biotic factors determining their morphology. There is an intrinsic complexity of the explicit qualification of the morphological variations in response to various types of control factors and the difficulty of expressing the cause-effect links. Traditional methods of drainage network classification… ▽ More Several approaches were proposed to describe the geomorphology of drainage networks and the abiotic/biotic factors determining their morphology. There is an intrinsic complexity of the explicit qualification of the morphological variations in response to various types of control factors and the difficulty of expressing the cause-effect links. Traditional methods of drainage network classification are based on the manual extraction of key characteristics, then applied as pattern recognition schemes. These approaches, however, have low predictive and uniform ability. We present a different approach, based on the data-driven supervised learning by images, extended also to extraterrestrial cases. With deep learning models, the extraction and classification phase is integrated within a more objective, analytical, and automatic framework. Despite the initial difficulties, due to the small number of training images available, and the similarity between the different shapes of the drainage samples, we obtained successful results, concluding that deep learning is a valid way for data exploration in geomorphology and related fields. △ Less

Submitted 6 March, 2021; originally announced March 2021.

Comments: Accepted, To be published on Scientific Reports (Nature Research Journal), 22 pages, 3 figures, 4 tables

Journal ref: Scientific Reports, 11, 5875 (2021)

arXiv:2007.01840 [pdf, other]

doi 10.1007/978-3-030-65867-0_11

Rejection criteria based on outliers in the KiDS photometric redshifts and PDF distributions derived by machine learning

Authors: Valeria Amaro, Stefano Cavuoti, Massimo Brescia, Giuseppe Riccio, Crescenzo Tortora, Maurizio D'Addona, Michele Delli Veneri, Nicola R. Napolitano, Mario Radovich, Giuseppe Longo

Abstract: The Probability Density Function (PDF) provides an estimate of the photometric redshift (zphot) prediction error. It is crucial for current and future sky surveys, characterized by strict requirements on the zphot precision, reliability and completeness. The present work stands on the assumption that properly defined rejection criteria, capable of identifying and rejecting potential outliers, can… ▽ More The Probability Density Function (PDF) provides an estimate of the photometric redshift (zphot) prediction error. It is crucial for current and future sky surveys, characterized by strict requirements on the zphot precision, reliability and completeness. The present work stands on the assumption that properly defined rejection criteria, capable of identifying and rejecting potential outliers, can increase the precision of zphot estimates and of their cumulative PDF, without sacrificing much in terms of completeness of the sample. We provide a way to assess rejection through proper cuts on the shape descriptors of a PDF, such as the width and the height of the maximum PDF's peak. In this work we tested these rejection criteria to galaxies with photometry extracted from the Kilo Degree Survey (KiDS) ESO Data Release 4, proving that such approach could lead to significant improvements to the zphot quality: e.g., for the clipped sample showing the best trade-off between precision and completeness, we achieve a reduction in outliers fraction of $\simeq 75\%$ and an improvement of $\simeq 6\%$ for NMAD, with respect to the original data set, preserving the $\simeq 93\%$ of its content. △ Less

Submitted 3 July, 2020; originally announced July 2020.

Comments: Preprint version of the manuscript to appear in the Volume "Intelligent Astrophysics" of the series "Emergence, Complexity and Computation", Book eds. I. Zelinka, D. Baron, M. Brescia, Springer Nature Switzerland, ISSN: 2194-7287

arXiv:2006.13905 [pdf, other]

doi 10.1007/978-3-030-65867-0_8

Periodic Astrometric Signal Recovery through Convolutional Autoencoders

Authors: Michele Delli Veneri, Louis Desdoigts, Morgan A. Schmitz, Alberto Krone-Martins, Emille E. O. Ishida, Peter Tuthill, Rafael S. de Souza, Richard Scalzo, Massimo Brescia, Giuseppe Longo, Antonio Picariello

Abstract: Astrometric detection involves a precise measurement of stellar positions, and is widely regarded as the leading concept presently ready to find earth-mass planets in temperate orbits around nearby sun-like stars. The TOLIMAN space telescope[39] is a low-cost, agile mission concept dedicated to narrow-angle astrometric monitoring of bright binary stars. In particular the mission will be optimised… ▽ More Astrometric detection involves a precise measurement of stellar positions, and is widely regarded as the leading concept presently ready to find earth-mass planets in temperate orbits around nearby sun-like stars. The TOLIMAN space telescope[39] is a low-cost, agile mission concept dedicated to narrow-angle astrometric monitoring of bright binary stars. In particular the mission will be optimised to search for habitable-zone planets around Alpha Centauri AB. If the separation between these two stars can be monitored with sufficient precision, tiny perturbations due to the gravitational tug from an unseen planet can be witnessed and, given the configuration of the optical system, the scale of the shifts in the image plane are about one millionth of a pixel. Image registration at this level of precision has never been demonstrated (to our knowledge) in any setting within science. In this paper we demonstrate that a Deep Convolutional Auto-Encoder is able to retrieve such a signal from simplified simulations of the TOLIMAN data and we present the full experimental pipeline to recreate out experiments from the simulations to the signal analysis. In future works, all the more realistic sources of noise and systematic effects present in the real-world system will be injected into the simulations. △ Less

Submitted 24 June, 2020; originally announced June 2020.

Comments: Preprint version of the manuscript to appear in the Volume "Intelligent Astrophysics" of the series "Emergence, Complexity and Computation", Book eds. I. Zelinka, D. Baron, M. Brescia, Springer Nature Switzerland, ISSN: 2194-7287

arXiv:1902.02522 [pdf, other]

doi 10.1093/mnras/stz856

Star Formation Rates for photometric samples of galaxies using machine learning methods

Authors: M. Delli Veneri, S. Cavuoti, M. Brescia, G. Longo, G. Riccio

Abstract: Star Formation Rates or SFRs are crucial to constrain theories of galaxy formation and evolution. SFRs are usually estimated via spectroscopic observations requiring large amounts of telescope time. We explore an alternative approach based on the photometric estimation of global SFRs for large samples of galaxies, by using methods such as automatic parameter space optimisation, and supervised Mach… ▽ More Star Formation Rates or SFRs are crucial to constrain theories of galaxy formation and evolution. SFRs are usually estimated via spectroscopic observations requiring large amounts of telescope time. We explore an alternative approach based on the photometric estimation of global SFRs for large samples of galaxies, by using methods such as automatic parameter space optimisation, and supervised Machine Learning models. We demonstrate that, with such approach, accurate multi-band photometry allows to estimate reliable SFRs. We also investigate how the use of photometric rather than spectroscopic redshifts, affects the accuracy of derived global SFRs. Finally, we provide a publicly available catalogue of SFRs for more than 27 million galaxies extracted from the Sloan Digital Sky survey Data Release 7. The catalogue is available through the Vizier facility at the following link ftp://cdsarc.u-strasbg.fr/pub/cats/J/MNRAS/486/1377. △ Less

Submitted 6 June, 2019; v1 submitted 7 February, 2019; originally announced February 2019.

Journal ref: MNRAS 2019 486(1) 1377-1391

arXiv:1805.06338 [pdf, other]

Stellar formation rates in galaxies using Machine Learning models

Authors: Michele Delli Veneri, Stefano Cavuoti, Massimo Brescia, Giuseppe Riccio, Giuseppe Longo

Abstract: Global Stellar Formation Rates or SFRs are crucial to constrain theories of galaxy formation and evolution. SFR's are usually estimated via spectroscopic observations which require too much previous telescope time and therefore cannot match the needs of modern precision cosmology. We therefore propose a novel method to estimate SFRs for large samples of galaxies using a variety of supervised ML mo… ▽ More Global Stellar Formation Rates or SFRs are crucial to constrain theories of galaxy formation and evolution. SFR's are usually estimated via spectroscopic observations which require too much previous telescope time and therefore cannot match the needs of modern precision cosmology. We therefore propose a novel method to estimate SFRs for large samples of galaxies using a variety of supervised ML models. △ Less

Submitted 23 January, 2019; v1 submitted 16 May, 2018; originally announced May 2018.

Comments: ESANN 2018 - Proceedings, ISBN-13 9782875870483

Showing 1–12 of 12 results for author: Veneri, M D