-
Euclid preparation. Measuring detailed galaxy morphologies for Euclid with Machine Learning
Authors:
Euclid Collaboration,
B. Aussel,
S. Kruk,
M. Walmsley,
M. Huertas-Company,
M. Castellano,
C. J. Conselice,
M. Delli Veneri,
H. Domínguez Sánchez,
P. -A. Duc,
U. Kuchner,
A. La Marca,
B. Margalef-Bentabol,
F. R. Marleau,
G. Stevens,
Y. Toba,
C. Tortora,
L. Wang,
N. Aghanim,
B. Altieri,
A. Amara,
S. Andreon,
N. Auricchio,
M. Baldi,
S. Bardelli
, et al. (233 additional authors not shown)
Abstract:
The Euclid mission is expected to image millions of galaxies with high resolution, providing an extensive dataset to study galaxy evolution. We investigate the application of deep learning to predict the detailed morphologies of galaxies in Euclid using Zoobot a convolutional neural network pretrained with 450000 galaxies from the Galaxy Zoo project. We adapted Zoobot for emulated Euclid images, g…
▽ More
The Euclid mission is expected to image millions of galaxies with high resolution, providing an extensive dataset to study galaxy evolution. We investigate the application of deep learning to predict the detailed morphologies of galaxies in Euclid using Zoobot a convolutional neural network pretrained with 450000 galaxies from the Galaxy Zoo project. We adapted Zoobot for emulated Euclid images, generated based on Hubble Space Telescope COSMOS images, and with labels provided by volunteers in the Galaxy Zoo: Hubble project. We demonstrate that the trained Zoobot model successfully measures detailed morphology for emulated Euclid images. It effectively predicts whether a galaxy has features and identifies and characterises various features such as spiral arms, clumps, bars, disks, and central bulges. When compared to volunteer classifications Zoobot achieves mean vote fraction deviations of less than 12% and an accuracy above 91% for the confident volunteer classifications across most morphology types. However, the performance varies depending on the specific morphological class. For the global classes such as disk or smooth galaxies, the mean deviations are less than 10%, with only 1000 training galaxies necessary to reach this performance. For more detailed structures and complex tasks like detecting and counting spiral arms or clumps, the deviations are slightly higher, around 12% with 60000 galaxies used for training. In order to enhance the performance on complex morphologies, we anticipate that a larger pool of labelled galaxies is needed, which could be obtained using crowdsourcing. Finally, our findings imply that the model can be effectively adapted to new morphological labels. We demonstrate this adaptability by applying Zoobot to peculiar galaxies. In summary, our trained Zoobot CNN can readily predict morphological catalogues for Euclid images.
△ Less
Submitted 15 February, 2024;
originally announced February 2024.
-
A BRAIN study to tackle image analysis with artificial intelligence in the ALMA 2030 era
Authors:
Fabrizia Guglielmetti,
Michele Delli Veneri,
Ivano Baronchelli,
Carmen Blanco,
Andrea Dosi,
Torsten Enßlin,
Vishal Johnson,
Giuseppe Longo,
Jakob Roth,
Felix Stoehr,
Łukasz Tychoniec,
Eric Villard
Abstract:
An ESO internal ALMA development study, BRAIN, is addressing the ill-posed inverse problem of synthesis image analysis employing astrostatistics and astroinformatics. These emerging fields of research offer interdisciplinary approaches at the intersection of observational astronomy, statistics, algorithm development, and data science. In this study, we provide evidence of the benefits of employing…
▽ More
An ESO internal ALMA development study, BRAIN, is addressing the ill-posed inverse problem of synthesis image analysis employing astrostatistics and astroinformatics. These emerging fields of research offer interdisciplinary approaches at the intersection of observational astronomy, statistics, algorithm development, and data science. In this study, we provide evidence of the benefits of employing these approaches to ALMA imaging for operational and scientific purposes. We show the potential of two techniques, RESOLVE and DeepFocus, applied to ALMA calibrated science data. Significant advantages are provided with the prospect to improve the quality and completeness of the data products stored in the science archive and overall processing time for operations. Both approaches evidence the logical pathway to address the incoming revolution in data rates dictated by the planned electronic upgrades. Moreover, we bring to the community additional products through a new package, ALMASim, to promote advancements in these fields, providing a refined ALMA simulator usable by a large community for training and/or testing new algorithms.
△ Less
Submitted 17 November, 2023;
originally announced November 2023.
-
Repeating Outbursts from the Young Stellar Object Gaia23bab (= SPICY 97589)
Authors:
Michael A. Kuhn,
Robert A. Benjamin,
Emille E. O. Ishida,
Rafael S. de Souza,
Julien Peloton,
Michele Delli Veneri
Abstract:
The light curve of Gaia23bab (= SPICY 97589) shows two significant ($ΔG>2$ mag) brightening events, one in 2017 and an ongoing event starting in 2022. The source's quiescent spectral energy distribution indicates an embedded ($A_V>5$ mag) pre-main-sequence star, with optical accretion emission and mid-infrared disk emission. This characterization is supported by the source's membership in an embed…
▽ More
The light curve of Gaia23bab (= SPICY 97589) shows two significant ($ΔG>2$ mag) brightening events, one in 2017 and an ongoing event starting in 2022. The source's quiescent spectral energy distribution indicates an embedded ($A_V>5$ mag) pre-main-sequence star, with optical accretion emission and mid-infrared disk emission. This characterization is supported by the source's membership in an embedded cluster in the star-forming cloud DOBASHI 1604 at a distance of $900\pm45$~pc. Thus, the brightening events are probable accretion outbursts, likely of EX Lup-type.
△ Less
Submitted 16 March, 2023;
originally announced March 2023.
-
SKA Science Data Challenge 2: analysis and results
Authors:
P. Hartley,
A. Bonaldi,
R. Braun,
J. N. H. S. Aditya,
S. Aicardi,
L. Alegre,
A. Chakraborty,
X. Chen,
S. Choudhuri,
A. O. Clarke,
J. Coles,
J. S. Collinson,
D. Cornu,
L. Darriba,
M. Delli Veneri,
J. Forbrich,
B. Fraga,
A. Galan,
J. Garrido,
F. Gubanov,
H. Håkansson,
M. J. Hardcastle,
C. Heneka,
D. Herranz,
K. M. Hess
, et al. (83 additional authors not shown)
Abstract:
The Square Kilometre Array Observatory (SKAO) will explore the radio sky to new depths in order to conduct transformational science. SKAO data products made available to astronomers will be correspondingly large and complex, requiring the application of advanced analysis techniques to extract key science findings. To this end, SKAO is conducting a series of Science Data Challenges, each designed t…
▽ More
The Square Kilometre Array Observatory (SKAO) will explore the radio sky to new depths in order to conduct transformational science. SKAO data products made available to astronomers will be correspondingly large and complex, requiring the application of advanced analysis techniques to extract key science findings. To this end, SKAO is conducting a series of Science Data Challenges, each designed to familiarise the scientific community with SKAO data and to drive the development of new analysis techniques. We present the results from Science Data Challenge 2 (SDC2), which invited participants to find and characterise 233245 neutral hydrogen (Hi) sources in a simulated data product representing a 2000~h SKA MID spectral line observation from redshifts 0.25 to 0.5. Through the generous support of eight international supercomputing facilities, participants were able to undertake the Challenge using dedicated computational resources. Alongside the main challenge, `reproducibility awards' were made in recognition of those pipelines which demonstrated Open Science best practice. The Challenge saw over 100 participants develop a range of new and existing techniques, with results that highlight the strengths of multidisciplinary and collaborative effort. The winning strategy -- which combined predictions from two independent machine learning techniques to yield a 20 percent improvement in overall performance -- underscores one of the main Challenge outcomes: that of method complementarity. It is likely that the combination of methods in a so-called ensemble approach will be key to exploiting very large astronomical datasets.
△ Less
Submitted 14 March, 2023;
originally announced March 2023.
-
3D Detection and Characterisation of ALMA Sources through Deep Learning
Authors:
Michele Delli Veneri,
Lukasz Tychoniec,
Fabrizia Guglielmetti,
Giuseppe Longo,
Eric Villard
Abstract:
We present a Deep-Learning (DL) pipeline developed for the detection and characterization of astronomical sources within simulated Atacama Large Millimeter/submillimeter Array (ALMA) data cubes. The pipeline is composed of six DL models: a Convolutional Autoencoder for source detection within the spatial domain of the integrated data cubes, a Recurrent Neural Network (RNN) for denoising and peak d…
▽ More
We present a Deep-Learning (DL) pipeline developed for the detection and characterization of astronomical sources within simulated Atacama Large Millimeter/submillimeter Array (ALMA) data cubes. The pipeline is composed of six DL models: a Convolutional Autoencoder for source detection within the spatial domain of the integrated data cubes, a Recurrent Neural Network (RNN) for denoising and peak detection within the frequency domain, and four Residual Neural Networks (ResNets) for source characterization. The combination of spatial and frequency information improves completeness while decreasing spurious signal detection. To train and test the pipeline, we developed a simulation algorithm able to generate realistic ALMA observations, i.e. both sky model and dirty cubes. The algorithm simulates always a central source surrounded by fainter ones scattered within the cube. Some sources were spatially superimposed in order to test the pipeline deblending capabilities. The detection performances of the pipeline were compared to those of other methods and significant improvements in performances were achieved. Source morphologies are detected with subpixel accuracies obtaining mean residual errors of $10^{-3}$ pixel ($0.1$ mas) and $10^{-1}$ mJy/beam on positions and flux estimations, respectively. Projection angles and flux densities are also recovered within $10\%$ of the true values for $80\%$ and $73\%$ of all sources in the test set, respectively. While our pipeline is fine-tuned for ALMA data, the technique is applicable to other interferometric observatories, as SKA, LOFAR, VLBI, and VLTI.
△ Less
Submitted 21 November, 2022;
originally announced November 2022.
-
Bayesian and Machine Learning Methods in the Big Data era for astronomical imaging
Authors:
Fabrizia Guglielmetti,
Philipp Arras,
Michele Delli Veneri,
Torsten Enßlin,
Giuseppe Longo,
Łukasz Tychoniec,
Eric Villard
Abstract:
The Atacama Large Millimeter/submillimeter Array with the planned electronic upgrades will deliver an unprecedented amount of deep and high resolution observations. Wider fields of view are possible with the consequential cost of image reconstruction. Alternatives to commonly used applications in image processing have to be sought and tested. Advanced image reconstruction methods are critical to m…
▽ More
The Atacama Large Millimeter/submillimeter Array with the planned electronic upgrades will deliver an unprecedented amount of deep and high resolution observations. Wider fields of view are possible with the consequential cost of image reconstruction. Alternatives to commonly used applications in image processing have to be sought and tested. Advanced image reconstruction methods are critical to meet the data requirements needed for operational purposes. Astrostatistics and astroinformatics techniques are employed. Evidence is given that these interdisciplinary fields of study applied to synthesis imaging meet the Big Data challenges and have the potentials to enable new scientific discoveries in radio astronomy and astrophysics.
△ Less
Submitted 4 October, 2022;
originally announced October 2022.
-
How have astronomers cited other fields in the last decade?
Authors:
Michele Delli Veneri,
Rafael S. de Souza,
Alberto Krone-Martins,
Emille E. O. Ishida,
Maria Luiza L. Dantas,
Noble Kennamer
Abstract:
We present a citation pattern analysis between astronomical papers and 13 other disciplines, based on the arXiv database over the past decade ($2010 - 2020$). We analyze 12,600 astronomical papers citing over 14,531 unique publications outside astronomy. Two striking patterns are unraveled. First, general relativity recently became the most cited field by astronomers, a trend highly correlated wit…
▽ More
We present a citation pattern analysis between astronomical papers and 13 other disciplines, based on the arXiv database over the past decade ($2010 - 2020$). We analyze 12,600 astronomical papers citing over 14,531 unique publications outside astronomy. Two striking patterns are unraveled. First, general relativity recently became the most cited field by astronomers, a trend highly correlated with the discovery of gravitational waves. Secondly, the fast growth of referenced papers in computer science and statistics, the first with a notable 15-fold increase since 2015. Such findings confirm the critical role of interdisciplinary efforts involving astronomy, statistics, and computer science in recent astronomical research.
△ Less
Submitted 26 May, 2022;
originally announced May 2022.
-
A novel approach to the classification of terrestrial drainage networks based on deep learning and preliminary results on Solar System bodies
Authors:
Carlo Donadio,
Massimo Brescia,
Alessia Riccardo,
Giuseppe Angora,
Michele Delli Veneri,
Giuseppe Riccio
Abstract:
Several approaches were proposed to describe the geomorphology of drainage networks and the abiotic/biotic factors determining their morphology. There is an intrinsic complexity of the explicit qualification of the morphological variations in response to various types of control factors and the difficulty of expressing the cause-effect links. Traditional methods of drainage network classification…
▽ More
Several approaches were proposed to describe the geomorphology of drainage networks and the abiotic/biotic factors determining their morphology. There is an intrinsic complexity of the explicit qualification of the morphological variations in response to various types of control factors and the difficulty of expressing the cause-effect links. Traditional methods of drainage network classification are based on the manual extraction of key characteristics, then applied as pattern recognition schemes. These approaches, however, have low predictive and uniform ability. We present a different approach, based on the data-driven supervised learning by images, extended also to extraterrestrial cases. With deep learning models, the extraction and classification phase is integrated within a more objective, analytical, and automatic framework. Despite the initial difficulties, due to the small number of training images available, and the similarity between the different shapes of the drainage samples, we obtained successful results, concluding that deep learning is a valid way for data exploration in geomorphology and related fields.
△ Less
Submitted 6 March, 2021;
originally announced March 2021.
-
Rejection criteria based on outliers in the KiDS photometric redshifts and PDF distributions derived by machine learning
Authors:
Valeria Amaro,
Stefano Cavuoti,
Massimo Brescia,
Giuseppe Riccio,
Crescenzo Tortora,
Maurizio D'Addona,
Michele Delli Veneri,
Nicola R. Napolitano,
Mario Radovich,
Giuseppe Longo
Abstract:
The Probability Density Function (PDF) provides an estimate of the photometric redshift (zphot) prediction error. It is crucial for current and future sky surveys, characterized by strict requirements on the zphot precision, reliability and completeness. The present work stands on the assumption that properly defined rejection criteria, capable of identifying and rejecting potential outliers, can…
▽ More
The Probability Density Function (PDF) provides an estimate of the photometric redshift (zphot) prediction error. It is crucial for current and future sky surveys, characterized by strict requirements on the zphot precision, reliability and completeness. The present work stands on the assumption that properly defined rejection criteria, capable of identifying and rejecting potential outliers, can increase the precision of zphot estimates and of their cumulative PDF, without sacrificing much in terms of completeness of the sample. We provide a way to assess rejection through proper cuts on the shape descriptors of a PDF, such as the width and the height of the maximum PDF's peak. In this work we tested these rejection criteria to galaxies with photometry extracted from the Kilo Degree Survey (KiDS) ESO Data Release 4, proving that such approach could lead to significant improvements to the zphot quality: e.g., for the clipped sample showing the best trade-off between precision and completeness, we achieve a reduction in outliers fraction of $\simeq 75\%$ and an improvement of $\simeq 6\%$ for NMAD, with respect to the original data set, preserving the $\simeq 93\%$ of its content.
△ Less
Submitted 3 July, 2020;
originally announced July 2020.
-
Periodic Astrometric Signal Recovery through Convolutional Autoencoders
Authors:
Michele Delli Veneri,
Louis Desdoigts,
Morgan A. Schmitz,
Alberto Krone-Martins,
Emille E. O. Ishida,
Peter Tuthill,
Rafael S. de Souza,
Richard Scalzo,
Massimo Brescia,
Giuseppe Longo,
Antonio Picariello
Abstract:
Astrometric detection involves a precise measurement of stellar positions, and is widely regarded as the leading concept presently ready to find earth-mass planets in temperate orbits around nearby sun-like stars. The TOLIMAN space telescope[39] is a low-cost, agile mission concept dedicated to narrow-angle astrometric monitoring of bright binary stars. In particular the mission will be optimised…
▽ More
Astrometric detection involves a precise measurement of stellar positions, and is widely regarded as the leading concept presently ready to find earth-mass planets in temperate orbits around nearby sun-like stars. The TOLIMAN space telescope[39] is a low-cost, agile mission concept dedicated to narrow-angle astrometric monitoring of bright binary stars. In particular the mission will be optimised to search for habitable-zone planets around Alpha Centauri AB. If the separation between these two stars can be monitored with sufficient precision, tiny perturbations due to the gravitational tug from an unseen planet can be witnessed and, given the configuration of the optical system, the scale of the shifts in the image plane are about one millionth of a pixel. Image registration at this level of precision has never been demonstrated (to our knowledge) in any setting within science. In this paper we demonstrate that a Deep Convolutional Auto-Encoder is able to retrieve such a signal from simplified simulations of the TOLIMAN data and we present the full experimental pipeline to recreate out experiments from the simulations to the signal analysis. In future works, all the more realistic sources of noise and systematic effects present in the real-world system will be injected into the simulations.
△ Less
Submitted 24 June, 2020;
originally announced June 2020.
-
Star Formation Rates for photometric samples of galaxies using machine learning methods
Authors:
M. Delli Veneri,
S. Cavuoti,
M. Brescia,
G. Longo,
G. Riccio
Abstract:
Star Formation Rates or SFRs are crucial to constrain theories of galaxy formation and evolution. SFRs are usually estimated via spectroscopic observations requiring large amounts of telescope time. We explore an alternative approach based on the photometric estimation of global SFRs for large samples of galaxies, by using methods such as automatic parameter space optimisation, and supervised Mach…
▽ More
Star Formation Rates or SFRs are crucial to constrain theories of galaxy formation and evolution. SFRs are usually estimated via spectroscopic observations requiring large amounts of telescope time. We explore an alternative approach based on the photometric estimation of global SFRs for large samples of galaxies, by using methods such as automatic parameter space optimisation, and supervised Machine Learning models. We demonstrate that, with such approach, accurate multi-band photometry allows to estimate reliable SFRs. We also investigate how the use of photometric rather than spectroscopic redshifts, affects the accuracy of derived global SFRs. Finally, we provide a publicly available catalogue of SFRs for more than 27 million galaxies extracted from the Sloan Digital Sky survey Data Release 7. The catalogue is available through the Vizier facility at the following link ftp://cdsarc.u-strasbg.fr/pub/cats/J/MNRAS/486/1377.
△ Less
Submitted 6 June, 2019; v1 submitted 7 February, 2019;
originally announced February 2019.
-
Stellar formation rates in galaxies using Machine Learning models
Authors:
Michele Delli Veneri,
Stefano Cavuoti,
Massimo Brescia,
Giuseppe Riccio,
Giuseppe Longo
Abstract:
Global Stellar Formation Rates or SFRs are crucial to constrain theories of galaxy formation and evolution. SFR's are usually estimated via spectroscopic observations which require too much previous telescope time and therefore cannot match the needs of modern precision cosmology. We therefore propose a novel method to estimate SFRs for large samples of galaxies using a variety of supervised ML mo…
▽ More
Global Stellar Formation Rates or SFRs are crucial to constrain theories of galaxy formation and evolution. SFR's are usually estimated via spectroscopic observations which require too much previous telescope time and therefore cannot match the needs of modern precision cosmology. We therefore propose a novel method to estimate SFRs for large samples of galaxies using a variety of supervised ML models.
△ Less
Submitted 23 January, 2019; v1 submitted 16 May, 2018;
originally announced May 2018.