Search | arXiv e-print repository

Temporal Stamp Classifier: Classifying Short Sequences of Astronomical Alerts

Authors: Daniel Neira O., Pablo A. Estévez, Francisco Förster

Abstract: In this work, we propose a deep learning-based classification model of astronomical objects using alerts reported by the Zwicky Transient Facility (ZTF) survey. The model takes as inputs sequences of stamp images and metadata contained in each alert, as well as features from the All-WISE catalog. The proposed model, called temporal stamp classifier, is able to discriminate between three classes of… ▽ More In this work, we propose a deep learning-based classification model of astronomical objects using alerts reported by the Zwicky Transient Facility (ZTF) survey. The model takes as inputs sequences of stamp images and metadata contained in each alert, as well as features from the All-WISE catalog. The proposed model, called temporal stamp classifier, is able to discriminate between three classes of astronomical objects: Active Galactic Nuclei (AGN), Super-Novae (SNe) and Variable Stars (VS), with an accuracy of approximately 98% in the test set, when using 2 to 5 detections. The results show that the model performance improves with the addition of more detections. Simple recurrence models obtain competitive results with those of more complex models such as LSTM.We also propose changes to the original stamp classifier model, which only uses the first detection. The performance of the latter model improves with changes in the architecture and the addition of random rotations, achieving a 1.46% increase in test accuracy. △ Less

Submitted 23 May, 2024; originally announced May 2024.

Comments: Accepted in International Joint Conference on Neural Networks 2024

arXiv:2405.03078 [pdf, other]

ATAT: Astronomical Transformer for time series And Tabular data

Authors: G. Cabrera-Vives, D. Moreno-Cartagena, N. Astorga, I. Reyes-Jainaga, F. Förster, P. Huijse, J. Arredondo, A. M. Muñoz Arancibia, A. Bayo, M. Catelan, P. A. Estévez, P. Sánchez-Sáez, A. Álvarez, P. Castellanos, P. Gallardo, A. Moya, D. Rodriguez-Mancini

Abstract: The advent of next-generation survey instruments, such as the Vera C. Rubin Observatory and its Legacy Survey of Space and Time (LSST), is opening a window for new research in time-domain astronomy. The Extended LSST Astronomical Time-Series Classification Challenge (ELAsTiCC) was created to test the capacity of brokers to deal with a simulated LSST stream. We describe ATAT, the Astronomical Trans… ▽ More The advent of next-generation survey instruments, such as the Vera C. Rubin Observatory and its Legacy Survey of Space and Time (LSST), is opening a window for new research in time-domain astronomy. The Extended LSST Astronomical Time-Series Classification Challenge (ELAsTiCC) was created to test the capacity of brokers to deal with a simulated LSST stream. We describe ATAT, the Astronomical Transformer for time series And Tabular data, a classification model conceived by the ALeRCE alert broker to classify light-curves from next-generation alert streams. ATAT was tested in production during the first round of the ELAsTiCC campaigns. ATAT consists of two Transformer models that encode light curves and features using novel time modulation and quantile feature tokenizer mechanisms, respectively. ATAT was trained on different combinations of light curves, metadata, and features calculated over the light curves. We compare ATAT against the current ALeRCE classifier, a Balanced Hierarchical Random Forest (BHRF) trained on human-engineered features derived from light curves and metadata. When trained on light curves and metadata, ATAT achieves a macro F1-score of 82.9 +- 0.4 in 20 classes, outperforming the BHRF model trained on 429 features, which achieves a macro F1-score of 79.4 +- 0.1. The use of Transformer multimodal architectures, combining light curves and tabular data, opens new possibilities for classifying alerts from a new generation of large etendue telescopes, such as the Vera C. Rubin Observatory, in real-world brokering scenarios. △ Less

Submitted 16 May, 2024; v1 submitted 5 May, 2024; originally announced May 2024.

arXiv:2304.08519 [pdf, other]

doi 10.1051/0004-6361/202346077

Persistent and occasional: searching for the variable population of the ZTF/4MOST sky using ZTF data release 11

Authors: P. Sánchez-Sáez, J. Arredondo, A. Bayo, P. Arévalo, F. E. Bauer, G. Cabrera-Vives, M. Catelan, P. Coppi, P. A. Estévez, F. Förster, L. Hernández-García, P. Huijse, R. Kurtev, P. Lira, A. M. Muñoz Arancibia, G. Pignata

Abstract: We present a variability, color and morphology based classifier, designed to identify transients, persistently variable, and non-variable sources, from the Zwicky Transient Facility (ZTF) Data Release 11 (DR11) light curves of extended and point sources. The main motivation to develop this model was to identify active galactic nuclei (AGN) at different redshift ranges to be observed by the 4MOST C… ▽ More We present a variability, color and morphology based classifier, designed to identify transients, persistently variable, and non-variable sources, from the Zwicky Transient Facility (ZTF) Data Release 11 (DR11) light curves of extended and point sources. The main motivation to develop this model was to identify active galactic nuclei (AGN) at different redshift ranges to be observed by the 4MOST ChANGES project. Still, it serves as a more general time-domain astronomy study. The model uses nine colors computed from CatWISE and PS1, a morphology score from PS1, and 61 single-band variability features computed from the ZTF DR11 g and r light curves. We trained two versions of the model, one for each ZTF band. We used a hierarchical local classifier per parent node approach, where each node was composed of a balanced random forest model. We adopted a 17-class taxonomy, including non-variable stars and galaxies, three transient classes, five classes of stochastic variables, and seven classes of periodic variables. The macro averaged precision, recall and F1-score are 0.61, 0.75, and 0.62 for the g-band model, and 0.60, 0.74, and 0.61, for the r-band model. When grouping the four AGN classes into one single class, its precision, recall, and F1-score are 1.00, 0.95, and 0.97, respectively, for both the g and r bands. We applied the model to all the sources in the ZTF/4MOST overlapping sky, avoiding ZTF fields covering the Galactic bulge, including 86,576,577 light curves in the g-band and 140,409,824 in the r-band. Only 0.73\% of the g-band light curves and 2.62\% of the r-band light curves were classified as stochastic, periodic, or transient with high probability ($P_{init}\geq0.9$). We found that, in general, more reliable results are obtained when using the g-band model. Using the latter, we identified 384,242 AGN candidates, 287,156 of which have $P_{init}\geq0.9$. △ Less

Submitted 17 April, 2023; originally announced April 2023.

Comments: Accepted for publication in Astronomy & Astrophysics. Abstract shortened for arXiv. Tables containing the classifications and features for the ZTF g and r bands, and the labeled sets will be available at CDS. Individual catalogs per class and band, as well as the labeled set catalogs, can be downloaded at Zenodo DOI:10.5281/zenodo.7826045

Journal ref: A&A 675, A195 (2023)

arXiv:2208.04310 [pdf, other]

doi 10.3847/1538-3881/ac912a

DELIGHT: Deep Learning Identification of Galaxy Hosts of Transients using Multi-resolution Images

Authors: Francisco Förster, Alejandra M. Muñoz Arancibia, Ignacio Reyes, Alexander Gagliano, Dylan Britt, Sara Cuellar-Carrillo, Felipe Figueroa-Tapia, Ava Polzin, Yara Yousef, Javier Arredondo, Diego Rodríguez-Mancini, Javier Correa-Orellana, Amelia Bayo, Franz E. Bauer, Márcio Catelan, Guillermo Cabrera-Vives, Raya Dastidar, Pablo A. Estévez, Giuliano Pignata, Lorena Hernandez-Garcia, Pablo Huijse, Esteban Reyes, Paula Sánchez-Sáez, Mauricio Ramirez, Daniela Grandón , et al. (3 additional authors not shown)

Abstract: We present DELIGHT, or Deep Learning Identification of Galaxy Hosts of Transients, a new algorithm designed to automatically and in real-time identify the host galaxies of extragalactic transients. The proposed algorithm receives as input compact, multi-resolution images centered at the position of a transient candidate and outputs two-dimensional offset vectors that connect the transient with the… ▽ More We present DELIGHT, or Deep Learning Identification of Galaxy Hosts of Transients, a new algorithm designed to automatically and in real-time identify the host galaxies of extragalactic transients. The proposed algorithm receives as input compact, multi-resolution images centered at the position of a transient candidate and outputs two-dimensional offset vectors that connect the transient with the center of its predicted host. The multi-resolution input consists of a set of images with the same number of pixels, but with progressively larger pixel sizes and fields of view. A sample of \nSample galaxies visually identified by the ALeRCE broker team was used to train a convolutional neural network regression model. We show that this method is able to correctly identify both relatively large ($10\arcsec < r < 60\arcsec$) and small ($r \le 10\arcsec$) apparent size host galaxies using much less information (32 kB) than with a large, single-resolution image (920 kB). The proposed method has fewer catastrophic errors in recovering the position and is more complete and has less contamination ($< 0.86\%$) recovering the cross-matched redshift than other state-of-the-art methods. The more efficient representation provided by multi-resolution input images could allow for the identification of transient host galaxies in real-time, if adopted in alert streams from new generation of large etendue telescopes such as the Vera C. Rubin Observatory. △ Less

Submitted 8 August, 2022; originally announced August 2022.

Comments: Submitted to The Astronomical Journal on Aug 5th, 2022. Comments and suggestions are welcome

arXiv:2205.06758 [pdf, other]

doi 10.3847/1538-4357/ac6f5a

Improving Astronomical Time-series Classification via Data Augmentation with Generative Adversarial Networks

Authors: Germán García-Jara, Pavlos Protopapas, Pablo A. Estévez

Abstract: Due to the latest advances in technology, telescopes with significant sky coverage will produce millions of astronomical alerts per night that must be classified both rapidly and automatically. Currently, classification consists of supervised machine learning algorithms whose performance is limited by the number of existing annotations of astronomical objects and their highly imbalanced class dist… ▽ More Due to the latest advances in technology, telescopes with significant sky coverage will produce millions of astronomical alerts per night that must be classified both rapidly and automatically. Currently, classification consists of supervised machine learning algorithms whose performance is limited by the number of existing annotations of astronomical objects and their highly imbalanced class distributions. In this work, we propose a data augmentation methodology based on Generative Adversarial Networks (GANs) to generate a variety of synthetic light curves from variable stars. Our novel contributions, consisting of a resampling technique and an evaluation metric, can assess the quality of generative models in unbalanced datasets and identify GAN-overfitting cases that the Fréchet Inception Distance does not reveal. We applied our proposed model to two datasets taken from the Catalina and Zwicky Transient Facility surveys. The classification accuracy of variable stars is improved significantly when training with synthetic data and testing with real data with respect to the case of using only real data. △ Less

Submitted 13 May, 2022; originally announced May 2022.

Comments: Accepted to ApJ on May 11, 2022

ACM Class: J.2.3

arXiv:2201.08482 [pdf, other]

doi 10.3847/1538-3881/ac9ab4

Deep Attention-Based Supernovae Classification of Multi-Band Light-Curves

Authors: Óscar Pimentel, Pablo A. Estévez, Francisco Förster

Abstract: In astronomical surveys, such as the Zwicky Transient Facility, supernovae (SNe) are relatively uncommon objects compared to other classes of variable events. Along with this scarcity, the processing of multi-band light-curves is a challenging task due to the highly irregular cadence, long time gaps, missing-values, few observations, etc. These issues are particularly detrimental to the analysis o… ▽ More In astronomical surveys, such as the Zwicky Transient Facility, supernovae (SNe) are relatively uncommon objects compared to other classes of variable events. Along with this scarcity, the processing of multi-band light-curves is a challenging task due to the highly irregular cadence, long time gaps, missing-values, few observations, etc. These issues are particularly detrimental to the analysis of transient events: SN-like light-curves. We offer three main contributions: 1) Based on temporal modulation and attention mechanisms, we propose a Deep attention model (TimeModAttn) to classify multi-band light-curves of different SN types, avoiding photometric or hand-crafted feature computations, missing-value assumptions, and explicit imputation/interpolation methods. 2) We propose a model for the synthetic generation of SN multi-band light-curves based on the Supernova Parametric Model, allowing us to increase the number of samples and the diversity of cadence. Thus, the TimeModAttn model is first pre-trained using synthetic light-curves. Then, a fine-tuning process is performed. The TimeModAttn model outperformed other Deep Learning models, based on Recurrent Neural Networks, in two scenarios: late-classification and early-classification. Also, the TimeModAttn model outperformed a Balanced Random Forest (BRF) classifier (trained with real data), increasing the balanced-$F_1$score from $\approx.525$ to $\approx.596$. When training the BRF with synthetic data, this model achieved similar performance to the TimeModAttn model proposed while still maintaining extra advantages. 3) We conducted interpretability experiments. High attention scores were obtained for observations earlier than and close to the SN brightness peaks. This also correlated with an early highly variability of the learned temporal modulation. △ Less

Submitted 25 November, 2022; v1 submitted 20 January, 2022; originally announced January 2022.

Comments: Submitted to AJ on 14-Jan-2022

arXiv:2106.07660 [pdf, other]

doi 10.3847/1538-3881/ac1426

Searching for changing-state AGNs in massive datasets -- I: applying deep learning and anomaly detection techniques to find AGNs with anomalous variability behaviours

Authors: P. Sánchez-Sáez, H. Lira, L. Martí, N. Sánchez-Pi, J. Arredondo, F. E. Bauer, A. Bayo, G. Cabrera-Vives, C. Donoso-Oliva, P. A. Estévez, S. Eyheramendy, F. Förster, L. Hernández-García, A. M. Muñoz Arancibia, M. Pérez-Carrasco, M. Sepúlveda, J. R. Vergara

Abstract: The classic classification scheme for Active Galactic Nuclei (AGNs) was recently challenged by the discovery of the so-called changing-state (changing-look) AGNs (CSAGNs). The physical mechanism behind this phenomenon is still a matter of open debate and the samples are too small and of serendipitous nature to provide robust answers. In order to tackle this problem, we need to design methods that… ▽ More The classic classification scheme for Active Galactic Nuclei (AGNs) was recently challenged by the discovery of the so-called changing-state (changing-look) AGNs (CSAGNs). The physical mechanism behind this phenomenon is still a matter of open debate and the samples are too small and of serendipitous nature to provide robust answers. In order to tackle this problem, we need to design methods that are able to detect AGN right in the act of changing-state. Here we present an anomaly detection (AD) technique designed to identify AGN light curves with anomalous behaviors in massive datasets. The main aim of this technique is to identify CSAGN at different stages of the transition, but it can also be used for more general purposes, such as cleaning massive datasets for AGN variability analyses. We used light curves from the Zwicky Transient Facility data release 5 (ZTF DR5), containing a sample of 230,451 AGNs of different classes. The ZTF DR5 light curves were modeled with a Variational Recurrent Autoencoder (VRAE) architecture, that allowed us to obtain a set of attributes from the VRAE latent space that describes the general behaviour of our sample. These attributes were then used as features for an Isolation Forest (IF) algorithm, that is an anomaly detector for a "one class" kind of problem. We used the VRAE reconstruction errors and the IF anomaly score to select a sample of 8,809 anomalies. These anomalies are dominated by bogus candidates, but we were able to identify 75 promising CSAGN candidates. △ Less

Submitted 12 July, 2021; v1 submitted 14 June, 2021; originally announced June 2021.

Comments: Accepted for publication in the Astronomical Journal (AJ)

Journal ref: AJ 162 206 (2021)

arXiv:2106.03736 [pdf, other]

doi 10.1093/mnras/stab1598

The effect of phased recurrent units in the classification of multiple catalogs of astronomical lightcurves

Authors: C. Donoso-Oliva, G. Cabrera-Vives, P. Protopapas, R. Carrasco-Davis, P. A. Estevez

Abstract: In the new era of very large telescopes, where data is crucial to expand scientific knowledge, we have witnessed many deep learning applications for the automatic classification of lightcurves. Recurrent neural networks (RNNs) are one of the models used for these applications, and the LSTM unit stands out for being an excellent choice for the representation of long time series. In general, RNNs as… ▽ More In the new era of very large telescopes, where data is crucial to expand scientific knowledge, we have witnessed many deep learning applications for the automatic classification of lightcurves. Recurrent neural networks (RNNs) are one of the models used for these applications, and the LSTM unit stands out for being an excellent choice for the representation of long time series. In general, RNNs assume observations at discrete times, which may not suit the irregular sampling of lightcurves. A traditional technique to address irregular sequences consists of adding the sampling time to the network's input, but this is not guaranteed to capture sampling irregularities during training. Alternatively, the Phased LSTM unit has been created to address this problem by updating its state using the sampling times explicitly. In this work, we study the effectiveness of the LSTM and Phased LSTM based architectures for the classification of astronomical lightcurves. We use seven catalogs containing periodic and nonperiodic astronomical objects. Our findings show that LSTM outperformed PLSTM on 6/7 datasets. However, the combination of both units enhances the results in all datasets. △ Less

Submitted 7 June, 2021; originally announced June 2021.

arXiv:2008.03311 [pdf, other]

doi 10.3847/1538-3881/abd5c1

Alert Classification for the ALeRCE Broker System: The Light Curve Classifier

Authors: P. Sánchez-Sáez, I. Reyes, C. Valenzuela, F. Förster, S. Eyheramendy, F. Elorrieta, F. E. Bauer, G. Cabrera-Vives, P. A. Estévez, M. Catelan, G. Pignata, P. Huijse, D. De Cicco, P. Arévalo, R. Carrasco-Davis, J. Abril, R. Kurtev, J. Borissova, J. Arredondo, E. Castillo-Navarrete, D. Rodriguez, D. Ruz-Mieres, A. Moya, L. Sabatini-Gacitúa, C. Sepúlveda-Cobo , et al. (1 additional authors not shown)

Abstract: We present the first version of the ALeRCE (Automatic Learning for the Rapid Classification of Events) broker light curve classifier. ALeRCE is currently processing the Zwicky Transient Facility (ZTF) alert stream, in preparation for the Vera C. Rubin Observatory. The ALeRCE light curve classifier uses variability features computed from the ZTF alert stream, and colors obtained from AllWISE and ZT… ▽ More We present the first version of the ALeRCE (Automatic Learning for the Rapid Classification of Events) broker light curve classifier. ALeRCE is currently processing the Zwicky Transient Facility (ZTF) alert stream, in preparation for the Vera C. Rubin Observatory. The ALeRCE light curve classifier uses variability features computed from the ZTF alert stream, and colors obtained from AllWISE and ZTF photometry. We apply a Balanced Random Forest algorithm with a two-level scheme, where the top level classifies each source as periodic, stochastic, or transient, and the bottom level further resolves each of these hierarchical classes, amongst 15 total classes. This classifier corresponds to the first attempt to classify multiple classes of stochastic variables (including core- and host-dominated active galactic nuclei, blazars, young stellar objects, and cataclysmic variables) in addition to different classes of periodic and transient sources, using real data. We created a labeled set using various public catalogs (such as the Catalina Surveys and {\em Gaia} DR2 variable stars catalogs, and the Million Quasars catalog), and we classify all objects with $\geq6$ $g$-band or $\geq6$ $r$-band detections in ZTF (868,371 sources as of 2020/06/09), providing updated classifications for sources with new alerts every day. For the top level we obtain macro-averaged precision and recall scores of 0.96 and 0.99, respectively, and for the bottom level we obtain macro-averaged precision and recall scores of 0.57 and 0.76, respectively. Updated classifications from the light curve classifier can be found at the \href{http://alerce.online}{ALeRCE Explorer website}. △ Less

Submitted 19 November, 2020; v1 submitted 7 August, 2020; originally announced August 2020.

Comments: 39 pages, 24 figures, 5 tables, 4 apendices. Accepted for publication in the Astronomical Journal (AJ)

arXiv:2008.03309 [pdf, other]

doi 10.3847/1538-3881/ac0ef1

Alert Classification for the ALeRCE Broker System: The Real-time Stamp Classifier

Authors: Rodrigo Carrasco-Davis, Esteban Reyes, Camilo Valenzuela, Francisco Förster, Pablo A. Estévez, Giuliano Pignata, Franz E. Bauer, Ignacio Reyes, Paula Sánchez-Sáez, Guillermo Cabrera-Vives, Susana Eyheramendy, Márcio Catelan, Javier Arredondo, Ernesto Castillo-Navarrete, Diego Rodríguez-Mancini, Daniela Ruz-Mieres, Alberto Moya, Luis Sabatini-Gacitúa, Cristóbal Sepúlveda-Cobo, Ashish A. Mahabal, Javier Silva-Farfán, Ernesto Camacho-Iñiquez, Lluís Galbany

Abstract: We present a real-time stamp classifier of astronomical events for the ALeRCE (Automatic Learning for the Rapid Classification of Events) broker. The classifier is based on a convolutional neural network, trained on alerts ingested from the Zwicky Transient Facility (ZTF). Using only the \textit{science, reference} and \textit{difference} images of the first detection as inputs, along with the met… ▽ More We present a real-time stamp classifier of astronomical events for the ALeRCE (Automatic Learning for the Rapid Classification of Events) broker. The classifier is based on a convolutional neural network, trained on alerts ingested from the Zwicky Transient Facility (ZTF). Using only the \textit{science, reference} and \textit{difference} images of the first detection as inputs, along with the metadata of the alert as features, the classifier is able to correctly classify alerts from active galactic nuclei, supernovae (SNe), variable stars, asteroids and bogus classes, with high accuracy ($\sim$94\%) in a balanced test set. In order to find and analyze SN candidates selected by our classifier from the ZTF alert stream, we designed and deployed a visualization tool called SN Hunter, where relevant information about each possible SN is displayed for the experts to choose among candidates to report to the Transient Name Server database. From June 26th 2019 to February 28th 2021, we have reported 6846 SN candidates to date (11.8 candidates per day on average), of which 971 have been confirmed spectroscopically. Our ability to report objects using only a single detection means that 70\% of the reported SNe occurred within one day after the first detection. ALeRCE has only reported candidates not otherwise detected or selected by other groups, therefore adding new early transients to the bulk of objects available for early follow-up. Our work represents an important milestone toward rapid alert classifications with the next generation of large etendue telescopes, such as the Vera C. Rubin Observatory. △ Less

Submitted 3 June, 2021; v1 submitted 7 August, 2020; originally announced August 2020.

Comments: Submitted to AAS on Jun 30th. Comments welcome

arXiv:2008.03303 [pdf, other]

doi 10.3847/1538-3881/abe9bc

The Automatic Learning for the Rapid Classification of Events (ALeRCE) Alert Broker

Authors: F. Förster, G. Cabrera-Vives, E. Castillo-Navarrete, P. A. Estévez, P. Sánchez-Sáez, J. Arredondo, F. E. Bauer, R. Carrasco-Davis, M. Catelan, F. Elorrieta, S. Eyheramendy, P. Huijse, G. Pignata, E. Reyes, I. Reyes, D. Rodríguez-Mancini, D. Ruz-Mieres, C. Valenzuela, I. Alvarez-Maldonado, N. Astorga, J. Borissova, A. Clocchiatti, D. De Cicco, C. Donoso-Oliva, M. J. Graham , et al. (15 additional authors not shown)

Abstract: We introduce the Automatic Learning for the Rapid Classification of Events (ALeRCE) broker, an astronomical alert broker designed to provide a rapid and self--consistent classification of large etendue telescope alert streams, such as that provided by the Zwicky Transient Facility (ZTF) and, in the future, the Vera C. Rubin Observatory Legacy Survey of Space and Time (LSST). ALeRCE is a Chilean--l… ▽ More We introduce the Automatic Learning for the Rapid Classification of Events (ALeRCE) broker, an astronomical alert broker designed to provide a rapid and self--consistent classification of large etendue telescope alert streams, such as that provided by the Zwicky Transient Facility (ZTF) and, in the future, the Vera C. Rubin Observatory Legacy Survey of Space and Time (LSST). ALeRCE is a Chilean--led broker run by an interdisciplinary team of astronomers and engineers, working to become intermediaries between survey and follow--up facilities. ALeRCE uses a pipeline which includes the real--time ingestion, aggregation, cross--matching, machine learning (ML) classification, and visualization of the ZTF alert stream. We use two classifiers: a stamp--based classifier, designed for rapid classification, and a light--curve--based classifier, which uses the multi--band flux evolution to achieve a more refined classification. We describe in detail our pipeline, data products, tools and services, which are made public for the community (see \url{https://alerce.science}). Since we began operating our real--time ML classification of the ZTF alert stream in early 2019, we have grown a large community of active users around the globe. We describe our results to date, including the real--time processing of $9.7\times10^7$ alerts, the stamp classification of $1.9\times10^7$ objects, the light curve classification of $8.5\times10^5$ objects, the report of 3088 supernova candidates, and different experiments using LSST-like alert streams. Finally, we discuss the challenges ahead to go from a single-stream of alerts such as ZTF to a multi--stream ecosystem dominated by LSST. △ Less

Submitted 7 August, 2020; originally announced August 2020.

Comments: Submitted to AAS on Jun 29th. Preview for LSST PCW 2020. Comments welcome

arXiv:2005.07795 [pdf, other]

doi 10.1109/IJCNN48605.2020.9207719

RED: Deep Recurrent Neural Networks for Sleep EEG Event Detection

Authors: Nicolás I. Tapia, Pablo A. Estévez

Abstract: The brain electrical activity presents several short events during sleep that can be observed as distinctive micro-structures in the electroencephalogram (EEG), such as sleep spindles and K-complexes. These events have been associated with biological processes and neurological disorders, making them a research topic in sleep medicine. However, manual detection limits their study because it is time… ▽ More The brain electrical activity presents several short events during sleep that can be observed as distinctive micro-structures in the electroencephalogram (EEG), such as sleep spindles and K-complexes. These events have been associated with biological processes and neurological disorders, making them a research topic in sleep medicine. However, manual detection limits their study because it is time-consuming and affected by significant inter-expert variability, motivating automatic approaches. We propose a deep learning approach based on convolutional and recurrent neural networks for sleep EEG event detection called Recurrent Event Detector (RED). RED uses one of two input representations: a) the time-domain EEG signal, or b) a complex spectrogram of the signal obtained with the Continuous Wavelet Transform (CWT). Unlike previous approaches, a fixed time window is avoided and temporal context is integrated to better emulate the visual criteria of experts. When evaluated on the MASS dataset, our detectors outperform the state of the art in both sleep spindle and K-complex detection with a mean F1-score of at least 80.9% and 82.6%, respectively. Although the CWT-domain model obtained a similar performance than its time-domain counterpart, the former allows in principle a more interpretable input representation due to the use of a spectrogram. The proposed approach is event-agnostic and can be used directly to detect other types of sleep events. △ Less

Submitted 3 October, 2020; v1 submitted 15 May, 2020; originally announced May 2020.

Comments: 8 pages, 5 figures. In proceedings of the 2020 International Joint Conference on Neural Networks (IJCNN 2020)

arXiv:2005.07783 [pdf, other]

doi 10.1109/IJCNN48605.2020.9207269

On the Information Plane of Autoencoders

Authors: Nicolás I. Tapia, Pablo A. Estévez

Abstract: The training dynamics of hidden layers in deep learning are poorly understood in theory. Recently, the Information Plane (IP) was proposed to analyze them, which is based on the information-theoretic concept of mutual information (MI). The Information Bottleneck (IB) theory predicts that layers maximize relevant information and compress irrelevant information. Due to the limitations in MI estimati… ▽ More The training dynamics of hidden layers in deep learning are poorly understood in theory. Recently, the Information Plane (IP) was proposed to analyze them, which is based on the information-theoretic concept of mutual information (MI). The Information Bottleneck (IB) theory predicts that layers maximize relevant information and compress irrelevant information. Due to the limitations in MI estimation from samples, there is an ongoing debate about the properties of the IP for the supervised learning case. In this work, we derive a theoretical convergence for the IP of autoencoders. The theory predicts that ideal autoencoders with a large bottleneck layer size do not compress input information, whereas a small size causes compression only in the encoder layers. For the experiments, we use a Gram-matrix based MI estimator recently proposed in the literature. We propose a new rule to adjust its parameters that compensates scale and dimensionality effects. Using our proposed rule, we obtain experimental IPs closer to the theory. Our theoretical IP for autoencoders could be used as a benchmark to validate new methods to estimate MI in neural networks. In this way, experimental limitations could be recognized and corrected, helping with the ongoing debate on the supervised learning case. △ Less

Submitted 3 October, 2020; v1 submitted 15 May, 2020; originally announced May 2020.

Comments: 8 pages, 9 figures. In proceedings of the 2020 International Joint Conference on Neural Networks (IJCNN 2020)

arXiv:2005.07779 [pdf, other]

Transformation Based Deep Anomaly Detection in Astronomical Images

Authors: Esteban Reyes, Pablo A. Estévez

Abstract: In this work, we propose several enhancements to a geometric transformation based model for anomaly detection in images (GeoTranform). The model assumes that the anomaly class is unknown and that only inlier samples are available for training. We introduce new filter based transformations useful for detecting anomalies in astronomical images, that highlight artifact properties to make them more ea… ▽ More In this work, we propose several enhancements to a geometric transformation based model for anomaly detection in images (GeoTranform). The model assumes that the anomaly class is unknown and that only inlier samples are available for training. We introduce new filter based transformations useful for detecting anomalies in astronomical images, that highlight artifact properties to make them more easily distinguishable from real objects. In addition, we propose a transformation selection strategy that allows us to find indistinguishable pairs of transformations. This results in an improvement of the area under the Receiver Operating Characteristic curve (AUROC) and accuracy performance, as well as in a dimensionality reduction. The models were tested on astronomical images from the High Cadence Transient Survey (HiTS) and Zwicky Transient Facility (ZTF) datasets. The best models obtained an average AUROC of 99.20% for HiTS and 91.39% for ZTF. The improvement over the original GeoTransform algorithm and baseline methods such as One-Class Support Vector Machine, and deep learning based methods is significant both statistically and in practice. △ Less

Submitted 15 May, 2020; originally announced May 2020.

Comments: 8 pages, 6 figures, 4 tables. Accepted for publication in proceedings of the IEEE World Congress on Computational Intelligence (IEEE WCCI), Glasgow, UK, 19-24 July, 2020

arXiv:2003.05499 [pdf, other]

doi 10.3847/1538-3881/ab7338

Asteroids' Size Distribution and Colors from HiTS

Authors: J. Peña, C. Fuentes, F. Förster, J. Martínez-Palomera, G. Cabrera-Vives, J. C. Maureira, P. Huijse, P. A. Estévez, L. Galbany, S. González-Gaitán, Th. de Jaeger

Abstract: We report the observations of solar system objects during the 2015 campaign of the High cadence Transient Survey (HiTS). We found 5740 bodies (mostly Main Belt asteroids), 1203 of which were detected in different nights and in $g'$ and $r'$. Objects were linked in the barycenter system and their orbital parameters were computed assuming Keplerian motion. We identified 6 near Earth objects, 1738 Ma… ▽ More We report the observations of solar system objects during the 2015 campaign of the High cadence Transient Survey (HiTS). We found 5740 bodies (mostly Main Belt asteroids), 1203 of which were detected in different nights and in $g'$ and $r'$. Objects were linked in the barycenter system and their orbital parameters were computed assuming Keplerian motion. We identified 6 near Earth objects, 1738 Main Belt asteroids and 4 Trans-Neptunian objects. We did not find a $g'-r'$ color-size correlation for $14<H_{g'}<18$ ($1<D<10$ km) asteroids. We show asteroids' colors are disturbed by HiTS' 1.6 hour cadence and estimate that observations should be separated by at most 14 minutes to avoid confusion in future wide-field surveys like LSST. The size distribution for the Main Belt objects can be characterized as a simple power law with slope $\sim0.9$, steeper than in any other survey, while data from HiTS 2014's campaign is consistent with previous ones (slopes $\sim0.68$ at the bright end and $\sim0.34$ at the faint end). This difference is likely due to the ecliptic distribution of the Main Belt since 2015's campaign surveyed farther from the ecliptic than did 2014's and most previous surveys. △ Less

Submitted 13 March, 2020; v1 submitted 11 March, 2020; originally announced March 2020.

Comments: 17 pages, 18 figures

Journal ref: The Astronomical Journal, Volume 159, Number 4, Page 148, Year 2020

arXiv:1809.06379 [pdf, other]

doi 10.1038/s41550-018-0563-4

The delay of shock breakout due to circumstellar material seen in most Type II Supernovae

Authors: F. Förster, T. J. Moriya, J. C. Maureira, J. P. Anderson, S. Blinnikov, F. Bufano, G. Cabrera-Vives, A. Clocchiatti, Th. de Jaeger, P. A. Estévez, L. Galbany, S. González-Gaitán, G. Gräfener, M. Hamuy, E. Hsiao, P. Huentelemu, P. Huijse, H. Kuncarayakti, J. Martínez-Palomera, G. Medina, F. Olivares E., G. Pignata, A. Razza, I. Reyes, J. San Martín , et al. (13 additional authors not shown)

Abstract: Type II supernovae (SNe) originate from the explosion of hydrogen-rich supergiant massive stars. Their first electromagnetic signature is the shock breakout, a short-lived phenomenon which can last from hours to days depending on the density at shock emergence. We present 26 rising optical light curves of SN II candidates discovered shortly after explosion by the High cadence Transient Survey (HiT… ▽ More Type II supernovae (SNe) originate from the explosion of hydrogen-rich supergiant massive stars. Their first electromagnetic signature is the shock breakout, a short-lived phenomenon which can last from hours to days depending on the density at shock emergence. We present 26 rising optical light curves of SN II candidates discovered shortly after explosion by the High cadence Transient Survey (HiTS) and derive physical parameters based on hydrodynamical models using a Bayesian approach. We observe a steep rise of a few days in 24 out of 26 SN II candidates, indicating the systematic detection of shock breakouts in a dense circumstellar matter consistent with a mass loss rate $\dot{M} > 10^{-4} M_\odot yr^{-1}$ or a dense atmosphere. This implies that the characteristic hour timescale signature of stellar envelope SBOs may be rare in nature and could be delayed into longer-lived circumstellar material shock breakouts in most Type II SNe. △ Less

Submitted 17 September, 2018; originally announced September 2018.

Comments: Published in Nature Astronomy (https://www.nature.com/articles/s41550-018-0563-4). 41 pages including methods. 5 figures in main text) + 8 figures in methods

Journal ref: Nature Astronomy, 2018

arXiv:1808.03626 [pdf, other]

doi 10.1109/IJCNN.2018.8489627

Enhanced Rotational Invariant Convolutional Neural Network for Supernovae Detection

Authors: Esteban Reyes, Pablo A. Estévez, Ignacio Reyes, Guillermo Cabrera-Vives, Pablo Huijse, Rodrigo Carrasco-Davis, Francisco Förster

Abstract: In this paper, we propose an enhanced CNN model for detecting supernovae (SNe). This is done by applying a new method for obtaining rotational invariance that exploits cyclic symmetry. In addition, we use a visualization approach, the layer-wise relevance propagation (LRP) method, which allows finding the relevant pixels in each image that contribute to discriminate between SN candidates and artif… ▽ More In this paper, we propose an enhanced CNN model for detecting supernovae (SNe). This is done by applying a new method for obtaining rotational invariance that exploits cyclic symmetry. In addition, we use a visualization approach, the layer-wise relevance propagation (LRP) method, which allows finding the relevant pixels in each image that contribute to discriminate between SN candidates and artifacts. We introduce a measure to assess quantitatively the effect of the rotational invariant methods on the LRP relevance heatmaps. This allows comparing the proposed method, CAP, with the original Deep-HiTS model. The results show that the enhanced method presents an augmented capacity for achieving rotational invariance with respect to the original model. An ensemble of CAP models obtained the best results so far on the HiTS dataset, reaching an average accuracy of 99.53%. The improvement over Deep-HiTS is significant both statistically and in practice. △ Less

Submitted 10 August, 2018; originally announced August 2018.

Comments: 8 pages, 5 figures. Accepted for publication in proceedings of the IEEE World Congress on Computational Intelligence (IEEE WCCI), Rio de Janeiro, Brazil, 8-13 July, 2018

arXiv:1807.03869 [pdf, other]

doi 10.1088/1538-3873/aaef12

Deep Learning for Image Sequence Classification of Astronomical Events

Authors: Rodrigo Carrasco-Davis, Guillermo Cabrera-Vives, Francisco Förster, Pablo A. Estévez, Pablo Huijse, Pavlos Protopapas, Ignacio Reyes, Jorge Martínez-Palomera, Cristóbal Donoso

Abstract: We propose a new sequential classification model for astronomical objects based on a recurrent convolutional neural network (RCNN) which uses sequences of images as inputs. This approach avoids the computation of light curves or difference images. This is the first time that sequences of images are used directly for the classification of variable objects in astronomy. The second contribution of th… ▽ More We propose a new sequential classification model for astronomical objects based on a recurrent convolutional neural network (RCNN) which uses sequences of images as inputs. This approach avoids the computation of light curves or difference images. This is the first time that sequences of images are used directly for the classification of variable objects in astronomy. The second contribution of this work is the image simulation process. We generate synthetic image sequences that take into account the instrumental and observing conditions, obtaining a realistic, set of movies for each astronomical object. The simulated dataset is used to train our RCNN classifier. This approach allows us to generate datasets to train and test our RCNN model for different astronomical surveys and telescopes. We aim at building a simulated dataset whose distribution is close enough to the real dataset, so that a fine tuning could match the distributions between real and simulated dataset. To test the RCNN classifier trained with the synthetic dataset, we used real-world data from the High cadence Transient Survey (HiTS) obtaining an average recall of 85%, improved to 94% after performing fine tuning with 10 real samples per class. We compare the results of our model with those of a light curve random forest classifier. The proposed RCNN with fine tuning has a similar performance on the HiTS dataset compared to the light curve classifier, trained on an augmented training set with 10 real samples per class. The RCNN approach presents several advantages in an alert stream classification scenario, such as a reduction of the data pre-processing, faster online evaluation and easier performance improvement using a few real data samples. These results encourage us to use this method for alert brokers systems that will process alert streams generated by new telescopes such as the Large Synoptic Survey Telescope. △ Less

Submitted 7 November, 2018; v1 submitted 10 July, 2018; originally announced July 2018.

Comments: 20 pages, 20 figures (corrected compilation errors). This is an Accepted Manuscript version of an article accepted for publication in Publications of the Astronomical Society of the Pacific. Nether the Astronomical Society of the Pacific nor IOP Publishing Ltd is responsible for any errors or omissions in this version of the manuscript or any version derived from it

arXiv:1806.03352 [pdf, other]

doi 10.3847/1538-3881/aaaaed

Asteroids in the High cadence Transient Survey

Authors: J. Peña, C. Fuentes, F. Förster, J. C. Maureira, J. San Martín, J. Littín, P. Huijse, G. Cabrera-Vives, P. A. Estévez, L. Galbany, S. González-Gaitán, J. Martínez, Th. de Jaeger, M. Hamuy

Abstract: We report on the serendipitous observations of Solar System objects imaged during the High cadence Transient Survey (HiTS) 2014 observation campaign. Data from this high cadence, wide field survey was originally analyzed for finding variable static sources using Machine Learning to select the most-likely candidates. In this work we search for moving transients consistent with Solar System objects… ▽ More We report on the serendipitous observations of Solar System objects imaged during the High cadence Transient Survey (HiTS) 2014 observation campaign. Data from this high cadence, wide field survey was originally analyzed for finding variable static sources using Machine Learning to select the most-likely candidates. In this work we search for moving transients consistent with Solar System objects and derive their orbital parameters. We use a simple, custom detection algorithm to link trajectories and assume Keplerian motion to derive the asteroid's orbital parameters. We use known asteroids from the Minor Planet Center (MPC) database to assess the detection efficiency of the survey and our search algorithm. Trajectories have an average of nine detections spread over 2 days, and our fit yields typical errors of $σしぐま_a\sim 0.07 ~{\rm AUえーゆー}$, $σしぐま_{\rm e} \sim 0.07 $ and $σしぐま_i\sim 0.^{\circ}5~ {\rm deg}$ in semi-major axis, eccentricity, and inclination respectively for known asteroids in our sample. We extract 7,700 orbits from our trajectories, identifying 19 near Earth objects, 6,687 asteroids, 14 Centaurs, and 15 trans-Neptunian objects. This highlights the complementarity of supernova wide field surveys for Solar System research and the significance of machine learning to clean data of false detections. It is a good example of the data--driven science that LSST will deliver. △ Less

Submitted 8 June, 2018; originally announced June 2018.

Comments: 9 pages, 7 figures

Journal ref: The Astronomical Journal, Volume 155, Year 2018, Page 135

arXiv:1709.07919 [pdf, other]

doi 10.1051/0004-6361/201731462

Proper motions in the VVV Survey: Results for more than 15 million stars across NGC 6544

Authors: R. Contreras Ramos, M. Zoccali, F. Rojas, A. Rojas-Arriagada, M. Gárate, P. Huijse, F. Gran, M. Soto, A. A. R. Valcarce, P. A. Estévez, D. Minniti

Abstract: Context: In the last six years, the VVV survey mapped 562 sq. deg. across the bulge and southern disk of the Galaxy. However, a detailed study of these regions, which includes $\sim 36$ globular clusters (GCs) and thousands of open clusters is by no means an easy challenge. High differential reddening and severe crowding along the line of sight makes highly hamper to reliably distinguish stars bel… ▽ More Context: In the last six years, the VVV survey mapped 562 sq. deg. across the bulge and southern disk of the Galaxy. However, a detailed study of these regions, which includes $\sim 36$ globular clusters (GCs) and thousands of open clusters is by no means an easy challenge. High differential reddening and severe crowding along the line of sight makes highly hamper to reliably distinguish stars belonging to different populations and/or systems. Aims: The aim of this study is to separate stars that likely belong to the Galactic GC NGC 6544 from its surrounding field by means of proper motion (PM) techniques. Methods: This work was based upon a new astrometric reduction method optimized for images of the VVV survey. Results: Photometry over the six years baseline of the survey allowed us to obtain a mean precision of $\sim0.51$ mas/yr, in each PM coordinate, for stars with Ks < 15 mag. In the area studied here, cluster stars separate very well from field stars, down to the main sequence turnoff and below, allowing us to derive for the first time the absolute PM of NGC 6544. Isochrone fitting on the clean and differential reddening corrected cluster color magnitude diagram yields an age of $\sim$ 11-13 Gyr, and metallicity [Fe/H] = -1.5 dex, in agreement with previous studies restricted to the cluster core. We were able to derive the cluster orbit assuming an axisymmetric model of the Galaxy and conclude that NGC 6544 is likely a halo GC. We have not detected tidal tail signatures associated to the cluster, but a remarkable elongation in the galactic center direction has been found. The precision achieved in the PM determination also allows us to separate bulge stars from foreground disk stars, enabling the kinematical selection of bona fide bulge stars across the whole survey area. Our results show that VVV data is perfectly suitable for this kind of analysis. △ Less

Submitted 22 September, 2017; originally announced September 2017.

Comments: 13 pages, 12 figures, accepted in A&A

arXiv:1709.03541 [pdf, other]

doi 10.3847/1538-4365/aab77c

Robust period estimation using mutual information for multi-band light curves in the synoptic survey era

Authors: Pablo Huijse, Pablo A. Estevez, Francisco Forster, Scott F. Daniel, Andrew J. Connolly, Pavlos Protopapas, Rodrigo Carrasco, Jose C. Principe

Abstract: The Large Synoptic Survey Telescope (LSST) will produce an unprecedented amount of light curves using six optical bands. Robust and efficient methods that can aggregate data from multidimensional sparsely-sampled time series are needed. In this paper we present a new method for light curve period estimation based on the quadratic mutual information (QMI). The proposed method does not assume a part… ▽ More The Large Synoptic Survey Telescope (LSST) will produce an unprecedented amount of light curves using six optical bands. Robust and efficient methods that can aggregate data from multidimensional sparsely-sampled time series are needed. In this paper we present a new method for light curve period estimation based on the quadratic mutual information (QMI). The proposed method does not assume a particular model for the light curve nor its underlying probability density and it is robust to non-Gaussian noise and outliers. By combining the QMI from several bands the true period can be estimated even when no single-band QMI yields the period. Period recovery performance as a function of average magnitude and sample size is measured using 30,000 synthetic multi-band light curves of RR Lyrae and Cepheid variables generated by the LSST Operations and Catalog simulators. The results show that aggregating information from several bands is highly beneficial in LSST sparsely-sampled time series, obtaining an absolute increase in period recovery rate up to 50%. We also show that the QMI is more robust to noise and light curve length (sample size) than the multiband generalizations of the Lomb Scargle and Analysis of Variance periodograms, recovering the true period in 10-30% more cases than its competitors. A python package containing efficient Cython implementations of the QMI and other methods is provided. △ Less

Submitted 11 September, 2017; originally announced September 2017.

Comments: Accepted for publication ApJ Supplement Series: Special Issue on Solar/Stellar Astronomy Big Data

arXiv:1701.00458 [pdf, ps, other]

doi 10.3847/1538-4357/836/1/97

Deep-HiTS: Rotation Invariant Convolutional Neural Network for Transient Detection

Authors: Guillermo Cabrera-Vives, Ignacio Reyes, Francisco Förster, Pablo A. Estévez, Juan-Carlos Maureira

Abstract: We introduce Deep-HiTS, a rotation invariant convolutional neural network (CNN) model for classifying images of transients candidates into artifacts or real sources for the High cadence Transient Survey (HiTS). CNNs have the advantage of learning the features automatically from the data while achieving high performance. We compare our CNN model against a feature engineering approach using random f… ▽ More We introduce Deep-HiTS, a rotation invariant convolutional neural network (CNN) model for classifying images of transients candidates into artifacts or real sources for the High cadence Transient Survey (HiTS). CNNs have the advantage of learning the features automatically from the data while achieving high performance. We compare our CNN model against a feature engineering approach using random forests (RF). We show that our CNN significantly outperforms the RF model reducing the error by almost half. Furthermore, for a fixed number of approximately 2,000 allowed false transient candidates per night we are able to reduce the miss-classified real transients by approximately 1/5. To the best of our knowledge, this is the first time CNNs have been used to detect astronomical transient events. Our approach will be very useful when processing images from next generation instruments such as the Large Synoptic Survey Telescope (LSST). We have made all our code and data available to the community for the sake of allowing further developments and comparisons at https://github.com/guille-c/Deep-HiTS. △ Less

Submitted 2 January, 2017; originally announced January 2017.

Journal ref: The Astrophysical Journal, 2017

arXiv:1609.03567 [pdf, other]

doi 10.3847/0004-637X/832/2/155

The High Cadence Transient Survey (HiTS) - I. Survey design and supernova shock breakout constraints

Authors: Francisco Förster, Juan C. Maureira, Jaime San Martín, Mario Hamuy, Jorge Martínez, Pablo Huijse, Guillermo Cabrera, Lluís Galbany, Thomas de Jaeger, Santiago González-Gaitán, Joseph P. Anderson, Hanindyo Kuncarayakti, Giuliano Pignata, Filomena Bufano, Jorge Littín, Felipe Olivares, Gustavo Medina, R. Chris Smith, A. Katherina Vivas, Pablo A. Estévez, Ricardo Muñoz, Eduardo Vera

Abstract: We present the first results of the High cadence Transient Survey (HiTS), a survey whose objective is to detect and follow up optical transients with characteristic timescales from hours to days, especially the earliest hours of supernova (SN) explosions. HiTS uses the Dark Energy Camera (DECam) and a custom made pipeline for image subtraction, candidate filtering and candidate visualization, whic… ▽ More We present the first results of the High cadence Transient Survey (HiTS), a survey whose objective is to detect and follow up optical transients with characteristic timescales from hours to days, especially the earliest hours of supernova (SN) explosions. HiTS uses the Dark Energy Camera (DECam) and a custom made pipeline for image subtraction, candidate filtering and candidate visualization, which runs in real-time to be able to react rapidly to the new transients. We discuss the survey design, the technical challenges associated with the real-time analysis of these large volumes of data and our first results. In our 2013, 2014 and 2015 campaigns we have detected more than 120 young SN candidates, but we did not find a clear signature from the short-lived SN shock breakouts (SBOs) originating after the core collapse of red supergiant stars, which was the initial science aim of this survey. Using the empirical distribution of limiting-magnitudes from our observational campaigns we measured the expected recovery fraction of randomly injected SN light curves which included SBO optical peaks produced with models from Tominaga et al. (2011) and Nakar & Sari (2010). From this analysis we cannot rule out the models from Tominaga et al. (2011) under any reasonable distributions of progenitor masses, but we can marginally rule out the brighter and longer-lived SBO models from Nakar & Sari (2010) under our best-guess distribution of progenitor masses. Finally, we highlight the implications of this work for future massive datasets produced by astronomical observatories such as LSST. △ Less

Submitted 12 September, 2016; originally announced September 2016.

Comments: 30 pages, 14 figures, accepted for publication in ApJ

arXiv:1509.07823 [pdf, other]

doi 10.1109/MCI.2014.2326100

Computational Intelligence Challenges and Applications on Large-Scale Astronomical Time Series Databases

Authors: Pablo Huijse, Pablo A. Estevez, Pavlos Protopapas, Jose C. Principe, Pablo Zegers

Abstract: Time-domain astronomy (TDA) is facing a paradigm shift caused by the exponential growth of the sample size, data complexity and data generation rates of new astronomical sky surveys. For example, the Large Synoptic Survey Telescope (LSST), which will begin operations in northern Chile in 2022, will generate a nearly 150 Petabyte imaging dataset of the southern hemisphere sky. The LSST will stream… ▽ More Time-domain astronomy (TDA) is facing a paradigm shift caused by the exponential growth of the sample size, data complexity and data generation rates of new astronomical sky surveys. For example, the Large Synoptic Survey Telescope (LSST), which will begin operations in northern Chile in 2022, will generate a nearly 150 Petabyte imaging dataset of the southern hemisphere sky. The LSST will stream data at rates of 2 Terabytes per hour, effectively capturing an unprecedented movie of the sky. The LSST is expected not only to improve our understanding of time-varying astrophysical objects, but also to reveal a plethora of yet unknown faint and fast-varying phenomena. To cope with a change of paradigm to data-driven astronomy, the fields of astroinformatics and astrostatistics have been created recently. The new data-oriented paradigms for astronomy combine statistics, data mining, knowledge discovery, machine learning and computational intelligence, in order to provide the automated and robust methods needed for the rapid detection and classification of known astrophysical objects as well as the unsupervised characterization of novel phenomena. In this article we present an overview of machine learning and computational intelligence applications to TDA. Future big data challenges and new lines of research in TDA, focusing on the LSST, are identified and discussed from the viewpoint of computational intelligence/machine learning. Interdisciplinary collaboration will be required to cope with the challenges posed by the deluge of astronomical data coming from the LSST. △ Less

Submitted 25 September, 2015; originally announced September 2015.

Journal ref: IEEE Computational Intelligence Magazine, vol. 9, n. 3, pp. 27-39, 2014

arXiv:1509.07577 [pdf, ps, other]

doi 10.1007/s00521-013-1368-0

A Review of Feature Selection Methods Based on Mutual Information

Authors: Jorge R. Vergara, Pablo A. Estévez

Abstract: In this work we present a review of the state of the art of information theoretic feature selection methods. The concepts of feature relevance, redundance and complementarity (synergy) are clearly defined, as well as Markov blanket. The problem of optimal feature selection is defined. A unifying theoretical framework is described, which can retrofit successful heuristic criteria, indicating the ap… ▽ More In this work we present a review of the state of the art of information theoretic feature selection methods. The concepts of feature relevance, redundance and complementarity (synergy) are clearly defined, as well as Markov blanket. The problem of optimal feature selection is defined. A unifying theoretical framework is described, which can retrofit successful heuristic criteria, indicating the approximations made by each method. A number of open problems in the field are presented. △ Less

Submitted 24 September, 2015; originally announced September 2015.

Journal ref: Neural Computing & Applications, vol. 24 (1), pp. 175-186, 2014

arXiv:1509.07093 [pdf, other]

doi 10.1007/s00521-013-1535-3

A review of learning vector quantization classifiers

Authors: David Nova, Pablo A. Estevez

Abstract: In this work we present a review of the state of the art of Learning Vector Quantization (LVQ) classifiers. A taxonomy is proposed which integrates the most relevant LVQ approaches to date. The main concepts associated with modern LVQ approaches are defined. A comparison is made among eleven LVQ classifiers using one real-world and two artificial datasets. In this work we present a review of the state of the art of Learning Vector Quantization (LVQ) classifiers. A taxonomy is proposed which integrates the most relevant LVQ approaches to date. The main concepts associated with modern LVQ approaches are defined. A comparison is made among eleven LVQ classifiers using one real-world and two artificial datasets. △ Less

Submitted 23 September, 2015; originally announced September 2015.

Comments: 14 pages

Journal ref: Neural Computing & Applications, vol. 25, pp. 511-524, 2014

arXiv:1412.1840 [pdf, ps, other]

doi 10.1088/0067-0049/216/2/25

A Novel, Fully Automated Pipeline for Period Estimation in the EROS 2 Data Set

Authors: Pavlos Protopapas, Pablo Huijse, Pablo A. Estevez, Pablo Zegers, Jose C. Principe

Abstract: We present a new method to discriminate periodic from non-periodic irregularly sampled lightcurves. We introduce a periodic kernel and maximize a similarity measure derived from information theory to estimate the periods and a discriminator factor. We tested the method on a dataset containing 100,000 synthetic periodic and non-periodic lightcurves with various periods, amplitudes and shapes genera… ▽ More We present a new method to discriminate periodic from non-periodic irregularly sampled lightcurves. We introduce a periodic kernel and maximize a similarity measure derived from information theory to estimate the periods and a discriminator factor. We tested the method on a dataset containing 100,000 synthetic periodic and non-periodic lightcurves with various periods, amplitudes and shapes generated using a multivariate generative model. We correctly identified periodic and non-periodic lightcurves with a completeness of 90% and a precision of 95%, for lightcurves with a signal-to-noise ratio (SNR) larger than 0.5. We characterize the efficiency and reliability of the model using these synthetic lightcurves and applied the method on the EROS-2 dataset. A crucial consideration is the speed at which the method can be executed. Using hierarchical search and some simplification on the parameter search we were able to analyze 32.8 million lightcurves in 18 hours on a cluster of GPGPUs. Using the sensitivity analysis on the synthetic dataset, we infer that 0.42% in the LMC and 0.61% in the SMC of the sources show periodic behavior. The training set, the catalogs and source code are all available in http://timemachine.iic.harvard.edu. △ Less

Submitted 4 December, 2014; originally announced December 2014.

Journal ref: The Astrophysical Journal Supplement Series, Volume 216, Number 2, 2015

arXiv:1212.2398 [pdf, other]

doi 10.1109/TSP.2012.2204260

An Information Theoretic Algorithm for Finding Periodicities in Stellar Light Curves

Authors: Pablo Huijse, Pablo A. Estevez, Pavlos Protopapas, Pablo Zegers, Jose C. Principe

Abstract: We propose a new information theoretic metric for finding periodicities in stellar light curves. Light curves are astronomical time series of brightness over time, and are characterized as being noisy and unevenly sampled. The proposed metric combines correntropy (generalized correlation) with a periodic kernel to measure similarity among samples separated by a given period. The new metric provide… ▽ More We propose a new information theoretic metric for finding periodicities in stellar light curves. Light curves are astronomical time series of brightness over time, and are characterized as being noisy and unevenly sampled. The proposed metric combines correntropy (generalized correlation) with a periodic kernel to measure similarity among samples separated by a given period. The new metric provides a periodogram, called Correntropy Kernelized Periodogram (CKP), whose peaks are associated with the fundamental frequencies present in the data. The CKP does not require any resampling, slotting or folding scheme as it is computed directly from the available samples. CKP is the main part of a fully-automated pipeline for periodic light curve discrimination to be used in astronomical survey databases. We show that the CKP method outperformed the slotted correntropy, and conventional methods used in astronomy for periodicity discrimination and period estimation tasks, using a set of light curves drawn from the MACHO survey. The proposed metric achieved 97.2% of true positives with 0% of false positives at the confidence level of 99% for the periodicity discrimination task; and 88% of hits with 11.6% of multiples and 0.4% of misses in the period estimation task. △ Less

Submitted 11 December, 2012; originally announced December 2012.

Journal ref: IEEE Transactions on Signal Processing, vol. 60, issue 10, pp. 5135-5145, October 2012

arXiv:1112.2962 [pdf]

doi 10.1109/LSP.2011.2141987

Period Estimation in Astronomical Time Series Using Slotted Correntropy

Authors: Pablo Huijse, Pablo A. Estévez, Pablo Zegers, José Príncipe, Pavlos Protopapas

Abstract: In this letter, we propose a method for period estimation in light curves from periodic variable stars using correntropy. Light curves are astronomical time series of stellar brightness over time, and are characterized as being noisy and unevenly sampled. We propose to use slotted time lags in order to estimate correntropy directly from irregularly sampled time series. A new information theoretic… ▽ More In this letter, we propose a method for period estimation in light curves from periodic variable stars using correntropy. Light curves are astronomical time series of stellar brightness over time, and are characterized as being noisy and unevenly sampled. We propose to use slotted time lags in order to estimate correntropy directly from irregularly sampled time series. A new information theoretic metric is proposed for discriminating among the peaks of the correntropy spectral density. The slotted correntropy method outperformed slotted correlation, string length, VarTools (Lomb-Scargle periodogram and Analysis of Variance), and SigSpec applications on a set of light curves drawn from the MACHO survey. △ Less

Submitted 13 December, 2011; originally announced December 2011.

Journal ref: IEEE Signal Processing Letters, vol. 18, no. 6, pp. 371-374, year 2011

Showing 1–29 of 29 results for author: Estévez, P A