Abstract
We present the first version of the Automatic Learning for the Rapid Classification of Events (ALeRCE) broker light curve classifier. ALeRCE is currently processing the Zwicky Transient ...Facility (ZTF) alert stream, in preparation for the Vera C. Rubin Observatory. The ALeRCE light curve classifier uses variability features computed from the ZTF alert stream and colors obtained from AllWISE and ZTF photometry. We apply a balanced random forest algorithm with a two-level scheme where the top level classifies each source as periodic, stochastic, or transient, and the bottom level further resolves each of these hierarchical classes among 15 total classes. This classifier corresponds to the first attempt to classify multiple classes of stochastic variables (including core- and host-dominated active galactic nuclei, blazars, young stellar objects, and cataclysmic variables) in addition to different classes of periodic and transient sources, using real data. We created a labeled set using various public catalogs (such as the Catalina Surveys and Gaia DR2 variable stars catalogs, and the Million Quasars catalog), and we classify all objects with ≥6
g
-band or ≥6
r
-band detections in ZTF (868,371 sources as of 2020 June 9), providing updated classifications for sources with new alerts every day. For the top level we obtain macro-averaged precision and recall scores of 0.96 and 0.99, respectively, and for the bottom level we obtain macro-averaged precision and recall scores of 0.57 and 0.76, respectively. Updated classifications from the light curve classifier can be found at the ALeRCE Explorer website (
http://alerce.online
).
Abstract
We introduce the Automatic Learning for the Rapid Classification of Events (ALeRCE) broker, an astronomical alert broker designed to provide a rapid and self-consistent classification of ...large etendue telescope alert streams, such as that provided by the Zwicky Transient Facility (ZTF) and, in the future, the Vera C. Rubin Observatory Legacy Survey of Space and Time (LSST). ALeRCE is a Chilean-led broker run by an interdisciplinary team of astronomers and engineers working to become intermediaries between survey and follow-up facilities. ALeRCE uses a pipeline that includes the real-time ingestion, aggregation, cross-matching, machine-learning (ML) classification, and visualization of the ZTF alert stream. We use two classifiers: a stamp-based classifier, designed for rapid classification, and a light curve–based classifier, which uses the multiband flux evolution to achieve a more refined classification. We describe in detail our pipeline, data products, tools, and services, which are made public for the community (see
https://alerce.science
). Since we began operating our real-time ML classification of the ZTF alert stream in early 2019, we have grown a large community of active users around the globe. We describe our results to date, including the real-time processing of 1.5 × 10
8
alerts, the stamp classification of 3.4 × 10
7
objects, the light-curve classification of 1.1 × 10
6
objects, the report of 6162 supernova candidates, and different experiments using LSST-like alert streams. Finally, we discuss the challenges ahead in going from a single stream of alerts such as ZTF to a multistream ecosystem dominated by LSST.
Context. The VISTA Variables in the Vía Láctea (VVV) is a near-IR time-domain survey of the Galactic bulge and southern plane. One of the main goals of this survey is to reveal the 3D structure of ...the Milky Way through their variable stars. In particular, enormous numbers of RR Lyrae stars have been discovered in the inner regions of the bulge (−8° ≲ b ≲ −1°) by optical surveys such as OGLE and MACHO, but leaving an unexplored window of more than ~47 sq deg (−10.0° ≲ ℓ ≲ + 10.7° and − 10.3° ≲ b ≲ −8.0°) observed by the VVV Survey. Aims. Our goal is to characterize the RR Lyrae stars in the outer bulge in terms of their periods, amplitudes, Fourier coefficients, and distances in order to evaluate the 3D structure of the bulge in this area. The distance distribution of RR Lyrae stars will be compared to that of red clump stars, which is known to trace a X-shaped structure, in order to determine whether these two different stellar populations share the same Galactic distribution. Methods. A search for RR Lyrae stars was performed in more than ~47 sq deg at low Galactic latitudes (−10.3° ≲ b ≲ −8.0°). In the procedure the χ2 value and analysis of variance (AoV) statistic methods were used to determine the variability and periodic features of the light curves, respectively. To prevent misclassifications, the analysis was performed only on the fundamental mode RR Lyrae stars (RRab) owing to similarities found in the near-IR light curve shapes of contact eclipsing binaries (W UMa) and first overtone RR Lyrae stars (RRc). On the other hand, the red clump stars of the same analyzed tiles were selected, and cuts in the color-magnitude diagram were applied and the maximum distance restricted to ~20 kpc in order to construct a similar catalog in terms of distances and covered area compared to the RR Lyrae stars. Results. We report the detection of more than 1000 RR Lyrae ab-type stars in the VVV Survey located in the outskirts of the Galactic bulge. A few of them are possibly associated with the Sagittarius Dwarf Spheroidal Galaxy. We calculated colours, reddening, extinction, and distances of the detected RR Lyrae stars in order to determine the outer bulge 3D structure. Our main result is that, at the low galactic latitudes mapped here, the RR Lyrae stars trace a centrally concentrated spheroidal distribution. This is a noticeably different spatial distribution to the one traced by red clump stars known to follow a bar and X-shaped structure. We estimate the completeness of our sample at 80% for Ks ≤ 15 mag.
Abstract
We present a real-time stamp classifier of astronomical events for the Automatic Learning for the Rapid Classification of Events broker, ALeRCE. The classifier is based on a convolutional ...neural network, trained on alerts ingested from the Zwicky Transient Facility (ZTF). Using only the
science, reference,
and
difference
images of the first detection as inputs, along with the metadata of the alert as features, the classifier is able to correctly classify alerts from active galactic nuclei, supernovae (SNe), variable stars, asteroids, and bogus classes, with high accuracy (∼94%) in a balanced test set. In order to find and analyze SN candidates selected by our classifier from the ZTF alert stream, we designed and deployed a visualization tool called SN Hunter, where relevant information about each possible SN is displayed for the experts to choose among candidates to report to the Transient Name Server database. From 2019 June 26 to 2021 February 28, we have reported 6846 SN candidates to date (11.8 candidates per day on average), of which 971 have been confirmed spectroscopically. Our ability to report objects using only a single detection means that 70% of the reported SNe occurred within one day after the first detection. ALeRCE has only reported candidates not otherwise detected or selected by other groups, therefore adding new early transients to the bulk of objects available for early follow-up. Our work represents an important milestone toward rapid alert classifications with the next generation of large etendue telescopes, such as the Vera C. Rubin Observatory.
Abstract
The classic classification scheme for active galactic nuclei (AGNs) was recently challenged by the discovery of the so-called changing-state (changing-look) AGNs. The physical mechanism ...behind this phenomenon is still a matter of open debate and the samples are too small and of serendipitous nature to provide robust answers. In order to tackle this problem, we need to design methods that are able to detect AGNs right in the act of changing state. Here we present an anomaly-detection technique designed to identify AGN light curves with anomalous behaviors in massive data sets. The main aim of this technique is to identify CSAGN at different stages of the transition, but it can also be used for more general purposes, such as cleaning massive data sets for AGN variability analyses. We used light curves from the Zwicky Transient Facility data release 5 (ZTF DR5), containing a sample of 230,451 AGNs of different classes. The ZTF DR5 light curves were modeled with a Variational Recurrent Autoencoder (VRAE) architecture, that allowed us to obtain a set of attributes from the VRAE latent space that describes the general behavior of our sample. These attributes were then used as features for an Isolation Forest (IF) algorithm that is an anomaly detector for a “one class” kind of problem. We used the VRAE reconstruction errors and the IF anomaly score to select a sample of 8809 anomalies. These anomalies are dominated by bogus candidates, but we were able to identify 75 promising CSAGN candidates.
Context. The Vista Variables in the Vía Láctea (VVV) ESO Public Survey is a variability survey of the Milky Way bulge and an adjacent section of the disk carried out from 2010 on ESO Visible and ...Infrared Survey Telescope for Astronomy (VISTA). The VVV survey will eventually deliver a deep near-IR atlas with photometry and positions in five passbands (ZYJHKS) and a catalogue of 1−10 million variable point sources – mostly unknown – that require classifications. Aims. The main goal of the VVV Templates Project, which we introduce in this work, is to develop and test the machine-learning algorithms for the automated classification of the VVV light-curves. As VVV is the first massive, multi-epoch survey of stellar variability in the near-IR, the template light-curves that are required for training the classification algorithms are not available. In the first paper of the series we describe the construction of this comprehensive database of infrared stellar variability. Methods. First, we performed a systematic search in the literature and public data archives; second, we coordinated a worldwide observational campaign; and third, we exploited the VVV variability database itself on (optically) well-known stars to gather high-quality infrared light-curves of several hundreds of variable stars. Results. We have now collected a significant (and still increasing) number of infrared template light-curves. This database will be used as a training-set for the machine-learning algorithms that will automatically classify the light-curves produced by VVV. The results of such an automated classification will be covered in forthcoming papers of the series.
ABSTRACT
In this study, we introduce a novel moving-average model for analyzing stationary time-series observed irregularly in time. The process is strictly stationary and ergodic under normality and ...weakly stationary when normality is not assumed. Maximum likelihood (ML) estimation can be efficiently carried out through a Kalman algorithm obtained from the state-space representation of the model. The Kalman algorithm has order O(n) (where n is the number of observations in the sequence), from which it is possible to efficiently generate parameter estimators, linear predictors, and their mean-squared errors. Two procedures were developed for assessing parameter estimation errors: one based on the Hessian of the likelihood function and another one based on the bootstrap method. The behaviour of these estimators was assessed through Monte Carlo experiments. Both methods give accurate estimation performance, even with relatively small number of observations. Moreover, it is shown that for non-Gaussian data, specifically for the Student's t and generalized error distributions, the parameters of the model can be estimated precisely by ML. The proposed model is compared to the continuous autoregressive moving average (MA) models, showing better performance when the MA parameter is negative or close to one. We illustrate the implementation of the proposed model with light curves of variable stars from the OGLE and HIPPARCOS surveys and stochastic objects from Zwicky Transient Facility. The results suggest that the irregular MA model is a suitable alternative for modelling astronomical light curves, particularly when they have negative autocorrelation.
The classic classification scheme for active galactic nuclei (AGNs) was recently challenged by the discovery of the so-called changing-state (changing-look) AGNs. The physical mechanism behind this ...phenomenon is still a matter of open debate and the samples are too small and of serendipitous nature to provide robust answers. In order to tackle this problem, we need to design methods that are able to detect AGNs right in the act of changing state. Here we present an anomaly-detection technique designed to identify AGN light curves with anomalous behaviors in massive data sets. The main aim of this technique is to identify CSAGN at different stages of the transition, but it can also be used for more general purposes, such as cleaning massive data sets for AGN variability analyses. We used light curves from the Zwicky Transient Facility data release 5 (ZTF DR5), containing a sample of 230,451 AGNs of different classes. The ZTF DR5 light curves were modeled with a Variational Recurrent Autoencoder (VRAE) architecture, that allowed us to obtain a set of attributes from the VRAE latent space that describes the general behavior of our sample. These attributes were then used as features for an Isolation Forest (IF) algorithm that is an anomaly detector for a "one class" kind of problem. We used the VRAE reconstruction errors and the IF anomaly score to select a sample of 8809 anomalies. These anomalies are dominated by bogus candidates, but we were able to identify 75 promising CSAGN candidates.
The Vista Variables in the Vía Láctea (VVV) ESO Public Survey is an ongoing time-series, near-infrared (IR) survey of the Galactic bulge and an adjacent portion of the inner disk, covering 562 square ...degrees of the sky, using ESO's VISTA telescope. The survey has provided superb multi-color photometry in 5 broadband filters (Z, Y, J, H, and Ks), leading to the best map of the inner Milky Way ever obtained, particularly in the near-IR. The main part of the survey, which is focused on the variability in the Ks-band, is currently underway, with bulge fields observed between 34 and 73 times, and disk fields between 34 and 36 times. When the survey is complete, bulge (disk) fields will have been observed up to a total of 100 (60) times, providing unprecedented depth and time coverage in the near-IR. Here we provide a first overview of stellar variability in the VVV data.
Context. The Vista Variables in the Via Lactea (VVV) ESO Public Survey is a variability survey of the Milky Way bulge and an adjacent section of the disk carried out from 2010 on ESO Visible and ...Infrared Survey Telescope for Astronomy (VISTA). The VVV survey will eventually deliver a deep near-IR atlas with photometry and positions in five passbands (ZYJHK sub(S)) and a catalogue of 1-10 million variable point sources - mostly unknown - that require classifications. Aims. The main goal of the VVV Templates Project, which we introduce in this work, is to develop and test the machine-learning algorithms for the automated classification of the VVV light-curves. As VVV is the first massive, multi-epoch survey of stellar variability in the near-IR, the template light-curves that are required for training the classification algorithms are not available. In the first paper of the series we describe the construction of this comprehensive database of infrared stellar variability. Methods. First, we performed a systematic search in the literature and public data archives; second, we coordinated a worldwide observational campaign; and third, we exploited the VVV variability database itself on (optically) well-known stars to gather high-quality infrared light-curves of several hundreds of variable stars. Results. We have now collected a significant (and still increasing) number of infrared template light-curves. This database will be used as a training-set for the machine-learning algorithms that will automatically classify the light-curves produced by VVV. The results of such an automated classification will be covered in forthcoming papers of the series.