We study the problem of periodicity detection in massive data sets of photometric or radial velocity time series, as presented by ESA's Gaia mission. Periodicity detection hinges on the estimation of ...the false alarm probability of the extremum of the periodogram of the time series. We consider the problem of its estimation with two main issues in mind. First, for a given number of observations and signal-to-noise ratio, the rate of correct periodicity detections should be constant for all realized cadences of observations regardless of the observational time patterns, in order to avoid sky biases that are difficult to assess. Secondly, the computational loads should be kept feasible even for millions of time series. Using the Gaia case, we compare the F
M
method of Paltani and Schwarzenberg-Czerny, the Baluev method and the GEV method of Süveges, as well as a method for the direct estimation of a threshold. Three methods involve some unknown parameters, which are obtained by fitting a regression-type predictive model using easily obtainable covariates derived from observational time series. We conclude that the GEV and the Baluev methods both provide good solutions to the issues posed by a large-scale processing. The first of these yields the best scientific quality at the price of some moderately costly pre-processing. When this pre-processing is impossible for some reason (e.g. the computational costs are prohibitive or good regression models cannot be constructed), the Baluev method provides a computationally inexpensive alternative with slight biases in regions where time samplings exhibit strong aliases.
In this article we present an overview of the ESA Gaia mission and of the unprecedented impact that Gaia will have on the field of variable star research. We summarise the contents and impact of the ...first Gaia data release on the description of variability phenomena, with particular emphasis on pulsating star research. The Tycho-Gaia astrometric solution, although limited to 2.1 million stars, has been used in many studies related to pulsating stars. Furthermore a set of 3,194 Cepheids and RR Lyrae stars with their times series have been released. Finally we present the plans for the ongoing study of variable phenomena with Gaia and highlight some of the possible impacts of the second data release on variable, and specifically, pulsating stars.
Starting from a description of the Rubin Observatory Data Management System Architecture, and drawing on our experience with and involvement in a range of other projects including Gaia, SDSS, UKIRT, ...and JCMT, we derive a series of generic design patterns and lessons learned.
The Vera C. Rubin Observatory will advance many areas of astronomy over the next decade with its unique wide-fast-deep multi-color imaging survey, the Legacy Survey of Space and Time (LSST). The LSST ...will produce approximately 20TB of raw data per night, which will be automatically processed by the LSST Science Pipelines to generate science-ready data products -- processed images, catalogs and alerts. To ensure that these data products enable transformative science with LSST, stringent requirements have been placed on their quality and scientific fidelity, for example on image quality and depth, astrometric and photometric performance, and object recovery completeness. In this paper we introduce faro, a framework for automatically and efficiently computing scientific performance metrics on the LSST data products for units of data of varying granularity, ranging from single-detector to full-survey summary statistics. By measuring and monitoring metrics, we are able to evaluate trends in algorithmic performance and conduct regression testing during development, compare the performance of one algorithm against another, and verify that the LSST data products will meet performance requirements by comparing to specifications. We present initial results using faro to characterize the performance of the data products produced on simulated and precursor data sets, and discuss plans to use faro to verify the performance of the LSST commissioning data products.
We describe Rubin Observatory's experience with offering a data access facility (and associated services including our Science Platform) deployed on Google Cloud infrastructure as part of our ...pre-Operations Data Preview program.
Tens of millions of new variable objects are expected to be identified in over a billion time series from the Gaia mission. Crossmatching known variable sources with those from Gaia is crucial to ...incorporate current knowledge, understand how these objects appear in the Gaia data, train supervised classifiers to recognise known classes, and validate the results of the Variability Processing and Analysis Coordination Unit (CU7) within the Gaia Data Analysis and Processing Consortium (DPAC). The method employed by CU7 to crossmatch variables for the first Gaia data release includes a binary classifier to take into account positional uncertainties, proper motion, targeted variability signals, and artefacts present in the early calibration of the Gaia data. Crossmatching with a classifier makes it possible to automate all those decisions which are typically made during visual inspection. The classifier can be trained with objects characterized by a variety of attributes to ensure similarity in multiple dimensions (astrometry, photometry, time-series features), with no need for a-priori transformations to compare different photometric bands, or of predictive models of the motion of objects to compare positions. Other advantages as well as some disadvantages of the method are discussed. Implementation steps from the training to the assessment of the crossmatch classifier and selection of results are described.
The Large Synoptic Survey Telescope (LSST) is an ambitious astronomical survey with a similarly ambitious Data Management component. Data Management for LSST includes processing on both nightly and ...yearly cadences to generate transient alerts, deep catalogs of the static sky, and forced photometry light-curves for billions of objects at hundreds of epochs, spanning at least a decade. The algorithms running in these pipelines are individually sophisticated and interact in subtle ways. This paper provides an overview of those pipelines, focusing more on those interactions than the details of any individual algorithm.
The second Gaia data release is expected to contain data products from about 22 months of observation. Based on these data, we aim to provide an advance publication of a full-sky Gaia map of RR Lyrae ...stars. Although comprehensive, these data still contain a significant fraction of sources which are insufficiently sampled for Fourier series decomposition of the periodic light variations. The challenges in the identification of RR Lyrae candidates with (much) fewer than 20 field-of-view transits are described. General considerations of the results, their limitations, and interpretation are presented together with prospects for improvement in subsequent Gaia data releases.
In this article we present an overview of the ESA Gaia mission and of the unprecedented impact that Gaia will have on the field of variable star research. We summarise the contents and impact of the ...first Gaia data release on the description of variability phenomena, with particular emphasis on pulsating star research. The Tycho-Gaia astrometric solution, although limited to 2.1 million stars, has been used in many studies related to pulsating stars. Furthermore a set of 3,194 Cepheids and RR Lyrae stars with their times series have been released. Finally we present the plans for the ongoing study of variable phenomena with Gaia and highlight some of the possible impacts of the second data release on variable, and specifically, pulsating stars.