ABSTRACT
Dust extinction and reddening are ubiquitous in astronomical observations and are often a major source of systematic uncertainty. We present here a study of the correlation between ...extinction in the Milky Way and the equivalent width of the Na i D absorption doublet. Our sample includes more than 100 high‐resolution spectra from the Keck telescopes and nearly a million low‐resolution spectra from the Sloan Digital Sky Survey (SDSS). We measure the correlation to unprecedented precision, constrain its shape and derive an empirical relation between these quantities with a dispersion of the order of 0.15 mag in E(B − V). From the shape of the curve of growth we further show that a typical sight line through the Galaxy, as seen within the SDSS footprint, crosses about three dust clouds. We provide a brief guide on how to best estimate extinction to extragalactic sources such as supernovae, using the Na i D absorption feature, under a variety of circumstances.
Heritable variation in gene expression forms a crucial bridge between genomic variation and the biology of many traits. However, most expression quantitative trait loci (eQTLs) remain unidentified. ...We mapped eQTLs by transcriptome sequencing in 1012 yeast segregants. The resulting eQTLs accounted for over 70% of the heritability of mRNA levels, allowing comprehensive dissection of regulatory variation. Most genes had multiple eQTLs. Most expression variation arose from
-acting eQTLs distant from their target genes. Nearly all
-eQTLs clustered at 102 hotspot locations, some of which influenced the expression of thousands of genes. Fine-mapped hotspot regions were enriched for transcription factor genes. While most genes had a local eQTL, most of these had no detectable effects on the expression of other genes in
. Hundreds of non-additive genetic interactions accounted for small fractions of expression variation. These results reveal the complexity of genetic influences on transcriptome variation in unprecedented depth and detail.
ABSTRACT
We present Cyclic-Permutation Invariant Neural Networks, a novel class of neural networks (NNs) designed to be invariant to phase shifts of period-folded periodic sequences by means of ...‘symmetry padding’. In the context of periodic variable star light curves, initial phases are exogenous to the physical origin of the variability and should thus be immaterial to the downstream inference application. Although previous work utilizing NNs commonly operated on period-folded light curves, no approach to date has taken advantage of such a symmetry. Across three different data sets of variable star light curves, we show that two implementations of Cyclic-Permutation Invariant Networks—iTCN and iResNet—consistently outperform state-of-the-art non-invariant baselines and reduce overall error rates by between 4 to 22 per cent. Over a 10-class OGLE-III sample, the iTCN/iResNet achieves an average per-class accuracy of 93.4 per cent/93.3 per cent, compared to recurrent NN/random forest accuracies of 70.5 per cent/89.5 per cent in a recent study using the same data. Finding improvement on a non-astronomy benchmark, we suggest that the methodology introduced here should also be applicable to a wide range of science domains where periodic data abounds.
Mutations are the root source of genetic variation and underlie the process of evolution. Although the rates at which mutations occur vary considerably between species, little is known about ...differences within species, or the genetic and molecular basis of these differences. Here, we leveraged the power of the yeast
as a model system to uncover natural genetic variants that underlie variation in mutation rate. We developed a high-throughput fluctuation assay and used it to quantify mutation rates in seven natural yeast isolates and in 1040 segregant progeny from a cross between BY, a laboratory strain, and RM, a wine strain. We observed that mutation rate varies among yeast strains and is heritable (
= 0.49). We performed linkage mapping in the segregants and identified four quantitative trait loci underlying mutation rate variation in the cross. We fine-mapped two quantitative trait loci to the underlying causal genes,
and
, that contribute to mutation rate variation. These genes also underlie sensitivity to the DNA-damaging agents 4NQO and MMS, suggesting a connection between spontaneous mutation rate and mutagen sensitivity.
Abstract
Despite the utility of neural networks (NNs) for astronomical time-series classification, the proliferation of learning architectures applied to diverse data sets has thus far hampered a ...direct intercomparison of different approaches. Here we perform the first comprehensive study of variants of NN-based learning and inference for astronomical time series, aiming to provide the community with an overview on relative performance and, hopefully, a set of best-in-class choices for practical implementations. In both supervised and self-supervised contexts, we study the effects of different time-series-compatible layer choices, namely the dilated temporal convolutional neural network (dTCNs), long-short term memory NNs, gated recurrent units and temporal convolutional NNs (tCNNs). We also study the efficacy and performance of encoder-decoder (i.e., autoencoder) networks compared to direct classification networks, different pathways to include auxiliary (non-time-series) metadata, and different approaches to incorporate multi-passband data (i.e., multiple time series per source). Performance—applied to a sample of 17,604 variable stars (VSs) from the MAssive Compact Halo Objects (MACHO) survey across 10 imbalanced classes—is measured in training convergence time, classification accuracy, reconstruction error, and generated latent variables. We find that networks with recurrent NNs generally outperform dTCNs and, in many scenarios, yield to similar accuracy as tCNNs. In learning time and memory requirements, convolution-based layers perform better. We conclude by discussing the advantages and limitations of deep architectures for VS classification, with a particular eye toward next-generation surveys such as the Legacy Survey of Space and Time, the Roman Space Telescope, and Zwicky Transient Facility.
The mechanistic basis for how genetic variants cause differences in phenotypic traits is often elusive. We identified a quantitative trait locus in Caenorhabditis elegans that affects three seemingly ...unrelated phenotypic traits: lifetime fecundity, adult body size, and susceptibility to the human pathogen Staphyloccus aureus. We found a QTL for all three traits arises from variation in the neuropeptide receptor gene npr-1. Moreover, we found that variation in npr-1 is also responsible for differences in 247 gene expression traits. Variation in npr-1 is known to determine whether animals disperse throughout a bacterial lawn or aggregate at the edges of the lawn. We found that the allele that leads to aggregation is associated with reduced growth and reproductive output. The altered gene expression pattern caused by this allele suggests that the aggregation behavior might cause a weak starvation state, which is known to reduce growth rate and fecundity. Importantly, we show that variation in npr-1 causes each of these phenotypic differences through behavioral avoidance of ambient oxygen concentrations. These results suggest that variation in npr-1 has broad pleiotropic effects mediated by altered exposure to bacterial food.
Full text
Available for:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Cosmic ray (CR) identification and replacement are critical components of imaging and spectroscopic reduction pipelines involving solid-state detectors. We present deepCR, a deep-learning-based ...framework for CR identification and subsequent image inpainting based on the predicted CR mask. To demonstrate the effectiveness of this framework, we train and evaluate models on Hubble Space Telescope (HST) ACS/WFC images of sparse extragalactic fields, globular clusters, and resolved galaxies. We demonstrate that at a false-positive rate of 0.5%, deepCR achieves close to 100% detection rates in both extragalactic and globular cluster fields, and 91% in resolved galaxy fields, which is a significant improvement over the current state-of-the-art method LACosmic. Compared with a multicore CPU implementation of LACosmic, deepCR CR mask predictions run up to 6.5 times faster on a CPU and 90 times faster on a single GPU. For image inpainting, the mean squared errors of deepCR predictions are 20 times lower in globular cluster fields, 5 times lower in resolved galaxy fields, and 2.5 times lower in extragalactic fields, compared with the best performing nonneural technique tested. We present our framework and the trained models as an open-source Python project , with a simple-to-use API. To facilitate reproducibility of the results we also provide a benchmarking codebase .
Linkage and association studies have mapped thousands of genomic regions that contribute to phenotypic variation, but narrowing these regions to the underlying causal genes and variants has proven ...much more challenging. Resolution of genetic mapping is limited by the recombination rate. We developed a method that uses CRISPR (clustered, regularly interspaced, short palindromic repeats) to build mapping panels with targeted recombination events. We tested the method by generating a panel with recombination events spaced along a yeast chromosome arm, mapping trait variation, and then targeting a high density of recombination events to the region of interest. Using this approach, we fine-mapped manganese sensitivity to a single polymorphism in the transporter Pmr1. Targeting recombination events to regions of interest allows us to rapidly and systematically identify causal variants underlying trait differences.
Full text
Available for:
BFBNIB, NMLJ, NUK, ODKLJ, PNG, SAZU, UL, UM, UPUK
For many traits, including susceptibility to common diseases in humans, causal loci uncovered by genetic-mapping studies explain only a minority of the heritable contribution to trait variation. ...Multiple explanations for this 'missing heritability' have been proposed. Here we use a large cross between two yeast strains to accurately estimate different sources of heritable variation for 46 quantitative traits, and to detect underlying loci with high statistical power. We find that the detected loci explain nearly the entire additive contribution to heritable variation for the traits studied. We also show that the contribution to heritability of gene-gene interactions varies among traits, from near zero to approximately 50 per cent. Detected two-locus interactions explain only a minority of this contribution. These results substantially advance our understanding of the missing heritability problem and have important implications for future studies of complex and quantitative traits.
Full text
Available for:
DOBA, IJS, IZUM, KILJ, KISLJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Genetic mapping studies of quantitative traits typically focus on detecting loci that contribute additively to trait variation. Genetic interactions are often proposed as a contributing factor to ...trait variation, but the relative contribution of interactions to trait variation is a subject of debate. Here we use a very large cross between two yeast strains to accurately estimate the fraction of phenotypic variance due to pairwise QTL-QTL interactions for 20 quantitative traits. We find that this fraction is 9% on average, substantially less than the contribution of additive QTL (43%). Statistically significant QTL-QTL pairs typically have small individual effect sizes, but collectively explain 40% of the pairwise interaction variance. We show that pairwise interaction variance is largely explained by pairs of loci at least one of which has a significant additive effect. These results refine our understanding of the genetic architecture of quantitative traits and help guide future mapping studies.