Quantifying “demographic independence” is a vital step in establishing potential conservation units for a species in that it effectively distinguishes migration from within‐population reproduction. ...This is an important aspect because it allows for an accurate estimate of recruitment. For example, populations may be designated as 'management units' (=MUs) if indeed population growth results from local demography rather than immigration. Of additional interest is the calculation of immigrant ancestry and ascertainment of the temporal context over which immigration occurred. This is because MUs depend largely upon local (self‐sustaining) birth and death rates, and the quantification of ancestry is necessary to validate demographic independence. Dispersal rate is also of immediate interest to conservation biologists, and can be assessed by quantifying genetic divergence among populations. The capacity with which to gauge these benchmarks has now been extended herein to genome‐wide molecular data, in an attempt to adjust an analytical tool that was until now intractable for the next generation sequencing data.
In this study, a popular legacy program for migrant detection (i.e. BayesAss3) has been modified to accept SNP (single nucleotide polymorphism) data. We validated BA3‐SNPs using empirical data to demonstrate its suitability for both high‐performance and desktop computing environments. We also facilitate high analytical throughput by presenting a binary search algorithm that automates MCMC (Markov chain Monte Carlo) parameter tuning.
Our BA3‐SNPs‐autotune program required five or fewer rounds of optimization for 99% of input files, with acceptable mixing parameters derived in 100% of our test cases. Runtime for BA3‐SNPs is a function of the number of loci analysed. Benchmarking yielded an average runtime <32 hr (10 million MCMC generations) for datasets containing thousands of SNPs.
The BA3 algorithm remains a viable option for analysing modern SNP datasets. Source code (C++ and Python) is released publicly under the GNU General Public License v3.0, and is available for download (Linux and Mac OSX) from the following URL: https://github.com/stevemussmann/.
Abstract
Background
Several cancer-susceptibility syndromes are reported to underlie pediatric rhabdomyosarcoma (RMS); however, to our knowledge there have been no systematic efforts to characterize ...the heterogeneous genetic etiologies of this often-fatal malignancy.
Methods
We performed exome-sequencing on germline DNA from 615 patients with newly diagnosed RMS consented through the Children’s Oncology Group. We compared the prevalence of cancer predisposition variants in 63 autosomal-dominant cancer predisposition genes in these patients with population controls (n = 9963). All statistical tests were 2-sided.
Results
We identified germline cancer predisposition variants in 45 RMS patients (7.3%; all FOXO1 fusion negative) across 15 autosomal dominant genes, which was statistically significantly enriched compared with controls (1.4%, P = 1.3 × 10–22). Specifically, 73.3% of the predisposition variants were found in predisposition syndrome genes previously associated with pediatric RMS risk, such as Li-Fraumeni syndrome (TP53) and neurofibromatosis type I (NF1). Notably, 5 patients had well-described oncogenic missense variants in HRAS (p.G12V and p.G12S) associated with Costello syndrome. Also, genetic etiology differed with histology, as germline variants were more frequent in embryonal vs alveolar RMS patients (10.0% vs 3.0%, P = .02). Although patients with a cancer predisposition variant tended to be younger at diagnosis (P = 9.9 × 10–4), 40.0% of germline variants were identified in those older than 3 years of age, which is in contrast to current genetic testing recommendations based on early age at diagnosis.
Conclusions
These findings demonstrate that genetic risk of RMS results from germline predisposition variants associated with a wide spectrum of cancer susceptibility syndromes. Germline genetic testing for children with RMS should be informed by RMS subtypes and not be limited to only young patients.
A community genomics approach to natural hybridization Zbinden, Zachery D; Douglas, Marlis R; Chafin, Tyler K ...
Proceedings of the Royal Society. B, Biological sciences,
05/2023, Letnik:
290, Številka:
1999
Journal Article
Recenzirano
Odprti dostop
Hybridization is a complicated, oft-misunderstood process. Once deemed unnatural and uncommon, hybridization is now recognized as ubiquitous among species. But hybridization rates within and among ...communities are poorly understood despite the relevance to ecology, evolution and conservation. To clarify, we examined hybridization across 75 freshwater fish communities within the Ozarks of the North American Interior Highlands (USA) by single nucleotide polymorphism (SNP) genotyping 33 species (
2865 individuals; double-digest restriction site-associated DNA sequencing (ddRAD)). We found evidence of hybridization (70 putative hybrids; 2.4% of individuals) among 18 species-pairs involving 73% (24/33) of study species, with the majority being concentrated within one family (Leuciscidae/minnows; 15 species; 66 hybrids). Interspecific genetic exchange-or introgression-was evident from 24 backcrossed individuals (10/18 species-pairs). Hybrids occurred within 42 of 75 communities (56%). Four selected environmental variables (species richness, protected area extent, precipitation (May and annually)) exhibited 73-78% accuracy in predicting hybrid occurrence via random forest classification. Our community-level assessment identified hybridization as spatially widespread and environmentally dependent (albeit predominantly within one diverse, omnipresent family). Our approach provides a more holistic survey of natural hybridization by testing a wide range of species-pairs, thus contrasting with more conventional evaluations.
Abstract
Background
Research on the molecular ecology of non-model organisms, while previously constrained, has now been greatly facilitated by the advent of reduced-representation sequencing ...protocols. However, tools that allow these large datasets to be efficiently parsed are often lacking, or if indeed available, then limited by the necessity of a comparable reference genome as an adjunct. This, of course, can be difficult when working with non-model organisms. Fortunately, pipelines are currently available that avoid this prerequisite, thus allowing data to be a priori parsed. An oft-used molecular ecology program (i.e., S
tructure
), for example, is facilitated by such pipelines, yet they are surprisingly absent for a second program that is similarly popular and computationally more efficient (i.e., A
dmixture
). The two programs differ in that A
dmixture
employs a maximum-likelihood framework whereas S
tructure
uses a Bayesian approach, yet both produce similar results. Given these issues, there is an overriding (and recognized) need among researchers in molecular ecology for bioinformatic software that will not only condense output from replicated A
dmixture
runs, but also infer from these data the optimal number of population clusters (
K
).
Results
Here we provide such a program (i.e., A
dmix
P
ipe
) that (a) filters SNPs to allow the delineation of population structure in A
dmixture
, then (b) parses the output for summarization and graphical representation via C
lumpak
. Our benchmarks effectively demonstrate how efficient the pipeline is for processing large, non-model datasets generated via double digest restriction-site associated DNA sequencing (ddRAD). Outputs not only parallel those from S
tructure
, but also visualize the variation among individual A
dmixture
runs, so as to facilitate selection of the most appropriate
K
-value.
Conclusions
A
dmix
P
ipe
successfully integrates A
dmixture
analysis with popular variant call format (VCF) filtering software to yield file types readily analyzed by C
lumpak
. Large population genomic datasets derived from non-model organisms are efficiently analyzed via the parallel-processing capabilities of A
dmixture
. A
dmix
P
ipe
is distributed under the GNU Public License and freely available for Mac OSX and Linux platforms at:
https://github.com/stevemussmann/admixturePipeline
.
Celotno besedilo
Dostopno za:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
As anthropogenic changes continue to ecologically stress wildlife, obtaining measures of gene flow and genetic diversity are crucial for evaluating population trends and considering management and ...conservation strategies for small, imperiled populations. In our study, we conducted a molecular assessment to expand on previous work to elucidate patterns of diversity and connectivity in the remaining disjunct Eastern Massasauga Rattlesnake (Sistrurus catenatus) hibernacula in Illinois. We assayed genetic data for 327 samples collected during 1999-2015 from the Carlyle Lake study area across 21 microsatellite loci. We found hibernacula formed distinct genetic clusters corresponding to the three main study areas (Dam Recreation Areas, Eldon Hazlet State Park, and South Shore State Park). Genetic structuring and low estimates of dispersal indicated that connectivity among these study areas is limited and each is demographically independent. Hibernacula exhibited moderate levels of heterozygosity (0.60-0.73), but estimates of effective population size (5.2-41.0) were low and track census sizes generated via long-term mark-recapture data. Hibernacula at Carlyle Lake, which represent the only Eastern Massasauga remaining in Illinois, are vulnerable to future loss of genetic diversity through lack of gene flow as well as demographic and environmental stochastic processes. Our work highlights the need to include population-level genetic data in recovery planning and suggests that recovery efforts should focus on managing the three major study areas as separate conservation units in order to preserve and maintain long-term adaptive potential of these populations. Specific management goals should include improving connectivity among hibernacula, maintaining existing wet grassland habitat, and minimizing anthropogenic sources of mortality caused by habitat management (e.g., mowing, prescribed fire) and recreational activities. Our molecular study provides additional details about demographic parameters and connectivity at Carlyle Lake that can be used to guide recovery of Eastern Massasauga in Illinois and throughout its range.
Celotno besedilo
Dostopno za:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Morphological data are a conduit for the recognition and description of species, and their acquisition has recently been broadened by geometric morphometric (GM) approaches that co-join the ...collection of digital data with exploratory 'big data' analytics. We employed this approach to dissect the Western Rattlesnake (Crotalus viridis) species-complex in North America, currently partitioned by mitochondrial (mt)DNA analyses into eastern and western lineages (two and seven subspecies, respectively). The GM data (i.e., 33 dorsal and 50 lateral head landmarks) were gleaned from 2,824 individuals located in 10 museum collections. We also downloaded and concatenated sequences for six mtDNA genes from the NCBI GenBank database. GM analyses revealed significant head shape differences attributable to size and subspecies-designation (but not their interactions). Pairwise shape distances among subspecies were significantly greater than those derived from ancestral character states via squared-change parsimony, with the greatest differences separating those most closely related. This, in turn, suggests the potential for historic character displacement as a diversifying force in the complex. All subspecies, save one, were significantly differentiated in a Bayesian discriminant function analysis (DFA), regardless of whether our priors were uniform or informative (i.e., mtDNA data). Finally, shape differences among sister-clades were significantly greater than expected by chance alone under a Brownian model of evolution, promoting the hypothesis that selection rather than drift was the driving force in the evolution of the complex. Lastly, we combine head shape and mtDNA data so as to derived an integrative taxonomy that produced robust boundaries for six OTUs (operational taxonomic units) of the C. viridis complex. We suggest these boundaries are concomitant with species-status and subsequently provide a relevant nomenclature for its recognition and representation.
Celotno besedilo
Dostopno za:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Despite its imperative, biodiversity conservation is chronically underfunded, a deficiency that often forces management agencies to prioritize. Single-species recovery thus becomes a focus (often ...with socio-political implications), whereas a more economical approach would be the transition to multi-targeted management (= MTM). This challenge is best represented in Midwestern North America where biodiversity has been impacted by 300+ years of chronic anthropogenic disturbance such that native tall-grass prairie is now supplanted by an agroecosystem. Here, we develop an MTM with a population genetic metric to collaboratively manage three Illinois upland gamebirds: common pheasant (Phasianus colchicus; pheasant), northern bobwhite quail (Colinus virginianus; quail), and threatened-endangered (T&E) greater prairie chicken (Tympanuchus cupido pinnatus; prairie chicken). We first genotyped our study pheasant at 19 microsatellite DNA loci and identified three captive breeding stocks (N = 143; IL Department of Natural Resources) as being significantly bottlenecked, with relatedness >1st-cousin (μR = 0.158). 'Wild' (non-stocked) pheasant N = 543; 14 Pheasant-Habitat-Areas (PHAs) were also bottlenecked, significantly interrelated (μR = 0.150) and differentiated (μFST = 0.047), yet distinct from propagation stock. PHAs that encompassed significantly with larger areas also reflected greater effective population sizes (μNE = 43; P<0.007). We juxtaposed these data against previously published results for prairie chicken and quail, and found population genetic structure driven by drift, habitat/climate impacts, and gender-biased selection via hunter-harvest. Each species (hunter-harvested or T&E) is independently managed, yet their composite population genetic baseline provides the quantitative criteria needed for an upland game bird MTM. Its implementation would require agricultural plots to be rehabilitated/reclaimed using a land-sharing/sparing portfolio that differs markedly from the Conservation Reserve Program (CRP), where sequestered land decreases as agricultural prices escalate. Cost-savings for an MTM would accrue by synchronizing single-species management with a dwindling hunter-harvest program, and by eliminating propagation/stocking programs. This would sustain not only native grasslands and their resident species, but also accelerate conservation at the wildlife-agroecosystem interface.
Celotno besedilo
Dostopno za:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Ecological restoration can promote biodiversity conservation in anthropogenically fragmented habitats, but effectiveness of these management efforts need to be statistically validated to determine ...'success.' One such approach is to gauge the extent of recolonization as a measure of landscape permeability and, in turn, population connectivity. In this context, we estimated dispersal and population connectivity in prairie vole (Microtus ochrogaster; N = 231) and meadow vole (M. pennsylvanicus; N = 83) within five tall-grass prairie restoration sites embedded within the agricultural matrix of midwestern North America. We predicted that vole dispersal would be constrained by the extent of agricultural land surrounding restored habitat patches, spatially isolating vole populations and resulting in significant genetic structure. We first employed genetic assignment tests based on 15 microsatellite DNA loci to validate field-derived species-designations, then tested reclassified samples with multivariate and Bayesian clustering to assay for spatial and temporal genetic structure. Population connectivity was further evaluated by calculating pairwise FST, then potential demographic effects explored by computing migration rates, effective population size (Ne), and average relatedness (r). Genetic species assignments reclassified 25% of initial field identifications (N = 11 M. ochrogaster; N = 67 M. pennsylvanicus). In M. ochrogaster population connectivity was high across the study area, reflected in little to no spatial or temporal genetic structure. In M. pennsylvanicus genetic structure was detected, but relatedness estimates identified it as kin-clustering instead, underscoring social behavior among populations rather than spatial isolation as the cause. Estimates of Ne and r were stable across years, reflecting high dispersal and demographic resilience. Combined, these metrics suggest the agricultural matrix is highly permeable for voles and does not impede dispersal. High connectivity observed confirms that the restored landscape is productive and permeable for specific management targets such as voles and also demonstrates population genetic assays as a tool to statistically evaluate effectiveness of conservation initiatives.
Celotno besedilo
Dostopno za:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Patterns of multi-locus differentiation (i.e., genomic clines) often extend broadly across hybrid zones and their quantification can help diagnose how species boundaries are shaped by adaptive ...processes, both intrinsic and extrinsic. In this sense, the transitioning of loci across admixed individuals can be contrasted as a function of the genome-wide trend, in turn allowing an expansion of clinal theory across a much wider array of biodiversity. However, computational tools that serve to interpret and consequently visualize 'genomic clines' are limited, and users must often write custom, relatively complex code to do so. Here, we introduce the ClineHelpR R-package for visualizing genomic clines and detecting outlier loci using output generated by two popular software packages, bgc and Introgress. ClineHelpR bundles both input generation (i.e., filtering datasets and creating specialized file formats) and output processing (e.g., MCMC thinning and burn-in) with functions that directly facilitate interpretation and hypothesis testing. Tools are also provided for post-hoc analyses that interface with external packages such as ENMeval and RIdeogram. Our package increases the reproducibility and accessibility of genomic cline methods, thus allowing an expanded user base and promoting these methods as mechanisms to address diverse evolutionary questions in both model and non-model organisms. Furthermore, the ClineHelpR extended functionality can evaluate genomic clines in the context of spatial and environmental features, allowing users to explore underlying processes potentially contributing to the observed patterns and helping facilitate effective conservation management strategies.
Celotno besedilo
Dostopno za:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK