Whole‐genome duplications have occurred in the recent ancestors of many plants, fish, and amphibians, resulting in a pervasiveness of paralogous loci and the potential for both disomic and tetrasomic ...inheritance in the same genome. Paralogs can be difficult to reliably genotype and are often excluded from genotyping‐by‐sequencing (GBS) analyses; however, removal requires paralogs to be identified which is difficult without a reference genome. We present a method for identifying paralogs in natural populations by combining two properties of duplicated loci: (i) the expected frequency of heterozygotes exceeds that for singleton loci, and (ii) within heterozygotes, observed read ratios for each allele in GBS data will deviate from the 1:1 expected for singleton (diploid) loci. These deviations are often not apparent within individuals, particularly when sequence coverage is low; but, we postulated that summing allele reads for each locus over all heterozygous individuals in a population would provide sufficient power to detect deviations at those loci. We identified paralogous loci in three species: Chinook salmon (Oncorhynchus tshawytscha) which retains regions with ongoing residual tetrasomy on eight chromosome arms following a recent whole‐genome duplication, mountain barberry (Berberis alpina) which has a large proportion of paralogs that arose through an unknown mechanism, and dusky parrotfish (Scarus niger) which has largely rediploidized following an ancient whole‐genome duplication. Importantly, this approach only requires the genotype and allele‐specific read counts for each individual, information which is readily obtained from most GBS analysis pipelines.
A whole genome duplication occurred in the ancestor of all salmonid fishes some 50-100 million years ago. Early inheritance studies with allozymes indicated that loci in the salmonid genome are ...inherited disomically in females. However, some pairs of duplicated loci showed patterns of inheritance in males indicating pairing and recombination between homeologous chromosomes. Nearly 20% of loci in the salmonid genome are duplicated and share the same alleles (isoloci), apparently due to homeologous recombination. Half-tetrad analysis revealed that isoloci tend to be telomeric. These results suggested that residual tetrasomic inheritance of isoloci results from homeologous recombination near chromosome ends and that continued disomic inheritance resulted from homologous pairing of centromeric regions. Many current genetic maps of salmonids are based on single nucleotide polymorphisms and microsatellites that are no longer duplicated. Therefore, long sections of chromosomes on these maps are poorly represented, especially telomeric regions. In addition, preferential multivalent pairing of homeologs from the same species in F1 hybrids results in an excess of nonparental gametes (so-called pseudolinkage). We consider how not including duplicated loci has affected our understanding of population and evolutionary genetics of salmonids, and we discuss how incorporating these loci will benefit our understanding of population genomics.
A common challenge for fisheries management is resolving the relative contribution of closely related populations where accuracy of genetic assignment may be limited. An overlooked method for ...increasing assignment accuracy is the use of multi-SNP (single nucleotide polymorphism) haplotypes rather than single-SNP genotypes. Haplotypes increase power for detecting population structure, and loci derived from next-generation sequencing methods often contain multiple SNPs. We evaluated the utility of multi-SNP haplotyping for mixture analysis in western Alaska Chinook salmon (Oncorhynchus tshawytscha). Multi-SNP haplotype data increased the accuracy of mixture analysis for closely related populations by up to seven percentage points relative to single-SNP genotype data for a set of 500 loci; 90% accuracy was achievable with as few as 150 loci with multi-SNP haplotypes but required at least 300 loci with single-SNP genotypes. Individual assignment to reporting groups showed an even greater increase in accuracy of up to 17 percentage points when multi-SNP haplotypes were used. Haplotyping multiple SNPs shows promise to improve the accuracy of assigning unknown fish to population of origin whenever haplotype data are available.
As the conservation challenges increase, new approaches are needed to help combat losses in biodiversity and slow or reverse the decline of threatened species. Genome‐editing technology is changing ...the face of modern biology, facilitating applications that were unimaginable only a decade ago. The technology has the potential to make significant contributions to the fields of evolutionary biology, ecology, and conservation, yet the fear of unintended consequences from designer ecosystems containing engineered organisms has stifled innovation. To overcome this gap in the understanding of what genome editing is and what its capabilities are, more research is needed to translate genome‐editing discoveries into tools for ecological research. Emerging and future genome‐editing technologies include new clustered regularly interspaced short palindromic repeats (CRISPR) targeted sequencing and nucleic acid detection approaches as well as species genetic barcoding and somatic genome‐editing technologies. These genome‐editing tools have the potential to transform the environmental sciences by providing new noninvasive methods for monitoring threatened species or for enhancing critical adaptive traits. A pioneering effort by the conservation community is required to apply these technologies to real‐world conservation problems.
Transformación de la Ecología y la Biología de la Conservación por medio de la Edición Genómica
Resumen
Conforme aumentan los retos de conservación, se necesitan nuevas estrategias para ayudar a combatir las pérdidas de biodiversidad y para disminuir o revertir la declinación de especies. La tecnología de edición genómica está cambiando el rostro de la biología moderna, facilitando aplicaciones que eran inimaginables hace una década. Esta tecnología tiene el potencial de contribuir significativamente en los campos de la biología evolutiva, la ecología y la conservación, aun así, el miedo a las consecuencias accidentales de los ecosistemas planeados que contienen organismos diseñados ha sofocado a la innovación. Para sobreponerse a este vacío en el entendimiento de lo que es la edición genómica y cuáles son sus capacidades se requiere de mayor investigación para traducir los descubrimientos de la edición genómica a herramientas para la investigación ecológica. Las tecnologías de edición genómica emergentes y futuras incluyen nuevas estrategias CRISPR enfocadas en la secuenciación y detección de ácidos nucleicos, así como tecnologías de definición del código de barras genético de las especies y de edición somática de genes. Estas herramientas de edición genómica tienen el potencial para transformar las ciencias ambientales al proporcionar nuevos métodos no invasivos para el monitoreo de especies amenazadas o para mejorar las características adaptativas más importantes. Se requiere de un esfuerzo vanguardista por parte de la comunidad conservadora para aplicar esta tecnología a los problemas de conservación en el mundo real.
摘要
随着生物多样性保护面临越来越多的挑战, 我们需要新的方法来对抗生物多样性丧失、减缓或逆转受胁迫物种的种群下降。基因组编辑技术推动了十年前人们还无法想象的应用, 正在改变着现代生物学的面貌。这项技术能为进化生物学、生态学和保护生物学领域做出重大贡献。然而, 对含有被工程改造过的生物有机体的生态系统是否会产生意外后果的担忧, 限制了这个方向的创新。为了帮助理解基因组编辑的概念及其功能, 还需要更多的研究将基因组编辑发现转变为生态学研究的工具。新兴及未来的基因组编辑技术包括新的 CRISPR 靶向测序和核酸检测方法、物种遗传条形码和体细胞基因组编辑技术等。这些基因组编辑工具可以通过提供新的非损伤性方法来监测受胁迫物种, 或改进关键的适应性性状, 为环境科学带来重大改变。因此, 要将这些技术应用于现实中的保护问题, 需要保护领域努力进行开拓。【翻译: 胡怡思; 审校: 聂永刚】
Article impact statement: Novel applications of CRISPR technology have the potential to make significant contributions to ecology and conservation.
Because of their high variability, microsatellites are still considered the marker of choice for studies on parentage and kinship in wild populations. Nevertheless, single nucleotide polymorphisms ...(SNPs) are becoming increasing popular in many areas of molecular ecology, owing to their high-throughput, easy transferability between laboratories and low genotyping error. An ongoing discussion concerns the relative power of SNPs compared to microsatellites--that is, how many SNP loci are needed to replace a panel of microsatellites? Here, we evaluate the assignment power of 80 SNPs (HE = 0.30, 80 independent alleles) and 11 microsatellites (HE = 0.85, 192 independent alleles) in a wild population of about 400 sockeye salmon with two commonly used software packages (Cervus3, Colony2) and, for SNPs only, a newly developed software (SNPPIT). Assignment success was higher for SNPs than for microsatellites, especially for parent pairs, irrespective of the method used. Colony2 assigned a larger proportion of offspring to at least one parent than the other methods, although Cervus and SNPPIT detected more parent pairs. Identification of full-sib groups without parental information from relatedness measures was possible using both marker systems, although explicit reconstruction of such groups in Colony2 was impossible for SNPs because of computation time. Our results confirm the applicability of SNPs for parentage analyses and refute the predictability of assignment success from the number of independent alleles.
In their recently corrected manuscript, “Breaking RAD: An evaluation of the utility of restriction site associated DNA sequencing for genome scans of adaptation”, Lowry et al. argue that genome scans ...using RADseq will miss many loci under selection due to a combination of sparse marker density and low levels of linkage disequilibrium in most species. We agree that marker density and levels of LD are important considerations when designing a RADseq study; however, we dispute that RAD‐based genome scans are as prone to failure as Lowry et al. suggest. Their arguments ignore the flexible nature of RADseq; the availability of different restriction enzymes and capacity for combining restriction enzymes ensures that a well‐designed study should be able to generate enough markers for efficient genome coverage. We further believe that simplifying assumptions about linkage disequilibrium in their simulations are invalid in many species. Finally, it is important to note that the alternative methods proposed by Lowry et al. have limitations equal to or greater than RADseq. The wealth of studies with positive impactful findings that have used RAD genome scans instead supports the argument that properly conducted RAD genome scans are an effective method for gaining insight into ecology and evolution, particularly for non‐model organisms and those with large or complex genomes.
Pink salmon (Oncorhynchus gorbuscha) adults are the smallest of the five Pacific salmon native to the western Pacific Ocean. Pink salmon are also the most abundant of these species and account for a ...large proportion of the commercial value of the salmon fishery worldwide. A two-year life history of pink salmon generates temporally isolated populations that spawn either in even-years or odd-years. To uncover the influence of this genetic isolation, reference genome assemblies were generated for each year-class and whole genome re-sequencing data was collected from salmon of both year-classes. The salmon were sampled from six Canadian rivers and one Japanese river. At multiple centromeres we identified peaks of Fst between year-classes that were millions of base-pairs long. The largest Fst peak was also associated with a million base-pair chromosomal polymorphism found in the odd-year genome near a centromere. These Fst peaks may be the result of a centromere drive or a combination of reduced recombination and genetic drift, and they could influence speciation. Other regions of the genome influenced by odd-year and even-year temporal isolation and tentatively under selection were mostly associated with genes related to immune function, organ development/maintenance, and behaviour.
Many studies exclude loci that exhibit linkage disequilibrium (LD); however, high LD can signal reduced recombination around genomic features such as chromosome inversions or sex-determining regions. ...Chromosome inversions and sex-determining regions are often involved in adaptation, allowing for the inheritance of co-adapted gene complexes and for the resolution of sexually antagonistic selection through sex-specific partitioning of genetic variants. Genomic features such as these can escape detection when loci with LD are removed; in addition, failing to account for these features can introduce bias to analyses. We examined patterns of LD using network analysis to identify an overlapping chromosome inversion and sex-determining region in chum salmon. The signal of the inversion was strong enough to show up as false population substructure when the entire dataset was analyzed, while the effect of the sex-determining region on population structure was only obvious after restricting analysis to the sex chromosome. Understanding the extent and geographic distribution of inversions is now a critically important part of genetic analyses of natural populations. Our results highlight the importance of analyzing and understanding patterns of LD in genomic dataset and the perils of excluding or ignoring loci exhibiting LD. Blindly excluding loci in LD would have prevented detection of the sex-determining region and chromosome inversion while failing to understand the genomic features leading to high-LD could have resulted in false interpretations of population structure.
Regions of the genome displaying elevated differentiation (genomic islands of divergence) are thought to play an important role in local adaptation, especially in populations experiencing high gene ...flow. However, the characteristics of these islands as well as the functional significance of genes located within them remain largely unknown. Here, we used data from thousands of SNPs aligned to a linkage map to investigate genomic islands of divergence in three ecotypes of sockeye salmon (Oncorhynchus nerka) from a single drainage in southwestern Alaska. We found ten islands displaying high differentiation among ecotypes. Conversely, neutral structure observed throughout the rest of the genome was low and not partitioned by ecotype. One island on linkage group So13 was particularly large and contained six SNPs with FST > 0.14 (average FST of neutral SNPs = 0.01). Functional annotation revealed that the peak of this island contained a nonsynonymous mutation in a gene involved in growth in other species (TULP4). The islands that we discovered were relatively small (80–402 Kb), loci found in islands did not show reduced levels of diversity, and loci in islands displayed slightly elevated linkage disequilibrium. These attributes suggest that the islands discovered here were likely generated by divergence hitchhiking; however, we cannot rule out the possibility that other mechanisms may have produced them. Our results suggest that islands of divergence serve an important role in local adaptation with gene flow and represent a significant advance towards understanding the genetic basis of ecotypic differentiation.
Recent advances in population genomics have made it possible to detect previously unidentified structure, obtain more accurate estimates of demographic parameters, and explore adaptive divergence, ...potentially revolutionizing the way genetic data are used to manage wild populations. Here, we identified 10 944 single‐nucleotide polymorphisms using restriction‐site‐associated DNA (RAD) sequencing to explore population structure, demography, and adaptive divergence in five populations of Chinook salmon (Oncorhynchus tshawytscha) from western Alaska. Patterns of population structure were similar to those of past studies, but our ability to assign individuals back to their region of origin was greatly improved (>90% accuracy for all populations). We also calculated effective size with and without removing physically linked loci identified from a linkage map, a novel method for nonmodel organisms. Estimates of effective size were generally above 1000 and were biased downward when physically linked loci were not removed. Outlier tests based on genetic differentiation identified 733 loci and three genomic regions under putative selection. These markers and genomic regions are excellent candidates for future research and can be used to create high‐resolution panels for genetic monitoring and population assignment. This work demonstrates the utility of genomic data to inform conservation in highly exploited species with shallow population structure.