We report a genome-wide association scan in over 6,000 Latin Americans for features of scalp hair (shape, colour, greying, balding) and facial hair (beard thickness, monobrow, eyebrow thickness). We ...found 18 signals of association reaching genome-wide significance (P values 5 × 10(-8) to 3 × 10(-119)), including 10 novel associations. These include novel loci for scalp hair shape and balding, and the first reported loci for hair greying, monobrow, eyebrow and beard thickness. A newly identified locus influencing hair shape includes a Q30R substitution in the Protease Serine S1 family member 53 (PRSS53). We demonstrate that this enzyme is highly expressed in the hair follicle, especially the inner root sheath, and that the Q30R substitution affects enzyme processing and secretion. The genome regions associated with hair features are enriched for signals of selection, consistent with proposals regarding the evolution of human hair.
The relationship between the evolution of genes and languages has been studied for over three decades. These studies rely on the assumption that languages, as many other cultural traits, evolve in a ...gene-like manner, accumulating heritable diversity through time and being subjected to evolutionary mechanisms of change. In the present work we used genetic data to evaluate South American linguistic classifications. We compared discordant models of language classifications to the current Native American genome-wide variation using realistic demographic models analyzed under an Approximate Bayesian Computation (ABC) framework. Data on 381 STRs spread along the autosomes were gathered from the literature for populations representing the five main South Amerindian linguistic groups: Andean, Arawakan, Chibchan-Paezan, Macro-Jê, and Tupí. The results indicated a higher posterior probability for the classification proposed by J.H. Greenberg in 1987, although L. Campbell's 1997 classification cannot be ruled out. Based on Greenberg's classification, it was possible to date the time of Tupí-Arawakan divergence (2.8 kya), and the time of emergence of the structure between present day major language groups in South America (3.1 kya).
To characterize the genetic basis of facial features in Latin Americans, we performed a genome-wide association study (GWAS) of more than 6000 individuals using 59 landmark-based measurements from ...two-dimensional profile photographs and ~9,000,000 genotyped or imputed single-nucleotide polymorphisms. We detected significant association of 32 traits with at least 1 (and up to 6) of 32 different genomic regions, more than doubling the number of robustly associated face morphology loci reported until now (from 11 to 23). These GWAS hits are strongly enriched in regulatory sequences active specifically during craniofacial development. The associated region in 1p12 includes a tract of archaic adaptive introgression, with a Denisovan haplotype common in Native Americans affecting particularly lip thickness. Among the nine previously unidentified face morphology loci we identified is the VPS13B gene region, and we show that variants in this region also affect midfacial morphology in mice.
Here we evaluate the accuracy of prediction for eye, hair and skin pigmentation in a dataset of > 6500 individuals from Mexico, Colombia, Peru, Chile and Brazil (including genome-wide SNP data and ...quantitative/categorical pigmentation phenotypes - the CANDELA dataset CAN). We evaluated accuracy in relation to different analytical methods and various phenotypic predictors. As expected from statistical principles, we observe that quantitative traits are more sensitive to changes in the prediction models than categorical traits. We find that Random Forest or Linear Regression are generally the best performing methods. We also compare the prediction accuracy of SNP sets defined in the CAN dataset (including 56, 101 and 120 SNPs for eye, hair and skin colour prediction, respectively) to the well-established HIrisPlex-S SNP set (including 6, 22 and 36 SNPs for eye, hair and skin colour prediction respectively). When training prediction models on the CAN data, we observe remarkably similar performances for HIrisPlex-S and the larger CAN SNP sets for the prediction of hair (categorical) and eye (both categorical and quantitative), while the CAN sets outperform HIrisPlex-S for quantitative, but not for categorical skin pigmentation prediction. The performance of HIrisPlex-S, when models are trained in a world-wide sample (although consisting of 80% Europeans, https://hirisplex.erasmusmc.nl), is lower relative to training in the CAN data (particularly for hair and skin colour). Altogether, our observations are consistent with common variation of eye and hair colour having a relatively simple genetic architecture, which is well captured by HIrisPlex-S, even in admixed Latin Americans (with partial European ancestry). By contrast, since skin pigmentation is a more polygenic trait, accuracy is more sensitive to prediction SNP set size, although here this effect was only apparent for a quantitative measure of skin pigmentation. Our results support the use of HIrisPlex-S in the prediction of categorical pigmentation traits for forensic purposes in Latin America, while illustrating the impact of training datasets on its accuracy.
•We investigate pigmentation traits in a cohort of ~6500 admixed individuals.•We compute prediction accuracy for pigmentation traits using various scenarios.•Random Forest and regression models are the most accurate, depending on trait.•The HIrisPlex-S set of SNPs shows competing accuracy levels for hair and skin color.•Using non-European datasets matters for prediction of skin pigmentation.
Ecological conditions in the Amazon rainforests are historically favorable for the transmission of numerous tropical diseases, especially vector-borne diseases. The high diversity of pathogens likely ...contributes to the strong selective pressures for human survival and reproduction in this region. However, the genetic basis of human adaptation to this complex ecosystem remains unclear. This study investigates the possible footprints of genetic adaptation to the Amazon rainforest environment by analyzing the genomic data of 19 native populations. The results based on genomic and functional analysis showed an intense signal of natural selection in a set of genes related to
infection, which is the pathogen responsible for Chagas disease, a neglected tropical parasitic disease native to the Americas that is currently spreading worldwide.
Spinocerebellar ataxia type 10 (SCA10) is an autosomal dominant neurodegenerative disorder characterized by progressive cerebellar ataxia and epilepsy. The disease is caused by a pentanucleotide ...ATTCT expansion in intron 9 of the
ATXN10
gene on chromosome 22q13.3. SCA10 has shown a geographical distribution throughout America with a likely degree of Amerindian ancestry from different countries so far. Currently available data suggest that SCA10 mutation might have spread out early during the peopling of the Americas. However, the ancestral origin of SCA10 mutation remains under speculation. Samples of SCA10 patients from two Latin American countries were analysed, being 16 families from Brazil (29 patients) and 21 families from Peru (27 patients) as well as 49 healthy individuals from Indigenous Quechua population and 51 healthy Brazilian individuals. Four polymorphic markers spanning a region of 5.2 cM harbouring the ATTCT expansion were used to define the haplotypes, which were genotyped by different approaches. Our data have shown that 19-CGGC-14 shared haplotype was found in 47% of Brazilian and in 63% of Peruvian families. Frequencies from both groups are not statistically different from Quechua controls (57%), but they are statistically different from Brazilian controls (12%) (
p
< 0.001). The most frequent expanded haplotype in Quechuas, 19-15-CGGC-14-10, is found in 50% of Brazilian and in 65% of Peruvian patients with SCA10. These findings bring valuable evidence that ATTCT expansion may have arisen in a Native American chromosome.