Abstract
Upstream open reading frames (uORFs) latent in mRNA transcripts are thought to modify translation of coding sequences by altering ribosome activity. Not all uORFs are thought to be active in ...such a process. To estimate the impact of uORFs on the regulation of translation in humans, we first circumscribed the universe of all possible uORFs based on coding gene sequence motifs and identified 1.3 million unique uORFs. To determine which of these are likely to be biologically relevant, we built a simple Bayesian classifier using 89 attributes of uORFs labeled as active in ribosome profiling experiments. This allowed us to extrapolate to a comprehensive catalog of likely functional uORFs. We validated our predictions using in vivo protein levels and ribosome occupancy from 46 individuals. This is a substantially larger catalog of functional uORFs than has previously been reported. Our ranked list of likely active uORFs allows researchers to test their hypotheses regarding the role of uORFs in health and disease. We demonstrate several examples of biological interest through the application of our catalog to somatic mutations in cancer and disease-associated germline variants in humans.
Background In the first year of roll-out, vaccination for severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) prevented almost 20 million deaths from coronavirus disease 2019 (COVID-19). ...Yet, little is known about the factors influencing access to vaccination at the individual level within rural poor settings of low-income countries. The aim of this study was to examine determinants of vaccine receipt in rural India. Methods A census of a rural village in Tamil Nadu was undertaken from June 2021 to September 2022. We surveyed 775 participants from 262 households. Household-level data on socioeconomic status (SES), water, sanitation, and hygiene practices, and individual-level demographic information, travel history, and biomedical data, including anthropometry, vital signs, and comorbidities, were collected. Logistic regression models with 5-fold cross-validation were used to identify the biomedical, demographic, and socioeconomic determinants of vaccine receipt and the timing of receipt within the first 30 days of eligibility. Vaccine ineligible participants were excluded leaving 659 eligible participants. There were 650 eligible participants with complete biomedical, demographic, and socioeconomic data. Results There were 68.0% and 34.0% of individuals (N = 650) who had received one and two vaccine doses, respectively. Participants with household ownership of a permanent account number (PAN) or ration card were 2.15 (95% CI:1.32–3.52) or 3.02 (95% CI:1.72–5.29) times more likely to receive at least one vaccine dose compared to households with no ownership of such cards. Participants employed as housewives or self-employed non-agricultural workers were 65% (95% CI:0.19–0.67) or 59% (95% CI:0.22–0.76) less likely to receive at least one vaccine dose compared to salaried workers. Household PAN card ownership, occupation and age were linked to the timing of vaccine receipt. Participants aged ≤18 and 45–60 years were 17.74 (95% CI:5.07–62.03) and 5.51 (95% CI:2.74–11.10) times more likely to receive a vaccine within 30 days of eligibility compared to 19-44-year-olds. Biomedical factors including BMI, vital signs, comorbidities, and COVID-19 specific symptoms were not consistently associated with vaccine receipt or timing of receipt. No support was found that travel history, contact with COVID-19 cases, and hospital admissions influenced vaccine receipt or timing of receipt. Conclusion Factors linked to SES were linked to vaccine receipt, more so than biomedical factors which were targeted by vaccine policies. Future research should explore if government interventions including vaccine mandates, barriers to vaccine access, or peer influence linked to workplace or targeted vaccine promotion campaigns underpin these findings.
Celotno besedilo
Dostopno za:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Understanding mechanisms of hepatocellular damage may lead to new treatments for liver disease, and genome-wide association studies (GWAS) of alanine aminotransferase (ALT) and aspartate ...aminotransferase (AST) serum activities have proven useful for investigating liver biology. Here we report 100 loci associating with both enzymes, using GWAS across 411,048 subjects in the UK Biobank. The rare missense variant SLC30A10 Thr95Ile (rs188273166) associates with the largest elevation of both enzymes, and this association replicates in the DiscovEHR study. SLC30A10 excretes manganese from the liver to the bile duct, and rare homozygous loss of function causes the syndrome hypermanganesemia with dystonia-1 (HMNDYT1) which involves cirrhosis. Consistent with hematological symptoms of hypermanganesemia, SLC30A10 Thr95Ile carriers have increased hematocrit and risk of iron deficiency anemia. Carriers also have increased risk of extrahepatic bile duct cancer. These results suggest that genetic variation in SLC30A10 adversely affects more individuals than patients with diagnosed HMNDYT1.
Variants predicted to result in the loss of function of human genes have attracted interest because of their clinical impact and surprising prevalence in healthy individuals. Here, we present ALoFT ...(annotation of loss-of-function transcripts), a method to annotate and predict the disease-causing potential of loss-of-function variants. Using data from Mendelian disease-gene discovery projects, we show that ALoFT can distinguish between loss-of-function variants that are deleterious as heterozygotes and those causing disease only in the homozygous state. Investigation of variants discovered in healthy populations suggests that each individual carries at least two heterozygous premature stop alleles that could potentially lead to disease if present as homozygotes. When applied to de novo putative loss-of-function variants in autism-affected families, ALoFT distinguishes between deleterious variants in patients and benign variants in unaffected siblings. Finally, analysis of somatic variants in >6500 cancer exomes shows that putative loss-of-function variants predicted to be deleterious by ALoFT are enriched in known driver genes.Variants causing loss of function (LoF) of human genes have clinical implications. Here, the authors present a method to predict disease-causing potential of LoF variants, ALoFT (annotation of Loss-of-Function Transcripts) and show its application to interpreting LoF variants in different contexts.
Significance Pseudogenes have long been considered nonfunctional elements. However, recent studies have shown they can potentially regulate the expression of protein-coding genes. Capitalizing on ...available functional-genomics data and the finished annotation of human, worm, and fly, we compared the pseudogene complements across the three phyla. We found that in contrast to protein-coding genes, pseudogenes are highly lineage specific, reflecting genome history more so than the conservation of essential biological functions. Specifically, the human pseudogene complement reflects a massive burst of retrotranspositional activity at the dawn of the primates, whereas the worm’s and fly's repertoire reflects a history of deactivated duplications. However, we also observe that pseudogenes across the three phyla have a consistent level of partial activity, with ∼15% being transcribed.
In primates and other animals, reverse transcription of mRNA followed by genomic integration creates retroduplications. Expressed retroduplications are either "retrogenes" coding for functioning ...proteins, or expressed "processed pseudogenes," which can function as noncoding RNAs. To date, little is known about the variation in retroduplications in terms of their presence or absence across individuals in the human population. We have developed new methodologies that allow us to identify "novel" retroduplications (i.e., those not present in the reference genome), to find their insertion points, and to genotype them. Using these methods, we catalogued and analyzed 174 retroduplication variants in almost one thousand humans, which were sequenced as part of Phase 1 of The 1000 Genomes Project Consortium. The accuracy of our data set was corroborated by (1) multiple lines of sequencing evidence for retroduplication (e.g., depth of coverage in exons vs. introns), (2) experimental validation, and (3) the fact that we can reconstruct a correct phylogenetic tree of human subpopulations based solely on retroduplications. We also show that parent genes of retroduplication variants tend to be expressed at the M-to-G1 transition in the cell cycle and that M-to-G1 expressed genes have more copies of fixed retroduplications than genes expressed at other times. These findings suggest that cell division is coupled to retrotransposition and, perhaps, is even a requirement for it.
A major challenge in genetic association studies is that most associated variants fall in the non-coding part of the human genome. We searched for variants associated with bone mineral density (BMD) ...after enriching the discovery cohort for loss-of-function (LoF) mutations by sequencing a subset of the Nord-Trøndelag Health Study, followed by imputation in the remaining sample (N = 19,705), and identified ten known BMD loci. However, one previously unreported variant, LoF mutation in MEPE, p.(Lys70IlefsTer26, minor allele frequency MAF = 0.8%), was associated with decreased ultradistal forearm BMD (P-value = 2.1 × 10
), and increased osteoporosis (P-value = 4.2 × 10
) and fracture risk (P-value = 1.6 × 10
). The MEPE LoF association with BMD and fractures was further evaluated in 279,435 UK (MAF = 0.05%, heel bone estimated BMD P-value = 1.2 × 10
, any fracture P-value = 0.05) and 375,984 Icelandic samples (MAF = 0.03%, arm BMD P-value = 0.12, forearm fracture P-value = 0.005). Screening for the MEPE LoF mutations before adulthood could potentially prevent osteoporosis and fractures due to the lifelong effect on BMD observed in the study. A key implication for precision medicine is that high-impact functional variants missing from the publicly available cosmopolitan panels could be clinically more relevant than polygenic risk scores.
Background The prevalence of Strongyloides stercoralis infection is estimated to be 30-100 million worldwide, although this an underestimate. Most cases remain undiagnosed due to the asymptomatic ...nature of the infection. We wanted to estimate the seroprevalence of S. stercoralis infection in a South Indian adult population. Methods To this end, we performed community-based screening of 2351 individuals (aged 18-65) in Kanchipuram District of Tamil Nadu between 2013 and 2020. Serological testing for S. stercoralis was performed using the NIE ELISA. Results Our data shows a seroprevalence of 33% (768/2351) for S. stercoralis infection which had a higher prevalence among males 36% (386/1069) than among females 29.8% (382/1282). Adults aged greater than or equal to55 (aOR = 1.65, 95% CI: 1.25-2.18) showed higher adjusted odds of association compared with other age groups. Eosinophil levels (39%) (aOR = 1.43, 95% CI: 1.19-1.74) and hemoglobin levels (24%) (aOR = 1.25, 95% CI: 1.11-1.53) were significantly associated with S. stercoralis infection. In contrast, low BMI (aOR = 1.15, 95% CI: 0.82-1.61) or the presence of diabetes mellitus (OR = 1.18, 95% CI: 0.83-1.69) was not associated with S. stercoralis seropositivity. Conclusions Our study provides evidence for a very high baseline prevalence of S. stercoralis infection in South Indian communities and this information could provide realistic and concrete planning of control measures.
Celotno besedilo
Dostopno za:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Mice have been a long-standing model for human biology and disease. Here we characterize, by RNA sequencing, the transcriptional profiles of a large and heterogeneous collection of mouse tissues, ...augmenting the mouse transcriptome with thousands of novel transcript candidates. Comparison with transcriptome profiles in human cell lines reveals substantial conservation of transcriptional programmes, and uncovers a distinct class of genes with levels of expression that have been constrained early in vertebrate evolution. This core set of genes captures a substantial fraction of the transcriptional output of mammalian cells, and participates in basic functional and structural housekeeping processes common to all cell types. Perturbation of these constrained genes is associated with significant phenotypes including embryonic lethality and cancer. Evolutionary constraint in gene expression levels is not reflected in the conservation of the genomic sequences, but is associated with conserved epigenetic marking, as well as with characteristic post-transcriptional regulatory programme, in which sub-cellular localization and alternative splicing play comparatively large roles.
The availability of genome sequences of numerous organisms allows comparative study of pseudogenes in syntenic regions. Conservation of pseudogenes suggests that they might have a functional role in ...some instances.
We report the first large-scale comparative analysis of ribosomal protein pseudogenes in four mammalian genomes (human, chimpanzee, mouse and rat). To this end, we have assigned these pseudogenes in the four organisms using an automated pipeline and make the results available online. Each organism has a large number of ribosomal protein pseudogenes (approximately 1,400 to 2,800). The majority of them are processed (generated by retrotransposition). However, we do not see a correlation between the number of pseudogenes associated with a ribosomal protein gene and its mRNA abundance. Analysis of pseudogenes in syntenic regions between species shows that most are conserved between human and chimpanzee, but very few are conserved between primates and rodents. Interestingly, syntenic pseudogenes have a lower rate of nucleotide substitution than their surrounding intergenic DNA. Moreover, evidence from expressed sequence tags indicates that two pseudogenes conserved between human and mouse are transcribed. Detailed analysis shows that one of them, the pseudogene of RPS27, is likely to be a protein-coding gene. This is significant as previous reports indicated there are exactly 80 ribosomal protein genes encoded by the human genome.
Our analysis indicates that processed ribosomal protein pseudogenes abound in mammalian genomes, but few of these are conserved between primates and rodents. This highlights the large amount of recent retrotranspositional activity in mammals and a relatively larger amount of it in the rodent lineage.