RNA-seq is now the technology of choice for genome-wide differential gene expression experiments, but it is not clear how many biological replicates are needed to ensure valid biological ...interpretation of the results or which statistical tools are best for analyzing the data. An RNA-seq experiment with 48 biological replicates in each of two conditions was performed to answer these questions and provide guidelines for experimental design. With three biological replicates, nine of the 11 tools evaluated found only 20%-40% of the significantly differentially expressed (SDE) genes identified with the full set of 42 clean replicates. This rises to >85% for the subset of SDE genes changing in expression by more than fourfold. To achieve >85% for all SDE genes regardless of fold change requires more than 20 biological replicates. The same nine tools successfully control their false discovery rate at ≲5% for all numbers of replicates, while the remaining two tools fail to control their FDR adequately, particularly for low numbers of replicates. For future RNA-seq experiments, these results suggest that at least six biological replicates should be used, rising to at least 12 when it is important to identify SDE genes for all fold changes. If fewer than 12 replicates are used, a superior combination of true positive and false positive performances makes edgeR and DESeq2 the leading tools. For higher replicate numbers, minimizing false positives is more important and DESeq marginally outperforms the other tools.
High-throughput RNA sequencing (RNA-seq) is now the standard method to determine differential gene expression. Identifying differentially expressed genes crucially depends on estimates of read-count ...variability. These estimates are typically based on statistical models such as the negative binomial distribution, which is employed by the tools edgeR, DESeq and cuffdiff. Until now, the validity of these models has usually been tested on either low-replicate RNA-seq data or simulations.
A 48-replicate RNA-seq experiment in yeast was performed and data tested against theoretical models. The observed gene read counts were consistent with both log-normal and negative binomial distributions, while the mean-variance relation followed the line of constant dispersion parameter of ∼0.01. The high-replicate data also allowed for strict quality control and screening of 'bad' replicates, which can drastically affect the gene read-count distribution.
RNA-seq data have been submitted to ENA archive with project ID PRJEB5348.
g.j.barton@dundee.ac.uk.
The heartworm Dirofilaria immitis is an important parasite of dogs. Transmitted by mosquitoes in warmer climatic zones, it is spreading across southern Europe and the Americas at an alarming pace. ...There is no vaccine, and chemotherapy is prone to complications. To learn more about this parasite, we have sequenced the genomes of D. immitis and its endosymbiont Wolbachia. We predict 10,179 protein coding genes in the 84.2 Mb of the nuclear genome, and 823 genes in the 0.9‐Mb Wolbachia genome. The D. immitis genome harbors neither DNA transposons nor active retrotransposons, and there is very little genetic variation between two sequenced isolates from Europe and the United States. The differential presence of anabolic pathways such as heme and nucleotide biosynthesis hints at the intricate metabolic interrelationship between the heartworm and Wolbachia. Comparing the proteome of D. immitis with other nematodes and with mammalian hosts, we identify families of potential drug targets, immune modulators, and vaccine candidates. This genome sequence will support the development of new tools against dirofilariasis and aid efforts to combat related human pathogens, the causative agents of lymphatic filariasis and river blindness.—Godel, C., Kumar, S., Koutsovoulos, G., Ludin, P., Nilsson, D., Comandatore, F., Wrobel, N., Thompson, M., Schmid, C. D., Goto, S., Bringaud, F., Wolstenholme, A., Bandi, C., Epe, C., Kaminsky, R., Blaxter, M., Mäser, P. The genome of the heartworm, Dirofilaria immitis, reveals drug and vaccine targets. FASEB J. 26, 4650–4661 (2012). www.fasebj.org
Full text
Available for:
BFBNIB, FZAB, GIS, IJS, KILJ, NLZOH, NUK, OILJ, SAZU, SBCE, SBMB, UL, UM, UPUK
DNA methylation is a dynamic epigenetic mechanism that occurs at cytosine-phosphate-guanine dinucleotide (CpG) sites. Epigenome-wide association studies (EWAS) investigate the strength of association ...between methylation at individual CpG sites and health outcomes. Although blood methylation may act as a peripheral marker of common disease states, previous EWAS have typically focused only on individual conditions and have had limited power to discover disease-associated loci. This study examined the association of blood DNA methylation with the prevalence of 14 disease states and the incidence of 19 disease states in a single population of over 18,000 Scottish individuals.
DNA methylation was assayed at 752,722 CpG sites in whole-blood samples from 18,413 volunteers in the family-structured, population-based cohort study Generation Scotland (age range 18 to 99 years). EWAS tested for cross-sectional associations between baseline CpG methylation and 14 prevalent disease states, and for longitudinal associations between baseline CpG methylation and 19 incident disease states. Prevalent cases were self-reported on health questionnaires at the baseline. Incident cases were identified using linkage to Scottish primary (Read 2) and secondary (ICD-10) care records, and the censoring date was set to October 2020. The mean time-to-diagnosis ranged from 5.0 years (for chronic pain) to 11.7 years (for Coronavirus Disease 2019 (COVID-19) hospitalisation). The 19 disease states considered in this study were selected if they were present on the World Health Organisation's 10 leading causes of death and disease burden or included in baseline self-report questionnaires. EWAS models were adjusted for age at methylation typing, sex, estimated white blood cell composition, population structure, and 5 common lifestyle risk factors. A structured literature review was also conducted to identify existing EWAS for all 19 disease states tested. The MEDLINE, Embase, Web of Science, and preprint servers were searched to retrieve relevant articles indexed as of March 27, 2023. Fifty-four of approximately 2,000 indexed articles met our inclusion criteria: assayed blood-based DNA methylation, had >20 individuals in each comparison group, and examined one of the 19 conditions considered. First, we assessed whether the associations identified in our study were reported in previous studies. We identified 69 associations between CpGs and the prevalence of 4 conditions, of which 58 were newly described. The conditions were breast cancer, chronic kidney disease, ischemic heart disease, and type 2 diabetes mellitus. We also uncovered 64 CpGs that associated with the incidence of 2 disease states (COPD and type 2 diabetes), of which 56 were not reported in the surveyed literature. Second, we assessed replication across existing studies, which was defined as the reporting of at least 1 common site in >2 studies that examined the same condition. Only 6/19 disease states had evidence of such replication. The limitations of this study include the nonconsideration of medication data and a potential lack of generalizability to individuals that are not of Scottish and European ancestry.
We discovered over 100 associations between blood methylation sites and common disease states, independently of major confounding risk factors, and a need for greater standardisation among EWAS on human disease.
Full text
Available for:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Epigenetic clocks can track both chronological age (cAge) and biological age (bAge). The latter is typically defined by physiological biomarkers and risk of adverse health outcomes, including ...all-cause mortality. As cohort sample sizes increase, estimates of cAge and bAge become more precise. Here, we aim to develop accurate epigenetic predictors of cAge and bAge, whilst improving our understanding of their epigenomic architecture.
First, we perform large-scale (N = 18,413) epigenome-wide association studies (EWAS) of chronological age and all-cause mortality. Next, to create a cAge predictor, we use methylation data from 24,674 participants from the Generation Scotland study, the Lothian Birth Cohorts (LBC) of 1921 and 1936, and 8 other cohorts with publicly available data. In addition, we train a predictor of time to all-cause mortality as a proxy for bAge using the Generation Scotland cohort (1214 observed deaths). For this purpose, we use epigenetic surrogates (EpiScores) for 109 plasma proteins and the 8 component parts of GrimAge, one of the current best epigenetic predictors of survival. We test this bAge predictor in four external cohorts (LBC1921, LBC1936, the Framingham Heart Study and the Women's Health Initiative study).
Through the inclusion of linear and non-linear age-CpG associations from the EWAS, feature pre-selection in advance of elastic net regression, and a leave-one-cohort-out (LOCO) cross-validation framework, we obtain cAge prediction with a median absolute error equal to 2.3 years. Our bAge predictor was found to slightly outperform GrimAge in terms of the strength of its association to survival (HR
= 1.47 1.40, 1.54 with p = 1.08 × 10
, and HR
= 1.52 1.44, 1.59 with p = 2.20 × 10
). Finally, we introduce MethylBrowsR, an online tool to visualise epigenome-wide CpG-age associations.
The integration of multiple large datasets, EpiScores, non-linear DNAm effects, and new approaches to feature selection has facilitated improvements to the blood-based epigenetic prediction of biological and chronological age.
Full text
Available for:
IZUM, KILJ, NUK, PILJ, PNG, SAZU, UL, UM, UPUK
Newborn heel prick blood spots are routinely used to screen for inborn errors of metabolism and life-limiting inherited disorders. The potential value of secondary data from newborn blood spot ...archives merits ethical consideration and assessment of feasibility for public benefit. Early life exposures and behaviours set health trajectories in childhood and later life. The newborn blood spot is potentially well placed to create an unbiased and cost-effective population-level retrospective birth cohort study. Scotland has retained newborn blood spots for all children born since 1965, around 3 million in total. However, a moratorium on research access is currently in place, pending public consultation.
We conducted a Citizens' Jury as a first step to explore whether research use of newborn blood spots was in the public interest. We also assessed the feasibility and value of extracting research data from dried blood spots for predictive medicine.
Jurors delivered an agreed verdict that conditional research access to the newborn blood spots was in the public interest. The Chief Medical Officer for Scotland authorised restricted lifting of the current research moratorium to allow a feasibility study. Newborn blood spots from consented Generation Scotland volunteers were retrieved and their potential for both epidemiological and biological research demonstrated.
Through the Citizens' Jury, we have begun to identify under what conditions, if any, should researchers in Scotland be granted access to the archive. Through the feasibility study, we have demonstrated the potential value of research access for health data science and predictive medicine.
DNA methylation reflects health-related environmental exposures and genetic risk, providing insights into aetiological mechanisms and potentially predicting disease onset, progression and treatment ...response. An increasingly recognised need for large-scale, longitudinally-profiled samples collected world-wide has made the development of efficient and straightforward sample collection and storage procedures a pressing issue. An alternative to the low-temperature storage of EDTA tubes of venous blood samples, which are frequently the source of the DNA used in such studies, is to collect and store at room temperature blood samples using purpose built filter paper, such as Whatman FTA® cards. Our goal was to determine whether DNA stored in this manner can be used to generate DNA methylation profiles comparable to those generated using blood samples frozen in EDTA tubes.
DNA methylation profiles were obtained from matched EDTA tube and Whatman FTA® card whole-blood samples from 62 Generation Scotland: Scottish Family Health Study participants using the Infinium HumanMethylation450 BeadChip. Multiple quality control procedures were implemented, the relationship between the two sample types assessed, and epigenome-wide association studies (EWASs) performed for smoking status, age and the interaction between these variables and sample storage method.
Dried blood spot (DBS) DNA methylation profiles were of good quality and DNA methylation profiles from matched DBS and EDTA tube samples were highly correlated (mean
= 0.991) and could distinguish between participants. EWASs replicated established associations for smoking and age, with no evidence for moderation by storage method.
Our results support the use of Whatman FTA® cards for collecting and storing blood samples for DNA methylation profiling. This approach is likely to be particularly beneficial for large-scale studies and those carried out in areas where freezer access is limited. Furthermore, our results will inform consideration of the use of newborn heel prick DBSs for research use.
The syndrome of apparent mineralocorticoid excess arises from nonfunctional mutations in 11beta-hydroxysteroid dehydrogenase type 2 (11betaHSD2), an enzyme that inactivates cortisol and confers ...aldosterone specificity on the mineralocorticoid receptor. Loss of 11betaHSD2 permits glucocorticoids to activate the mineralocorticoid receptor, and the hypertension in the syndrome is presumed to arise from volume expansion secondary to renal sodium retention. An 11betaHSD2 null mouse was generated on an inbred C57BL/6J genetic background, allowing survival to adulthood. 11betaHSD2(-/-) mice had BP approximately 20 mmHg higher on average compared with wild-type mice but were volume contracted, not volume expanded as expected. Initially, impaired sodium excretion associated with increased activity of the epithelial sodium channel was observed. By 80 days of age, however, channel activity was abolished and 11betaHSD2(-/-) mice lost salt. Despite the natriuresis, hypertension remained but was not attributable to intrinsic vascular dysfunction. Instead, urinary catecholamine levels in 11betaHSD2(-/-) mice were double those in wild-type mice, and alpha1-adrenergic receptor blockade rescued the hypertensive phenotype, suggesting that vasoconstriction contributes to the sustained hypertension in this model. In summary, it is proposed that renal sodium retention remains a key event in apparent mineralocorticoid excess but that the accompanying hypertension changes from a renal to a vascular etiology over time.
Patients with congenital adrenal hyperplasia arising from mutations of 11β-hydroxylase, the final enzyme in the glucocorticoid biosynthetic pathway, exhibit glucocorticoid deficiency, adrenal ...hyperplasia driven by unsuppressed hypothalamo-pituitary-adrenal activity, and excess mineralocorticoid activity caused by the accumulation of deoxycorticosterone. A mouse model, in which exons 3-7 of Cyp11b1 (the gene encoding 11β-hydroxylase) were replaced with cDNA encoding enhanced cyan fluorescent protein, was generated to investigate the underlying disease mechanisms. Enhanced cyan fluorescent protein was expressed appropriately in the zona fasciculata of the adrenal gland, and targeted knock-out was confirmed by urinary steroid profiles and, immunocytochemically, by the absence of 11β-hydroxylase. The null mice exhibited glucocorticoid deficiency, mineralocorticoid excess, adrenal hyperplasia, mild hypertension, and hypokalemia. They also displayed glucose intolerance. Because rodents do not synthesize adrenal androgens, changes in reproductive function such as genital virilization of females were not anticipated. However, adult homozygote females were infertile, their ovaries showing an absence of corpora lutea and a central proliferation of disorganized steroidogenic tissue. Null females responded normally to superovulation, suggesting that raised systemic progesterone levels also contribute to infertility problems. The model reveals previously unrecognized phenotypic subtleties of congenital adrenal hyperplasia.
Full text
Available for:
GEOZS, IJS, IMTLJ, KILJ, KISLJ, NLZOH, NUK, OILJ, PNG, SAZU, SBCE, SBJE, UILJ, UL, UM, UPCLJ, UPUK, ZAGLJ, ZRSKP
Chronic ACTH exposure is associated with adrenal hypertrophy and steroidogenesis. The underlying molecular processes in mice have been analysed by microarray, histological and immunohistochemical ...techniques. Synacthen infused for 2 weeks markedly increased adrenal mass and plasma corticosterone levels. Microarray analysis found greater than 2-fold changes in expression of 928 genes (P < 0.001; 397 up, 531 down). These clustered in pathways involved in signalling, sterol/lipid metabolism, cell proliferation/hypertrophy and apoptosis. Signalling genes included some implicated in adrenal adenomas but also upregulated genes associated with cyclic AMP and downregulated genes associated with aldosterone synthesis. Sterol metabolism genes were those promoting cholesterol supply (Scarb1, Sqle, Apoa1) and disposal (Cyp27a1, Cyp7b1). Oil red O staining showed lipid depletion consistent with reduced expression of genes involved in lipid synthesis. Genes involved in steroidogenesis (Star, Cyp11a1, Cyp11b1) were modestly affected (P < 0.05; <1.3-fold). Increased Ki67, Ccna2, Ccnb2 and Tk1 expression complemented immunohistochemical evidence of a 3-fold change in cell proliferation. Growth arrest genes, Cdkn1a and Cdkn1c, which are known to be active in hypertrophied cells, were increased >4-fold and cross-sectional area of fasciculata cells was 2-fold greater. In contrast, genes associated with apoptosis (eg Casp12, Clu,) were downregulated and apoptotic cells (Tunel staining) were fewer (P < 0.001) and more widely distributed throughout the cortex. In , long-term steroidogenesis with ACTH excess is sustained by genes controlling cholesterol supply and adrenal mass. ACTH effects on adrenal morphology and genes controlling cell hypertrophy, proliferation and apoptosis suggest the involvement of different cell types and separate molecular pathways.