Abstract
Markedly expanded tandem repeats (TRs) have been correlated with ~60 diseases. TR diversity has been considered a clue toward understanding missing heritability. However, haplotype-resolved ...long TRs remain mostly hidden or blacked out because their complex structures (TRs composed of various units and minisatellites containing >10-bp units) make them difficult to determine accurately with existing methods. Here, using a high-precision algorithm to determine complex TR structures from long, accurate reads of PacBio HiFi, an investigation of 270 Japanese control samples yields several genome-wide findings. Approximately 322,000 TRs are difficult to impute from the surrounding single-nucleotide variants. Greater genetic divergence of TR loci is significantly correlated with more events of younger replication slippage. Complex TRs are more abundant than single-unit TRs, and a tendency for complex TRs to consist of <10-bp units and single-unit TRs to be minisatellites is statistically significant at loci with ≥500-bp TRs. Of note, 8909 loci with extended TRs (>100b longer than the mode) contain several known disease-associated TRs and are considered candidates for association with disorders. Overall, complex TRs and minisatellites are found to be abundant and diverse, even in genetically small Japanese populations, yielding insights into the landscape of long TRs.
Proline is the most abundant amino acid in wine and beer, because the yeast Saccharomyces cerevisiae hardly assimilates proline during fermentation processes. Our previous studies showed that ...arginine induces endocytosis of the proline transporter Put4, resulting in inhibition of proline utilization. We here report a possible role of arginine sensing in the inhibition of proline utilization. We first found that two basic amino acids, ornithine, and lysine, inhibit proline utilization by inducing Put4 endocytosis in a manner similar to arginine, but citrulline does not. Our genetic screening revealed that the arginine transporter Can1 is involved in the inhibition of proline utilization by arginine. Intriguingly, the arginine uptake activity of Can1 was not required for the arginine‐dependent inhibition of proline utilization, suggesting that Can1 has a function beyond its commonly known function of transporting arginine. More importantly, our biochemical analyses revealed that Can1 activates signaling cascades of protein kinase A in response to extracellular arginine. Hence, we proposed that Can1 regulates proline utilization by functioning as a transceptor possessing the activity of both a transporter and receptor of arginine.
Proline is mainly transported to the cells by the proline‐specific transporter Put4. In the presence of basic amino acids, Put4 is endocytosed dependent on the ubiquitin ligase Rsp5 and its adaptor Art3, leading to the inhibition of proline utilization. The transceptor function of Can1 plays a role in the mechanism for sensing basic amino acids that regulate the inhibition of proline utilization.
Take‐away
Basic amino acids inhibit proline utilization by inducing Put4 endocytosis.
Can1 is involved in the inhibition of proline utilization in yeast.
Can1 is a transceptor acting as both a transporter and receptor of arginine.
Abstract
Motivation
Long tandem repeat expansions of more than 1000 nt have been suggested to be associated with diseases, but remain largely unexplored in individual human genomes because read ...lengths have been too short. However, new long-read sequencing technologies can produce single reads of 10 000 nt or more that can span such repeat expansions, although these long reads have high error rates, of 10–20%, which complicates the detection of repetitive elements. Moreover, most traditional algorithms for finding tandem repeats are designed to find short tandem repeats (<1000 nt) and cannot effectively handle the high error rate of long reads in a reasonable amount of time.
Results
Here, we report an efficient algorithm for solving this problem that takes advantage of the length of the repeat. Namely, a long tandem repeat has hundreds or thousands of approximate copies of the repeated unit, so despite the error rate, many short k-mers will be error-free in many copies of the unit. We exploited this characteristic to develop a method for first estimating regions that could contain a tandem repeat, by analyzing the k-mer frequency distributions of fixed-size windows across the target read, followed by an algorithm that assembles the k-mers of a putative region into the consensus repeat unit by greedily traversing a de Bruijn graph. Experimental results indicated that the proposed algorithm largely outperformed Tandem Repeats Finder, a widely used program for finding tandem repeats, in terms of sensitivity.
Availability and implementation
https://github.com/morisUtokyo/mTR.
was the first multicellular eukaryotic genome sequenced to apparent completion. Although this assembly employed a standard
strain (N2), it used sequence data from several laboratories, with DNA ...propagated in bacteria and yeast. Thus, the N2 assembly has many differences from any
available today. To provide a more accurate
genome, we performed long-read assembly of VC2010, a modern strain derived from N2. Our VC2010 assembly has 99.98% identity to N2 but with an additional 1.8 Mb including tandem repeat expansions and genome duplications. For 116 structural discrepancies between N2 and VC2010, 97 structures matching VC2010 (84%) were also found in two outgroup strains, implying deficiencies in N2. Over 98% of N2 genes encoded unchanged products in VC2010; moreover, we predicted ≥53 new genes in VC2010. The recompleted genome of
should be a valuable resource for genetics, genomics, and systems biology.
Centromeres and large-scale structural variants evolve and contribute to genome diversity during vertebrate speciation. Here, we perform de novo long-read genome assembly of three inbred medaka ...strains that are derived from geographically isolated subpopulations and undergo speciation. Using single-molecule real-time (SMRT) sequencing, we obtain three chromosome-mapped genomes of length ~734, ~678, and ~744Mbp with a resource of twenty-two centromeric regions of length 20-345kbp. Centromeres are positionally conserved among the three strains and even between four pairs of chromosomes that were duplicated by the teleost-specific whole-genome duplication 320-350 million years ago. The centromeres do not all evolve at a similar pace; rather, centromeric monomers in non-acrocentric chromosomes evolve significantly faster than those in acrocentric chromosomes. Using methylation sensitive SMRT reads, we uncover centromeres are mostly hypermethylated but have hypomethylated sub-regions that acquire unique sequence compositions independently. These findings reveal the potential of non-acrocentric centromere evolution to contribute to speciation.
ABSTRACT
Proline is a predominant amino acid in grape must, but it is poorly utilized by the yeast Saccharomyces cerevisiae in wine-making processes. This sometimes leads to a nitrogen deficiency ...during fermentation and proline accumulation in wine. In this study, we clarified that a glucose response is involved in an inhibitory mechanism of proline utilization in yeast. Our genetic screen showed that strains with a loss-of-function mutation on the CDC25 gene can utilize proline even under fermentation conditions. Cdc25 is a regulator of the glucose response consisting of the Ras/cAMP-dependent protein kinase A (PKA) pathway. Moreover, we found that activation of the Ras/PKA pathway is necessary for the inhibitory mechanism of proline utilization. The present data revealed that crosstalk exists between the carbon and proline metabolisms. Our study could hold promise for the development of wine yeast strains that can efficiently assimilate proline during the fermentation processes.
NUP98 and NUP214 form chimeric fusion proteins that assemble into phase-separated nuclear bodies containing CRM1, a nuclear export receptor. However, these nuclear bodies’ function in controlling ...gene expression remains elusive. Here, we demonstrate that the nuclear bodies of NUP98::HOXA9 and SET::NUP214 promote the condensation of mixed lineage leukemia 1 (MLL1), a histone methyltransferase essential for the maintenance of HOX gene expression. These nuclear bodies are robustly associated with MLL1/CRM1 and co-localized on chromatin. Furthermore, whole-genome chromatin-conformation capture analysis reveals that NUP98::HOXA9 induces a drastic alteration in high-order genome structure at target regions concomitant with the generation of chromatin loops and/or rearrangement of topologically associating domains in a phase-separation-dependent manner. Collectively, these results show that the phase-separated nuclear bodies of nucleoporin fusion proteins can enhance the activation of target genes by promoting the condensation of MLL1/CRM1 and rearrangement of the 3D genome structure.
Display omitted
•SET::NUP214 and NUP98:HOXA9 induce the condensation of MLL1/CRM1 at particular genomic loci•NUP98:HOXA9 alters TAD structures and 3D genome organization•NUP98:HOXA9 generates novel enhancers upon binding•NUP98:HOXA9 functions by phase separation and DNA binding
Oka et al. show that nucleoporin fusion forms phase-separated bodies with two functions: condensing functional molecules (e.g., MLL1) and organizing the 3D genome structure. These functions enable target gene activation (e.g., Hoxa and Hoxb cluster genes, and Meis1).
Proline is a pivotal and multifunctional amino acid that is used not only as a nitrogen source but also as a stress protectant and energy source. Therefore, proline metabolism is known to be ...important in maintaining cellular homeostasis. Here, we discovered that proline oxidation, catalyzed by the proline oxidase Put1, a mitochondrial flavin-dependent enzyme converting proline into ∆1-pyrroline-5-carboxylate, controls the chronological lifespan of the yeast Saccharomyces cerevisiae. Intriguingly, the yeast strain with PUT1 deletion showed a reduced chronological lifespan compared with the wild-type strain. The addition of proline to the culture medium significantly increased the longevity of wild-type cells but not that of PUT1-deleted cells. We next found that induction of the transcriptional factor Put3-dependent PUT1 and degradation of proline occur during the aging of yeast cells. Additionally, the lifespan of the PUT3-deleted strain, which is deficient in PUT1 induction, was shorter than that of the wild-type strain. More importantly, the oxidation of proline by Put1 helped maintain the mitochondrial membrane potential and ATP production through the aging period. These results indicate that mitochondrial energy metabolism is maintained through oxidative degradation of proline and that this process is important in regulating the longevity of yeast cells.
Sarcopenia is a muscle loss syndrome known as a risk factor of various carcinomas. The impact of sarcopenia and sarcopenia-related inflammatory/nutritional markers in metastatic urothelial carcinoma ...(mUC) treated with pembrolizumab was unknown, so this retrospective study of 27 patients was performed. Psoas muscle mass index (PMI) was calculated by bilateral psoas major muscle area at the L3 with computed tomography. The cut-off PMI value for sarcopenia was defined as ≤6.36 cm
/m
for men and ≤3.92 cm
/m
for women. Neutrophil-to-lymphocyte ratio (NLR) ≥ 4.0 and sarcopenia correlated with significantly shorter progression-free survival (PFS) (hazard ratio (HR) 3.81,
= 0.020; and HR 2.99,
= 0.027, respectively). Multivariate analyses identified NLR ≥ 4.0 and sarcopenia as independent predictors for PFS (HR 2.89,
= 0.025; and HR 2.79,
= 0.030, respectively). Prognostic nutrition index < 45, NLR ≥ 4.0 and sarcopenia were correlated with significantly worse for overall survival (OS) (HR 3.44,
= 0.046; HR 4.26,
= 0.024; and HR 3.92,
= 0.012, respectively). Multivariate analyses identified sarcopenia as an independent predictor for OS (HR 4.00,
= 0.026). Furthermore, a decrease in PMI ≥ 5% in a month was an independent predictor of PFS and OS (HR 12.8,
= 0.008; and HR 6.21,
= 0.036, respectively). Evaluation of sarcopenia and inflammatory/nutritional markers may help in the management of mUC with pembrolizumab.
Mitochondrial diseases (MDs) are occasionally difficult to diagnose. Growth differentiation factor 15 (GDF15) has been reported as a biomarker useful for not only diagnosing MDs, but also evaluating ...disease severity and therapeutic efficacy. To enable the measurement of serum GDF15 concentrations at medical institutions, we developed a new latex‐enhanced turbidimetric immunoassay (LTIA) as an automated diagnostic indication test for MDs. We also examined the equivalency of specificity and sensitivity in measuring serum GDF15 concentrations between a commercially available enzyme‐linked immunosorbent assay (ELISA) kit and a novel LTIA device in patients with MDs, disease controls, and healthy controls. A clinical performance study used a newly developed LTIA device and an existing ELISA kit to measure the concentrations of GDF15 in 35 MD patients, 111 disease controls, and 86 healthy controls. The median (first quartile‐third quartile) of serum GDF15 concentrations measured with the LTIA device was significantly higher (P < .001) in MD patients (1389.0 U/mL 869.5‐1776.0 U/mL) than in healthy controls (380.5 U/mL 330.2‐471.8 U/mL); the interquartile ranges did not overlap between MD patients and healthy controls. The areas under the curve in disease and healthy controls were 0.812 (95% confidence interval CI: 0.734‐0.886) and 0.951 (95% CI: 0.910‐0.992), respectively. The automated, high‐throughput technology‐based LTIA device has definite advantages over the ELISA kit in shorter processing time and lower estimated cost per sample measurement. The LTIA device of GDF15 may be a sufficiently reliable, frontline, diagnostic indicator of individuals with suspected MDs in the general population.