Schizophrenia is a serious psychiatric disorder with a broadly undiscovered genetic etiology. Recent studies of de novo mutations (DNMs) in schizophrenia and autism have reinforced the hypothesis ...that rare genetic variation contributes to risk. We carried out exome sequencing on 57 trios with sporadic or familial schizophrenia. In sporadic trios, we observed a ~3.5-fold increase in the proportion of nonsense DNMs (0.101 vs 0.031, empirical P=0.01, Benjamini-Hochberg-corrected P=0.044). These mutations were significantly more likely to occur in genes with highly ranked probabilities of haploinsufficiency (P=0.0029, corrected P=0.006). DNMs of potential functional consequence were also found to occur in genes predicted to be less tolerant to rare variation (P=2.01 × 10(-)(5), corrected P=2.1 × 10(-)(3)). Genes with DNMs overlapped with genes implicated in autism (for example, AUTS2, CHD8 and MECP2) and intellectual disability (for example, HUWE1 and TRAPPC9), supporting a shared genetic etiology between these disorders. Functionally CHD8, MECP2 and HUWE1 converge on epigenetic regulation of transcription suggesting that this may be an important risk mechanism. Our results were consistent in an analysis of additional exome-based sequencing studies of other neurodevelopmental disorders. These findings suggest that perturbations in genes, which function in the epigenetic regulation of brain development and cognition, could have a central role in the susceptibility to, pathogenesis and treatment of mental disorders.
We report here genome-wide analysis of the tumor suppressor p53 binding sites in normal human cells. 743 high-confidence ChIP-seq peaks representing putative genomic binding sites were identified in ...normal IMR90 fibroblasts using a reference chromatin sample. More than 40% were located within 2 kb of a transcription start site (TSS), a distribution similar to that documented for individually studied, functional p53 binding sites and, to date, not observed by previous p53 genome-wide studies. Nearly half of the high-confidence binding sites in the IMR90 cells reside in CpG islands, in marked contrast to sites reported in cancer-derived cells. The distinct genomic features of the IMR90 binding sites do not reflect a distinct preference for specific sequences, since the de novo developed p53 motif based on our study is similar to those reported by genome-wide studies of cancer cells. More likely, the different chromatin landscape in normal, compared with cancer-derived cells, influences p53 binding via modulating availability of the sites. We compared the IMR90 ChIPseq peaks to the recently published IMR90 methylome1 and demonstrated that they are enriched at hypomethylated DNA. Our study represents the first genome-wide, de novo mapping of p53 binding sites in normal human cells and reveals that p53 binding sites reside in distinct genomic landscapes in normal and cancer-derived human cells.
The incidence of Barrett's esophagus (BE)-associated esophageal adenocarcinoma (EAC) is increasing. Next-generation sequencing (NGS) provides an unprecedented opportunity to uncover genomic ...alterations during BE pathogenesis and progression to EAC, but treatment-naive surgical specimens are scarce. The objective of this study was to establish the feasibility of using widely available endoscopic mucosal biopsies for successful NGS, using samples obtained from a BE 'progressor'. Paired-end whole-genome NGS was performed on the Illumina platform using libraries generated from mucosal biopsies of normal squamous epithelium (NSE), BE and EAC obtained from a patient who progressed to adenocarcinoma during endoscopic surveillance. Selective validation studies, including Sanger sequencing, immunohistochemistry and functional assays, were performed to confirm the NGS findings. NGS identified somatic nonsense mutations of AT-rich interactive domain 1A (SWI like) (ARID1A) and PPIE and an additional 37 missense mutations in BE and/or EAC, which were confirmed by Sanger sequencing. ARID1A mutations were detected in 15% (3/20) high-grade dysplasia (HGD)/EAC patients. Immunohistochemistry performed on an independent archival cohort demonstrated ARID1A protein loss in 0% (0/76), 4.9% (2/40), 14.3% (4/28), 16.0% (8/50) and 12.2% (12/98) of NSE, BE, low-grade dysplasia, HGD and EAC tissues, respectively, and was inversely associated with nuclear p53 accumulation (P=0.028). Enhanced cell growth, proliferation and invasion were observed on ARID1A knockdown in EAC cells. In addition, genes downstream of ARID1A that potentially contribute to the ARID1A knockdown phenotype were identified. Our studies establish the feasibility of using mucosal biopsies for NGS, which should enable the comparative analysis of larger 'progressor' versus 'non-progressor' cohorts. Further, we identify ARID1A as a novel tumor-suppressor gene in BE pathogenesis, reiterating the importance of aberrant chromatin in the metaplasia-dysplasia sequence.
A balanced t(1;11) translocation that transects the Disrupted in schizophrenia 1 (DISC1) gene shows genome-wide significant linkage for schizophrenia and recurrent major depressive disorder (rMDD) in ...a single large Scottish family, but genome-wide and exome sequencing-based association studies have not supported a role for DISC1 in psychiatric illness. To explore DISC1 in more detail, we sequenced 528 kb of the DISC1 locus in 653 cases and 889 controls. We report 2718 validated single-nucleotide polymorphisms (SNPs) of which 2010 have a minor allele frequency of <1%. Only 38% of these variants are reported in the 1000 Genomes Project European subset. This suggests that many DISC1 SNPs remain undiscovered and are essentially private. Rare coding variants identified exclusively in patients were found in likely functional protein domains. Significant region-wide association was observed between rs16856199 and rMDD (P=0.026, unadjusted P=6.3 × 10(-5), OR=3.48). This was not replicated in additional recurrent major depression samples (replication P=0.11). Combined analysis of both the original and replication set supported the original association (P=0.0058, OR=1.46). Evidence for segregation of this variant with disease in families was limited to those of rMDD individuals referred from primary care. Burden analysis for coding and non-coding variants gave nominal associations with diagnosis and measures of mood and cognition. Together, these observations are likely to generalise to other candidate genes for major mental illness and may thus provide guidelines for the design of future studies.
MicroRNAs (miRNAs) are key regulators of gene expression and contribute to a variety of biological processes. Abnormal miRNA expression has been reported in various diseases including pathophysiology ...of breast cancer, where they regulate protumorigenic processes including vascular invasiveness, estrogen receptor status, chemotherapy resistance, invasion and metastasis. The miRBase sequence database, a public repository for newly discovered miRNAs, has grown rapidly with approximately >10,000 entries to date. Despite this rapid growth, many miRNAs have not yet been validated, and several others are yet to be identified. A lack of a full complement of miRNAs has imposed limitations on recognizing their important roles in cancer, including breast cancer. Using deep sequencing technology, we have identified 189 candidate novel microRNAs in human breast cancer cell lines with diverse tumorigenic potential. We further show that analysis of 500-nucleotide pri-microRNA secondary structure constitutes a reliable method to predict bona fide miRNAs as judged by experimental validation. Candidate novel breast cancer miRNAs with stem lengths of greater than 30 bp resulted in the generation of precursor and mature sequences in vivo. On the other hand, candidates with stem length less than 30 bp were less efficient in producing mature miRNA. This approach may be used to predict which candidate novel miRNA would qualify as bona fide miRNAs from deep sequencing data with approximately 90% accuracy.
Celotno besedilo
Dostopno za:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
MaizeCODE is a project aimed at identifying and analyzing functional elements in the maize genome. In its initial phase, MaizeCODE assayed up to five tissues from four maize strains (B73, NC350, W22, ...TIL11) by RNA-Seq, Chip-Seq, RAMPAGE, and small RNA sequencing. To facilitate reproducible science and provide both human and machine access to the MaizeCODE data, we enhanced SciApps, a cloud-based portal, for analysis and distribution of both raw data and analysis results. Based on the SciApps workflow platform, we generated new components to support the complete cycle of MaizeCODE data management. These include publicly accessible scientific workflows for the reproducible and shareable analysis of various functional data, a RESTful API for batch processing and distribution of data and metadata, a searchable data page that lists each MaizeCODE experiment as a reproducible workflow, and integrated JBrowse genome browser tracks linked with workflows and metadata. The SciApps portal is a flexible platform that allows the integration of new analysis tools, workflows, and genomic data from multiple projects. Through metadata and a ready-to-compute cloud-based platform, the portal experience improves access to the MaizeCODE data and facilitates its analysis.
Pigeonpea (Cajanus cajan), an important food legume crop in the semi-arid regions of the world and the second most important pulse crop in India, has an average crop productivity of 780 kg/ha. The ...relatively low crop yields may be attributed to non-availability of improved cultivars, poor crop husbandry and exposure to a number of biotic and abiotic stresses in pigeonpea growing regions. Narrow genetic diversity in cultivated germplasm has further hampered the effective utilization of conventional breeding as well as development and utilization of genomic tools, resulting in pigeonpea being often referred to as an ‘orphan crop legume'. To enable genomics-assisted breeding in this crop, the pigeonpea genomics initiative (PGI) was initiated in late 2006 with funding from Indian Council of Agricultural Research under the umbrella of Indo-US agricultural knowledge initiative, which was further expanded with financial support from the US National Science Foundation's Plant Genome Research Program and the Generation Challenge Program. As a result of the PGI, the last 3 years have witnessed significant progress in development of both genetic as well as genomic resources in this crop through effective collaborations and coordination of genomics activities across several institutes and countries. For instance, 25 mapping populations segregating for a number of biotic and abiotic stresses have been developed or are under development. An 11X-genome coverage bacterial artificial chromosome (BAC) library comprising of 69,120 clones have been developed of which 50,000 clones were end sequenced to generate 87,590 BAC-end sequences (BESs). About 10,000 expressed sequence tags (ESTs) from Sanger sequencing and ca. 2 million short ESTs by 454/FLX sequencing have been generated. A variety of molecular markers have been developed from BESs, microsatellite or simple sequence repeat (SSR)-enriched libraries and mining of ESTs and genomic amplicon sequencing. Of about 21,000 SSRs identified, 6,698 SSRs are under analysis along with 670 orthologous genes using a GoldenGate SNP (single nucleotide polymorphism) genotyping platform, with large scale SNP discovery using Solexa, a next generation sequencing technology, is in progress. Similarly a diversity array technology array comprising of ca. 15,000 features has been developed. In addition, >600 unique nucleotide binding site (NBS) domain containing members of the NBS-leucine rich repeat disease resistance homologs were cloned in pigeonpea; 960 BACs containing these sequences were identified by filter hybridization, BES physical maps developed using high information content fingerprinting. To enrich the genomic resources further, sequenced soybean genome is being analyzed to establish the anchor points between pigeonpea and soybean genomes. In addition, Solexa sequencing is being used to explore the feasibility of generating whole genome sequence. In summary, the collaborative efforts of several research groups under the umbrella of PGI are making significant progress in improving molecular tools in pigeonpea and should significantly benefit pigeonpea genetics and breeding. As these efforts come to fruition, and expanded (depending on funding), pigeonpea would move from an ‘orphan legume crop' to one where genomics-assisted breeding approaches for a sustainable crop improvement are routine.
We have sequenced and annotated the genome of fission yeast (Schizosaccharomyces pombe), which contains the smallest number of protein-coding genes yet recorded for a eukaryote: 4,824. The ...centromeres are between 35 and 110 kilobases (kb) and contain related repeats including a highly conserved 1.8-kb element. Regions upstream of genes are longer than in budding yeast (Saccharomyces cerevisiae), possibly reflecting more-extended control regions. Some 43% of the genes contain introns, of which there are 4,730. Fifty genes have significant similarity with human disease genes; half of these are cancer related. We identify highly conserved genes important for eukaryotic cell organization including those required for the cytoskeleton, compartmentation, cell-cycle control, proteolysis, protein phosphorylation and RNA splicing. These genes may have originated with the appearance of eukaryotic life. Few similarly conserved genes that are important for multicellular organization were identified, suggesting that the transition from prokaryotes to eukaryotes required more new genes than did the transition from unicellular to multicellular organization.
Celotno besedilo
Dostopno za:
DOBA, IJS, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Pichia pastoris is a methylotrophic yeast increasingly important in the production of therapeutic proteins. Expression vectors are based on the methanol-inducible AOX1 promoter and are integrated ...into the host chromosome. In most cases high copy number integration has been shown to be important for high-level expression. Since this occurs at low frequency during transformation, we previously used DNA dot blot screens to identify suitable clones. In this paper we report the use of vectors containing the Tn903 kanr gene conferring G418-resistance. Initial experiments demonstrated that copy number showed a tight correlation with drug-resistance. Using a G418 growth inhibition screen, we readily isolated a series of transformants, containing progressively increasing numbers (1 to 12) of a vector expressing HIV-1 ENV, which we used to examine the relationship between copy number and foreign mRNA levels. Northern blot analysis indicated that ENV mRNA levels from a single-copy clone were nearly as high as AOX1 mRNA, and increased progressively with increasing copy number so as to greatly exceed AOX1 mRNA. We have also developed protocols for the selection, using G418, of high copy number transformants following spheroplast transformation or electroporation. We anticipate that these protocols will simplify the use of Pichia as a biotechnological tool.