The Consensus Coding Sequence (CCDS) collaboration involves curators at multiple centers with a goal of producing a conservative set of high quality, protein-coding region annotations for the human ...and mouse reference genome assemblies. The CCDS data set reflects a 'gold standard' definition of best supported protein annotations, and corresponding genes, which pass a standard series of quality assurance checks and are supported by manual curation. This data set supports use of genome annotation information by human and mouse researchers for effective experimental design, analysis and interpretation. The CCDS project consists of analysis of automated whole-genome annotation builds to identify identical CDS annotations, quality assurance testing and manual curation support. Identical CDS annotations are tracked with a CCDS identifier (ID) and any future change to the annotated CDS structure must be agreed upon by the collaborating members. CCDS curation guidelines were developed to address some aspects of curation in order to improve initial annotation consistency and to reduce time spent in discussing proposed annotation updates. Here, we present the current status of the CCDS database and details on our procedures to track and coordinate our efforts. We also present the relevant background and reasoning behind the curation standards that we have developed for CCDS database treatment of transcripts that are nonsense-mediated decay (NMD) candidates, for transcripts containing upstream open reading frames, for identifying the most likely translation start codons and for the annotation of readthrough transcripts. Examples are provided to illustrate the application of these guidelines. DATABASE URL: http://www.ncbi.nlm.nih.gov/CCDS/CcdsBrowse.cgi.
Advances in high-throughput mass spectrometry are making proteomics an increasingly important tool in genome annotation projects. Peptides detected in mass spectrometry experiments can be used to ...validate gene models and verify the translation of putative coding sequences (CDSs). Here, we have identified peptides that cover 35% of the genes annotated by the GENCODE consortium for the human genome as part of a comprehensive analysis of experimental spectra from two large publicly available mass spectrometry databases. We detected the translation to protein of "novel" and "putative" protein-coding transcripts as well as transcripts annotated as pseudogenes and nonsense-mediated decay targets. We provide a detailed overview of the population of alternatively spliced protein isoforms that are detectable by peptide identification methods. We found that 150 genes expressed multiple alternative protein isoforms. This constitutes the largest set of reliably confirmed alternatively spliced proteins yet discovered. Three groups of genes were highly overrepresented. We detected alternative isoforms for 10 of the 25 possible heterogeneous nuclear ribonucleoproteins, proteins with a key role in the splicing process. Alternative isoforms generated from interchangeable homologous exons and from short indels were also significantly enriched, both in human experiments and in parallel analyses of mouse and Drosophila proteomics experiments. Our results show that a surprisingly high proportion (almost 25%) of the detected alternative isoforms are only subtly different from their constitutive counterparts. Many of the alternative splicing events that give rise to these alternative isoforms are conserved in mouse. It was striking that very few of these conserved splicing events broke Pfam functional domains or would damage globular protein structures. This evidence of a strong bias toward subtle differences in CDS and likely conserved cellular function and structure is remarkable and strongly suggests that the translation of alternative transcripts may be subject to selective constraints.
Alternative splicing (AS) has the potential to greatly expand the functional repertoire of mammalian transcriptomes. However, few variant transcripts have been characterized functionally, making it ...difficult to assess the contribution of AS to the generation of phenotypic complexity and to study the evolution of splicing patterns. We have compared the AS of 309 protein-coding genes in the human ENCODE pilot regions against their mouse orthologs in unprecedented detail, utilizing traditional transcriptomic and RNAseq data. The conservation status of every transcript has been investigated, and each functionally categorized as coding (separated into coding sequence CDS or nonsense-mediated decay NMD linked) or noncoding. In total, 36.7% of human and 19.3% of mouse coding transcripts are species specific, and we observe a 3.6 times excess of human NMD transcripts compared with mouse; in contrast to previous studies, the majority of species-specific AS is unlinked to transposable elements. We observe one conserved CDS variant and one conserved NMD variant per 2.3 and 11.4 genes, respectively. Subsequently, we identify and characterize equivalent AS patterns for 22.9% of these CDS or NMD-linked events in nonmammalian vertebrate genomes, and our data indicate that functional NMD-linked AS is more widespread and ancient than previously thought. Furthermore, although we observe an association between conserved AS and elevated sequence conservation, as previously reported, we emphasize that 30% of conserved AS exons display sequence conservation below the average score for constitutive exons. In conclusion, we demonstrate the value of detailed comparative annotation in generating a comprehensive set of AS transcripts, increasing our understanding of AS evolution in vertebrates. Our data supports a model whereby the acquisition of functional AS has occurred throughout vertebrate evolution and is considered alongside amino acid change as a key mechanism in gene evolution.
Defensins are important components of innate immunity to combat bacterial and viral infections, and can even elicit antitumor responses. Clusters of defensin (DEF) genes are located in a 2 Mb range ...of the human chromosome 8p23.1. This DEF locus, however, represents one of the regions in the euchromatic part of the final human genome sequence which contains segmental duplications, and recalcitrant gaps indicating high structural dynamics.
We find that inter- and intraindividual genetic variations within this locus prevent a correct automatic assembly of the human reference genome (NCBI Build 34) which currently even contains misassemblies. Manual clone-by-clone alignment and gene annotation as well as repeat and SNP/haplotype analyses result in an alternative alignment significantly improving the DEF locus representation. Our assembly better reflects the experimentally verified variability of DEF gene and DEF cluster copy numbers. It contains an additional DEF cluster which we propose to reside between two already known clusters. Furthermore, manual annotation revealed a novel DEF gene and several pseudogenes expanding the hitherto known DEF repertoire. Analyses of BAC and working draft sequences of the chimpanzee indicates that its DEF region is also complex as in humans and DEF genes and a cluster are multiplied. Comparative analysis of human and chimpanzee DEF genes identified differences affecting the protein structure. Whether this might contribute to differences in disease susceptibility between man and ape remains to be solved. For the determination of individual DEF gene repertoires we provide a molecular approach based on DEF haplotypes.
Complexity and variability seem to be essential genomic features of the human DEF locus at 8p23.1 and provides an ongoing challenge for the best possible representation in the human reference sequence. Dissection of paralogous sequence variations, duplicon SNPs ans multisite variations as well as haplotypes by sequencing based methods is the way for future studies of interindividual DEF locus variability and its disease association.
Celotno besedilo
Dostopno za:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Abstract
The Human and Vertebrate Analysis and Annotation (HAVANA) group at the Wellcome Trust Sanger Institute produced the manually annotated geneset for the Encyclopedia of DNA Elements (ENCODE) ...pilot project and, as part of the Gencode subgroup, are reprising this role in the scale up to cover the whole human genome. Our manual annotation is checked computationally and validated experimentally. Loci and transcripts predicted to be absent from the initial annotation are identified by comparison with a number of state-of-the-art algorithms for identifying exons, splice sites, transcripts and pseudogenes. Where novel features are confirmed the annotation is updated. Annotated coding transcripts are analysed to assess their coding potential by investigating patterns of conservation within the coding sequence (CDS) and comparing predicted secondary structures of annotated CDSs to similar proteins with solved structures. Annotated coding transcripts are also checked against the current set of human Consensus CDSs (CCDS) to check agreement with other participating centres (EBI, NCBI, & UCSC).An initial round of annotation and analysis of chromosomes 21 and 22 has shown that while HAVANA annotation is both comprehensive and robust, it has benefitted from computational review. 13 novel non-coding loci, 27 novel splice variants and 6 extensions to existing variants were identified, many of which were found using supporting EST/mRNA sequences that were not present at the time of initial annotation. Fewer than 10 annotated CDSs required reclassification, no CCDS sequences required updating and 26 novel pseudogene were added. The annotation of human chromosome 2 is complete and we are currently annotating chromosomes 3 and 7. Data from all members of Gencode is distributed via DAS and is now visible in our Zmap annotation interface, allowing assessment of computational predictions contemporaneous with first-pass gene annotation.
To compare the classic clomiphene citrate (CC) and hMG regime for ovarian stimulation before IVF in women who received hMG post-long protocol down-regulation with either 3 mg triptorelin INN IM or ...150 mg buserelin acetate four times daily intranasally. Furthermore, if possible, to determine the preferred method of down-regulation.
A prospective study of 150 women randomized blind to the clinician to one of three alternative ovarian stimulation regimes when passing for the first time through an IVF program during 1992.
Triptorelin INN down-regulated significantly more quickly than buserelin acetate. The non-down-regulated group CC and hMG used significantly less hMG in a shorter time. In these women LH levels at hCG administration were significantly higher. No other intergroup differences were found. Pregnancy and take-home baby rates for the overall study were, respectively, 32%:25% (per cycle) and 42%:33%; (per ET) for the triptorelin INN group 28%:22% and 39%:31%; the CC group 32%:24% and 46%:34%; and the buserelin acetate group 34%:28% and 42%:34%.
Triptorelin INN and buserelin acetate were comparable in all parameters except down-regulation. The former was significantly quicker and more sure. In none of the clinical end points measured, however, was the classic CC and hMG non-down-regulation regime significantly less effective or troublesome than where down-regulation was used. These results therefore show that although indications for down-regulation before IVF exist, it should not to be used on all patients.
Careful manual annotation of the human reference sequence provides a solid basis for the identification of disease-associated genes. Toward this end, we focused on a medically relevant 2.6-Mb region ...of the human chromosome Xp11.4 between markers DXS9851 and DXS9751 and identified 16 transcription units according to the Vertebrate Genome Annotation (Vega) rules. In order to validate these annotations, we performed a comprehensive RT-PCR expression analysis and a human-mouse comparison. This revealed, despite the high overall genomic conservation of the region, remarkable differences of the gene content between human and mouse. Whereas 12 of 16 annotations were confirmed by RT-PCR in human tissues, for only seven genes mouse orthologs could be identified and found to be expressed. This indicates that a comprehensive and experimentally supported annotation effort of the human genome simultaneously highlights regions with striking differences in gene organization to other species and may indicate evolutionary events specific to the human lineage demanding further functional analyses.
M.A.
South Africa is a country characterised by inherent diversity of culture, language and race and is undergoing a process of reconstruction and development. This entails the establishment of a new ...national identity and socio-political structure based on integration, accommodation of difference and equity of access to the basic necessities for mental, physical, social and spiritual well-being for all its citizens within budgetary and personnel limitations. Mental health care needs and services have been shaped by socio-political and economic factors in the past. Further there has been unequal access to health care, which focused on psychiatric conditions and which provided curative, institutionalised care. Little provision was made for mental health care of children and mentally handicapped persons. A climate of unprecedented change and uncertainty also impacts on the mental health care of South Africans. A vital part of the Reconstruction and Development Programme is the reorientation of health care, which includes mental health, to a comprehensive health care approach with emphasis on primary preventive and promotive health care. An invaluable, economical yet untapped source of manpower to provide primary mental health care is the paraprofessional counsellor. Paraprofessional mental health workers can provide first level mental health care in the form of growth counselling which includes primary preventive and promotive care as well as some secondary and tertiary mental health care. In order to provide safe and effective care, this category of health worker needs to be carefully selected and trained and should work under the supervision of professional mental health care workers. The role of the latter needs to be re-evaluated in the context of South Africa. Effective and efficient care can be facilitated by the use of a simple yet comprehensive model for the selection and training of paraprofessionals and to guide their contribution to mental health care in South Africa. Two theoretical shifts have influenced mental health care. These are the paradigm shift from linear Cartesian-Newtonian thinking to the circular systems epistemology and the movement in psychological theories to ecosystemic theories. These developments have given rise to the bio-psycho-social model which does provide a basis for holistic health care but it excludes the spiritual dimension. This dissertation examines the concept of spirituality and argues for its inclusion in a truly holistic model of human functioning. The Pursuit of Wholeness model is proposed as a simply yet truly holistic tool to effect efficient, economical and ethical mental health care using , paraprofessionals. The model is described in detail and its use by paraprofessional counsellors in a growth counselling situation is discussed. An evaluative study was undertaken to test the use of the Pursuit of Wholeness model by paraprofessionals in three organisations. Results showed that the Pursuit of Wholeness model has the potential to enable paraprofessional counsellors through holistic growth counselling to contribute significantly to mental health care in South Africa.