The reference sequence for each human chromosome provides the framework for understanding genome function, variation and evolution. Here we report the finished sequence and biological annotation of ...human chromosome 1. Chromosome 1 is gene-dense, with 3,141 genes and 991 pseudogenes, and many coding sequences overlap. Rearrangements and mutations of chromosome 1 are prevalent in cancer and many other diseases. Patterns of sequence variation reveal signals of recent selection in specific genes that may contribute to human fitness, and also in regions where no function is evident. Fine-scale recombination occurs in hotspots of varying intensity along the sequence, and is enriched near genes. These and other studies of human biology and disease encoded within chromosome 1 are made possible with the highly accurate annotated sequence, as part of the completed set of chromosome sequences that comprise the reference human genome.
Chromosome 6 is a metacentric chromosome that constitutes about 6% of the human genome. The finished sequence comprises 166,880,988 base pairs, representing the largest chromosome sequenced so far. ...The entire sequence has been subjected to high-quality manual annotation, resulting in the evidence-supported identification of 1,557 genes and 633 pseudogenes. Here we report that at least 96% of the protein-coding genes have been identified, as assessed by multi-species comparative sequence analysis, and provide evidence for the presence of further, otherwise unsupported exons/genes. Among these are genes directly implicated in cancer, schizophrenia, autoimmunity and many other diseases. Chromosome 6 harbours the largest transfer RNA gene cluster in the genome; we show that this cluster co-localizes with a region of high transcriptional activity. Within the essential immune loci of the major histocompatibility complex, we find HLA-B to be the most polymorphic gene on chromosome 6 and in the human genome.
Agoraphobic avoidance of everyday situations is a common feature in many mental health disorders. Avoidance can be due to a variety of fears, including concerns about negative social evaluation, ...panicking, and harm from others. The result is inactivity and isolation. Behavioural avoidance tasks (BATs) provide an objective assessment of avoidance and
anxiety but are challenging to administer and lack standardisation. Our aim was to draw on the principles of BATs to develop a self-report measure of agoraphobia symptoms.
The scale was developed with 194 patients with agoraphobia in the context of psychosis, 427 individuals in the general population with high levels of agoraphobia, and 1094 individuals with low levels of agoraphobia. Factor analysis, item response theory, and receiver operating characteristic analyses were used. Validity was assessed against a BAT, actigraphy data, and an existing agoraphobia measure. Test-retest reliability was assessed with 264 participants.
An eight-item questionnaire with avoidance and distress response scales was developed. The avoidance and distress scales each had an excellent model fit and reliably assessed agoraphobic symptoms across the severity spectrum. All items were highly discriminative (avoidance:
= 1.24-5.43; distress:
= 1.60-5.48), indicating that small increases in agoraphobic symptoms led to a high probability of item endorsement. The scale demonstrated good internal reliability, test-retest reliability, and validity.
The Oxford Agoraphobic Avoidance Scale has excellent psychometric properties. Clinical cut-offs and score ranges are provided. This precise assessment tool may help focus attention on the clinically important problem of agoraphobic avoidance.
Chromosome 9 is highly structurally polymorphic. It contains the largest autosomal block of heterochromatin, which is heteromorphic in 6-8% of humans, whereas pericentric inversions occur in more ...than 1% of the population. The finished euchromatic sequence of chromosome 9 comprises 109,044,351 base pairs and represents >99.6% of the region. Analysis of the sequence reveals many intra- and interchromosomal duplications, including segmental duplications adjacent to both the centromere and the large heterochromatic block. We have annotated 1,149 genes, including genes implicated in male-to-female sex reversal, cancer and neurodegenerative disease, and 426 pseudogenes. The chromosome contains the largest interferon gene cluster in the human genome. There is also a region of exceptionally high gene and G + C content including genes paralogous to those in the major histocompatibility complex. We have also detected recently duplicated genes that exhibit different rates of sequence divergence, presumably reflecting natural selection.
The finished sequence of human chromosome 20 comprises 59,187,298 base pairs (bp) and represents 99.4% of the euchromatic DNA. A single contig of 26 megabases (Mb) spans the entire short arm, and ...five contigs separated by gaps totalling 320 kb span the long arm of this metacentric chromosome. An additional 234,339 bp of sequence has been determined within the pericentromeric region of the long arm. We annotated 727 genes and 168 pseudogenes in the sequence. About 64% of these genes have a 5' and a 3' untranslated region and a complete open reading frame. Comparative analysis of the sequence of chromosome 20 to whole-genome shotgun-sequence data of two other vertebrates, the mouse Mus musculus and the puffer fish Tetraodon nigroviridis, provides an independent measure of the efficiency of gene annotation, and indicates that this analysis may account for more than 95% of all coding exons and almost all genes.
Chromosome 13 is the largest acrocentric human chromosome. It carries genes involved in cancer including the breast cancer type 2 (BRCA2) and retinoblastoma (RB1) genes, is frequently rearranged in ...B-cell chronic lymphocytic leukaemia, and contains the DAOA locus associated with bipolar disorder and schizophrenia. We describe completion and analysis of 95.5 megabases (Mb) of sequence from chromosome 13, which contains 633 genes and 296 pseudogenes. We estimate that more than 95.4% of the protein-coding genes of this chromosome have been identified, on the basis of comparison with other vertebrate genome sequences. Additionally, 105 putative non-coding RNA genes were found. Chromosome 13 has one of the lowest gene densities (6.5 genes per Mb) among human chromosomes, and contains a central region of 38 Mb where the gene density drops to only 3.1 genes per Mb.
The finished sequence of human chromosome 10 comprises a total of 131,666,441 base pairs. It represents 99.4% of the euchromatic DNA and includes one megabase of heterochromatic sequence within the ...pericentromeric region of the short and long arm of the chromosome. Sequence annotation revealed 1,357 genes, of which 816 are protein coding, and 430 are pseudogenes. We observed widespread occurrence of overlapping coding genes (either strand) and identified 67 antisense transcripts. Our analysis suggests that both inter- and intrachromosomal segmental duplications have impacted on the gene count on chromosome 10. Multispecies comparative analysis indicated that we can readily annotate the protein-coding genes with current resources. We estimate that over 95% of all coding exons were identified in this study. Assessment of single base changes between the human chromosome 10 and chimpanzee sequence revealed nonsense mutations in only 21 coding genes with respect to the human sequence.
Comparative genomic sequence analysis is a powerful technique for identifying regulatory regions in genomic DNA. However, its utility largely depends on the evolutionary distances between the species ...involved. Here we describe the screening of a genomic BAC library from the stripe-faced dunnart,
Sminthopsis macroura, formerly known as the narrow-footed marsupial mouse. We isolated a clone containing the
LYL1 locus, completely sequenced the 60.6-kb insert, and compared it with orthologous human and mouse sequences. Noncoding homology was substantially reduced in the human/dunnart analysis compared with human/mouse, yet we could readily identify all promoters and exons. Human/mouse/dunnart alignments of the
LYL1 candidate promoter allowed us to identify putative transcription factor binding sites, revealing a pattern highly reminiscent of critical regulatory regions of the
LYL1 paralogue,
SCL. This newly identified
LYL1 promoter showed strong activity in myeloid progenitor cells and was bound in vivo by Fli1, Elf1, and Gata2—transcription factors all previously shown to bind to the
SCL stem cell enhancer. This study represents the first large-scale comparative analysis involving marsupial genomic sequence and demonstrates that such comparisons provide a powerful approach to characterizing mammalian regulatory elements.
Comparative genomic sequence analysis is a powerful technique for identifying regulatory regions in genomic DNA. However, its utility largely depends on the evolutionary distances between the species ...involved. Here we describe the screening of a genomic BAC library from the stripe-faced dunnart, Sminthopsis macroura, formerly known as the narrow-footed marsupial mouse. We isolated a clone containing the LYL1 locus, completely sequenced the 60.6-kb insert, and compared it with orthologous human and mouse sequences. Noncoding homology was substantially reduced in the human/dunnart analysis compared with human/mouse, yet we could readily identify all promoters and exons. Human/mouse/dunnart alignments of the LYL1 candidate promoter allowed us to identify putative transcription factor binding sites, revealing a pattern highly reminiscent of critical regulatory regions of the LYL1 paralogue, SCL. This newly identified LYL1 promoter showed strong activity in myeloid progenitor cells and was bound in vivo by Fli1, Elf1, and Gata2-transcription factors all previously shown to bind to the SCL stem cell enhancer. This study represents the first large-scale comparative analysis involving marsupial genomic sequence and demonstrates that such comparisons provide a powerful approach to characterizing mammalian regulatory elements.
A growing number of applications now involve the storage and retrieval of digital images, but it is accepted that there is limited value in storing those images if one cannot easily retrieve them. ...The limitations of methods based on text-labelling are by now well known and have led to a burgeoning of research projects to develop content-based search methods. We have shown that content-based techniques, and texture in particular, can be used to cluster images, giving a reasonable correlation with assignments made by visual inspection. Options for improving the accuracy of the clustering include: synthesis of purer examples; outlining regions within the image; appropriate weighting of the components of the texture feature vector; and experimenting with alternative classifiers, particularly where texture is important.