Background: Through the Community Sequencing Program, a catfish EST sequencing project was carried out through a collaboration between the catfish research community and the Department of Energy's ...Joint Genome Institute. Prior to this project, only a limited EST resource from catfish was available for the purpose of SNP identification. Results: A total of 438,321 quality ESTs were generated from 8 channel catfish (Ictalurus punctatus) and 4 blue catfish (Ictalurus furcatus) libraries, bringing the number of catfish ESTs to nearly 500,000. Assembly of all catfish ESTs resulted in 45,306 contigs and 66,272 singletons. Over 35% of the unique sequences had significant similarities to known genes, allowing the identification of 14,776 unique genes in catfish. Over 300,000 putative SNPs have been identified, of which approximately 48,000 are high-quality SNPs identified from contigs with at least four sequences and the minor allele presence of at least two sequences in the contig. The EST resource should be valuable for identification of microsatellites, genome annotation, large-scale expression analysis, and comparative genome analysis. Conclusions: This project generated a large EST resource for catfish that captured the majority of the catfish transcriptome. The parallel analysis of ESTs from two closely related Ictalurid catfishes should also provide powerful means for the evaluation of ancient and recent gene duplications, and for the development of high-density microarrays in catfish. The inter- and intra-specific SNPs identified from all catfish EST dataset assembly will greatly benefit the catfish introgression breeding program and whole genome association studies.
The ciliate protozoan Ichthyophthirius multifiliis (Ich) is an important parasite of freshwater fish that causes 'white spot disease' leading to significant losses. A genomic resource for large-scale ...studies of this parasite has been lacking. To study gene expression involved in Ich pathogenesis and virulence, our goal was to generate expressed sequence tags (ESTs) for the development of a powerful microarray platform for the analysis of global gene expression in this species. Here, we initiated a project to sequence and analyze over 10,000 ESTs.
We sequenced 10,368 EST clones using a normalized cDNA library made from pooled samples of the trophont, tomont, and theront life-cycle stages, and generated 9,769 sequences (94.2% success rate). Post-sequencing processing led to 8,432 high quality sequences. Clustering analysis of these ESTs allowed identification of 4,706 unique sequences containing 976 contigs and 3,730 singletons. These unique sequences represent over two million base pairs (~10% of Plasmodium falciparum genome, a phylogenetically related protozoan). BLASTX searches produced 2,518 significant (E-value < 10-5) hits and further Gene Ontology (GO) analysis annotated 1,008 of these genes. The ESTs were analyzed comparatively against the genomes of the related protozoa Tetrahymena thermophila and P. falciparum, allowing putative identification of additional genes. All the EST sequences were deposited by dbEST in GenBank (GenBank: EG957858-EG966289). Gene discovery and annotations are presented and discussed.
This set of ESTs represents a significant proportion of the Ich transcriptome, and provides a material basis for the development of microarrays useful for gene expression studies concerning Ich development, pathogenesis, and virulence.
Alternative splicing (AS) is a mechanism by which the coding diversity of the genome can be greatly increased. Rates of AS are known to vary according to the complexity of eukaryotic species ...potentially explaining the tremendous phenotypic diversity among species with similar numbers of coding genes. Little is known, however, about the nature or rate of AS in teleost fish. Here, we report the characteristics of AS in teleost fish and classification and frequency of five canonical AS types. We conducted both same-species and cross-species analysis utilizing the Genome Mapping and Alignment Program (GMAP) and an AS pipeline (ASpipe) to study AS in four genome-enabled species (Danio rerio, Oryzias latipes, Gasterosteus aculeatus, and Takifugu rubripes) and one species lacking a complete genome sequence, Ictalurus punctatus. AS frequency was lowest in the highly duplicated genome of zebrafish (17% of mapped genes). The compact genome of the pufferfish showed the highest occurrence of AS (~43% of mapped genes). An inverse correlation between AS frequency and genome size was consistent across all analyzed species. Cross-species comparisons utilizing zebrafish as the reference genome allowed the identification of additional putative AS genes not revealed by zebrafish transcripts. Approximately, 50% of AS genes identified by same-species comparisons were shared among two or more species. A searchable website, the Teleost Alternative Splicing Database, was created to allow easy identification and visualization of AS transcripts in the studied teleost genomes. Our results and associated database should further our understanding of alternative splicing as an important functional and evolutionary mechanism in the genomes of teleost fish.
Genome annotation projects, gene functional studies, and phylogenetic analyses for a given organism all greatly benefit from access to a validated full-length cDNA resource. While increasingly common ...in model species, full-length cDNA resources in aquaculture species are scarce.
Through in silico analysis of catfish (Ictalurus spp.) ESTs, a total of 10,037 channel catfish and 7,382 blue catfish cDNA clones were identified as potentially encoding full-length cDNAs. Of this set, a total of 1,169 channel catfish and 933 blue catfish full-length cDNA clones were selected for re-sequencing to provide additional coverage and ensure sequence accuracy. A total of 1,745 unique gene transcripts were identified from the full-length cDNA set, including 1,064 gene transcripts from channel catfish and 681 gene transcripts from blue catfish, with 416 transcripts shared between the two closely related species. Full-length sequence characteristics (ortholog conservation, UTR length, Kozak sequence, and conserved motifs) of the channel and blue catfish were examined in detail. Comparison of gene ontology composition between full-length cDNAs and all catfish ESTs revealed that the full-length cDNA set is representative of the gene diversity encoded in the catfish transcriptome.
This study describes the first catfish full-length cDNA set constructed from several cDNA libraries. The catfish full-length cDNA sequences, and data gleaned from sequence characteristics analysis, will be a valuable resource for ongoing catfish whole-genome sequencing and future gene-based studies of function and evolution in teleost fishes.
•White bass exhibit greater resistance to F. covae than hybrid striped bass.•F. covae elicits a cytokine-mediated response in white and hybrid striped bass gill.•White bass immune response is mounted ...earlier than the hybrid in response to F. covae.
Columnaris disease is a prevalent disease in freshwater environments caused by the ubiquitous aquatic pathogen Flavobacterium species. Adhesion to the external mucosal surfaces of fishes is the initial stage of infection, and the gills specifically have been identified as both a primary target and release site for this pathogen. Demonstrated here and in previous research, the hybrid striped bass (Morone chrysops x M. saxatilis), a prominent United States aquaculture product, is more susceptible to infection with F. covae than the maternal white bass (M. chrysops) parental species. To further elucidate the mechanisms underlying differences in resistance between these fish we examined gill gene expression profiles using RNA sequencing at different timepoints after F. covae infection. Patterns of differential gene expression and association with key enrichment terms indicate the effective immune response observed in white bass includes the up-regulation of multiple cytokines (IL-1β, IL-17C, TNF-α, G-CSF, IL-8, CCL16, CXCL9), hepcidins (HAMP and HAMP2), ribosomal subunit components (e.g., RPL13), and, interestingly, the down-regulation of leptin (LEPA) during initial (1 – 4 h) stages of infection. Conversely, up-regulation of the same genes was not detected in hybrid striped bass until 24 h after infection, indicating a delay in immune response mechanisms that is ultimately ineffective in protecting the host, as this was concurrent with the onset of mortality in these fish. Collectively, the presented results include several putative pathways and candidate genes for further investigation toward characterizing immune defense mechanisms underlying the resistance (white bass) and susceptibility (hybrid striped bass) for selective breeding efforts and/or biotechnological intervention.
Comparative mapping is a powerful tool to transfer genomic information from sequenced genomes to closely related species for which whole genome sequence data are not yet available. However, such an ...approach is still very limited in catfish, the most important aquaculture species in the United States. This project was initiated to generate additional BAC end sequences and demonstrate their applications in comparative mapping in catfish.
We reported the generation of 43,000 BAC end sequences and their applications for comparative genome analysis in catfish. Using these and the additional 20,000 existing BAC end sequences as a resource along with linkage mapping and existing physical map, conserved syntenic regions were identified between the catfish and zebrafish genomes. A total of 10,943 catfish BAC end sequences (17.3%) had significant BLAST hits to the zebrafish genome (cutoff value <or= e(-5)), of which 3,221 were unique gene hits, providing a platform for comparative mapping based on locations of these genes in catfish and zebrafish. Genetic linkage mapping of microsatellites associated with contigs allowed identification of large conserved genomic segments and construction of super scaffolds.
BAC end sequences and their associated polymorphic markers are great resources for comparative genome analysis in catfish. Highly conserved chromosomal regions were identified to exist between catfish and zebrafish. However, it appears that the level of conservation at local genomic regions are high while a high level of chromosomal shuffling and rearrangements exist between catfish and zebrafish genomes. Orthologous regions established through comparative analysis should facilitate both structural and functional genome analysis in catfish.
Construction of high-density genetic linkage maps is crucially important for quantitative trait loci (QTL) studies, and they are more useful when integrated with physical maps. Such integrated maps ...are valuable genome resources for fine mapping of QTL, comparative genomics, and accurate and efficient whole-genome assembly. Previously, we established both linkage maps and a physical map for channel catfish, Ictalurus punctatus, the dominant aquaculture species in the United States. Here we added 2030 BAC end sequence (BES)-derived microsatellites from 1481 physical map contigs, as well as markers from singleton BES, ESTs, anonymous microsatellites, and SNPs, to construct a second-generation linkage map. Average marker density across the 29 linkage groups reached 1.4 cM/marker. The increased marker density highlighted variations in recombination rates within and among catfish chromosomes. This work effectively anchored 44.8% of the catfish BAC physical map contigs, covering ~52.8% of the genome. The genome size was estimated to be 2546 cM on the linkage map, and the calculated physical distance per centimorgan was 393 Kb. This integrated map should enable comparative studies with teleost model species as well as provide a framework for ordering and assembling whole-genome scaffolds.
Trisomy mapping is a powerful method for assigning genes to chicken microchromosome 16 (GGA 16). The single chicken nucleolar organizer region (NOR), the 2 major histocompatibility complex regions ...(MHC-Y and MHC-B), and CD1 genes were all previously assigned to GGA 16 using trisomy mapping. Here, we combined array comparative genomic hybridization with trisomy mapping to screen unassigned genomic scaffolds (consigned temporarily to chrUn_random) for sequences originating from GGA 16. A number of scaffolds mapped to GGA 16. Among these were scaffolds that contain genes for olfactory (OR) and cysteine-rich domain scavenger (SRCR) receptors, along with a number of genes that encode putative immunoglobulin-like receptors and other molecules. We used high-resolution cytogenomic analyses to confirm assignment of OR and SRCR genes to GGA 16 and to pinpoint members of these gene families to the q-arm in partially overlapping regions between the centromere and the NOR. Southern blots revealed sequence polymorphism within the OR/SRCR region and linkage with the MHC-Y region, thereby providing evidence for conserved linkage between OR genes and the MHC within birds. This work localizes OR genes to the vicinity of the chicken MHC and assigns additional genes, including immune defense genes, to GGA 16.
BACKGROUND: Salmonella enterica serovar Typhimurium is a major foodborne pathogen worldwide. S. Typhimurium encodes type III secretion systems via Salmonella pathogenicity islands (SPI), producing ...the major effector proteins of virulence. Previously, we identified two genes of Salmonella pyruvate metabolism that were up-regulated during chicken cell infection: pyruvate formate lyase I (pflB) and bifunctional acetaldehyde-CoA/alcohol dehydrogenase (adhE). We were therefore interested in examining the role these genes may play in the transmission of Salmonella to humans. METHODS: Mutant strains of Salmonella with single gene deletions for pflB and adhE were created. Invasion and growth in human HCT-8 intestinal epithelial cells and THP-1 macrophages was examined. Quantitative PCR was performed on 19 SPI-1 genes. RESULTS: In HCT-8 cells, both mutant strains had significantly higher intracellular counts than the wild-type from 4 to 48 h post-infection. Various SPI-1 genes in the mutants were up-regulated over the wild-type as early as 1 h and lasting until 24 h post-infection. In THP-1 cells, no significant difference in internal Salmonella counts was observed; however, SPI-1 genes were largely down-regulated in the mutants during the time-course of infection. We also found five SPI-1 genes - hilA, hilC hilD, sicP and rtsA - which were up-regulated in at least one of the mutant strains in log-phase broth cultures alone. We have therefore identified a set of SPI-1 virulence genes whose regulation is effected by the central metabolism of Salmonella.
The complement system in vertebrates plays a crucial role in immune defense via recognition and removal of pathogens. Complement is tightly regulated by a group of both soluble and cell-associated ...proteins. Complement factor I is a soluble serine protease that regulates multiple pathways in complement activation. In this work, a complement factor I transcript was isolated and sequenced from channel catfish (
Ictalurus punctatus) liver after screening expressed sequence tags. The full-length cDNA is comprised of 2284
bp in length, encoding a polypeptide of 668 amino acids. The complement factor I protein was found to be well conserved, with similar domain structures and architecture from fish to mammals. The catfish complement factor I exists as a single-copied gene in the catfish genome. Expression analysis revealed that the catfish complement factor I is constitutively expressed in all tissues and leukocyte cell lines tested, indicating its importance as a regulatory enzyme throughout channel catfish. While expression of complement factor I is often found to be in the liver in mammals, it is constitutively expressed in channel catfish and carp throughout in various tissues and organs.