New contact measures for the protein docking problem Lenhof, Hans-Peter
Annual Conference on Research in Computational Molecular Biology: Proceedings of the first annual international conference on Computational molecular biology; 20-23 Jan. 1997,
01/1997
Conference Proceeding
Chrome obstructive pulmonary disease (COPD) significantly increases the risk of developing cancer. Biomarker studies frequently follow a case-control set-up in which patients diagnosed with a disease ...are compared to controls. Longitudinal cohort studies such as the COPD-centered German COPD and SYstemic consequences-COmorbidities NETwork (COSYCONET) study provide the patient and biomaterial base for discovering predictive molecular markers. We asked whether microRNA (miRNA) profiles in blood collected from COPD patients prior to a tumor diagnosis could support an early diagnosis of tumor development independent of the tumor type. From 2741 participants of COSYCONET diagnosed with COPD, we selected 534 individuals including 33 patients who developed cancer during the follow-up period of 54 months and 501 patients who did not develop cancer, but had similar age, gender and smoking history. Genome-wide miRNA profiles were generated and evaluated using machine learning techniques. For patients developing cancer we identified nine miRNAs with significantly decreased abundance (two-tailed unpaired t-test adjusted for multiple testing P < 0.05), including members of the miR-320 family. The identified miRNAs regulate different cancer-related pathways including the MAPK pathway (P = 2.3 x 10(-5)). We also observed the impact of confounding factors on the generated miRNA profiles, underlining the value of our matched analysis. For selected miRNAs, qRT-PCR analysis was applied to validate the results. In conclusion, we identified several miRNAs in blood of COPD patients, which could serve as candidates for biomarkers to help identify COPD patients at risk of developing cancer.
Background: High-throughput technologies have opened new avenues to study biological processes and pathways. The interpretation of the immense amount of data sets generated nowadays needs to be ...facilitated in order to enable biologists to identify complex gene networks and functional pathways. To cope with this task multiple computer-based programs have been developed. GeneTrail is a freely available online tool that screens comparative transcriptomic data for differentially regulated functional categories and biological pathways extracted from common data bases like KEGG, Gene Ontology (GO), TRANSPATH and TRANSFAC. Additionally, GeneTrail offers a feature that allows screening of individually defined biological categories that are relevant for the respective research topic. Results: We have set up GeneTrail for the use of Arabidopsis thaliana. To test the functionality of this tool for plant analysis, we generated transcriptome data of root and leaf responses to Fe deficiency and the Arabidopsis metal homeostasis mutant nas4x-1. We performed Gene Set Enrichment Analysis (GSEA) with eight meaningful pairwise comparisons of transcriptome data sets. We were able to uncover several functional pathways including metal homeostasis that were affected in our experimental situations. Representation of the differentially regulated functional categories in Venn diagrams uncovered regulatory networks at the level of whole functional pathways. Over-Representation Analysis (ORA) of differentially regulated genes identified in pairwise comparisons revealed specific functional plant physiological categories as major targets upon Fe deficiency and in nas4x-1. Conclusion: Here, we obtained supporting evidence, that the nas4x-1 mutant was defective in metal homeostasis. It was confirmed that nas4x-1 showed Fe deficiency in roots and signs of Fe deficiency and Fe sufficiency in leaves. Besides metal homeostasis, biotic stress, root carbohydrate, leaf photosystem and specific cell biological categories were discovered as main targets for regulated changes in response to - Fe and nas4x-1. Among 258 differentially expressed genes in response to - Fe and nas4x-1 five functional categories were enriched covering metal homeostasis, redox regulation, cell division and histone acetylation. We proved that GeneTrail offers a flexible and user-adapted way to identify functional categories in large-scale plant transcriptome data sets. The distinguished feature that allowed analysis of individually assembled functional categories facilitated the study of the Arabidopsis thaliana transcriptome.
Background: Genome wide association studies (GWAS) are applied to identify genetic loci, which are associated with complex traits and human diseases. Analogous to the evolution of gene expression ...analyses, pathway analyses have emerged as important tools to uncover functional networks of genome-wide association data. Usually, pathway analyses combine statistical methods with a priori available biological knowledge. To determine significance thresholds for associated pathways, correction for multiple testing and over-representation permutation testing is applied. Results: We systematically investigated the impact of three different permutation test approaches for over-representation analysis to detect false positive pathway candidates and evaluate them on genome-wide association data of Dilated Cardiomyopathy (DCM) and Ulcerative Colitis (UC). Our results provide evidence that the gold standard - permuting the case-control status - effectively improves specificity of GWAS pathway analysis. Although permutation of SNPs does not maintain linkage disequilibrium (LD), these permutations represent an alternative for GWAS data when case-control permutations are not possible. Gene permutations, however, did not add significantly to the specificity. Finally, we provide estimates on the required number of permutations for the investigated approaches. Conclusions: To discover potential false positive functional pathway candidates and to support the results from standard statistical tests such as the Hypergeometric test, permutation tests of case control data should be carried out. The most reasonable alternative was case-control permutation, if this is not possible, SNP permutations may be carried out. Our study also demonstrates that significance values converge rapidly with an increasing number of permutations. By applying the described statistical framework we were able to discover axon guidance, focal adhesion and calcium signaling as important DCM-related pathways and Intestinal immune network for IgA production as most significant UC pathway.
An animation of a fixed-radius all-nearest-neighbors algorithm Lenhof, Hans-Peter; Smid, Michiel
Annual Symposium on Computational Geometry: Proceedings of the tenth annual symposium on Computational geometry; 06-08 June 1994,
06/1994
Conference Proceeding
A polyhedral approach to RNA sequence structure alignment Lenhof, Hans-Peter; Reinert, Knut; Vingron, Martin
Annual Conference on Research in Computational Molecular Biology: Proceedings of the second annual international conference on Computational molecular biology; 22-25 Mar. 1998,
03/1998
Conference Proceeding
Enumerating the k closest pairs optimally Lenhof, H.-P.; Smid, M.
Proceedings., 33rd Annual Symposium on Foundations of Computer Science,
1992
Conference Proceeding
Let S be a set of n points in D-dimensional space, where D is a constant, and let k be an integer between 1 and (/sub 2//sup n/) An algorithm is given that computes the k closest pairs in the set S ...in O(nlogn+k) time, using O(n+k) space. The algorithm fits in the algebraic decision tree model and is, therefore, optimal.< >