Structural variations (SVs) or copy number variations (CNVs) greatly impact the functions of the genes encoded in the genome and are responsible for diverse human diseases. Although a number of ...existing SV detection algorithms can detect many types of SVs using whole genome sequencing (WGS) data, no single algorithm can call every type of SVs with high precision and high recall.
We comprehensively evaluate the performance of 69 existing SV detection algorithms using multiple simulated and real WGS datasets. The results highlight a subset of algorithms that accurately call SVs depending on specific types and size ranges of the SVs and that accurately determine breakpoints, sizes, and genotypes of the SVs. We enumerate potential good algorithms for each SV category, among which GRIDSS, Lumpy, SVseq2, SoftSV, Manta, and Wham are better algorithms in deletion or duplication categories. To improve the accuracy of SV calling, we systematically evaluate the accuracy of overlapping calls between possible combinations of algorithms for every type and size range of SVs. The results demonstrate that both the precision and recall for overlapping calls vary depending on the combinations of specific algorithms rather than the combinations of methods used in the algorithms.
These results suggest that careful selection of the algorithms for each type and size range of SVs is required for accurate calling of SVs. The selection of specific pairs of algorithms for overlapping calls promises to effectively improve the SV detection accuracy.
The BioBank Japan (BBJ) Project was launched in 2003 with the aim of providing evidence for the implementation of personalized medicine by constructing a large, patient-based biobank (BBJ). This ...report describes the study design and profile of BBJ participants who were registered during the first 5-year period of the project.
The BBJ is a registry of patients diagnosed with any of 47 target common diseases. Patients were enrolled at 12 cooperative medical institutes all over Japan from June 2003 to March 2008. Clinical information was collected annually via interviews and medical record reviews until 2013. We collected DNA from all participants at baseline and collected annual serum samples until 2013. In addition, we followed patients who reported a history of 32 of the 47 target diseases to collect survival data, including cause of death.
During the 5-year period, 200,000 participants were registered in the study. The total number of cases was 291,274 at baseline. Baseline data for 199,982 participants (53.1% male) were available for analysis. The average age at entry was 62.7 years for men and 61.5 years for women. Follow-up surveys were performed for participants with any of 32 diseases, and survival time data for 141,612 participants were available for analysis.
The BBJ Project has constructed the infrastructure for genomic research for various common diseases. This clinical information, coupled with genomic data, will provide important clues for the implementation of personalized medicine.
•The BioBank Japan Project (BBJ) enrolled 200,000 patients with 47 target diseases.•The BBJ is one of the largest patient-based biobanks in the world.•The BBJ may allow for personalized medicine in the future.
•For precision medicine, genetic, clinical and environmental information is crucial.•Biobank Japan project constructed a clinical database of 200,000 patients.•This supplement reported the ...epidemiological studies in the Biobank Japan.
Numerous genetic variants associated with hypertension and blood pressure are known, but there is a paucity of evidence from genetic studies of resistant hypertension, especially in Asian ...populations. To identify novel genetic loci associated with resistant hypertension in the Japanese population, we conducted a genome-wide association study with 2705 resistant hypertension cases and 21,296 mild hypertension controls, all from BioBank Japan. We identified one novel susceptibility candidate locus, rs1442386 on chromosome 18p11.3 (DLGAP1), achieving genome-wide significance (odds ratio (95% CI) = 0.85 (0.81-0.90), P = 3.75 × 10
) and 18 loci showing suggestive association, including rs62525059 of 8q24.3 (CYP11B2) and rs3774427 of 3p21.1 (CACNA1D). We further detected biological processes associated with resistant hypertension, including chemical synaptic transmission, regulation of transmembrane transport, neuron development and neurological system processes, highlighting the importance of the nervous system. This study provides insights into the etiology of resistant hypertension in the Japanese population.
The extent to which the biology of oncogenesis and ageing are shaped by factors that distinguish human populations is unknown. Haematopoietic clones with acquired mutations become common with ...advancing age and can lead to blood cancers
. Here we describe shared and population-specific patterns of genomic mutations and clonal selection in haematopoietic cells on the basis of 33,250 autosomal mosaic chromosomal alterations that we detected in 179,417 Japanese participants in the BioBank Japan cohort and compared with analogous data from the UK Biobank. In this long-lived Japanese population, mosaic chromosomal alterations were detected in more than 35.0% (s.e.m., 1.4%) of individuals older than 90 years, which suggests that such clones trend towards inevitability with advancing age. Japanese and European individuals exhibited key differences in the genomic locations of mutations in their respective haematopoietic clones; these differences predicted the relative rates of chronic lymphocytic leukaemia (which is more common among European individuals) and T cell leukaemia (which is more common among Japanese individuals) in these populations. Three different mutational precursors of chronic lymphocytic leukaemia (including trisomy 12, loss of chromosomes 13q and 13q, and copy-neutral loss of heterozygosity) were between two and six times less common among Japanese individuals, which suggests that the Japanese and European populations differ in selective pressures on clones long before the development of clinically apparent chronic lymphocytic leukaemia. Japanese and British populations also exhibited very different rates of clones that arose from B and T cell lineages, which predicted the relative rates of B and T cell cancers in these populations. We identified six previously undescribed loci at which inherited variants predispose to mosaic chromosomal alterations that duplicate or remove the inherited risk alleles, including large-effect rare variants at NBN, MRE11 and CTU2 (odds ratio, 28-91). We suggest that selective pressures on clones are modulated by factors that are specific to human populations. Further genomic characterization of clonal selection and cancer in populations from around the world is therefore warranted.
Obesity is a risk factor for a wide variety of health problems. In a genome-wide association study (GWAS) of body mass index (BMI) in Japanese people (n = 173,430), we found 85 loci significantly ...associated with obesity (P < 5.0 × 10
), of which 51 were previously unknown. We conducted trans-ancestral meta-analyses by integrating these results with the results from a GWAS of Europeans and identified 61 additional new loci. In total, this study identifies 112 novel loci, doubling the number of previously known BMI-associated loci. By annotating associated variants with cell-type-specific regulatory marks, we found enrichment of variants in CD19
cells. We also found significant genetic correlations between BMI and lymphocyte count (P = 6.46 × 10
, r
= 0.18) and between BMI and multiple complex diseases. These findings provide genetic evidence that lymphocytes are relevant to body weight regulation and offer insights into the pathogenesis of obesity.
Pathogenic variants in highly penetrant genes are useful for the diagnosis, therapy, and surveillance for hereditary breast cancer. Large-scale studies are needed to inform future testing and variant ...classification processes in Japanese. We performed a case-control association study for variants in coding regions of 11 hereditary breast cancer genes in 7051 unselected breast cancer patients and 11,241 female controls of Japanese ancestry. Here, we identify 244 germline pathogenic variants. Pathogenic variants are found in 5.7% of patients, ranging from 15% in women diagnosed <40 years to 3.2% in patients ≥80 years, with BRCA1/2, explaining two-thirds of pathogenic variants identified at all ages. BRCA1/2, PALB2, and TP53 are significant causative genes. Patients with pathogenic variants in BRCA1/2 or PTEN have significantly younger age at diagnosis. In conclusion, BRCA1/2, PALB2, and TP53 are the major hereditary breast cancer genes, irrespective of age at diagnosis, in Japanese women.
Display omitted
•Biliary tract cancers are clinically and genetically heterogeneous.•32 significantly mutated genes were identified, some negatively affecting prognosis.•A novel deletion of MUC17 at ...7q22.1 was detected.•Cell-of-origin predictions suggest hepatocyte-origin of hepatitis-related ICCs.•Deleterious germline mutations of cancer-predisposing genes were detected in 11% of patients with BTC.
Biliary tract cancers (BTCs) are clinically and pathologically heterogeneous and respond poorly to treatment. Genomic profiling can offer a clearer understanding of their carcinogenesis, classification and treatment strategy. We performed large-scale genome sequencing analyses on BTCs to investigate their somatic and germline driver events and characterize their genomic landscape.
We analyzed 412 BTC samples from Japanese and Italian populations, 107 by whole-exome sequencing (WES), 39 by whole-genome sequencing (WGS), and a further 266 samples by targeted sequencing. The subtypes were 136 intrahepatic cholangiocarcinomas (ICCs), 101 distal cholangiocarcinomas (DCCs), 109 peri-hilar type cholangiocarcinomas (PHCs), and 66 gallbladder or cystic duct cancers (GBCs/CDCs). We identified somatic alterations and searched for driver genes in BTCs, finding pathogenic germline variants of cancer-predisposing genes. We predicted cell-of-origin for BTCs by combining somatic mutation patterns and epigenetic features.
We identified 32 significantly and commonly mutated genes including TP53, KRAS, SMAD4, NF1, ARID1A, PBRM1, and ATR, some of which negatively affected patient prognosis. A novel deletion of MUC17 at 7q22.1 affected patient prognosis. Cell-of-origin predictions using WGS and epigenetic features suggest hepatocyte-origin of hepatitis-related ICCs. Deleterious germline mutations of cancer-predisposing genes such as BRCA1, BRCA2, RAD51D, MLH1, or MSH2 were detected in 11% (16/146) of BTC patients.
BTCs have distinct genetic features including somatic events and germline predisposition. These findings could be useful to establish treatment and diagnostic strategies for BTCs based on genetic information.
We here analyzed genomic features of 412 BTC samples from Japanese and Italian populations. A total of 32 significantly and commonly mutated genes were identified, some of which negatively affected patient prognosis, including a novel deletion of MUC17 at 7q22.1. Cell-of-origin predictions using WGS and epigenetic features suggest hepatocyte-origin of hepatitis-related ICCs. Deleterious germline mutations of cancer-predisposing genes were detected in 11% of patients with BTC. BTCs have distinct genetic features including somatic events and germline predisposition.
We report genome-wide association studies for hematological and biochemical traits from approximately 14,700 Japanese individuals. We identified 60 associations for 8 hematological traits and 29 ...associations for 12 biochemical traits at genome-wide significance levels (P < 5 x 10(-8)). Of these, 46 associations were new to this study and 43 replicated previous reports. We compared these associated loci with those reported in similar GWAS in European populations. When the minor allele frequency was >10% in the Japanese population, 32 (94.1%) and 31 (91.2%) of the 34 hematological loci previously reported to be associated in a European population were replicated with P-values less than 0.05 and 0.01, respectively, and 31 (73.8%) and 27 (64.3%) of the 42 European biochemical loci were replicated.