NUK - logo
E-viri
Recenzirano Odprti dostop
  • Aggregation of population‐b...
    Wiel, Laurens; Venselaar, Hanka; Veltman, Joris A.; Vriend, Gert; Gilissen, Christian

    Human mutation, November 2017, Letnik: 38, Številka: 11
    Journal Article

    Whole exomes of patients with a genetic disorder are nowadays routinely sequenced but interpretation of the identified genetic variants remains a major challenge. The increased availability of population‐based human genetic variation has given rise to measures of genetic tolerance that have been used, for example, to predict disease‐causing genes in neurodevelopmental disorders. Here, we investigated whether combining variant information from homologous protein domains can improve variant interpretation. For this purpose, we developed a framework that maps population variation and known pathogenic mutations onto 2,750 “meta‐domains.” These meta‐domains consist of 30,853 homologous Pfam protein domain instances that cover 36% of all human protein coding sequences. We find that genetic tolerance is consistent across protein domain homologues, and that patterns of genetic tolerance faithfully mimic patterns of evolutionary conservation. Furthermore, for a significant fraction (68%) of the meta‐domains high‐frequency population variation re‐occurs at the same positions across domain homologues more often than expected. In addition, we observe that the presence of pathogenic missense variants at an aligned homologous domain position is often paired with the absence of population variation and vice versa. The use of these meta‐domains can improve the interpretation of genetic variation. We developed a framework to map population variation and known pathogenic mutations onto 2,750 “meta‐domains.” These meta‐domains consist of 30,853 within‐human protein domain homologues. We find that population variation re‐occurs at the same positions across domain homologues more often than expected. Additionally, we observe that the presence of pathogenic variants at an aligned homologous domain position is often paired with the absence of population variation and vice versa. These meta‐domains aid in interpreting genetic variants in protein domains.