Akademska digitalna zbirka SLovenije - logo
E-viri
Celotno besedilo
Recenzirano Odprti dostop
  • ALLSorts: a RNA-Seq subtype...
    Schmidt, Breon; Brown, Lauren M.; Ryland, Georgina L.; Lonsdale, Andrew; Kosasih, Hansen J.; Ludlow, Louise E.; Majewski, Ian J.; Blombery, Piers; Ekert, Paul G.; Davidson, Nadia M.; Oshlack, Alicia

    Blood advances, 07/2022, Letnik: 6, Številka: 14
    Journal Article

    B-cell acute lymphoblastic leukemia (B-ALL) is the most common childhood cancer. Subtypes within B-ALL are distinguished by characteristic structural variants and mutations, which in some instances strongly correlate with responses to treatment. The World Health Organisation (WHO) recognises seven distinct classifications, or subtypes, as of 2016. However, recent studies have demonstrated that B-ALL can be segmented into 23 subtypes based on a combination of genomic features and gene expression profiles. A method to identify a patient's subtype would have clear utility. Despite this, no publically available classification methods using RNA-Seq exist for this purpose. Here we present ALLSorts: a publicly available method that uses RNA-Seq data to classify B-ALL samples to 18 known subtypes and five meta-subtypes. ALLSorts is the result of a hierarchical supervised machine learning algorithm applied to a training set of 1223 B-ALL samples aggregated from multiple cohorts. Validation revealed that ALLSorts can accurately attribute samples to subtypes and can attribute multiple subtypes to a sample. Furthermore, when applied to both paediatric and adult cohorts, ALLSorts was able to classify previously undefined samples into subtypes. ALLSorts is available and documented on GitHub (https://github.com/Oshlack/AllSorts/).