E-resources
Full text
Peer reviewed
  • m2ABQ—a proposed refinement...
    Aaltonen, H. L.; O’Reilly, M. K.; Linnau, K. F.; Dong, Q.; Johnston, S. K.; Jarvik, J. G.; Cross, N. M.

    Osteoporosis international, 01/2023, Volume: 34, Issue: 1
    Journal Article

    Summary Currently, there is no reproducible, widely accepted gold standard to classify osteoporotic vertebral body fractures (OVFs). The purpose of this study is to refine a method with clear rules to classify OVFs for machine learning purposes. The method was found to have moderate interobserver agreement that improved with training. Introduction The current methods to classify osteoporotic vertebral body fractures are considered ambiguous; there is no reproducible, accepted gold standard. The purpose of this study is to refine classification methodology by introducing clear, unambiguous rules and a refined flowchart to allow consistent classification of osteoporotic vertebral body fractures. Methods We developed a set of rules and refinements that we called m2ABQ to classify vertebrae into five categories. A fracture-enriched database of thoracic and lumbar spine radiographs of patients 65 years of age and older was retrospectively obtained from clinical institutional radiology records using natural language processing. Five raters independently classified each vertebral body using the m2ABQ system. After each annotation round, consensus sessions that included all raters were held to discuss and finalize a consensus annotation for each vertebral body where individual raters’ evaluations differed. This process led to further refinement and development of the rules. Results Each annotation round showed increase in Fleiss kappa both for presence vs absence of fracture 0.62 (0.56–0.68) to 0.70 (0.65–0.75), as well as for the whole m2ABQ scale 0.29 (0.25–0.33) to 0.54 (0.51–0.58). Conclusion The m2ABQ system demonstrates moderate interobserver agreement and practical feasibility for classifying osteoporotic vertebral body fractures. Future studies to compare the method to existing studies are warranted, as well as further development of its use in machine learning purposes.