Akademska digitalna zbirka SLovenije - logo
E-resources
Full text
Peer reviewed
  • Binarization of degraded do...
    Vo, Quang Nhat; Kim, Soo Hyung; Yang, Hyung Jeong; Lee, Gueesang

    Pattern recognition, February 2018, 2018-02-00, Volume: 74
    Journal Article

    •We propose a supervised binarization method based on the deep supervised networks.•The multi-scale deep supervised network for binarization has not been reported yet.•A hierarchical architecture is designed to distinguish text from background noises.•Different feature levels are dealt by the multi-scale architecture.•The performance results are considerably better than state-of-the-art methods. The binarization of degraded document images is a challenging problem in terms of document analysis. Binarization is a classification process in which intra-image pixels are assigned to either of the two following classes: foreground text and background. Most of the algorithms are constructed on low-level features in an unsupervised manner, and the consequent disenabling of full utilization of input-domain knowledge considerably limits distinguishing of background noises from the foreground. In this paper, a novel supervised-binarization method is proposed, in which a hierarchical deep supervised network (DSN) architecture is learned for the prediction of the text pixels at different feature levels. With higher-level features, the network can differentiate text pixels from background noises, whereby severe degradations that occur in document images can be managed. Alternatively, foreground maps that are predicted at lower-level features present a higher visual quality at the boundary area. Compared with those of traditional algorithms, binary images generated by our architecture have cleaner background and better-preserved strokes. The proposed approach achieves state-of-the-art results over widely used DIBCO datasets, revealing the robustness of the presented method.