Akademska digitalna zbirka SLovenije - logo
E-resources
Full text
Peer reviewed
  • Contextual invariant-integr...
    Müller, Florian; Mertins, Alfred

    Speech communication, 07/2011, Volume: 53, Issue: 6
    Journal Article

    ► A feature-extraction method based on invariant integration is presented. ► Experiments show a superior performance compared to standard features. ► The new features benefit from the combination with speaker-adaptation methods. This work presents a feature-extraction method that is based on the theory of invariant integration. The invariant-integration features are derived from an extended time period, and their computation has a very low complexity. Recognition experiments show a superior performance of the presented feature type compared to cepstral coefficients using a mel filterbank (MFCCs) or a gammatone filterbank (GTCCs) in matching as well as in mismatching training-testing conditions. Even without any speaker adaptation, the presented features yield accuracies that are larger than for MFCCs combined with vocal tract length normalization (VTLN) in matching training-test conditions. Also, it is shown that the invariant-integration features (IIFs) can be successfully combined with additional speaker-adaptation methods to further increase the accuracy. In addition to standard MFCCs also contextual MFCCs are introduced. Their performance lies between the one of MFCCs and IIFs.