VSE knjižnice (vzajemna bibliografsko-kataložna baza podatkov COBIB.SI)
  • Efficient noise robust feature extraction algorithms for distributed speech recognition (DSR) systems
    Kotnik, Bojan ; Vlaj, Damjan ; Horvat, Bogomir, 1936-
    The evolution of robust speech recognition systems that maintain a high level of recognition accuracy in difficult and dynamically-varying acoustical environments is becoming increasingly important ... as speech recognition technology becomes a more integral part of mobile applications. In distributed speech recognition (DSR) architecture the recogniserćs front-end is located in the terminal and is connected over a data network to a remote back-end recognition server. The terminal performs the feature parameter extraction, or the front-end of the speech recognition system. These features are transmitted over a data channel to the remote back-end recogniser. DSR provides particular benefits for the applications of mobile devices such as improved recognition performance compared to using the voice channel and ubiquitous access from different networks with a guaranteed level of recognition performance. A feature extraction algorithm integrated into the DSR system is required to operate in real-time as well as with the lowest possible computational costs. In this paper, two innovative front-end processing techniques for noise robust speech recognition are presented and compared, time-domain based frame-attenuation (TD-FrAtt) and frequency-domain based frame-attenuation (FD-FrAtt). These techniques include different forms of frame-attenuation, improvement of spectral subtraction based on minimum statistics, as well as a mel-cepstrum feature extraction procedure. Tests are performed using the Slovenian SpeechDat II fixed telephone database and the Aurora 2 database together with the HTK speech recognition toolkit. The results obtained are especially encouraging for mobile DSR systems with limited sizes of available memory and processing power.
    Vir: International journal of speech technology. - ISSN 1381-2416 (Vol. 6, iss. 3, July 2003, str. 205-219)
    Vrsta gradiva - članek, sestavni del
    Leto - 2003
    Jezik - angleški
    COBISS.SI-ID - 7963670