-
Slovenian spontaneous speech recognition and acoustic modeling of filled pauses and onomatopoeasŽgank, Andrej ; Rotovnik, Tomaž, telekomunikacije ; Sepesy Maučec, MirjamThis paper is focused on acoustic modeling for spontaneous speech recognition.This topic is still a very challenging task for speech technology research community. The attributes of spontaneous ... speech can heavily degrade speech recognizer's accuracy and performance. Filled pauses and onomatopoeias present one of such important attributes of spontaneous speech, which can give considerably worse accuracy. Although filled pauses don't carry any semantic information, they are still very important from the modeling perspective. A novel acoustic modeling approach is proposed in this paper, where the filled pauses are modeled using the phonetic broad classes, which corresponds with their acoustic-phonetic properties. The phonetic broad classes are language dependent, and can be defined by an expert or in a data-driven way. The new filled pauses modeling approach is compared with three other implicit filled pauses modeling methods. All experiments were carried out using a context-dependent Hidden Markov Models based speech recognition system. For training and evaluation, the Slovenian BNSI Broadcast News speech and text database was applied. The database contains manually transcribed recordings of TV news shows. The evaluation of the proposed acoustic modeling approach was done on a set of spontaneous speech. The overall best filled pauses acoustic modeling approach improved the speech recognizer's word accuracy for 5.70% relatively in comparison to the baseline system, without influencing the recognition time.Source: WSEAS transactions on signal processing. - ISSN 1790-5052 (Vol. 4, iss. 7, Jul. 2008, str. 388-397)Type of material - article, component partPublish date - 2008Language - englishCOBISS.SI-ID - 12706070
Author
Žgank, Andrej |
Rotovnik, Tomaž, telekomunikacije |
Sepesy Maučec, Mirjam
Topics
avtomatsko razpoznavanje govora |
razpoznavanja slovenskega jezika |
akustični modeli |
onomatopoeja |
vremenska napoved |
speech recognition |
acoustic modeling |
filles pauses |
onomatopoeas |
Slovenian spontaneous speech |
broadcast news |
HMM




Shelf entry
Permalink
- URL:
Impact factor
Access to the JCR database is permitted only to users from Slovenia. Your current IP address is not on the list of IP addresses with access permission, and authentication with the relevant AAI accout is required.
Year | Impact factor | Edition | Category | Classification | ||||
---|---|---|---|---|---|---|---|---|
JCR | SNIP | JCR | SNIP | JCR | SNIP | JCR | SNIP |
Impact factor
Select the library membership card:
DRS, in which the journal is indexed
Database name | Field | Year |
---|
Links to authors' personal bibliographies | Links to information on researchers in the SICRIS system |
---|---|
Žgank, Andrej | 20032 |
Rotovnik, Tomaž, telekomunikacije | 21304 |
Sepesy Maučec, Mirjam | 18168 |
Select pickup location:
Material pickup by post
Notification
Subject headings in COBISS General List of Subject Headings
Select pickup location
Pickup location | Material status | Reservation |
---|
Please wait a moment.