Large vocabulary speech recognition of Slovenian language using morphological models

(UM)

Large vocabulary speech recognition of Slovenian language using morphological models

Sepesy Maučec, Mirjam ...

This paper concerns the development of automatic speech recognition system for Slovenian language. The large number of unique words in inflected languages is identified as the primary reason for ... performance degradation. This article discusses the statistical language models. A novel variation of the n-gram modelling theme is examined. Modelling units are chosen to be stems and endings instead of words. Only data-driven algorithms are employed to decompose words into stems and endings automatically. Significant reduction of OOV rate results when using stems and endings for modelling the Slovenian language. We as well discuss corpus-based topic-adapted language models. Language models are most often used in topic homogeneous environment. The problem of topic detection in highly inflected language is outlined, caused by appearance of several word forms derived from the same lemma. The problem is solved by using data-driven algorithms to group words of the same lemma into classes.

Source: The IEEE Region 8 EUROCON 2003 : computer as a tool : 22-24. September 2003, Faculty of Electrical Engineering, University of Ljubljana, Ljubljana, Slovenia : proceedings (Vol. 2, str. 158-161)

Type of material - conference contribution ; adult, serious

Publish date - 2003

Language - english

COBISS.SI-ID - 8249110

Keep searching

Author
Sepesy Maučec, Mirjam | Rotovnik, Tomaž, telekomunikacije | Kačič, Zdravko | Horvat, Bogomir, 1936-

Holdings

source: The IEEE Region 8 EUROCON 2003 : computer as a tool : 22-24. September 2003, Faculty of Electrical Engineering, University of Ljubljana, Ljubljana, Slovenia : proceedings (Vol. 2, str. 158-161)

Access to the JCR database is permitted only to users from Slovenia. Your current IP address is not on the list of IP addresses with access permission, and authentication with the relevant AAI accout is required.

Year	Impact factor		Edition		Category		Classification
Year	JCR	SNIP	JCR	SNIP	JCR	SNIP	JCR	SNIP

Links to authors' personal bibliographies	Links to information on researchers in the SICRIS system
Sepesy Maučec, Mirjam	18168
Rotovnik, Tomaž, telekomunikacije	21304
Kačič, Zdravko	06821
Horvat, Bogomir, 1936-	03015

Source: Personal bibliographies and: SICRIS

The material from the parent unit is free. If the material is delivered to the pickup location from another unit, the library may charge you for this service.

Pickup location	Material status	Reservation

Upload image

Shelf entry

Adding material to shelf was successful.

Adding material to shelf failed.

It was not necessary to add the material to the shelf.

Permalink

E-mail

Impact factor

Select the library membership card:

DRS, in which the journal is indexed

Select pickup location:

Material pickup by post

Notification

Citations

Subject headings in COBISS General List of Subject Headings

Select pickup location

Reservation was successful.

Reservation failed.

Reservation...

Bibliographic data

Number of loans

Loan was successful

Loan failed

Loan was successful

Loan failed

Loan was successful

Loan failed

Loan was successful

Loan failed

Theme