Using the lemmatization technique for phonetic transcription in text-to-speech system

Kanis, Jakub; Müller, Luděk

Full metadata record

DC pole	Hodnota	Jazyk
dc.contributor.author	Kanis, Jakub
dc.contributor.author	Müller, Luděk
dc.date.accessioned	2016-01-06T13:09:00Z
dc.date.available	2016-01-06T13:09:00Z
dc.date.issued	2004
dc.identifier.citation	KANIS, Jakub; MÜLLER, Luděk. Using the lemmatization technique for phonetic transcription in text-to-speech system. In: Text, speech and dialogue. Berlin: Springer, 2004, p. 355-361. (Lectures notes in computer science; 3206). ISBN 978-3-540-23049-6.	en
dc.identifier.isbn	978-3-540-23049-6
dc.identifier.uri	http://www.kky.zcu.cz/cs/publications/KanisJ_2004_Usingthe_1
dc.identifier.uri	http://hdl.handle.net/11025/17131
dc.description.abstract	Tento článek se zabývá technikou lemmatizace a jejím využitím pro fonetickou transkripci slov, jež jsou výjimkami z pravidelné fonetické transkripce. Lemmatizátor je založen na morfologii jazyka a používá slovník základních tvarů a množinu inverzních derivačních pravidel k nalezení lemmatizačních pravidel, která jsou nezbytná pro hledání základních tvarů slov. Dále je v článku popsán algoritmus lemmatizace a jeho nutné modifikace pro zajištění fonetické transkripce výjimek. Hlavním cílem navrženého systému je úspora paměti při uložení slovníku výjimek. Výsledky experimentů ukazují, že lze uspořit 18,3 % (Angličtina) až 98,4 % (Finština) velikosti plného slovníku výjimek. Navržená technika tedy může být s výhodou použita pro vysoce flexivní a aglutinační jazyky.	cs
dc.format	7 s.	cs
dc.format.mimetype	application/pdf
dc.language.iso	en	en
dc.publisher	Springer	en
dc.relation.ispartofseries	Lectures notes in computer science; 3206	en
dc.rights	© Jakub Kanis - Luděk Müller	cs
dc.subject	lemmatizace	cs
dc.subject	fonetická transkripce	cs
dc.subject	výjimky z fonetické transkripce	cs
dc.title	Using the lemmatization technique for phonetic transcription in text-to-speech system	en
dc.title.alternative	Využití techniky lemmatizace pro fonetickou transkripci v text-to-speech systému	cs
dc.type	článek	cs
dc.type	article	en
dc.rights.access	openAccess	en
dc.type.version	publishedVersion	en
dc.description.abstract-translated	This paper deals with a lemmatization technique and its using for phonetic transcription of exceptional words. The lemmatizer is based on language morphology and uses a lexicon of basic word forms and a set of inversion derivation rules to acquire lemmatization rules, which are essential for finding word bases. The lemmatization algorithm and its necessary modifications for transcription of exceptional words are described. The main goal of the designed system is to save computer memory for exceptional lexicon storing. The experimental results showed that it is possible to save from 18.3 % (English) to 98.4 % (Finnish) of the full lexicon size. Hence, the described technique can be applied with advantage for high inflectional and agglutinative languages.	en
dc.subject.translated	lemmatization	en
dc.subject.translated	phonetic transcription	en
dc.subject.translated	exceptions from letter to sound conversion	en
dc.type.status	Peer-reviewed	en
Vyskytuje se v kolekcích:	Články / Articles (NTIS) Články / Articles (KKY)

Soubory připojené k záznamu:

Soubor	Popis	Velikost	Formát
KanisJ_2004_Usingthe_1.pdf	Plný text	95,48 kB	Adobe PDF	Zobrazit/otevřít

Zobrazit minimální záznam Zobrazit statistiky

Použijte tento identifikátor k citaci nebo jako odkaz na tento záznam: http://hdl.handle.net/11025/17131

Všechny záznamy v DSpace jsou chráněny autorskými právy, všechna práva vyhrazena.

hledání

navigace