Reducing footprint of unit selection TTS system by excluding utterances from source speech corpus

Matoušek, Jindřich; Tihelka, Daniel; Hanzlíček, Zdeněk

Full metadata record

DC pole	Hodnota	Jazyk
dc.contributor.author	Matoušek, Jindřich
dc.contributor.author	Tihelka, Daniel
dc.contributor.author	Hanzlíček, Zdeněk
dc.date.accessioned	2015-12-16T07:52:07Z	-
dc.date.available	2015-12-16T07:52:07Z	-
dc.date.issued	2009
dc.identifier.citation	MATOUŠEK, Jindřich; TIHELKA, Daniel; HANZLÍČEK, Zdeněk. Reducing footprint of unit selection TTS system by excluding utterances from source speech corpus. In: Speech processing 19th czech â€“ german workshop 29th September â€“ 1st October 2009. Prague: Institute of Photonics and Electronics Academy of Sciences of the Czech Republic, 2009, p. 92-98. ISBN 978-80-86269-18-4.	en
dc.identifier.isbn	978-80-86269-18-4
dc.identifier.uri	http://www.kky.zcu.cz/cs/publications/MatousekJ_2009_ReducingFootprintof
dc.identifier.uri	http://hdl.handle.net/11025/17017
dc.format	8 s.	cs
dc.format.mimetype	application/pdf
dc.language.iso	en	en
dc.publisher	Institute of Photonics and Electronics Academy of Sciences of the Czech Republic	en
dc.rights	© Jindřich Matoušek - Daniel Tihelka - Zdeněk Hanzlíček	cs
dc.subject	syntéza řeči	cs
dc.subject	výběr jednotky	cs
dc.subject	korpus řeči	cs
dc.title	Reducing footprint of unit selection TTS system by excluding utterances from source speech corpus	en
dc.title.alternative	Snižování paměťových nároků systému TTS pracujícího na principu výběru jednotek vyhozením promluv ze zdrojového řečového korpusu	cs
dc.type	článek	cs
dc.type	article	en
dc.rights.access	openAccess	en
dc.type.version	publishedVersion	en
dc.description.abstract-translated	Current unit selection speech synthesis systems are capable of producing speech of a high quality at the expense of enormous computational and storage requirements. In this paper, the analysis of an existing large speech corpus employed for unit-selection-based synthesis of Czech speech is performed. Subsequently, a procedure for the exclusion of some amount of utterances from the source speech corpus is proposed. The procedure is based on the statistics of the utilisation of all utterances during text-to-speech synthesis of a large portion of texts. The exclusion of whole utterances was preferred over the exclusion of the particular instances of speech units in order to preserve the main feature of unit selection framework - to select as longest sequence of contiguous speech units as possible. After the exclusion, the footprint of the system was reduced approximately by 42 %. The resulting synthetic speech was then judged by means of 5-scale CCR listening tests and evaluated in average as only "slightly worse" than speech generated by the baseline (i.e. not reduced) system.	en
dc.subject.translated	speech synthesis	en
dc.subject.translated	unit selection	en
dc.subject.translated	speech corpus	en
dc.type.status	Peer-reviewed	en
Vyskytuje se v kolekcích:	Články / Articles (NTIS) Články / Articles (KIV)

Soubory připojené k záznamu:

Soubor	Popis	Velikost	Formát
MatousekJ_2009_ReducingFootprintof.pdf	Plný text	742,59 kB	Adobe PDF	Zobrazit/otevřít

Zobrazit minimální záznam Zobrazit statistiky

Použijte tento identifikátor k citaci nebo jako odkaz na tento záznam: http://hdl.handle.net/11025/17017

Všechny záznamy v DSpace jsou chráněny autorskými právy, všechna práva vyhrazena.

hledání

navigace