Design and recording of czech sign language corpus for automatic sign language recognition

Campr, Pavel; Hrúz, Marek; Železný, Miloš

Název:	Design and recording of czech sign language corpus for automatic sign language recognition
Další názvy:	Návrh a záznam korpusu české znakové řeči pro automatické rozpoznávání znakové řeči
Autoři:	Campr, Pavel Hrúz, Marek Železný, Miloš
Citace zdrojového dokumentu:	CAMPR, Pavel; HRÚZ, Marek; ŽELEZNÝ, Miloš. Design and recording of czech sign language corpus for automatic sign language recognition. In: Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08): 28-29-30 May 2008. Marrakech: ELRA, 2008, p. 678-681. ISBN 2-9517408-4-0.
Datum vydání:	2008
Nakladatel:	ELRA
Typ dokumentu:	článek article
URI:	http://hdl.handle.net/11025/16935 http://www.kky.zcu.cz/cs/publications/CamprP_2007_DesignandRecording
ISBN:	2-9517408-4-0
Klíčová slova:	znaková řeč;rozpoznávání gest;korpus řeči;čeština
Klíčová slova v dalším jazyce:	sign language;gesture recognition;speech corpus;Czech
Abstrakt v dalším jazyce:	In this paper we discuss the design, acquisition and preprocessing of a Czech audio-visual speech corpus. The corpus is intended for training and testing of existing audio-visual speech recognition system. The name of the database is UWB-07-ICAVR, where ICAVR stands for Impaired Condition Audio Visual speech Recognition. The corpus consist of 10000 utterances of continuous speech obtained from 50 speakers. The total length of the database is 25 hours. Each utterance is stored as a separate sentence. The corpus extends existing databases by covering condition of variable illumination. We acquired 50 speakers, where half of them were men and half of them were women. Recording was done by two cameras and two microphones. Database introduced in this paper can be used for testing of visual parameterization in audio-visual speech recognition (AVSR). Corpus can be easily split into training and testing part. Each speaker pronounced 200 sentences: ﬁrst 50 were the same for all, the rest of them were different. Six types of illumination were covered. Session for one speaker can ﬁt on one DVD disk. All ﬁles are accompanied by visual labels. Labels specify region of interest (mouth and area around them speciﬁed by bounding box). Actual pronunciation of each sentence is transcribed into the text ﬁle.
Práva:	© Pavel Campr - Marek Hrúz - Miloš Železný
Vyskytuje se v kolekcích:	Články / Articles (NTIS) Články / Articles (KIV)

Soubory připojené k záznamu:

Soubor	Popis	Velikost	Formát
CamprP_2007_DesignandRecording.pdf	Plný text	1,72 MB	Adobe PDF	Zobrazit/otevřít

Zobrazit celý záznam Zobrazit statistiky

Použijte tento identifikátor k citaci nebo jako odkaz na tento záznam: http://hdl.handle.net/11025/16935

Všechny záznamy v DSpace jsou chráněny autorskými právy, všechna práva vyhrazena.

hledání

navigace