Kategorizace zákaznické zpětné vazby nezávislá na jazyce

Průcha, Pavel

Title:	Kategorizace zákaznické zpětné vazby nezávislá na jazyce
Other Titles:	Categorization of customer feedback independent of the language
Authors:	Průcha, Pavel
Advisor:	Steinberger Josef, Doc. Ing. Ph.D.
Referee:	Přibáň Pavel, Ing.
Issue Date:	2023
Publisher:	Západočeská univerzita v Plzni
Document type:	diplomová práce
URI:	http://hdl.handle.net/11025/53738
Keywords:	nlp;strojové učení;neuronové sítě
Keywords in different language:	nlp;machine learning;neural networks
Abstract:	Kategorizaci zákaznické zpětné vazby využívají různé společnosti pro zkvalitnění produktu nebo služby, kterou nabízí. Tato diplomová práce se zabývá kategorizací zákaznické zpětné vazby nezávislé na jazyce. Konverze mezi jazyky využívá transformaci vektorového prostoru pomocí transformační matice a strojový překlad. Datový korpus pro trénování a testování klasifikátorů je vytvořen z recenzí řetězce McDonald's, ve kterých je následně manuálně označený sentiment ve vybraných kategoriích. Tímto způsobem je vytvořený trénovací korpus z českých recenzí a testovací korpusy z anglických a německých recenzí. Data jsou v různých kombinacích testována primárně na neuronových sítích CNN a LSTM s vektorovou reprezentací textu word2vec a fasttext. Nejúspěšnější kombinací modelů je LSTM s fasttext, která je použita v demonstrátoru práce.
Abstract in different language:	Categorization of customer feedback is used by various companies to improve the quality of the product or service they offer. This thesis deals with language-independent categorization of customer feedback. Conversion between languages uses vector space transformation using a transformation matrix and machine translation. The data corpus for training and testing classifiers is created from reviews of the chain McDonald's, in which the sentiment in selected categories is then manually annotated. In this way, the training corpus is created from Czech reviews and the test corpus from English and German reviews. Data are tested in various combinations primarily on neural networks CNN and LSTM with word embeddings word2vec and fasttext. The most successful combination of models is LSTM with fasttext, which are used in the work demonstrator.
Rights:	Plný text práce je přístupný bez omezení
Appears in Collections:	Diplomové práce / Theses (KIV)

Files in This Item:

File	Description	Size	Format
A20N0102P_DP.pdf	Plný text práce	1,16 MB	Adobe PDF	View/Open
A20N0102Pposudek-op.pdf	Posudek oponenta práce	251,61 kB	Adobe PDF	View/Open
A20N0102Phodnoceni-ved.pdf	Posudek vedoucího práce	102,18 kB	Adobe PDF	View/Open
A20N0102Pobhajoba.pdf	Průběh obhajoby práce	528,28 kB	Adobe PDF	View/Open

Show full item record

Please use this identifier to cite or link to this item: http://hdl.handle.net/11025/53738

search

navigation