A Computerized approach for emotion recognition using voice

Kumara RADGD

UoM IR
→
Thesis & Dissertation
→
Faculty of IT, Information Technology
→
Master of Science in Information Technology
→
View Item

dc.contributor.advisor	Karunarathne I
dc.contributor.author	Kumara RADGD
dc.date.accessioned	2021
dc.date.available	2021
dc.date.issued	2021
dc.identifier.citation	Kumara, R.A.D.G.D. (2021). A Computerized approach for emotion recognition using voice [Master's theses, University of Moratuwa]. Institutional Repository University of Moratuwa. http://dl.lib.uom.lk/handle/123/20734
dc.identifier.uri	http://dl.lib.uom.lk/handle/123/20734
dc.description.abstract	The purpose of this paper is providing guess information about emotion status of human using voice cut, previous machine learning researches for emotion status of human using voice cut and accuracy and recall of each research and comparison of current research and previous related researches. deeply discuss how to use machine learning for emotion status of human using voice. Firstly, explain ML using many categories Supervised learning algorithms, Unsupervised learning algorithms like that logistic regression, random forest, svm, gaussianNB, decision tree and k-nn .Python scipy provide facilities for analyze voice and provide frequency spectrum and amplitude of voice . fast Fourier transform (FFT) is providing maximum amplitude from all frequencies and frequency of maximum amplitude. The mel scale is a scale of pitches that human earshot usually observes to be intermediate from each other. Maximum mel value, maximum amplitude from all frequencies, frequency of maximum amplitude, age of user and gender are used to find out emotion status human. Part of this research is analyzing various type of voices according to emotional situation. Then train and test model using Machine learning for predict what emotional situation of is given voice. Elicited emotional speech database are creating emotional situation artificially by accumulating fact from the talker. emotional states are identified by researcher after features are trained and evaluated by machine learning models. The best recognition rate is reported by SER between 75% and 82% on random forest and recognition rate between 75% and 77% on decision tree. From this result, we can see that Decision-making algorithms (random forest and decision tree) often perform better with our data base. Therefore, we concluded that the K-Nearest Neighbor (accuracy 50%) , svm (46%), GaussianNB (46%) and logistic regression (46%). Rule based algorithms are good for recognize emotion status using voice. Random forest is best machine learning algorithm for recognize emotion status using voice according to our experiment. Random forest has good test accuracy (between 80% to 90%).	en_US
dc.language.iso	en	en_US
dc.subject	EMOTION RECOGNITION	en_US
dc.subject	MACHINE LEARNING	en_US
dc.subject	DATA PREPROCESSING	en_US
dc.subject	VOICE CLIP	en_US
dc.subject	COMPUTER SCIENCE -Dissertation	en_US
dc.subject	INFORMATION TECHNOLOGY -Dissertation	en_US
dc.title	A Computerized approach for emotion recognition using voice	en_US
dc.type	Thesis-Abstract	en_US
dc.identifier.faculty	IT	en_US
dc.identifier.degree	Msc. in Information Technology	en_US
dc.identifier.department	Department of Information Technology	en_US
dc.date.accept	2021
dc.identifier.accno	TH4572	en_US