dc.contributor.advisor |
Premaratne S.C |
|
dc.contributor.author |
Fernando W.W.E.N. |
|
dc.date.accessioned |
2021 |
|
dc.date.available |
2021 |
|
dc.date.issued |
2021 |
|
dc.identifier.citation |
Fernando, W. W. E. N. (2021). Identify hateful comments in Sinhala language on social media [Masters Theses, University of Moratuwa]. University of Moratuwa Institutional Repository. http://dl.lib.uom.lk/handle/123/17553 |
|
dc.identifier.uri |
http://dl.lib.uom.lk/handle/123/17553 |
|
dc.description.abstract |
In present, the spread of hate speech through social media has become a very serious problem, both globally and locally. The route causefor this is the increasing use of social media with the rapid expansionof computer science and information technology. Therefore, it is very important to use sameto control this kindof situations. Although there is a mechanism in place on social media to automatically controlsuch hate speech in English language, but it is still not seen in Sinhala Language.The reason for this is the lack of knowledge about the native languages such as Sinhala in the social media service providers. Therefore, the identification of hatefulcontentsin Sinhala language is an urgent and vitaltask that needs to be addressed. This research propose lexicon based and machine learning based approaches for the automatic identification of hatefulspeech in Sinhala on social media. With different pre-processing techniques and machine learning algorithms, machine learning algorithm based approach was conducted with four different approaches. These approaches were begun with 3000 comments which is equally divided into hateful and non-hateful. Using these comments, it was able to identify the most appropriate featured groups and model toidentify the hatefulspeech in Sinhala language on social media. |
en_US |
dc.language.iso |
en |
en_US |
dc.subject |
SOCIAL MEDIA – Language Use – Sinhala |
en_US |
dc.subject |
TEXT MINING |
en_US |
dc.subject |
MULTINOMIAL NAÏVE BAYES |
en_US |
dc.subject |
CONTENT IDENTIFICATION |
en_US |
dc.subject |
INFORMATION TECHNOLOGY- Dissertation |
en_US |
dc.subject |
COMPUTER SCIENCE – Dissertations |
en_US |
dc.title |
Identify hateful comments in Sinhala language on social media |
en_US |
dc.type |
Thesis-Abstract |
en_US |
dc.identifier.faculty |
IT |
en_US |
dc.identifier.degree |
MSc in Information Technology |
en_US |
dc.identifier.department |
Department of Information Technology |
en_US |
dc.date.accept |
2021 |
|
dc.identifier.accno |
TH4553 |
en_US |