Lexicon-Based Indonesian Local Language Abusive Words Dictionary to Detect Hate Speech in Social Media

Mardhiya Hayaty

Sumarni Adi

Anggit Dwi Hartanto

Resumen

Background: Hate speech is an expression to someone or a group of people that contain feelings of hate and/or anger at people or groups. On social media users are free to express themselves by writing harsh words and share them with a group of people so that it triggers separations and conflicts between groups. Currently, research has been conducted by several experts to detect hate speech in social media namely machine learning-based and lexicon-based, but the machine learning approach has a weakness namely the manual labelling process by an annotator in separating positive, negative or neutral opinions takes time long and tiringObjective: This study aims to produce a dictionary containing abusive words from local languages in Indonesia. Lexicon-base is very dependent on the language contained in dictionary words. Indonesia has thousands of tribes with 2500 local languages, and 80% of the population of Indonesia use local languages in communication, with the result that a significant challenge to detect hate speech of social media.Methods: Abusive words surveys are conducted by using proportionate stratified random sampling techniques in 4 major tribes on the island of Java, namely Betawi, Sundanese, Javanese, MadureseResults: The experimental results produce 250 abusive words dictionary from 4 major Indonesian tribes to detect hate speech in Indonesian social media by using the lexicon-based approach. Conclusion: A stratified random sampling technique has been conducted in 4 major Indonesian tribes to produce 250 abusive words for hate speech detection using the lexicon-based approach.

Acceso

PÁGINAS

pp. 9 - 17

NÚMERO

Volumen: 6 Número: 1 Parte: 0 (2020)

MATERIAS

ADMINISTRACIÓN
INGENIERÍA Y CONSTRUCCIÓN CIVIL
TECNOLOGÍA

REVISTAS SIMILARES

Emitter: International Journal of Engineering Technology
Information
Applied Sciences

DOI

http://dx.doi.org/10.20473/jisebi.6.1.9-17

Artículos similares

Social Media Hate Speech Detection Using Explainable Artificial Intelligence (XAI)

Acceso

Harshkumar Mehta and Kalpdrum Passi

Explainable artificial intelligence (XAI) characteristics have flexible and multifaceted potential in hate speech detection by deep learning models. Interpreting and explaining decisions made by complex artificial intelligence (AI) models to understand t... ver más

Revista: Algorithms

Detection of Racist Language in French Tweets

Acceso

Natalia Vanetik and Elisheva Mimoun

Toxic online content has become a major issue in recent years due to the exponential increase in the use of the internet. In France, there has been a significant increase in hate speech against migrant and Muslim communities following events such as Grea... ver más

Revista: Information

Risk Determination versus Risk Perception: A New Model of Reality for Human?Machine Autonomy

Acceso

William Lawless

We review the progress in developing a science of interdependence applied to the determinations and perceptions of risk for autonomous human?machine systems based on a case study of the Department of Defense?s (DoD) faulty determination of risk in a dron... ver más

Revista: Informatics

Arabic Offensive and Hate Speech Detection Using a Cross-Corpora Multi-Task Learning Model

Acceso

Wassen Aldjanabi, Abdelghani Dahou, Mohammed A. A. Al-qaness, Mohamed Abd Elaziz, Ahmed Mohamed Helmi and Robertas Dama?evicius

As social media platforms offer a medium for opinion expression, social phenomena such as hatred, offensive language, racism, and all forms of verbal violence have increased spectacularly. These behaviors do not affect specific countries, groups, or comm... ver más

Revista: Informatics

Change Point Detection in Terrorism-Related Online Content Using Deep Learning Derived Indicators

Acceso

Ourania Theodosiadou, Kyriaki Pantelidou, Nikolaos Bastas, Despoina Chatzakou, Theodora Tsikrika, Stefanos Vrochidis and Ioannis Kompatsiaris

Given the increasing occurrence of deviant activities in online platforms, it is of paramount importance to develop methods and tools that allow in-depth analysis and understanding to then develop effective countermeasures. This work proposes a framework... ver más

Revista: Information

Revistas destacadas

Acceso directo a los números publicados en la revista Infrastructures

Infrastructures

Acceso directo a los números publicados en la revista Informed Infraestructure

Informed Infraestructure

Acceso directo a los números publicados en la revista BiT

Acceso directo a los números publicados en la revista Revista de la Construcción

Revista de la Construcción

Ver todas las revistas