Light

ananya-agarwal / automated-cyberbullying-detection-system Goto Github PK

View Code? Open in Web Editor NEW

0.0 1.0 0.0 3.67 MB

automated-cyberbullying-detection-system's Introduction

Automated Cyberbullying Detection System

Why this project?

60% of teenagers have experienced some sort of cyberbullying.
Overall, 36.5% of people feel they have been cyberbullied in their lifetime.
Increasing Cyberbullying since digital sphere (social media) has expanded and technology has advanced so its misuse has also increased.

Abstract

What is Cyberbullying or online bullying exactly: It is when someone bullies or harasses others on the internet, particularly on social media. Harmful bullying behaviour can include posting rumours, threats, sexual remarks, personal information of victim, or hate speeches.
Victims of cyberbullying may experience lower self-esteem, increased suicidal ideation, and various negative emotional responses, including anger and depression.

Project Overview

Develop a Machine Learning model, which will classify any text into 6 categories which are as follows: age-based cyberbullying, ethnicity-based cyberbullying, gender-based cyberbullying, religion-based cyberbullying, any other form of cyberbullying, and not cyberbullying.
Further, developing chatbots for various social media platforms like Discord (https://discord.com/) to try to detect cyberbullying using the above machine learning model, and take appropriate measures.
Dataset from Kaggle having 47693 sentences / tweets.

Tech Used:

Transfer Learning, Python, Google Colab, etc.

Python Libraries used during training of the ML model:

spacy, NLTK, scikit-learn, TensorFlow, Keras, NumPy, etc.

Path Followed

Tried different models and the best came out to be Long short-term memory (LSTM) with 85% accuracy.
Datset used to train the model was the Twitter based dataset picked up from Kaggle.
After training was done, object's state was saved in the binary Pickle file so that there is no need of training the model again and again.
Discord Bot that we have created will pick up the text from the Discord via the Python script and then run the pickle file over that text and flags the text on the basis of cyberbullying and thus take appropriate actions if cyberbullying is detected.
LSTM based ML model couldn't be integrated with the Discord Bot due to unavailability of the high processors and GPU so we have to got best accuracy as 78% for the Random Forest model which can be integrated with the Discord Bot.
We have used the concept of Transfer Learning where in we are changing the last layer of the model.

Challanges and Future Scope

Further increasing accuracy of the model as it requires to take care of the context in which words are used to be able to properly classify it.
Converting the code so that it can be used in case of videos, images and audio as well.
Difficulty in the initial phases due to pandemic.

UML Diagrams

automated-cyberbullying-detection-system's People

Contributors

Watchers

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.