Giter VIP home page Giter VIP logo

ssl_project's Introduction

Abusive Language Detection in Social Media

Ionescu Diana & Dumitrescu Andrei

Overview

This repository contains the final code and documentation for the SSL Project.
The topic of the project is Abusive Language Detection in Social Media.
The project offers multiple baselines and State-of-the-Art Models.
As requirments for running the experiments you need the following APIs: TensorFlow, Numpy, PyTorch, HuggingFace Transformers, Sklearn, Pandas, NLTK.

Repository Structure

The structure of the repository is the following:
* docs folder: contains all the documentation presented during this semester
* datasets folder: contains all the datasets used for this project
* plots folder: contains some graphs plotted during the first iteration experiments
* baselines.py, first_iteration.py, second_iteration.py files: contain the code for training and evaluating the baseline / first iteration / second iteration
* torch_dataset.py file: contains a custom dataset implementation
* data_reader.py, preprocess.py files: contain auxiliary functions used for data reading and processing
* utils.py file: contains auxiliary functions used for plotting graphs
* prediction.py file: contains code for predicting on different datasets

Use Case

Each of the files baselines.py, first_iteration.py, second_iteration.py contain functions that follow the template: run_X_experiments(...).
To train and evaluate the second iteration models simply run:

python second_iteration.py

The models from the baselines and first iteration will be saved in a folder called models.
The models from the second iteration will be saved in a folder called models_torch.
Make sure that these folders exist!
To run the prediction demo simply run:

python prediction.py

You can change the global parameters to run on any dataset you want.

ssl_project's People

Contributors

andreidumitrescu99 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.