Giter VIP home page Giter VIP logo

Adrian Böck's Projects

hate-speech-detection-on-code-mixed-dataset-using-a-fusion-of-custom-and-pre-trained-models-with-pro icon hate-speech-detection-on-code-mixed-dataset-using-a-fusion-of-custom-and-pre-trained-models-with-pro

With the increase in user-generated content on social media networks, hate speech and offensive language content are also increasing. From the perspective of computer science, automatic detection of such hate speech and offensive language content is an interesting problem to solve. The natural language community has taken a step to identify such content via automated hate speech and offensive content detection. The hate speech content is generated mostly on social media, and automatic hate speech and offensive language detection face many challenges due to non-standard spelling and grammar variations. Specifically, in a multilingual community, the hate content would be in code-mixed form, making the task further challenging. In this article, we propose a model for code-mixed hate speech detection. This model embeds the knowledge from both user-trained and multilingual pre-trained models. The proposed method also calculates the profanity word list and augments it. Experimental results on code-mixed hate speech and offensive language detection benchmarks show that our method outperforms the existing baselines.

inqugen icon inqugen

Intelligent Question Generation System

lazynlp icon lazynlp

Library to scrape and clean web pages to create massive datasets.

lingfeat icon lingfeat

LingFeat - A Comprehensive Linguistic Features Extraction ToolKit for Readability Assessment

malaya icon malaya

Natural Language Toolkit for bahasa Malaysia, https://malaya.readthedocs.io/

pawraphrase_public icon pawraphrase_public

Deep Learning Training script implemented in Python based on Deep Learning model T5 --- Teaching Google's T5 how to paraphrase sentences through fine tuning with PAWS dataset.

pdf_text icon pdf_text

Taking tables in a PDF document and using PyPDF2 and re libraries to convert to csv format

pdflayouttextstripper icon pdflayouttextstripper

Converts a pdf file into a text file while keeping the layout of the original pdf. Useful to extract the content from a table in a pdf file for instance. This is a subclass of PDFTextStripper class (from the Apache PDFBox library).

pyqtgraph icon pyqtgraph

Fast data visualization and GUI tools for scientific / engineering applications

q-ai-frag icon q-ai-frag

German NLP Question Generator (work in progress)

questgen.ai icon questgen.ai

Question generation using state-of-the-art Natural Language Processing algorithms

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.