Giter VIP home page Giter VIP logo

rumour-data-aug's Introduction

Training data augmentation for rumour detection using context-sensitive neural language model

Rumour Data set

Following Rumour Dataset are used in our experiment.

  • CrisisLexT26: References(labels) for the Boston marathon bombings are obtained from CrisisLexT26 corpus.

  • Twitter event datasets (2012-2016) : This is a Twitter corpus that is used as candidate tweets for data augmentation.

  • PHEME dataset: References(labels) for the five events(Ferguson unrest, Sydney siege, Ottawa shooting, Charlie hebdo attacks, and Germanwings plance crash) are obtained from this data.

Data Collection

Data collection is performed to collect social-temporal data (typically replies and retweets) for rumour source tweets.

Semantic Relatedness Computation

Semantic Relatedness computation is to locate various forms of rumours based on textual variations. Fine-tuned ELMo is employed to learn representation of tweets and pairwise cosine similarity are computed between reference rumour tweets and rumour candidate tweets.

Baseline Classification Model

We evaluated the effectiveness of our augmented rumour data in a state-of-the-art classification model for the task of rumour detection. You can find modified source code in Multitask4Veracity

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.