Giter VIP home page Giter VIP logo

detecting_fake_news's Introduction

Detecting_Fake_News

In this project I worked on one of the biggest problem, detecting the fake news with 52,000 article with %97 Accuracy.

I started with scraping news from NYT API and The Gueardian API to have data set labeled as real news, and dowloaded fakenews dataset from kaggle.com. At the end I scraped more than 200,000 articles. I wanted to be able to represent the real world in terms of the proportion of the real news and the fake news in my training. I had 12,000 fake news articles from kaggle.com so I dediced to have more real news, assuming there are more real news then fake in real world. Eventually I had 43,000 real news and 12,000 fakenews.

Real and the fake news articles had to be in certain topics. I decided to have only from: "US News","Politics","Business" and "World", assuming that mostly fake news would be from these topics.

What's here

Cleaning the data folder has the notebooks where I parse and clean the datasets. Modeling and Grid Search folder has modeling and the grod search for the best perfomant model Scraping data has notebooks to scrape data from NYT API and The Guardian API

detecting_fake_news's People

Contributors

genyunus avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.