Light

kevinwild / twitter_nlp Goto Github PK

View Code? Open in Web Editor NEW

0.0 1.0 0.0 281 KB

Python 15.92% HTML 4.88% CSS 54.69% JavaScript 24.51%

twitter_nlp's Introduction

Twitter Python NLP

Installation

Clone the repo: git clone https://github.com/kevinwild/twitter_nlp
Copy sample.env from root to .env and set your twitter API keys
Install dependencies:
pip install requests_oauthlib nltk emoji python-dotenv
Configure config.json

data_store_dir = folder for writing files
- raw_file_name = file name for raw data from API
- report_file_name = file name for final report
- confidence_level = the level of probability confidence that the AI has chosen the right tone for the tweet. If the tweets confidence number is lower then this number it will not be considered a valid tweet for the sentiment analysis
- tweet_save_limit = the confidence level of tweets that should be added to the final report.
- top_word_limit = the number of most frequent words used throughout the data set.
- search_url = the main search URL, other URLs must have similar return json shape.
- python_shell_cmd" = the command you use to run python scripts in your terminal
- query_params = list of query parameters used for each API request. Full list found here
  or more info here

Run the entire program or only a section. The program overwrites their output file and analyze_data.py always has to be ran after request_api.py

run entire program: python main.py
or
- generate raw data: python request_api.py
generate report: python analyze_data.py

Report Generates the following data:

total_tweets = number of total tweets
positive_count = number of tweets that are considered positive and within the confidence threshold
negative_count = number of tweets that are considered negative and within the confidence threshold
total_certain_positive = number of tweets that are considered positive and within the tweet_save_limit threshold
total_certain_negative = number of tweets that are considered negative and within the tweet_save_limit threshold
positive_tweets = An array of objects that show positive tweets and the probability it is considered positive
negative_tweets = An array of objects that show negative tweets and the probability it is considered negative
top_N_words = multi-dimensional array with the first index being the word and the second index showing the usage count
emoji_count = multi-dimensional array with the first index being the emoji and the second index showing the usage count
user_locations = multi-dimensional array with the first index being the location name and the second index showing the amount of times that location was used in a tweet

`

twitter_nlp's People

Contributors

Watchers

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.