maggieubc / cancer-sentiment-analysis Goto Github PK
View Code? Open in Web Editor NEWThis project was run in DataBricks using spark to analyze the recent news in 'cancer' for sentiment evaluation. The goal of this project is to practice traditional NLP like tokenization, stopwords, CV and TF-IDF, N-grams. Also, this project applied tools like AWS S3, athena, QuickSight etc. to address big data.