Comments (6)
Would appreciate a swift response.
Thanks in advance
from birdspotter.
Update : I assumed that my json lines file had a lot of stop words and I ran a Python program which removes stop words.
This is the updated JSONL file
JSON FILE
{"created_at": "Sun Jan 10 23:57:57 +0000 2021", "id": 1348418836060643332, "id_str": "1348418836060643332", "text": "RT @randomsakuga: Key Animation: https://t.co/Rl0OlpzfN6Series: Rise Teenage Mutant Ninja Turtles (2019)https://t.co/LhTssYyf3A ", "truncated": false, "entities": {"hashtags": [], "symbols": [], "user_mentions": [{"screen_name": "randomsakuga", "name": "randomsakuga", "id": 835969639851126784, "id_str": "835969639851126784", "indices": [3, 16]}], "urls": [{"url": "https://t.co/Rl0OlpzfN6", "expanded_url": "https://pastebin.com/raw/tqwCF1Ue", "display_url": "pastebin.com/raw/tqwCF1Ue", "indices": [33, 56]}, {"url": "https://t.co/LhTssYyf3A", "expanded_url": "https://www.sakugabooru.com/post/show/107024", "display_url": "sakugabooru.com/post/show/1070", "indices": [114, 137]}]}, "source": "<a href='http://twitter.com/download/android' rel='nofollow'>Twitter Android</a>", "in_reply_to_status_id": null, "in_reply_to_status_id_str": "en", "in_reply_to_user_id": null, "in_reply_to_user_id_str": null, "in_reply_to_screen_name": null, "user": {"id": 2809289914, "id_str": "2809289914", "name": "peppy ", "screen_name": "fluttershoot", "location": null, "description": "hi peppy! personal twitter post everything! pfp @EliseraArt ", "url": null, "entities": {"description": {"urls": []}}, "protected": false, "followers_count": 628, "friends_count": 2396, "listed_count": 4, "created_at": "Sun Oct 05 23:02:50 +0000 2014", "favourites_count": 80025, "utc_offset": null, "time_zone": null, "geo_enabled": false, "verified": false, "statuses_count": 10299, "lang": null, "contributors_enabled": false, "is_translator": false, "is_translation_enabled": false, "profile_background_color": "C0DEED", "profile_background_image_url": "http://abs.twimg.com/images/themes/theme1/bg.png", "profile_background_image_url_https": "https://abs.twimg.com/images/themes/theme1/bg.png", "profile_background_tile": false, "profile_image_url": "http://pbs.twimg.com/profile_images/1386826629217927168/OcENzEZW_normal.jpg", "profile_image_url_https": "https://pbs.twimg.com/profile_images/1386826629217927168/OcENzEZW_normal.jpg", "profile_banner_url": "https://pbs.twimg.com/profile_banners/2809289914/1629929357", "profile_link_color": "1DA1F2", "profile_sidebar_border_color": "C0DEED", "profile_sidebar_fill_color": "DDEEF6", "profile_text_color": "333333", "profile_use_background_image": true, "has_extended_profile": true, "default_profile": true, "default_profile_image": false, "following": false, "follow_request_sent": false, "notifications": false, "translator_type": "null", "withheld_in_countries": []}, "geo": null, "coordinates": null, "place": null, "contributors": null, "retweeted_status": {"created_at": "Sun Jan 10 17:00:40 +0000 2021", "id": 1348313822810013708, "id_str": "1348313822810013708", "text": "Key Animation: https://t.co/Rl0OlpzfN6 Series: Rise Teenage Mutant Ninja Turtles (2019) https://t.co/iBxNCrJNYp", "truncated": true, "entities": {"hashtags": [], "symbols": [], "user_mentions": [], "urls": [{"url": "https://t.co/Rl0OlpzfN6", "expanded_url": "https://pastebin.com/raw/tqwCF1Ue", "display_url": "pastebin.com/raw/tqwCF1Ue", "indices": [15, 38]}, {"url": "https://t.co/iBxNCrJNYp", "expanded_url": "https://twitter.com/i/web/status/1348313822810013708", "display_url": "twitter.com/i/web/status/1", "indices": [96, 119]}]}, "source": "<a href='https://www.hootsuite.com' rel='nofollow'>Hootsuite Inc.</a>", "in_reply_to_status_id": null, "in_reply_to_status_id_str": null, "in_reply_to_user_id": null, "in_reply_to_user_id_str": null, "in_reply_to_screen_name": null, "user": {"id": 835969639851126784, "id_str": "835969639851126784", "name": "randomsakuga", "screen_name": "randomsakuga", "location": null, "description": "Providing good animation timeline. The medias taken @sakugabooru", "url": "https://t.co/vw6ZEEYgAU", "entities": {"url": {"urls": [{"url": "https://t.co/vw6ZEEYgAU", "expanded_url": "https://sakugabooru.com/post", "display_url": "sakugabooru.com/post", "indices": [0, 23]}]}, "description": {"urls": []}}, "protected": false, "followers_count": 258337, "friends_count": 33, "listed_count": 1618, "created_at": "Sun Feb 26 21:47:48 +0000 2017", "favourites_count": 6, "utc_offset": null, "time_zone": null, "geo_enabled": false, "verified": true, "statuses_count": 9668, "lang": null, "contributors_enabled": false, "is_translator": false, "is_translation_enabled": false, "profile_background_color": "000000", "profile_background_image_url": "http://abs.twimg.com/images/themes/theme1/bg.png", "profile_background_image_url_https": "https://abs.twimg.com/images/themes/theme1/bg.png", "profile_background_tile": false, "profile_image_url": "http://pbs.twimg.com/profile_images/840302059362615296/TaVA2uei_normal.jpg", "profile_image_url_https": "https://pbs.twimg.com/profile_images/840302059362615296/TaVA2uei_normal.jpg", "profile_banner_url": "https://pbs.twimg.com/profile_banners/835969639851126784/1489177763", "profile_link_color": "ABB8C2", "profile_sidebar_border_color": "000000", "profile_sidebar_fill_color": "000000", "profile_text_color": "000000", "profile_use_background_image": false, "has_extended_profile": false, "default_profile": false, "default_profile_image": false, "following": false, "follow_request_sent": false, "notifications": false, "translator_type": "null", "withheld_in_countries": []}, "geo": null, "coordinates": null, "place": null, "contributors": null, "is_quote_status": false, "retweet_count": 2009, "favorite_count": 8910, "favorited": false, "retweeted": false, "possibly_sensitive": false, "possibly_sensitive_appealable": false, "lang": "en"}, "is_quote_status": false, "retweet_count": 2009, "favorite_count": 0, "favorited": false, "retweeted": false, "possibly_sensitive": false, "possibly_sensitive_appealable": false, "lang": "en"}
from birdspotter.
from birdspotter.
found the solution
from birdspotter.
Hi Jon,
Apologies that I didn't get around to helping you earlier.
Could you briefly describe what the underlying problem was, and your solution?
(For future reference)
Cheers
from birdspotter.
I am not an expert in using Tweepy but from what I gather , Some of the Tweet IDs led to broken statuses.I removed these broken IDs and the program works fine now.
Ps.Sorry for the late reply !
from birdspotter.
Related Issues (11)
- Trouble parsing Twarc dump HOT 8
- Installation fail on macOS Mojave 10.14.6 because of xgboost=0.81 dependancy
- BirdSpotter does not have a licence blurb in README; is MIT the right license?
- Error when trying to use BirdSpotter on specialised Twitter Dump HOT 7
- What is the threshold for the bot score ?
- Trouble with formats/filenames and downloading resources HOT 5
- Documentation link in the README
- Loading ouput from twitter-intact-stream failed HOT 6
- KeyError: "['botness', 'influence'] not in index" HOT 1
- Add other hawkes kernels to influence quantification (namely PL)
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from birdspotter.