justmarkham / trump-lies Goto Github PK
View Code? Open in Web Editor NEWTutorial: Web scraping in Python with Beautiful Soup
Home Page: https://www.dataschool.io/python-web-scraping-of-president-trumps-lies/
Tutorial: Web scraping in Python with Beautiful Soup
Home Page: https://www.dataschool.io/python-web-scraping-of-president-trumps-lies/
C:\Python\pythonw.exe "C:/Users/nitin tyagi/PycharmProjects/untitled/venv/scr2.py"
Traceback (most recent call last):
File "C:/Users/nitin tyagi/PycharmProjects/untitled/venv/scr2.py", line 16, in
import pandas as pd
File "C:\Python\lib\site-packages\pandas_init_.py", line 42, in
from pandas.core.api import *
File "C:\Python\lib\site-packages\pandas\core\api.py", line 10, in
from pandas.core.groupby.groupby import Grouper
File "C:\Python\lib\site-packages\pandas\core\groupby_init_.py", line 2, in
from pandas.core.groupby.groupby import (
File "C:\Python\lib\site-packages\pandas\core\groupby\groupby.py", line 49, in
from pandas.core.frame import DataFrame
File "C:\Python\lib\site-packages\pandas\core\frame.py", line 74, in
from pandas.core.series import Series
File "", line 983, in _find_and_load
File "", line 967, in _find_and_load_unlocked
File "", line 677, in _load_unlocked
File "", line 724, in exec_module
File "", line 857, in get_code
File "", line 525, in _compile_bytecode
ValueError: bad marshal data (unknown type code)
Process finished with exit code 1
please help me
my code is
import requests
r = requests.get('https://www.nytimes.com/interactive/2017/06/23/opinion/trumps-lies.html')
from bs4 import BeautifulSoup
soup = BeautifulSoup(r.text, 'html.parser')
results = soup.find_all('span', attrs={'class':'short-desc'})
records = []
for result in results:
date = result.find('strong').text[0:-1] + ', 2017'
lie = result.contents[1][1:-2]
explanation = result.find('a').text[1:-1]
url = result.find('a')['href']
records.append((date, lie, explanation, url))
import pandas as pd
df = pd.DataFrame(records, columns=['date', 'lie', 'explanation', 'url'])
df['date'] = pd.to_datetime(df['date'])
df.to_csv('trump_lies.csv', index=False, encoding='utf-8')
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.