kelvinxuande / deprecated_glassdoor-scraper Goto Github PK
View Code? Open in Web Editor NEWWeb scraping the popular job listing site "Glassdoor" with Python and BeautifulSoup. Implemented from scratch.
Web scraping the popular job listing site "Glassdoor" with Python and BeautifulSoup. Implemented from scratch.
Hello,
When running the script, (after cloning, without modifying config.json), I had the following error:
[ERROR] Assumptions invalid
Traceback (most recent call last):
File "<path>/glassdoor-scraper-master/src/main.py", line 68, in <module>
maxJobs, maxPages = extract_maximums(base_url)
File "<path>/glassdoor-scraper-master/src/packages/page.py", line 31, in extract_maximums
return(int(maxJobs), int(maxPages))
ValueError: invalid literal for int() with base 10: ''
I believed that Glassdoor frontend was changed, thus the code on this project needs to be updated.
Bests,
Gauthier
Hi, let me tell you this is great code and works very well. I'm trying to scrape Glassdoor job offers for Australia and had to implement only one minor change in encoding:
I want to add one more variable with salary range info. I modified code successfully, but it gives me NA results (I performed test with the same class as Location and worked, so something's wrong with span class name). Those are screens of which part I'm struggling to read
Would you be able to advice me why it doesn't work this way?
PS. forgive me if this is wrong place or wrong way of creating such a request. I'm python beginner and just created github account to be able to comment this thread.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.