Giter VIP home page Giter VIP logo

comphy / beware-web-scraper Goto Github PK

View Code? Open in Web Editor NEW

This project forked from prankshaw/beware-web-scraper

0.0 0.0 0.0 135 KB

Web Scraping project including; C projects scraper from GitHub , ICC rankings scraper, YouTube Trending Scrapper, LinkedIn Profile Scraper, Wikipedia Image Scraper

Home Page: https://prankshaw.github.io/Beware-web-scraper/

License: MIT License

Python 100.00%

beware-web-scraper's Introduction

Visit The project here Contributions Welcome

https://prankshaw.github.io/Beware-web-scraper/

Build Status Documentation Status Code style: black codecov License: MIT Issues Open Forks Stars Twitter URL

Scrapers available

    C-project-scraper

    Scrapes the top projects for 'C' language from github. It can be extended to get projects in any language present on GitHub.

    ICC Rankings-Scraper

    Tells about top 100 ranked batsmen from all over the world for all 3 formats, i.e. Test cricket, One day International and T20 International.

    Youtube Trending-Scraper

    Scrapes all the information from trending section of youtune, including video name, description available and video liks

    LinkedIn-Scraper

    Automatically LogIn to the profile and scrapes the relavant information from profile, including name, location, title, connections and more

    Wikipedia Image-Scraper

    Scrapes links of all the images present in the given wikipedia page and prints them

These project use selenium driver.

To use project

Just fork the project and the install the prerequisities.
Simply run, if present in jupyter notebook, else follow below mentioned steps.
Python (I am using Python 3.x). After downloading python, pip all the requirements(if any).
Selenium Webdriver for Google Chrome: Chromedriver โ€“ Download it and place it anywhere on your machine.
pip install selenium
pip install pandas

Change path of 'chromedriver' with your own path.
Just run in IDLE and see the output

License

Licensed under MIT-license https://prankshaw.mit-license.org/

beware-web-scraper's People

Contributors

prankshaw avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.