Giter VIP home page Giter VIP logo

coronavirus-stats's Introduction

Top 10

CSV Data on Coronavirus (COVID-19)

This repository contains data in (CSV format) which are scraped from reliable sources (e.g. World Health Organisation).

Aggregate sites

Below are international stats, excluding China.

CSV direct link

Bar chart of the latest snapshot.

WHO & Government sites

From WHO (World Health Organisation) Situation reports

Data are scraped from these reports which are in PDF formats. New reports are released daily.

Globally confirmed cases

CSV direct link

Stats from China

This page has the realtime stats from China. Data are pulled several times a day by the pipeline.

All cases in China

CSV direct link

Stats from Australia

Data is pulled from Department of Health website.

Cases in Australia

CSV direct link

Stats from Singapore

Data are scraped from the MOH (Ministry of Health) local situation web page.

Cases in Singapore

CSV direct link

From US CDC (Centers for Disease Control and Prevention)

Cases in the US (data are scraped from here)

CSV direct link

Chart for US is not plotted due to change in the way stats are collected.

How it works

  • Jupyter notebooks are used for scraping data and output to CSV files
  • These notebooks are executed on a schedule by Github Actions pipeline to scrape new data
  • This pipeline also commits back new data to this repository

Development

  • Tools: Python3, Jupyter, Pandas, BeautifulSoup and related stuff (e.g. Selenium for web-scraping). It is recommended to start the development environment with this docker image, which is also used for the Github Actions build pipeline.
docker run  -p 8888:8888 -it -v $PWD:/stats -w /stats alext234/datascience:latest  bash 
pip install -r requirements.txt
  • Start Jupyter notebook from inside the container and then visit the browser at http://localhost:8888
jupyter notebook --allow-root --ip=0.0.0.0

Contributions

  • Feel free to create new issues for any potential data source worth scraping.
  • Pull requests are welcomed!

Repo status and stats

  • Stargazers

GitHub stars

  • Last update from pipeline

Last update

  • Pipeline status

Run notebooks and commit back data/charts

coronavirus-stats's People

Contributors

abboudalomari avatar actions-user avatar akatie avatar alext234 avatar alext234team avatar bb1231 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.