Light

saimmehmood / covid_19_risk Goto Github PK

View Code? Open in Web Editor NEW

0.0 1.0 3.0 9.99 MB

Utilizing trajectories to find risk of geographical areas being infected by COVID-19 (PostgreSQL/PostGIS approach)

Home Page: https://towardsdatascience.com/covid-19-risk-of-geographical-areas-being-infected-a81938a5e286

Python 100.00%

covid_19_risk's Introduction

COVID-19: Risk of Geographical Areas being infected

This experimental project was done as a use-case of my research paper (https://github.com/saimmehmood/semantic_relationships) to predict COVID-19 infection hotspots for a probable second wave of cases in Manhattan area.

Key Points

We began at exploring a geographical space and looking at movement path of people i.e, trajectories.
We divided geographical space into uniform grid cells and discussed the trade-off between having smaller or larger grid sizes.
Further, we elaborated on how dividing geographical space enables us to translate trajectories from a set of geolocation coordinates into a set of grid cells.
Lastly, we performed various Postgres/PostGIS queries to find grid cells through which trajectories are traversing.
In the end, python (numpy, pandas) enable us in putting trajectories belonging to each cell together and we rank them through a bash command.

Data Preparation

The dataset I’m using is released by NYC Taxi and Limousine Commission (TLC), which includes pickup & dropoff time, geo-coordinates, number of passengers, and several other features.

The data-set file contains 1,048,576 trip records containing pickup and dropoff points as pairs of (longitude, latitude) coordinates. - For the needs of this experiment, I rely on random sample S that includes 10,000 pairs.
For each pair of pickup and dropoff coordinates, I utilize Google Directions API (new_york_taxi.py) to create trajectories in NYC.

Conclusion

In essence, this work provides a simple, easy-to-follow proof-of-concept in finding areas that are at a high risk of being affected by COVID-19.

Link to medium post:

https://towardsdatascience.com/covid-19-risk-of-geographical-areas-being-infected-a81938a5e286

Link to the gists:

https://gist.github.com/saimmehmood

covid_19_risk's People

Contributors

Watchers

Forkers

fedmask xiang526 anabossler

covid_19_risk's Issues

hi, when running new_york_taxi, a error appeared: ModuleNotFoundError: No module named 'calculation', at the same time, I want to know how to plot the grid of the city in tne map

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.