Giter VIP home page Giter VIP logo

projects's Introduction

Projects

Welcome to my projects overview with short descriptions of every project. At the end of this file you find the technical requirements.


Project 1: EDA Project

King County Real Estate

This notebook is about my first EDA project during the Data Science bootcamp at neue fische. The task of the project was to recommend suitable properties for a pre-selected client. The data set included information on the number of rooms and bathrooms, year of construction, size of the house and property, and budget. Each of the bootcamp participants could choose one customer before starting with the EDA, who was already presented with his/her budget and ideas a.o. regarding size, number of bathrooms and rooms.

Notebook: Jupyter Notebook

Images: Images, plots and maps.

Dataset: I used the King County House Sales dataset. Here, the focus is on EDA though it was required to demonstrate an entire Data Science Lifecycle using linear regression. The task will be to perform an extensive EDA and to train a explanatory linear regression model. The task is not only to explain the data but also to evaluate how well the model is fitting the data.

Stakeholder Presentation: PDF file



Project 2: Machine Learning

AirQo Ugandan Air Quality Forecast Challenge

This notebook is about my second project during the Data Science bootcamp at neue fische.

The task of the project was the prediction of the air quality in the Ugandan capital Kampala - e.g. relevant for local TV stations - and thus the early warning of harmful fine dust concentrations. The data set included weather and wind data as well as measurement data from the external provider AirQo. The project included EDA and data visualization, a "small time series", the selection of different machine learning models and the presentation of results at the end of the project.

In contrast to the 1st project this was a group work with partly also pair programming and error analysis, as it also belongs to the normal working day of a Data Analyst and Data Scientist. The bootcamp participants could choose from given projects and get together in groups.

Notebook: Jupyter Notebook

Images: Images, plots and maps.

Dataset: The dataset is from a challenge which was created on Zindi, the data science competition platform with the mission of building the data science ecosystem in Africa. The objective of this challenge is to accurately forecast air quality (as measured by PM2.5 µ/m3) for each hour of the coming 25 hours across five locations in Kampala Uganda. Forecasts will be based on the past 5 days of hourly air quality measurements at each site. Zindi provided .csv files with train and test data but also meta data with location details. The meta data was excluded in the project.

Stakeholder Presentation: PDF file



Technical preparation

Requirements

pyenv python==3.9.4 Setup

For this purpose you use following commands:

python -m venv .venv
source .venv/bin/activate
pip install --upgrade pip
pip install -r requirements.txt

projects's People

Contributors

ironman2483 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.