Giter VIP home page Giter VIP logo

data-engineer-camp-project-1-arodriguez's Introduction

stock logo

Financial Analytics ELT

Data Engineer Camp - Group Project #1 - Oct 2022


Objective

This is a data engineering pipeline that extracts, loads and transforms multiple sets of economic and financial data from the Alpha Vantage API.

The output of the pipeline could be used by data/business analysts, investment managers and other finance professionals, data and finance journalists alike, depending on the specific use case.

The data allows data analysts to investigate correlations and derive hypotheses on causation between those data points. These data points would also show the financial and economic changes coinciding with black swan events such as the outbreak of the COVID-19 pandemic in 2019 and Russia’s invasion of Ukraine in 2022.

Source datasets

The pipeline currently contains the following datasets:

  • Foreign Exchange (FX) daily exchange rates for different countries' currencies compared against the US Dollar. The current pipeline defaults to exchange rates against the AUD, EUR, JPY, RUB and GBP respectively.

  • Digital & Crypto Currencies daily exchange rates for different digital currencies rates compared against the US Dollar. The current pipeline defaults to exchange rates against BTC, ETH and DOGE respectively.

  • US treasury yield data for various maturity timelines. The current pipeline shows the maturity timelines for 3 months, 2 years, 5 years, 7 years and 10 years respectively.

Architecture

  • Object-oriented Python extraction using requests and pandas with integrated logging, testing, and yaml configuration for user input
  • Upsert load to Postgres with sqlalchemy and database transformations using modularized sql with jinja templating
  • Dockerizing and serving ELT pipeline on AWS with use of ECS, ECR, S3, RDS, and IAM

arch

Instructions

Environment

The pipeline can be forked and run locally as well as run via Docker.

Preconditions

Pass in the secret variables to run the script. The script requres an Alpha Vantage API key (api_key, obtained via filling in the form on this page ) and the credentials of the Postgres database to load the data into (target_db_user, target_db_password, target_db_server_name and target_db_database_name)

data-engineer-camp-project-1-arodriguez's People

Contributors

paulhalla avatar u4587426 avatar infinit3labs avatar arodriguez2018 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.