Giter VIP home page Giter VIP logo

massivedata-course-resources's Introduction

Course-Resources

This github repository contains Course Resources and Instructions for Setup, Configuration and Deployment. This is the primary repository for the course.

There are also other repositories containing source code, scripts, installation and configuration instructions to accompany the course.

I will share the links below to those repositories:

Additional Resources and Repositories

Infrastructure, Data Sets, Tools, Postman Collections and Local DNS

In this current repository, there are instructions for configuring the environment, loading up the sample datasets, installing your IDE, sample Postman collections for interacting with REST endpoints and a utility script for configuring your local DNS.

Data Generators

The repository contains a set of micro services that simulates realtime purchase orders for customers of an online grocery store. There is simulation of:

  • new orders
  • order fullfillment and shipments
  • order deliveries
  • order returns
  • inventory replenishment by supplies and many more

Apache Spark Analysis of Bounded Data Sets

This repository contains a batch analysis job with Apache spark that processes a fixed, bounded data set of grocery products and performs a bulk enrichment of the product items. It pulls the set of data from a MongoDB database, performs a join of the datasets and then saves the merged resultset into a new MongoDB collection

Kafka Streams Analysis of Unbounded Data Streams

This repo contains a set of micro services that perform realtime analysis of unbounded streams of data to simulate realtime analysis, joins and enrichment of product and order data. It demonstrates how Kafka Streams can be used to perform realtime joins with fact streams and dimension streams.

KSQL Data Analysis of Realtime Data Streams

The repository is a collection of DDL and DML statements used to perform data analysis and stream processing via KSQL and KSQLDB.

Sales Data Web API

The repository contains a Spring Boot Java 11 application that contains micro services that report the data from the relational database as JSON. This will be used by the Angular Web Application to show reports and data visualization

Sales Data Reporting and Visualization

Check out this repo for a sample application that uses Node.js, Angular 11 and D3.js to create a web application that shows reports and real time data visualizations.

massivedata-course-resources's People

Contributors

izzyacademy avatar israelekpo avatar

Watchers

 avatar

Forkers

rossanomarcos

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.