This github repository contains Course Resources and Instructions for Setup, Configuration and Deployment. This is the primary repository for the course.
There are also other repositories containing source code, scripts, installation and configuration instructions to accompany the course.
I will share the links below to those repositories:
- Infrastructure Setup Resources
- Data Sets
- Tools and Integrated Development Environment Setup
- Postman Collections
- Local DNS and Environment Variables
- Data Generators
- Apache Spark Data Batch Analysis
- Kafka Streams Real Time Streams Data Analysis
- KSQL Data Analysis
- Sales Data Web REST API
- Angular App for D3.js Visualization
In this current repository, there are instructions for configuring the environment, loading up the sample datasets, installing your IDE, sample Postman collections for interacting with REST endpoints and a utility script for configuring your local DNS.
The repository contains a set of micro services that simulates realtime purchase orders for customers of an online grocery store. There is simulation of:
- new orders
- order fullfillment and shipments
- order deliveries
- order returns
- inventory replenishment by supplies and many more
This repository contains a batch analysis job with Apache spark that processes a fixed, bounded data set of grocery products and performs a bulk enrichment of the product items. It pulls the set of data from a MongoDB database, performs a join of the datasets and then saves the merged resultset into a new MongoDB collection
This repo contains a set of micro services that perform realtime analysis of unbounded streams of data to simulate realtime analysis, joins and enrichment of product and order data. It demonstrates how Kafka Streams can be used to perform realtime joins with fact streams and dimension streams.
The repository is a collection of DDL and DML statements used to perform data analysis and stream processing via KSQL and KSQLDB.
The repository contains a Spring Boot Java 11 application that contains micro services that report the data from the relational database as JSON. This will be used by the Angular Web Application to show reports and data visualization
Check out this repo for a sample application that uses Node.js, Angular 11 and D3.js to create a web application that shows reports and real time data visualizations.