ryanchanwj / security-data-lake-kinesis Goto Github PK

View Code? Open in Web Editor NEW

Transformation of simulated logs streamed into kinesis data stream using kinesis data analytics studio notebook. Output is streamed from the analytics application to S3 using another kinesis data stream and kinesis firehose, partitioned by date.

Python 100.00%

security-data-lake-kinesis's Introduction

AWS Security Data Lake Log Transformation using Kinesis Data Analytics

This guide runs through how a data streaming pipeline that enriches and transforms ingested logs can be deployed in AWS using Kinesis and Glue services.
In this PoC, a python script is used to generate logs which is ingested by a Kinesis Data Stream. The logs simulated contains a “port_number” field. Kinesis Data Analytics transforms the log data in the Kinesis Data Stream and inserts the curated logs into another Kinesis Data Stream. The curated logs will be enriched with a “tag” field with its value dependent on the value of the “port_number” field. Kinesis Firehose Delivery Stream ingests the data from the Kinesis Data Stream for curated logs and streams it into an S3 bucket. The data is partitioned in the S3 bucket by year, month, day and hour.
A detailed walkthrough of the deployment steps can be found here (https://quip-amazon.com/9y2SAbT7CWRS#XQA9AAC7E4R).

CloudFormation Resources

Logical ID	Type
AnalyticsApplication	AWS::KinesisAnalyticsV2::Application
AnalyticsServiceExecutionRole	AWS::IAM::Role
AthenaWorkgroup	AWS::Athena::WorkGroup
FirehoseServiceExecutionRole	AWS::IAM::Role
FirehoseStream	AWS::KinesisFirehose::DeliveryStream
GlueDatabase	AWS::Glue::Database
GlueTable	AWS::Glue::Table
InputStream	AWS::Kinesis::Stream
OutputStream	AWS::Kinesis::Stream
S3Logs	AWS::S3::Bucket

Recommend Projects

ryanchanwj / security-data-lake-kinesis Goto Github PK

security-data-lake-kinesis's Introduction

AWS Security Data Lake Log Transformation using Kinesis Data Analytics

CloudFormation Resources

security-data-lake-kinesis's People

Contributors

Watchers

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent