Giter VIP home page Giter VIP logo

ml-data-pipeline-kafka's Introduction

confluent-kafka-python

This repo help us to know how to publish and consume data to and from kafka confluent in json format.

Step 1: Create a conda environment

conda --version

Step2: Create a conda environment

conda create -p mlproj1 python==3.8 -y

Step3:

conda activate mlproj1/

OR

conda create --prefix ./env python=3.8 -y
conda activate ./env

OR

source activate ./env

Step4:

pip install -r requirements.txt

Below repo help you to obtain requried credentials

https://github.com/Big-Data-01/confluent-tutorial.git

Cluster Environment Variable

API_KEY
API_SECRET_KEY
BOOTSTRAP_SERVER

Schema related Environment Variable

SCHEMA_REGISTRY_API_KEY
SCHEMA_REGISTRY_API_SECRET
ENDPOINT_SCHEMA_URL

Data base related Environment Variable

MONGO_DB_URL

Update the credential in .env file and run below command to run your application in docker container

Create .env file in root dir of your project if it is not available paste the below content and update the credentials

API_KEY=xxxxxxx
API_SECRET_KEY=xxxxxxxxxxxx
BOOTSTRAP_SERVER=xxxxxxxx
SCHEMA_REGISTRY_API_KEY=xxxxxxx
SCHEMA_REGISTRY_API_SECRET=xxxxxxxxxxxxxx
ENDPOINT_SCHEMA_URL=xxxxxxxx
MONGO_DB_URL=xxxxxxxxxxxxxx

Build docker image

docker build -t data-pipeline:lts .

For linux or mac Run docker image

docker run -it -v $(pwd)/logs:/logs  --env-file=$(pwd)/.env data-pipeline:lts

Git repo commands

git init
git add README.md
git commit -m "first commit"
git branch -M main
git remote add origin https://github.com/pallavi176/ML-Data-Pipeline-Kafka.git
git push -u origin main

git restore --staged <file_name>

ml-data-pipeline-kafka's People

Contributors

pallavi176 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.