careeratg Goto Github PK
Type: Organization
Type: Organization
Introduction to the data pipeline management with Airflow. Airflow schedule and maintain numerous ETL processes running on a large scale Enterprise Data Warehouse.
A curated list of awesome ETL frameworks, libraries, and software.
Developing end to end ETL process evolving dashboard and KPI's
My first project working with TALEND and PowerBI
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
Oracle Database Sample Schemas
Used Talend as an ETL and orchestrator for jobs to load Banking data files and load Teradata Warehouse
ETL workflow and data analysis. ETL-workflow using prefect and pygrametl (SCD, slow changing dimension). Product classification based on product name.
Getting Started with Talend Open Studio for Data Integration, published by Packt
This repository contains analysis of IMDB data from multiple sources and analysis of movies/cast/box office revenues, movie brands and franchises.
Designed and developed Dimensional Model and performed data profiling, ETL operations for staging using Alteryx. Created data integration workflow in Talend to load data in AzureSQL and BigQuery DW and visualized analytical data reports dashboard using Tableau and PowerBI to get the insights and single truth story of the data
Implemented an Analytical Data Architecture (ADA) to create a single source of truth (SSOT) from multiple data sources such as CSV files and SQL databases for IMDb and the Numbers datasets
Scripts from Office Hours sessions, blog posts or anything else that needed a script :-)
Analysis of New York State Police Department Arrests dataset. Created Dimensional Model for the provided dataset. Using Alteryx and Talend, built ETL pipelines to process, clean the data and create dimensions and facts in the destination database. Further, visualized the necessary details of the database using Tableau and PowerBI.
PyMongo with FastAPI CRUD application
🐍 Quick reference guide to common patterns & functions in PySpark.
Pyspark RDD, DataFrame and Dataset Examples in Python language
Python data repo, jupyter notebook, python scripts and data.
As part of my Data Engineering professional development, I am developing an array of data pipelines which extract, transform and load the data from various sources such as this Quandal API, CSV, JSON and database such as Google BigQuery, Microsft SQL Server and PostgreSQL, just to name a few.
This Project aims at creating a data warehouse for e-commerce based company, transforming data in ETL tools like Alteryx and Talend and then performing analytics as per user requirements.
Snowflake Cookbook, published by Packt
ETL Repository for stocks parsing
Talend ETL & ESB
Published by Packt
Talend with Big Data
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.