Vaquar Khan's Projects
pyspark notes
Code for PySpark Tutorial
Interview questions on Spark concepts
PySpark Cookbook, published by Packt
A toolset to streamline running spark python on EMR
Example project implementing best practices for PySpark ETL jobs and applications.
Pyspark RDD, DataFrame and Dataset Examples in Python language
Code examples on Apache Spark using python
PySpark for Beginners by Packt Pyblishing
Generic Python/PySpark Process
Repository for code examples from my youtube channel and medium articles working with data in python on AWS
Analysis of 60+ million NYC Yellow Taxi Trips in 2018 using PySpark. The analysis utilizes a Spark cluster that is setup using GCP.
Installation of Pyspark using pip and brief introduction
Source code for 'PySpark Recipes' by Raju Kumar Mishra
An example PySpark project with pytest
PySpark testing example project
PySpark-Tutorial provides basic algorithms using PySpark
Git Repository
Exemplo de como usar o pyspark com aws deequ para data quality
Yu Long's note about spark and pyspark
Ravi Azure ADB ADF Repository
Introduction to Spark with Jupyter Notebook
https://www.udemy.com/course/a-crash-course-in-pyspark/
My notebook on using Python with Jupyter Notebook, PySpark etc
Jupyter notebooks and datasets for the interesting pandas/python/data science video series.
Code along with the course 'Spark and Python for Big Data with PySpark' on Udemy https://www.udemy.com/course/spark-and-python-for-big-data-with-pyspark/