Omar ElMaria's Projects
This repo contains a full analysis of the sales, promo, and pricing trends of fashion products sold by the European E-commerce company "About You". It also contains an ARIMA machine learning model to predict order volume based on historical sales.
This repo contains a readme.md file that explains how to use Airflow at Delivery Hero
This repo contains several DAGs that use a wide variety of Airflow's operators and capabilities
This repo contains a readme.md file explaining how to install Airflow locally using Docker
This repo contains the DAGs that run on my local Airflow environment. I use the local environment to test my DAGs before deploying them to virtual machines via Kubernetes
This repo contains a Python-based web crawler that scrapes data on Luwak coffee products from amazon.de. It is designed to surpass Amazon's anti-bot mechanisms and crawl the most important info from the product pages successfully
This is a bot that notifies the user of available Anmledung (i.e., appointment registration) appointments in Berlin, Germany
[Delivery Hero] This is a BigQuery routine to detect changes in the ASA or scheme configuration throughout a particular time period. It lets you know which changes occurred and how many times they took place. This helps us troubleshoot data problems in experiments.
This repo contains a Big Query code that pulls raw data from a central data warehouse, cleans it, and aggregates it into meaningful KPIs. The queries feed a Data Studio dashboard that senior executives use it to keep track of the business performance
This repo contains a Jupyter notebook that cleans data in CSV files and publishes the data to BigQuery so it gets fed to a Looker Studio dashboard
This repo contains an R script that algorithmically finds the best distribution that fits several continuous, randomized variables
This repo contains a Jupyter notebook that analyzes the order and CVR of various price elasticity tests in Thailand
This repo contains a full ETL pipeline coded in R. The script extracts data from Airtable, cleans and aggregates the data into meaningful statistics, then stores the end result in a G-sheet, which feeds a Data Studio dashboard
This repo contains a Python script that crawls gig information from the "Data Processing" category on Fiverr
This repo contains a script that pulls Google Trends data from the Pytrends library and plots the popularity of several keywords
This repo contains the materials used in the GPT Python bootcamp
This script scrapes job listings on Indeed, a popular job platform. The code was modified to work on a Windows VM
This repo contains a Python script that opens the website of LATAM airlines, inputs some parameters in the flight search fields and scrapes some data off of the page using Python selenium
This repo contains an algorithm that identifies vendors whose customers have a higher willingness to pay. The inherent inelasticity of these vendors is utilized as part of a price differentiation strategy called "Loved Brands"
This repo contains a Python script that computes the match percentage of Loved Brands and Non-Loved Brands between different pipeline run dates
This repo contains Python code that analyzes 280+ AB tests to identify if there are differences between treatment scope and experiment level significance calculations
This repo contains a Python script that uses Scrapy to scrape motorcycle attributes off of a Polish website and enter them into an online importing cost estimation tool using Selenium