Giter VIP home page Giter VIP logo

Omar ElMaria's Projects

permanent_residence_appointment_finder icon permanent_residence_appointment_finder

This repo contains a Selenium script that automatically checks for Consultation appointments on the Volkshochschule Berlin Mitte Website (https://vhsmitte.flexappoint.de/#/). This website is used to book appointments for the "Leben in Deutschland" test, which is a prerequisite for obtaining the permanent residence or citizenship in Germany

python_scrapy_airflow_pipeline icon python_scrapy_airflow_pipeline

This repo contains a full-fledged Python-based script that scrapes a JavaScript-rendered website, cleans the data, and pushes the results to a cloud-based database. The workflow is orchestrated on Airflow to run automatically

scraping_with_r_selenium_and_rvest icon scraping_with_r_selenium_and_rvest

This repo contains a multi-stage R-based script that scrapes a JavaScript-rendered E-commerce website using RSelenium and RVest. It also formats and cleans the data and stores it in a table for analysis purposes.

scrapy_playwright_example icon scrapy_playwright_example

This repo contains a scraping script that crawls a JavaScript-rendered website using the scrapy-playwright package in Python and the scrapy framework

scrapy_playwright_with_proxy_service icon scrapy_playwright_with_proxy_service

This repo contains the source code showing how to integrate a Proxy service (ScraperAPI) with Scrapy Playwright. The repo has two spiders, one for quotestocrape.com and the other for httpbin.org/ip

skyscanner_crawler icon skyscanner_crawler

This repo contains a Python script that crawls 5120 flight routes from the popular flight aggregator Skyscanner

smart_vendor_clustering_in_gbq_and_r icon smart_vendor_clustering_in_gbq_and_r

This repo contains a GBQ script that clusters vendors according to their elasticity of demand and conversion rate trends. The goal is to identify vendors with lower price sensitivity than their peers to implement a differentiated pricing strategy. The R script analyzes the performance of an ABn test that was set up to validate the quality of the clusters in how much incremental gross profit does the differentiated pricing strategy yield

surge_pricing_experiment_analysis_in_gbq_and_r icon surge_pricing_experiment_analysis_in_gbq_and_r

This repo contains a GBQ script that pulls, cleans, and aggregates data of a hybrid experiment (AB & diff-in-diff). The R script contains a logic that analyzes the performance and significance of the results according to key success metrics

switchback_test_dag icon switchback_test_dag

This repo contains a data pipeline composed of Python and Big Query scripts that extract, clean, and aggregate data, as well as perform statistical significance tests. The code is fully orchestrated on Airflow and feeds a Tableau dashboard that displays the success metrics of surge pricing switchback experiments.

vendor_analysis_for_subscription_benefits icon vendor_analysis_for_subscription_benefits

This repo contains queries used to extract data about vendor performance in APAC markets of Delivery Hero. The data is used to simulate the impact on gross profit and GMV if free delivery beneficiaries were charged delivery fees. The simulation is done in a Tableau dashboard.

vendor_performance_scorecard_in_gbq icon vendor_performance_scorecard_in_gbq

This repo contains a GBQ script that pulls operational performance metrics of an E-commerce platform's suppliers and ranks them against one another for benchmarking purposes

wafid_crawling_bot icon wafid_crawling_bot

This repo contains a Python script that tracks the availability of medical appointments on https://wafid.com/medical-status-search/ in the UAE

wine_and_real_estate_listings_r_scraper icon wine_and_real_estate_listings_r_scraper

This repo contains two Rmd files. The first file scrapes wine listings under the brand name "mövenpick" using the rvest package. The second scrapes Javascript-rendered apartment listings on the Swiss real estate website (homegate.ch) using RSelenium

wolt_crawler icon wolt_crawler

This repo contains a Python Selenium script that scrapes the restaurant name, subtitle, delivery fee, and promised order time from the restaurants listing page of Wolt (https://wolt.com/en/discovery/restaurants)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.