Giter VIP home page Giter VIP logo

tech-spotlight's Introduction

Tech Spotlight

pdm-managed

Project Description

Welcome to Tech Spotlight. The goal of this project is to search through relevant job postings and find the frequency of technologies associated with those jobs.

This is achieved in two parts: a scraper that scrapes data off Indeed.com with four queries, job title, location, age of posting, and a number of jobs to scrape. This produces a CSV file which can be processed within a Kaggle notebook, that tells the story about what technologies, frameworks, and libraries were found and how frequently they were posted.

Enabling and user to get hyper focused insights about the job market and they are interested in, as it pertains to software development


Display

Example of output after csv file processed by kaggle notebook

Full Dataset from Term Detection

Technology_Frequencies_full_dataset

Popular Program languages

Popular_Program_lang

Top Forty Technologies

Top_Forty_Tech


Project Authors

Team Spotlight


Disclaimers

Possible known bug with Windows Native, in writing files.

Term Selection is not all inclusive. (Feel free to add more terms to the Tech_term list in the datasets folder to further customize your search!)

Make sure you're running python 3.10

This data is intended for personal research purpose and not as a single source of truth.

We cannot account for the validity of the data.

This is a learning project and is not 100% flushed out.


How to install/Use

Scraper

    git clone https://github.com/regex-rejects/tech-spotlight.git

Create visual python environment (venv)

example:

    python3 -m venv .venv (mac/wsl)
    py -m venv .venv (windows native)

activate virtual environment

example:

    source .venv/bin/activate (mac/wsl)
    .\.venv\Scripts\activate (windows native)
    pip install -r requirements.txt

Run web scraper:

    python tech_spotlight/scraper.py

You can input the CSV file this application has created into this notebook template to view your data


Kaggle Notebook

This is the kaggle notebook we produced from datasets we scraped during our project. It includes insights based on our interpretations of the data.

Indeed Past 7 Day 900 Listing Term Data 18 May 2022


Template notebook

Fork this notebook and upload your csv

We welcome you to use this tool to create your own datasets tailored to your specific interest in the market.

Upload output csv file to new notebook and use above code as reference


Version

Version 1.0


Technologies Used

Kaggle

Libraries

  • numpy
  • pandas
  • seaborn
  • matplotlib
  • requests
  • urllib
  • bs4
  • time
  • random
  • sys
  • csv

Acknowledgments

  • JB Tellez
  • David Hecker
  • Adam Owada
  • Aaron Imbrock
  • Joseph Streifel
  • Chloe Nott

tech-spotlight's People

Contributors

chrisyamas avatar eden-brekke avatar motobenny avatar nicholas-mercado avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.