Giter VIP home page Giter VIP logo

expertdeveloperrecommendation's Introduction

ExpertDeveloperRecommendation

A tool designed to work as a search engine to find expert developers based on their skills and experience.

  • Built a search engine to find expert developers by utilizing GitHub datasets.
  • Reduced 3TB of data into merely 600 MB by keeping developer specific information such as (number of commits, first and last commit, average time between commits etc.)

Overview

Abstract

Global and distributed software development increases the need to find developers with relevant expertise. At the same time, online collaborative development tools such as GitHub have become extremely popular and are publicly providing considerably large datasets on projects and developers. Currently, a variety of algorithms, methodologies and techniques exists on using collaborative data to recommend expert developers based on knowledge and other criteria. We worked to adapt as well as extend several of these methods so as to better match the large size of available datasets. Specifically, we created a tool that can be used as a search engine for expert developers, based on their skills and experience, which are extracted based on their actual code contributions.

Github Dataset Properties

Property Size
number of repositories 2.9 million
number of commits 215 million
number of file paths 2.3 billion
number of file contents 163 million
size in disk 3+TB

Project Goals

A search engine that:

  • can be used to search for expert developers
  • provides search criteria for developers’ skills and experience
  • bases results on actual contributions
  • uses a very large dataset of possible experts (20+ million)
  • search queries are reasonably fast
  • can be extended to include more criteria (code quality, bugs introduced, etc.)

For Details

  • ExpertDeveloperRecommendation (Report).pdf

expertdeveloperrecommendation's People

Contributors

saimmehmood avatar

Stargazers

yuanyuanzhang avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.