Giter VIP home page Giter VIP logo

cricket-scorecard-and-commentary-dataset's Introduction

CRICKET-SCORECARD-AND-COMMENTARY-DATASET

Match Scorecard and Commentary Data from 2017 for ODI, Test, T20, IPL, BBL, PSL. This Dataset is also available on Kaggle: https://www.kaggle.com/raghuvansht/cricket-scorecard-and-commentary-dataset Due to large file size Commentary files could not be added here please visit the Kaggle Dataset link.

Cricket is being played for ages. Cricket was halted due to Corona Virus and it resumed again. But the Dataset we have is ages old so, I Raghuvansh Tahlan an aspiring Data Scientist and an avid Cricket lover bring you my Personal Cricket Dataset which covers Matches from 2017 till Pre-COVID(2020). The Dataset is 3 fold first a CSV file containing each row for a Match, then Batting and Bowling Scorecard CSV files for each match and a CSV file per match containing ball-by-ball commentary for each match. The data covers International Matches(ODI, T20, Test Match) and leagues namely Indian Premier League(IPL), Big Bash League(BBL) and Pakistan Super League(PSL). The Dataset covers over 1200 Cricket Matches.

UNDERSTANDING THE DATA "INTERNATIONAL_MATCH.csv" contains one row per match and contains Superficial data for each match i.e. Name of the teams, Unique ID for each team, Venue, Venue Unique ID, Date of the Match, Result of the Match and most importantly "MATCH NUMBER". This number matches with the scorecards files and commentary files.

The "BATTING" folder contains batting scorecard CSV files. The name of every file in the Batting folder is named as "XXBATTINGSCORECARD.csv" where "XX" is the "MATCH NUMBER". The "BOWLING" folder contains bowling scorecard CSV files. The name of every file in the Bowling folder is named as "XXBOWLINGSCORECARD.csv" where "XX" is the "MATCH NUMBER".

"COMMENTARYINTLMATCH" folder contains Ball-by-ball commentary CSV files. The name of every file in the Commentary folder is named as "XX_COMMENTARY.csv" where "XX" is the "MATCH NUMBER".

Let's Understand the structure by an example: A row in the "INTERNATIONALMATCH.csv" file contains match number as "12345" then it's Batting Scorecard CSV file will be in "BATTING" folder named as "12345BATTINGSCORECARD.csv", Bowling Scorecard CSV file will be in "BOWLING" folder named as "12345BOWLINGSCORECARD.csv", and Commentary CSV file will be in "COMMENTARYINTLMATCH" named as "12345COMMENTARY.csv".

I would like people to come up with kaggle Kernels and Analyze the Data and mention me when you use the data and I will try to keep the Dataset as updated as possible. I can be reached at: LinkedIn: https://www.linkedin.com/in/raghuvansh-tahlan/ Medium: https://medium.com/@raghuginnu

cricket-scorecard-and-commentary-dataset's People

Contributors

rvt123 avatar

Stargazers

 avatar  avatar  avatar  avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.