shufinskiy / nba_data Goto Github PK
View Code? Open in Web Editor NEWNBA play-by-play data from stats.nba.com, data.nba.com, pbpstats.com, and also shots information with season 1996/97
License: Apache License 2.0
NBA play-by-play data from stats.nba.com, data.nba.com, pbpstats.com, and also shots information with season 1996/97
License: Apache License 2.0
Thank you for compiling this dataset and for making it easily accessible!
I was just wondering about the availability of post-season (as well as pre-season and all-start game) data. What was the rationale behind only using regular season data here?
Thank you again!
I'm still unsure of how game time is denoted in the nbastats_XXXX.csv files. I consulted the data dictionary, but it doesn't seem to be that PCTIMESTRING represents the time to the end of the quarter. At best, it looks like this is the cumulative game time that has elapsed in the game multiplied by 60. However, there are still obvious errors. For instance, there are numerous games where a quarter ends, and the next quarter starts with a PCTIMESTRING value that is less than the previous value.
Can you offer clarification on this, and more importantly, how to estimate the current game time for each event in the file?
Within datanba_po_2020.csv, the references to GAME_ID of 42000133
references all the events of NBA's gid of 0041900133
.
0042000133
should reference a game between NYK and ATL from 2021-05-28 https://www.nba.com/game/nyk-vs-atl-0042000133/play-by-play but first actual event in for that game is:
Jump Ball Adebayo vs Turner (G Dragic gains possession)
Which comes from a game between MIA an IND from 2020-08-20 https://www.nba.com/game/mia-vs-ind-0041900132/play-by-play
Incredible work! I was curious if you had any insight into acquiring (or the process to acquire) play-by-play data from the early 90s?
Flagging that from the NBA's perspective, game_id
is a 10 digit character vector but all the uncompressed csv files in this repository have converted this variable to a numeric and taken off the first two digits. This does make for a more efficient compression as the first two digits are always "0", but it distorts what a researcher would have received from making calls to these APIs themselves as this variable is now of a different type.
And just for reference the game_id
is the form XXXYYZZZZZ:
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.