Context
Dataset is based on box score and standing statistics from the NBA.
Calculations such as number of possessions, floor impact counter, strength of schedule, and simple rating system are performed.
Finally, extracts are created based on a perspective:
-
teamBoxScore.csv communicates game data from each teams perspective
-
officialBoxScore.csv communicates game data from each officials perspective
-
playerBoxScore.csv communicates game data from each players perspective
-
standing.csv communicates standings data for each team every day during the season
Content
Data Sources
Box score and standing statistics were obtained by a Java application using RESTful APIs provided by xmlstats.
Calculation Sources
Another Java application performs advanced calculations on the box score and standing data.
Formulas for these calculations were primarily obtained from these sources:
Inspiration
Favoritism
Does a referee impact the number of fouls made against a player or the pace of a game?
Forcasting
Can the aggregated points scored by and against a team along with their strength of schedule be used to determine their projected winning percentage for the season?
Predicting the Past
For a given game, can games played earlier in the season help determine how a team will perform?
Lots of data elements and possibilities. Let your imagination roam!