Network Science Based Basketball Analytics

photo src: www.bas.unc.edu

Network Science based basketball analytics comprise a various recent attempts to apply the perspective of networks to the analysis of basketball.

SportIQ Smart Basketball Analytics - YouTube

photo src: www.youtube.com

Maps, Directions, and Place Reviews

Overview

Traditional basketball statistics analyze individuals independently of their teammates or competitors and traditional player positions are determined by individual attributes. In contrast, these network based analytics are obtained by constructing a team or league level player networks, where individual players are nodes connected by the ball movement or by some measure of similarity. Then the metrics are obtained by calculating network properties, such as degree, density, centrality, clustering, distance etc. This approach enriches the analysis of basketball with new individual and team level statistics and offers a new way of assigning position to a player.

Basketball Analytics Video

Team level statistics

The biggest contribution to the team level metrics has been dome by Arizona State University researchers led by Jennifer H. Fewell. Using 2010 NBA first round playoff data they constructed the networks for each team using players as nodes and ball movement between them as links. They distinguish the trade-off between not necessarily mutually exclusive division of labor and team`s unpredictability that are measured by Uphill downhill flux and Team entropy respectively.

Team entropy - a measure of unpredictability and variation in teams offense, higher entropy meaning more variation. It is calculated as aggregated individual Shannon entropies, where unpredictability is measured as uncertainty of the ball movement between any two nodes.

Uphill downhill flux - measures the division of labor, or the expertize in moving the ball to the player with the best shooting percentage. According to Fewell et al. It can be interpreted as an average change in potential shooting percentage per pass. The metric is calculated as a sum of the differences between the shooting percentages of the nodes at the ends of each edge

where p _ij is the probability of the link between players i and j, x_i and x_j are their shooting percentages.

Other measures include

Team clustering coefficient - a direct application of a clustering coefficient. It measures how interconnected are the players, whether the ball moves via one node or whether in many ways between all the players.

Team degree centrality - similarly to the previous metric, it measures if there is one dominant player in the team. It is calculated by the formula

$C_{D}=\sum _{v\in V}{\frac {deg(v^{*})-deg(v)}{\mid V\mid -1}}$

where deg(v) is the degree of the node v, deg(v^*) is the highest degree node, V is the number of nodes.

Combined low clustering and high degree centrality mean that the defense can put double team on the dominant player, since without him ball team experiences problems in moving the ball.

Average path length - number of passes per play.

Path flow rate - number of passes per unit time. It measures how quickly the team moves the ball.

Deviation from maximum operating potential - using players as the nodes and ball movement and links and true shooting percentage as efficiency, analogy can by made to the traffic network. Each individual is assumed to have a skill curve f(x), which is declining in the number shots taken. Individual maximization of the efficiency yield $f(x_{1})=f(x_{2})...=f(x_{i})$ whereas to maximum efficiency is achieved by solving ${\frac {dF}{dx_{1}}}={\frac {dF}{dx_{2}}}...={\frac {dF}{dx_{i}}}$ , where $F=x_{1}f(x_{1})+x_{2}f(x_{2})...+x_{3}f(x_{3})$

The difference between these two constitute the teams deviation from the maximum potential.

It's March: Time for Hoops â€¦ and Analytics! - SAS Voices

photo src: blogs.sas.com

Individual Statistics

Success/Failure Ratio - the number of times the player (node) was involved in the successful play divided by the number of times the player was involved in the unsuccessful play. The metric is obtained from the team play by play network.

Under/over performance - the metric is calculated by mapping the bipartite player network. Players are connected if they were a part of one team. The links are weighted by how successful was the team, where the players played together. Then node centrality measures are compared to the reference centrality distributions for each node obtained by bootsrap - based randomization procedures and p - values are calculated. For example p - value of player i is given by :

$p_{i}={\frac {\sum _{k=1}^{J}(I|\pi _{i^{*}k}\geq \pi _{i}^{0})}{J}}$ , where ?_i^* is the reference centrality score, ?_i⁰ is the calculated centrality score, J - number of iterations. High p - values indicate under-performance, low - over-performance.

Under - utilization - a player is under-utilized by the time if he has a low degree centrality, but is over-performing

Hackathon - NBA Basketball Analytics Hackathon

photo src: hackathon.nba.com

Player Positions

New basketball positions were classified by Stanford university student Muthu Alagappan, who while working for the data visualization company Ayasdi mapped the network of one season NBA players linking them by the similarity of their statistics. Then based on node clusters players were grouped into 13 positions.

Offensive Ball-Handler Player that specializes in scoring and ball handling, but has low averages of steals and blocks.

Defensive Ball-Handler Player who specialized in assisting and stealing the ball, but is average in scoring and shooting.

Combo Ball-Handler Player who are above average in both offense and defense, but doesn't excel in any.

Shooting Ball-Handler Player that is above average in shot attempts and points scored per game.

Role-Playing Ball-Handler Those who play few minutes and don't have large impact on the team.

3-Point Rebounder A big man and a ball handler with above average rebounds and three point shots attempted and made.

Scoring Rebounder Player with high scoring and rebound averages.

Paint Protector Those valued for blocking and rebounding, but with low average points scored.

Scoring Paint Protector Players that at both good and offense and defense in the paint.

NBA 1st-Team Those with above averages in most of the statistical categories.

NBA 2nd-Team Similar, but a bit worse than NBA 1st-Team players.

Role Player Similar, but worse than NBA 2nd-Team players.

One-of-a-Kind Ones that are so good and exceptional that could not be categorized.

Source of the article : Wikipedia