|
|
|
ABSTRACT |
In preparation for the Olympics, there is a limited opportunity for coaches and athletes to interact regularly with team performance indicators providing important guidance to coaches for enhanced match success at the elite level. This study examined the relationship between match outcome and team performance indicators during men’s basketball tournaments at the Olympic Games. Twelve team performance indicators were collated from all men’s teams and matches during the basketball tournament of the 2004-2016 Olympic Games (n = 156). Linear and non-linear analyses examined the relationship between match outcome and team performance indicator characteristics; namely, binary logistic regression and a conditional interference (CI) classification tree. The most parsimonious logistic regression model retained ‘assists’, ‘defensive rebounds’, ‘field-goal percentage’, ‘fouls’, ‘fouls against’, ‘steals’ and ‘turnovers’ (delta AIC <0.01; Akaike weight = 0.28) with a classification accuracy of 85.5%. Conversely, four performance indicators were retained with the CI classification tree with an average classification accuracy of 81.4%. However, it was the combination of ‘field-goal percentage’ and ‘defensive rebounds’ that provided the greatest probability of winning (93.2%). Match outcome during the men’s basketball tournaments at the Olympic Games was identified by a unique combination of performance indicators. Despite the average model accuracy being marginally higher for the logistic regression analysis, the CI classification tree offered a greater practical utility for coaches through its resolution of non-linear phenomena to guide team success. |
Key words:
Team sport, classification tree, machine learning, performance analysis, non-linear analysis, athlete
|
Key
Points
- A unique combination of team performance indicators explained 93.2% of winning observations in men’s basketball at the Olympics.
- Monitoring of these team performance indicators may provide coaches with the capability to devise multiple game plans or strategies to enhance their likelihood of winning.
- Incorporation of machine learning techniques with team performance indicators may provide a valuable and strategic approach to explain patterns within multivariate datasets in sport science.
|
Basketball is an intermittent, anaerobic-dominant, team sport that is played by athletes across a range of levels (Scanlan et al., 2012; Tessitore et al., 2006). Several studies have reported on the key physical and physiological characteristics of basketball athletes (Hoare, 2000; Koklu et al., 2011). While these characteristics contribute to individual performance, the combination of all individual performances in a coherent manner ultimately results in team success (Gomez et al., 2009). Subsequently, team performance indicators during matches may provide a holistic foundation for coaches in the development of training and match strategies to enhance success. Basketball match success has been associated with a range of team performance indicators including successful 3-point (Ibanez et al., 2009; Jukic et al., 2000; Lorenzo et al., 2010), 2-point (Ibanez et al., 2009; Jukic et al., 2000; Lorenzo et al., 2010) and 1-point (free-throw) (Jukic et al., 2000; Sampaio et al., 2006; Trninic et al., 2002) shots, ‘defensive rebounds’ (Gomez et al., 2008; Ibanez et al., 2009; Trninic et al., 2002), ‘fouls’ (Sampaio et al., 2006), ‘turnovers’ (Lorenzo et al., 2010) and ‘assists’ (Gomez et al., 2008; Ibanez et al., 2009; Trninic et al., 2002). Gomez and colleagues (2008) reported that ‘defensive rebounds’ and ‘assists’ discriminated all wins and losses during the 2004-2005 Spanish Men’s Professional League. Other studies focussing on European basketball matches have also reported the importance of ‘defensive rebounds’, ‘assists’ and ‘field-goal percentage’ for team match success (Jukic et al., 2000; Trninic et al., 2002). During short-term, junior tournaments, a range of performance indicators including ‘2-point field-goal’ accuracy, ‘defensive rebounds’, ‘turnovers’ and ‘assists’ were acknowledged as discriminatory for wins or losses (Ibanez et al., 2009; Lorenzo et al., 2010). Despite this quantity of work, the contribution of similar team performance indicators to match success during the men’s basketball tournament at the Olympic Games has yet to be examined. Unlike season long competitions, players within teams for the Olympics have a limited opportunity to interact regularly with identification of these team performance indicators expected to provide vital direction to coaches in the design of training programs and match strategies to enhance match success likelihood. To identify performance indicators explanatory of a predetermined response (e.g. match outcome), sports performance analysts have become increasingly proficient with the use of machine learning techniques (Gomez et al., 2015a; Gomez et al., 2015b; Robertson et al., 2015). One of the benefits of machine learning techniques is its capability to resolve meaningful, non-linear interactions within multivariate datasets in contrast to traditional linear techniques (Morgan et al., 2013; Robertson et al., 2015). Accordingly, machine learning may assist coaches with the identification of flexible targets or performance indicator combinations that enhance the likelihood of team success. Classification trees have been shown to be an effective, machine learning technique to explain match outcome in elite Australian Football (AF) (Robertson et al., 2015) and rugby league (Woods et al., In press), as well as explaining the effectiveness of ball screens and inside passes in basketball (Courel-Ibáñez et al., 2016; Gomez et al., 2015a). However, such an approach has yet to be employed for team performance indicator combinations and match outcome during an elite basketball tournament. The aims of the current study were: 1) to examine the relationship between team performance indicator characteristics and match outcome during the men’s basketball tournament at the Olympic Games; 2) to compare the utility of linear and non-linear statistical approaches in the resolution of this relationship. Given the findings of others (Courel-Ibáñez et al., 2016; Gomez et al., 2015a; Robertson et al., 2015), it was hypothesised that: 1) distinctive performance indicator combinations would be explanatory of match outcome; and 2) the classification accuracy of both statistical approaches would yield similarity, however, the non-linear approach would offer greater practical utility given the non-linear interactions between team performance indicators.
Study designThis study was a retrospective analysis of publically available data from the official Olympic websites with all procedures considered in line with local, institutional, ethical approval. All matches (n = 156) undertaken within the men’s basketball tournament at the 2004 (Athens, n = 42), 2008 (Beijing, n = 38), 2012 (London, n = 38) and 2016 (Rio de Janeiro, n = 38) Olympic Games were examined. The team performance indicators for each match were downloaded and collated (Table 1) and a priori classified according to match outcome (win/loss). All team performance indicators were normalised for the number of ball possessions by each team per match, as previously described (Gomez et al., 2013; Lorenzo et al., 2010; Sampaio and Janeira, 2003). Briefly, ball possessions were calculated using the following equation: field-goal attempts – ‘offensive rebounds’ + ‘turnovers’ – (0.4 x free-throw attempts) (Oliver, 2004). Each match provided two datasets or observations (one per team) with a total of 312 observations (84 from 2004, 76 from 2008, 76 from 2012 and 76 from 2016) examined in the current study.
Statistical analysisDescriptive statistics (mean D SD) were calculated for each team performance indicator relative to match outcome (win/loss). All analyses and visualisations were conducted using R (version 3.2.2, Vienna, Austria). Prior to hypothesis testing, a Spearman’s correlation matrix was built to assess the level of collinearity between the team performance indicators. This screening process revealed collinearity between ‘field goal percentage’, ‘3-point percentage’, and ‘2-point percentage’ (r > 0.4). Accordingly, the latter two indicators were removed from further analyses. A multivariate analysis of variance (MANOVA) examined the main effect of match outcome on each performance indicator. Additionally, the effect size (and 90% confidence interval) of match outcome on the team performance indicators was calculated using Cohen’s d statistic, where an effect size (d) of < 0.2 was considered trivial, d = 0.20–0.49 considered small, d = 0.50–0.79 considered medium, and d > 0.79 large (Cohen, 1992). Binary logistic regression was then used to build linear probability models. Match outcome was coded as the response variable (0 = win, 1 = loss), while each performance indicator that was significantly different according to the MANOVA, was coded as explanatory variables. Model selection was performed using the delta Akaike Information Criterion (AIC) and Akaike weights (Burnham and Anderson, 2002). The delta AIC provides a measure of relative model parsimony with a value < 2 indicating high parsimony (Burnham and Anderson, 2002). Conversely, Akaike weights indicate the probability that the selected model is the best among the model set with a value closer to 1 indicating greater probability (Burnham and Anderson, 2002). Both delta AIC and Akaike weights were ascertained via the ‘dredge’ function in the MuMIn package (Burnham and Anderson, 2002). A recursively, partitioned, conditional interference (CI), classification tree was then grown to model the relationship between the same response and explanatory variables. This classification analysis enables the resolution of non-linear phenomena (Robertson and Joyce, 2015). A CI classification tree was chosen as its fitting algorithm estimated a regressive relationship through binary partitioning by testing the null hypothesis between a set of explanatory variables and a response variable (Hothorn et al., 2006). To prune the tree, a minimum node size of ten observations was chosen with partitioning occurring for the explanatory variables that had the greatest association with the response variable (i.e. estimated in accordance with a ‘P-value’). Thus, the partitioning stopped when the null hypothesis could not be rejected (i.e. p > 0.05). This analysis was performed via the ‘ctree’ function in the party package (Hothorn et al., 2006).
The descriptive statistics for each normalised, team performance indicator relative to match outcome are presented in Table 2 with most indicators significantly greater during wins compared to losses. ‘Field-goal percentage’ had the largest effect on match outcome (d > 1.1) with ‘defensive rebounds’ and ‘assists’ also exhibiting large effects (Table 2). As shown in Table 3, the best linear model retained ‘assists’, ‘defensive rebounds’, ‘field-goal percentage’, ‘fouls’, ‘fouls against’, ‘steals’ and ‘turnovers’. This reduced linear model successfully classified 85.2% and 85.8% of the a priori classified wins and losses, respectively, for an average model accuracy of 85.5%. The subsequent output from the CI classification tree is illustrated in Figure 1. As shown, four performance indicators were retained within the tree with an average classification accuracy of 81.4%. Seven terminal nodes were grown (numbers 5, 6, 7, 8, 11, 12 and 13) with the root node (number 1) partitioning the dataset based on ‘field-goal percentage’. The subsequent branching to the left of the tree denotes a loss (≤63.9 ‘field-goal percentage’) and the branching to the right denotes a win (>63.9 ‘field- goal percentage’). Following the branch to the right of the tree, node number 9 partitioned the data based on ‘defensive rebounds’. Of the 59 observations in terminal node 13 (>63.9% ‘field- goal’ and >34.7 ‘defensive rebounds’), the probability of winning was higher (93.2%) than the probability of losing (6.8%). This combination offered the greatest probability of winning from the performance indicator combinations for both linear and non-linear approaches. Node number 10 (≤34.7 ‘defensive rebounds’) split the data based on ‘steals’ with two terminal nodes generated (numbers 11 and 12). Following the branch to the left of the tree, node number 2 partitioned the data based on ‘defensive rebounds’, growing terminal node 8. Progressing further down the branch (≤40.8 ‘defensive rebounds’), node number 3 partitioned the data based on ‘turnovers’, growing terminal node 7. Lastly, node number 4 partitioned the data based on the number of ‘defensive rebounds’, resulting in the growth of two terminal nodes, numbered 5 and 6. Of the 61 observations in terminal node 5, the probability of losing (73.8%) was higher than the probability of winning (26.2%). This combination offered the lowest probability of winning from the performance indicator combinations.
The current study identified key-indicators of performance that were explanatory of success in men’s basketball at the Olympic Games. The best logistic regression model resolved a relatively higher classification accuracy in contrast to the CI classification tree. Although, the CI classification tree revealed a range of performance indicator combinations for match outcome in contrast to the outputs of the linear regression model. Of considerable note was the combination of ‘field-goal percentage’ and ‘defensive rebounds’ that explained 93.2% of the winning observations in terminal node 13. Conversely, a unique combination of ‘field-goal percentage’, ‘defensive rebounds’ and ‘turnovers’ offered the lowest probability of winning (6.8%) and the greatest probability of losing (93.2%, terminal node 7). Despite the average model accuracy being only marginally higher for the logistic regression analysis, the output from the CI classification tree offers coaches and analysts with greater practical utility. Further, it offers flexible and non-linear insights into the unique combinations of performance indicators required to increase (or decrease) a team’s probability of success within an Olympic basketball tournament. This practical model provides coaches with the capability to devise multiple game plans or strategies to enhance their likelihood of winning based upon the accumulation of these performance indicators. For example, if a team was unable to generate a ‘field-goal percentage’ of ~64%, then a coach could shift strategic focus toward the generation of more than 40 ‘defensive rebounds’ to preserve a higher than chance probability of winning (Figure 1). The results of this study have demonstrated the utility of non-linear, machine learning techniques to explain patterns within multivariate datasets in sport science. The resolution of ‘field-goal percentage’, ‘defensive rebounds’, ‘steals’ and ‘turnovers’ in both linear and non-linear models emphases their importance for match outcome during these Olympic basketball tournaments. These findings were in general agreement with those reported for other basketball competitions with Gomez et al. (2008) demonstrating that ‘defensive rebounds’ were a key contributor to winning basketball matches in the elite Spanish Men’s Professional League. Specifically ‘defensive rebounds’ was the essential discriminatory variable for balanced (final score difference of ≤12 points) and unbalanced (final score difference of >12 points) matches (Gomez et al., 2008). This indicator represents a substantial defensive action that decreases the number of possessions and subsequent shots for the opposition while increasing the number of possessions and shot opportunities for the securing team (Sampaio and Janeira, 2003). Others have also identified ‘defensive rebounds’ as a crucial performance indicator for wins in junior (Ibanez et al., 2009; Lorenzo et al., 2010) and senior (Garcia et al., 2013; Gomez et al., 2014; Sampaio et al., 2006) competitions. However, to our knowledge, the current study was the first to identify ‘steals’ as a key performance indicator for winning in an elite basketball competition. This performance indicator again reflects a defensive action that decreases the number of possessions and subsequent shots for the opposition, while increasing the number of possessions and shot opportunities for the team in possession. Accordingly, in addition to identifying and then selecting players capable of accruing a high field goal accuracy, coaches and analysts should focus on devising defensive strategies (i.e. ‘defensive rebounds’ and ‘steals’) during their preparation of national basketball teams during the limited, pre-Olympic stage. Such preparations would likely lead to enhanced team possessions and the probability of match success during the Olympic Games. The current study has provided promising findings but was not without limitations that requires acknowledgement. Firstly, the results were de-limited to men’s basketball matches of the most recent Olympic Games. Others have identified significant gender differences in performance indicators for elite basketball matches (Gomez et al., 2014; Sampaio et al., 2004) with further examination of female elite competition warranted. Secondly, data was retrieved from the last four Olympic Games only with this sample examined to enhance the robustness of modelling performed. Future studies may clarify the capability of the identified performance indicators to explain match outcome in prospective basketball tournaments at the Olympic Games. Next, both regular and playoff tournament matches were included into the modelling with potential differences possible between stages of competition. Previously, style of play and subsequent team performance indicators for winning varied with stage of competition during a full season (Garcia et al., 2013). The impact of such stage differences though may be minimal during short-term tournaments where match victory may have greater impact on progression through the tournament and ultimate tournament success. Finally, measures of athlete physical performance (Robertson et al., 2015) were not examined presently in conjunction with team match performance indicators. Future work examining the physical activity profiles of teams may clarify the impact of accumulated fatigue during Olympic tournaments (i.e. multiple matches over short durations) on the relationship between performance indicators and match outcome. Such future work may integrate the use of microtechnology (e.g. local positioning systems) to enable ‘multidimensional’ insight into successful and less successful team profiles during elite basketball competition when coupled with the current results.
ConclusionsUsing both linear and non-linear analyses, this study showed that: 1) distinctive combinations of ‘field-goal percentage’, ‘defensive rebounds’, ‘turnovers’ and ‘steals’ were explanatory of match outcome during a men’s Olympic basketball tournament; and 2) the classification accuracy of both statistical approaches yielded similarity, however, the non-linear approach (CI classification tree) was likely to offer coaches and analysts with greater practical utility given the interactions between multiple team performance indicators. Such analyses may provide important guidance when devising training and game strategies to increase the probability of winning during a men’s Olympic basketball tournament.
ACKNOWLEDGEMENTS |
No authors declare conflicts of interest. |
|
AUTHOR BIOGRAPHY |
|
Anthony S. Leicht |
Employment: Exercise physiologist at James Cook University |
Degree: PhD |
Research interests: Cardiac autonomic control; physiological responses to exercise and sport; performance analysis of elite basketball players/referees; fitness assessment |
E-mail: Anthony.Leicht@jcu.edu.au |
|
|
Miguel A. Gómez |
Employment: Reader at the Faculty of Physical Activity and Sport Sciences (Technical University of Madrid) |
Degree: PhD |
Research interests: Performance analysis and notational Analysis in Team and dual sports; teaching Physical Education; Research Methods and statistical analysis in sport sciences |
E-mail: magor_2@yahoo.es |
|
|
Carl T. Woods |
Employment: Skill acquisition scientist at James Cook University |
Degree: PhD |
Research interests: Application of multidimensional design in talent identification, and talent development in team sport, performance analysis to inform practice; specificity and application of statistics and machine learning to sport science |
E-mail: carl.woods@jcu.edu.au |
|
|
|
REFERENCES |
Burnham K.P., Anderson D.R. (2002) Model selection and multimodel inference: A practical information-theoretic approach. , -.
|
Cohen J (1992) A power primer. Psychological Bulletin 112, 155-159.
|
Courel-Ibáñez J., McRobert A.P., Toro E.O., Vélez D.C. (2016) Inside pass predicts ball possession effectiveness in NBA basketball. International Journal of Performance Analysis in Sport 16, 711-725.
|
Garcia J., Ibanez S.J., De Santos R.M., Leite N., Sampaio J. (2013) Identifying basketball performance indicators in regular season and playoff games. Journal of Human Kinetics 36, 161-168.
|
Gomez M.A., Battaglia O., Lorenzo A., Lorenzo J., Jimenez S., Sampaio J. (2015a) Effectiveness during ball screens in elite basketball games. Journal of Sports Sciences 33, 1844-1852.
|
Gomez M.A., Lorenzo A., Ibanez S.J., Sampaio J. (2013) Ball possession effectiveness in men’s and women’s elite basketball according to situational variables in different game periods. Journal of Sports Sciences 31, 1578-1587.
|
Gomez M.A., Lorenzo A., Ortega E., Sampaio J., Ibanez S.J. (2009) Game related statistics discriminating between starters and nonstarters players in Women’s National Basketball Association League (WNBA). Journal of Sports Science & Medicine 8, 278-283.
|
Gomez M.A., Lorenzo A., Sampaio J., Ibanez S.J., Ortega E. (2008) Game-related statistics that discriminated winning and losing teams from the Spanish men’s professional basketball teams. Collegium Antropologicum 32, 451-456.
|
Gomez M.A., Moral J., Lago-Penas C. (2015b) Multivariate analysis of ball possessions effectiveness in elite futsal. Journal of Sports Sciences 33, 2173-2181.
|
Gomez M.A., Perez J., Molik B., Szyman R.J., Sampaio J. (2014) Performance analysis of elite men’s and women’s wheelchair basketball teams. Journal of Sports Sciences 32, 1066-1075.
|
Hoare D.G. (2000) Predicting success in junior elite basketball players--the contribution of anthropometic and physiological attributes. Journal of Science and Medicine in Sport 3, 391-405.
|
Hothorn T., Hornik K., Zeileis A. (2006) Unbiased recursive partitioning: A conditional inference framework. Journal of Computational and Graphical Statistics 15, 651-674.
|
Ibanez S.J., Garcia J., Feu S., Lorenzo A., Sampaio J. (2009) Effects of consecutive basketball games on the game-related statistics that discriminate winner and losing teams. Journal of Sports Science & Medicine 8, 458-462.
|
Jukic I., Milanovic D., Vuleta D., Bracic M. (2000) Evaluation of variables of shooting for a goal recorded during the 1997 European Basketball Champioship in Barcelona. Kinesiology 32, 51-62.
|
Koklu Y., Alemdaroglu U., Kocak F.U., Erol A.E., Findikoglu G. (2011) Comparison of chosen physical fitness characteristics of Turkish professional basketball players by division and playing position. Journal of Human Kinetics 30, 99-106.
|
Lorenzo A., Gomez M.A., Ortega E., Ibanez S.J., Sampaio J. (2010) Game related statistics which discriminate between winning and losing under-16 male basketball games. Journal of Sports Science & Medicine 9, 664-668.
|
Morgan S., Williams M.D., Barnes C. (2013) Applying decision tree induction for identification of important attributes in one-versus-one player interactions: a hockey exemplar. Journal of Sports Sciences 31, 1031-107.
|
Oliver D (2004) Basketball on paper : rules and tools for performance analysis. , -.
|
Robertson S., Woods C., Gastin P. (2015) Predicting higher selection in elite junior Australian Rules football: The influence of physical performance and anthropometric attributes. Journal of Science and Medicine in Sport 18, 601-606.
|
Robertson S.J., Joyce D.G. (2015) Informing in-season tactical periodisation in team sport: development of a match difficulty index for Super Rugby. Journal of Sports Sciences 33, 99-107.
|
Sampaio J., Godoy S.I., Feu S. (2004) Discriminative power of basketball game-related statistics by level of competition and sex. Perceptual and Motor Skills 99, 1231-1238.
|
Sampaio J., Ibanez S., Lorenzo A., Gomez M. (2006) Discriminative game-related statistics between basketball starters and nonstarters when related to team quality and game outcome. Perceptual and Motor Skills 103, 486-94.
|
Sampaio J., Janeira M. (2003) Statistical analyses of basketball team performance: understanding teams’ wins and losses according to a different index of ball possessions. International Journal of Performance Analysis in Sport 3, 40-49.
|
Scanlan A.T., Dascombe B.J., Reaburn P., Dalbo V.J. (2012) The physiological and activity demands experienced by Australian female basketball players during competition. Journal of Science and Medicine in Sport 15, 341-347.
|
Tessitore A., Tiberi M., Cortis C., Rapisarda E., Meeusen R., Capranica L. (2006) Aerobic-anaerobic profiles, heart rate and match analysis in old basketball players. Gerontology 52, 214-222.
|
Trninic S., Dizdar D., Luksic E. (2002) Differences between winning and defeated top quality basketball teams in final tournaments of European club championship. Collegium Antropologicum 26, 521-531.
|
Woods C.T., Sinclair W., Robertson S. () (In press) Explaining match outcome and ladder position in the National Rugby League using team performance indicators. Journal of Science and Medicine in Sport , -.
|
|
|
|
|
|
|