|
|
|
| |
| ABSTRACT |
|
Agility, characterized by rapid whole-body movement in response to external stimuli, is a key performance determinant in team sports owing to its reliance on perception-action coupling. Despite its importance, existing evidence on training effects is disproportionately focused on change-of-direction ability, overlooking the perceptual-cognitive demands of true agility. This systematic review therefore sought to quantify the effects of agility-specific training, examine moderators and establish dose-response model in team-sport players, while accounting for perception-action coupling. Following PRISMA 2020 guidelines, systematic searches were conducted in Web of Science, Scopus and PubMed for peer-reviewed English-language studies. Risk of bias was appraised using a modified Cochrane Collaboration’s tool and study quality was evaluated via a tailored PEDro scale. Effect size (ES) was calculated using Hedge’s g, and dependencies among multiple ESs within studies were addressed through three-level meta-analysis. Subgroup analyses and both linear and non-linear meta-regressions were performed to examine potential moderators and establish dose-response models. Twenty-eight studies met the inclusion criteria, of which 26 contributed 53 ESs to quantitative analyses. Agility training produced a large improvement in reactive agility test (RAT) performance (ES = 0.65, p < 0.01), a moderate improvement in pre-planned agility test (PAT) performance (ES = 0.55, p < 0.01), and a non-significant moderate effect in reaction test (RT) outcomes (ES = 0.52, p > 0.05). Training effects were moderated by participants’ characteristics, with junior athletes (ES = 0.77, p < 0.05) and national level athletes (ES = 0.80, p < 0.05) demonstrating greater responses. However, female and mix-gender samples were underrepresented, and the evidence base was dominated by studies in soccer and basketball. Dose-response modelling revealed a curvilinear relationship between training duration and RAT (QM = 11.64, p < 0.01), peaking at 7-8 weeks and a positive linear association between training frequency and RAT (β = 0.172, p < 0.01). No significant relationship was observed between session time and RAT (p > 0.05), although most positive ESs clustered around 20-25 minutes per training session. Agility training exerts a large overall effect on RAT performance in team-sport players, with outcomes moderated by age and training status. Interventions of 7-8 weeks delivered at higher frequency (> 3 times/week) with 20-25 minutes session duration are frequently associated with favorable adaptions. These recommendations, however, should be interpreted cautiously given the moderate-to-low certainty of evidence, high within-study variability, dominance of soccer and basketball samples and potential risk of publication bias. |
| Key words:
Agility, Team-sport, Perception-action coupling, Meta-analysis, Dose-response
|
Key
Points
- Agility training yields large improvements in RAT and medium effects on PAT among team-sport players, with no significant gains in reaction test (RT). Training effects are larger in junior and national-level athletes.
- Perception-action coupling is crucial in agility training. While practitioners can use visual stimulus for controlled agility training gain, using human stimulus is encouraged to provide more sport-specific stimulus.
- 7-8 weeks agility training at a frequency higher than 3 times per week and a session duration of 20-25 minutes is associated with favorable adaptations in team-sport players.
|
Agility, defined as a rapid whole-body movement involving changes in velocity or direction in response to a stimulus (Sheppard and Young, 2006), is a key determinant of team-sport performance (Chaalali et al., 2016; Lucia et al., 2023). In contrast to change-of-direction (COD), which reflects pre-planned locomotor tasks, agility inherently requires perception-action coupling (Young et al., 2021), integrating perceptual-cognitive processes such as anticipation and decision-making with motor execution. This distinction underscores why agility cannot be fully captured by COD measures, although COD remains an important foundational component of agility performance (Sheppard and Young, 2006). The term “COD”, “agility” and “reactive agility” were used interchangeably in agility reviews. However, according to the definition of agility, using the term “reactive” is redundant (Young et al., 2015). Moreover, studies were either evaluating COD ability (Asadi et al., 2017; Forster et al., 2022; Sun et al., 2025), or validity of agility tests (Morral-Yepes et al., 2022), incorporating outcomes such as T-test, Illinois test or pro-agility. While trainability of agility was qualitatively discussed (Thieschäfer and Büsch, 2022), quantitative evidence of aggregated agility training outcomes in team sports remained absent. As suggested by Young et al., (2021), findings on COD should not be extrapolated to on-field agility due to distinct perception-action coupling differences. Furthermore, these reviews often did not account for hierarchical structure of data (Asadi et al., 2017; Sun et al., 2025), such as multiple effect sizes nested within studies, which can bias pooled estimates (Van den Noortgate et al., 2013). However, extracting only one effect size per study can also yield biased estimates. In addition, existing reviews have drawn heavily from soccer (Sun et al., 2025) and basketball (Zhang et al., 2025), with relatively fewer studies addressing other team sports, thereby limiting generalizability. A further limitation concerns the absence of dose-response modelling using linear and non-linear regression methods to identify optimal training parameters such as duration, frequency and session length. This is particularly important for practitioners, as understanding dose-response relationships enables coaches to prescribe training that maximizes adaptation while avoid overtraining. Other Factors such as participant characteristics (e.g. age, sex and competitive level) and stimulus type (e.g. light, video and human) have not been systematically explored in team sports. Therefore, the objectives of this systematic review were to :
- Quantify the effects of agility-specific training on reactive agility performance among team-sport players, with particular emphasis on perception-action coupling.
- Examine moderating factors including participants’ characteristics, training modalities and stimulus type, to provide tailored insights into athlete responsiveness.
- Establish dose-response models to identify potential thresholds of training duration, frequency and session length that optimize agility development.
By addressing these objectives, this review aims to advance the evidence base on agility training and provide practical guidance for coaches and practitioners in designing valid training interventions.
Registration of systematic protocolThis systematic review followed the Preferred Reporting Items for Systematic Reviews and Meta-analysis (PRISMA) 2020 guidelines (Page et al., 2021) and was registered in PROSPERO (CRD420251060149). Registration occurred after article retrieval but before data extraction and analysis. Although this timing may increase the risk of selective reporting, analyses were conducted according to a pre-specified protocol to mitigate bias. Deviation from registered protocol was given in Supplementary Table S13.
Search strategySystematic searches were conducted on 20th April 2025 in Web of Science (WOS), Scopus and PubMed, with updates on 4th May 2025. Boolean search strings were developed iteratively from preliminary scoping, where key articles were identified (Sheppard and Young, 2006; Young et al., 2021; Young et al., 2015). Complete search strategies are provided in full for all databases in Table S1. Identical syntax was adapted across platforms to ensure comparability of retrieval. In addition, backward and forward citation tracking was performed in Google Scholar. Records were imported into Endnote (Version 21; Clarivate, Philadelphia, PA, USA) and deduplicated prior to screening. A sample search strategy in WOS is shown below: #1: ALL = (agility OR quickness OR change-of-direction OR reactive OR plyometric OR neuromuscular OR stimulu* OR SSG OR small-sided-game OR dual-task OR cognitive OR perceptual) AND (training OR intervention OR program*) #2: ALL = (athlete OR player OR recreational OR elite OR professional OR trained) AND (soccer OR football OR rugby OR basketball OR volleyball OR team-sport) #3: #1 AND #2
Study selection and eligibility criteriaInitial study retrieval was conducted by one author (Z.Z.). After identification and screening of articles, during the eligibility phase, two authors (Y.L. and Z.Z) reviewed the full texts based on the exclusion and inclusion criteria (Supplementary Table S2 and S2-1). When mutual agreement on studies cannot be achieved, decision was made in consultation with a third author (K.X.). All processes were conducted in Endnote 21.
Data extractionData were extracted by two authors (Z.Z. and Y.L.) independently and validated by the third author (K.X.) after cross checked by the first two authors. Data extracted were organized into Excel tables (Microsoft Corporation, Redmond, WA, USA) for the following information: 1) study authors and year; 2) number of participants in each group, participants’ age, sex, training status, and types of sport; 3) intervention duration (weeks), frequency (times/week), session time (minutes/session), and types of stimulus; 4) agility performance outcomes including reactive agility test (RAT), pre-planned agility test (PAT), and reaction test (RT) using means and standard deviations (SD) of pre- and post- agility training in the experimental and control groups. When data were missing, the corresponding authors were contacted for raw data, if there was no response the study was excluded. Specifically, one study was excluded for no response (Martin-Niedecken et al., 2023).
Coding of studiesOutcomes were classified according to Sheppard & Young, (2006), agility measurement was coded into simple agility (tests with no temporal or spatial uncertainty), temporal agility (tests with temporal uncertainty but pre-planned movement), spatial agility (tests with spatial uncertainty, but timing of movement is pre-planned), and universal agility in which the test has both spatial and temporal uncertainty, restricting athletes from anticipating where or when to react. However, simple agility and temporal agility do not require movement change and perceptual response at the same time. Like shuttle run (simple agility) and 100m race (temporal agility). Therefore, these two were not included for RAT. Spatial agility and universal agility tests were included in RAT since they require external stimuli and movement simultaneously. In addition, we included the outcomes from PAT and RT as secondary outcome. Age was grouped based on Cobley’s classification (Cobley et al., 2009), as it reflects key developmental stages like pre-puberty, early puberty, late puberty/adolescence and adulthood. Participants were categorized into child (< 11 years), junior (11-14 years), adolescent (15-18 years) and senior (>18 years). Sex was coded into three categories: 1) male, where studies recruited male participants only; 2) female, where studies recruited female participants only and 3) mix, where studies recruited both male and female participants. Training status classification was made based on McKay et al., (2022). Tier 1 and Tier 2 classification were not included since they are not identified as athletes. Therefore, we categorized athletes as: 1) developmental: local-level representation, identified with a specific team-sport and training with a purpose to compete; 2) national level: team-sport players competing in national and/or state leagues/tournaments, NCAA second or third division athletes; 3) International level: team-sport players on a national team and NCAA first division athletes. For those studies that did not mention competitive level, but identified to a specific sport and trained, we categorized those participants into developmental level (Arede et al., 2021; Dunton et al., 2020; Engelbrecht et al., 2016; Hassan et al., 2022; Trecroci et al., 2016; T. Zwierko et al., 2024). Training duration was categorized into short term (< 6 weeks) and long term (≥ 6 weeks) using the median-split method (Ramirez-Campillo et al., 2022). Training frequency and duration per training session were also coded using the same method into low frequency (< 3 times/week) and high frequency (≥ 3 times/ week) and short session (< 25 minutes/session) and long session (≥ 25 minutes/session). Training load was coded into training volume and training intensity respectively. For training volume, data was coded based on the total sets/repetitions performed in running-based and SSG-based exercises respectively. While training intensity was coded according to heart rate. Although most studies (approximately 69.23% of the total sample) reported sets or repetitions performed (Supplementary Table S4
), the reporting varied greatly in unit, such as number of COD maneuvers (Chaouachi et al., 2014; Young and Rogers, 2014), soccer passes (Dunton et al., 2020; Hicheur et al., 2020), or sets duration (Vogt et al., 2018; Zago et al., 2016). The lack of standardized training volume reporting making it impractical to synthesize data. Also, only three studies reported training intensity quantitively using heart rate or rating of perceived exertion (Chaouachi et al., 2014; Friebe et al., 2024; Vogt et al., 2018). Therefore, coding of training load data was not employed. Stimuli type was coded into visual, visual-auditory and human stimuli modified from (Paul et al., 2016). We used these intercepts to better conclude various stimulus types as there are some verbal or sound instructions during agility training.
Quality evaluation and risk of bias evaluationRisk of bias was assessed using a modified version of the Cochrane Collaboration tool (Higgins et al., 2011). Blinding of participants is critical when results are highly subjective, such as pain (Higgins et al., 2011). However, in current research, results were mostly assessed subjectively. Therefore, we excluded this domain and incorporated additional items on test familiarization, measurement reliability and sample size estimation, to assure a holistic assessment of risk of bias. For familiarization of tests, as recommended by Paul et al., (2016), athletes should be appropriately familiarized with tests before commencing data collection. Therefore, studies were categorized into “low risk”, if study reported that participants were well familiarized with the test protocols or achieved stable performance with trivial detectable change before data collection; “unclear” if familiarization was allowed but degree of familiarity was unknown; “high risk” if no familiarization was allowed. For measurement bias, tests for agility are often modified to reflect on-field performance. Therefore, studies were categorized into “low risk” if two or more measurement reliability were reported (e.g. Intraclass correlation coefficient (ICC) and coefficient variation (CV)) (Currell and Jeukendrup, 2008); “unclear” if only one reliability was reported and “high risk” if no reliability was reported. For sample size bias, studies were categorized into “low risk”, if they reported prior sample size estimation with appropriate parameters (e.g. G*power using accurate estimates including α probability, power and effect size); “unclear” if they didn’t report sample size estimation with appropriate parameters or software used; “high risk” if they didn’t report sample size estimation and included less than 10 participants. Study quality was evaluated using a modified PEDro tailored for strength and conditioning (Brughelli et al., 2008), to assess the completeness of research methodology process and reporting. The scale includes 10 items (total score ranging from 0-20) with each item rated as: 0 = clearly no; 1 = maybe; and 2 = clearly yes. The items included: 1) inclusion criteria were clearly stated; 2) subjects were randomly allocated to groups; 3) intervention was clearly defined; 4) groups were tested for similarity at baseline; 5) use of a control group; 6) outcome variables were clearly defined; 7) assessments were practically useful; 8) duration of intervention practically useful; 9) between-group statistical analysis appropriate; 10) point measures of variability were provided. The modified version of quality and risk of bias assessment tools were discussed and agreed by the three authors (Z.Z., Y.L. and K.X.). Certainty of evidence was graded using GRADE (Grading of Recommendations Assessment, Development, and Evaluation) framework (Guyatt et al., 2008). The system is used to rate the quality of evidence in reviews and meta-analyses, offering structured assessments of our confidence that the true effect is near the estimated effect. GRADE categorizes the quality of evidence into four tiers: high, moderate, low and very low based on risk of bias, inconsistency, indirectness, imprecision and publication bias. The assessments were then carried out by the three authors independently, discrepancies after evaluation were discussed and resolved.
Statistical analysis
Data synthesis and effect measuresIn this study, between-group comparisons were made to calculate effect sizes (ES). Given the typically small sample size in sport science, Hedge’s g ES was utilized for its bias correction using means and SDs from experimental and control group (Hedges and Olkin, 2014). Specific formulas used are as follows:
Where M stands for mean, SD stands for standard deviation and n stands for sample size, and the subscripts 1 and 2 represent experimental and control group, respectively. Additionally, when standard errors (SE) were presented in studies, they were converted into SDs using SD=SE× Magnitude of ESs were categorized into small (<0.2), medium (0.2-0.5), and large (>0.5) (Cohen, 2013). I2 and Q tests were utilized to evaluate heterogeneity, with I2 of 25, 50 and 75% indicating low, moderate and high heterogeneity, respectively. The Q tests were considered significant at p < 0.1 (Xu et al., 2025). The performance evaluated in time units was multiplied by -1 to calculate the effect size in the analysis, in which the improvement of performance ES was adjusted to reflect a positive enhancement.
Three-level meta-analysisAgility research often reported various performance outcomes (Chaouachi et al., 2014; Friebe et al., 2024), and assessments at multiple time points (Born et al., 2016; Zago et al., 2016). Different ESs within single study are correlated and including them simultaneously would violate the independence assumption of traditional meta-analysis, which leads to overlapped information and in turn potentially biased outcomes (Kadlec et al., 2023). However, including only one ES for each study may seem too conservative to reflect the true estimates (Van den Noortgate et al., 2013). Therefore, we utilized a three-level meta-analysis model following the methods of Xu et al., (2025), with analysis carried out using the open-source code provided by the same authors to account for dependence between effect sizes. It is assumed that multiple assessments and comparisons from the same study were nested within studies, therefore we decomposed the variance of the observed effect size into sampling variance, within-study variance (level 2) and between study variance (level 3) to account for within-study correlations (Cheung, 2019). The model was used to fit the hierarchical structure of the data (e.g. effect sizes are nested within studies), so that multiple ESs from each study add up to the statistical power and provide estimates closer to the true ES (Assink and Wibbelink, 2016). The following equation will be used in three-level model:

Here, represents the estimated true effect size for i-th effect within the j-th study. The term μ denotes the overall pooled effect size. The random components ζ(2)ij and ζ(3)ij stand for within-study and between study variance respectively, while ϵij represents the sampling variance associated with each effect size. Accordingly, the observed effect size can be expressed as the sum of the pooled effect size, the variance components at each level and the sampling error. Cluster-robust variance estimation method with small-sample adjustment to adjust the within-study standard errors for correlations between ESs was utilized for sensitivity analysis (Hedges et al., 2010). Within-study correlation was set at 0.6 to reflect moderate dependence, consistent with previous multilevel syntheses (Xu et al., 2025), sensitivity analyses at 0.4 and 0.8 confirmed stability of results (Supplementary Table S10). In addition, results did not change after performing small sample size adjustment. The analyses were carried out using the restricted maximum-likelihood (REML) method as recommended by Harrer et al., (2021), and cross-verified using maximum-likelihood (ML). Three-level meta-analysis model was selected based on goodness-of-fit information (e.g. Akaike information criterion). Three-level meta-analysis was conducted using the metafor package in R (version 4.5.0; R core team, Vienna, Austria) (Viechtbauer, 2010).
Subgroup analysis and meta-regressionSubgroup analyses were conducted based on participants’ characteristics (i.e. age, sex, training status and sport type) and training modalities (i.e. training period, frequency, time per session, and stimulus type) to explore the moderating effect on agility performance. Subgroups analyses were conducted if the number of ESs within a subgroup was greater than 10 (k ≥ 10) to avoid biased outcomes. Subgroups analysis results will be kept descriptive and detailed analyses of moderators will be explored using dose-response modelling (meta-regression). Linear and nonlinear relationships of different moderators and performance outcomes were explored using meta-regression models including simple linear, cubic polynomial, restricted cubic spline, natural cubic spline and thin plate spline models (Xu et al., 2025). The fitting model was than selected based on goodness-of-fit and practical considerations. Linear model reflected better fit (e.g. lower Akaike information criterion) for training frequency and RAT performance, but results can be misleading due to the complex nature of agility in team-sport. Therefore, non-linear model with a better fit was conducted to explore if potential non-linear relationship exists. Except for training frequency, non-linear models all have the better fit than linear model and can better reflect the complex nature of agility intervention. Therefore, non-linear models were used for meta-regression analyses.
Publication bias and sensitivity analysisTo assess potential publication bias or small-study effects in the presence of dependent ESs, we employed a multilevel extension of Egger’s test, which models asymmetry in a three-level framework (Fernández-Castilla et al., 2021). The model extends the traditional two-level Egger’s regression to account for sampling error (level 1), within-study variance (level 2) and between study variance (level 3), aligning with three-level structure in the current meta-analysis. And it has been proved to have adequate false-positive error control and a slight power advantage over robust variance estimation alternatives (Rodgers and Pustejovsky, 2021). Subsequently, Hat, Cook’s distance and studentized residuals were employed for leverage, outliers, and influential case diagnosis at level 2 and level 3 respectively. Studies were flagged if their Hat and Cook’s distance values were greater than three times their respective mean, which were further confirmed if they had an absolute Studentized residual value greater than 3. Then the three-level model was repeated after excluding the influential studies. Additionally, each study was excluded at level 2 and level 3 to determine the impact of individual study on overall effect size (Harrer et al., 2021). Baseline difference for all interventions were calculated using unpaired t-test to ensure stability. Subsequently, sensitivity analysis was conducted by excluding significantly different baseline data. If no significant change in results was observed, the models were retained.
Search resultsThe initial article retrieval yielded 11,840 records (PubMed = 1490, WOS = 5918, Scopus = 4432). After removal of duplicates, 8362 titles/abstracts were screened. After which, 361 records were identified as potential articles for full-text screening based on our pre-specified exclusion criteria. Reasons for full-text exclusion included:
- Isolated physical or perceptual training (n = 173);
- Absence of control group or non-randomized design (n = 28);
- Lack of reactive agility test results (n = 108);
- Population other than team-sport players (n = 23);
- Full-text not available (n = 5)
Therefore, a final 25 articles met the inclusion criteria. In addition, a snowballing method was used for secondary search, which identified 3 additional articles. Subsequently, per pre-specified criteria, two studies were excluded due to influential outlier status, leaving 26 studies contributing 53 ESs to quantitative analysis (Figure 1). Thus, 28 studies were included for systematic review, while 26 studies entered meta-analysis.
Study characteristicsA summary of included studies can be found in (Supplementary Table S3), a total of 755 participants aged between 10.31 and 35 years old were recruited. Among them, 2 studies (Lee et al., 2024; Mancini et al., 2024) recruited female subjects (43 subjects), 6 studies (Arede et al., 2021; Ehmann et al., 2022; Lucia et al., 2023; Lucia et al., 2024; Zwierko et al., 2023; M. Zwierko et al., 2024) recruited both male and female subjects (209 subjects) and 3 studies (Dunton et al., 2020; Hassan et al., 2022; Trecroci et al., 2016) didn’t report the gender of the subjects (75 subjects). Participants were all team-sport players, 14 studies recruited developmental athletes, 8 studies recruited national level athletes, and 6 studies recruited international level athletes. Training duration ranged from 1 to 22 weeks, training frequency ranged from 1 to 6 times per week, and time per session ranged from 12 to 60 minutes. Number of studies chose to use visual, visual-auditory, and human stimuli was 14, 5 and 7 respectively. Two studies (Hicheur et al., 2020; Lucia et al., 2024) utilized both visual and visual-auditory stimuli in their research. All included studies reported at least one indicator of reactive agility and/or open-skill performance.
Risk of bias and quality assessmentAll included studies were randomized controlled trials and mentioned about randomization (Figure 2). However, only 6 studies (Friebe et al., 2024; Gonzalo-Skok et al., 2023; Lucia et al., 2023; Mancini et al., 2024; Serpell et al., 2011; Trecroci et al., 2016) detailed the method of randomization, which were categorized as “low risk” for selection bias, while others were labeled as “unclear”. One study reported that assessor was blinded for condition assignment, which was rated as “low risk” for detection bias. And one study stating that assessors were not blinded was rated as “high risk”, while the rest were rated as “unclear”. Five studies that experienced attrition of subjects with no significant impact on results were rated as “unclear” for attrition bias, while others were rated as “low risk”. Nine studies clearly reported prespecified outcomes and were therefore rated as “low risk” for reporting bias, while others were rated as “unclear”. Three studies didn’t allow familiarization before data collection, which were rated as “high risk” for familiarization bias. (Born et al., 2016) was coded into “low risk” for reporting participants were well familiarized with test protocols, while the remaining studies were rated as “unclear”. Thirteen studies reporting ICC and CV for reliability were rated as “low risk” for measurement bias, and 3 studies reporting none were rated as “high risk”, while the rest were “unclear”. Six studies reported sample size calculation with appropriate parameters reported and were therefore categorized into “low risk”, while 3 studies with small sample size were categorized as “high risk”. The rest were categorized into “unclear”. The modified PEDro score of included studies ranged from 16 to 20 (Supplementary Table S5
), with an average score of 17.43. Although the overall risk of bias suggests “unclear” to “high risk” for the included studies, the PEDro score indicates that the methodological implementation was relatively complete.
Publication bias and Certainty assessmentPublication bias was evaluated using the multilevel Egger’s regression test along with contour-enhanced funnel plot, which revealed evidence of asymmetry for RAT (p < 0.01), while no evidence of asymmetry was detected for PAT and RT (p > 0.1). Reflecting on funnel plots (Figure 3), it can be inferred that PAT and RT were slightly asymmetric, while RAT was highly asymmetric. However, unlike typical risk of publication bias where studies with significant or positive results are more likely to be published. The RAT results included many non-significant or small-effect results. Therefore, the significant publication bias may stem from selective reporting of results, as most studies were unclear regarding reporting bias. The extended version of Egger’s test detects cumulative publications bias of each level, making it more sensitive to outcome reporting bias hidden in multiple outcomes than traditional model (Citkowicz and Vevea, 2017). Therefore, highly influential studies would significantly inflate the overall ESs. Further, sensitivity analyses were conducted using leave-one-out, cook’s distance and studentized residual value. The leave-one-out analysis flagged (Hassan et al., 2022) as influential study, changing the overall ES from 0.79 to 0.70. Cook’s distance identified Hassan et al., (2022) and Steff et al., (2024) as outliers with absolute cook’s distance value exceeding three times the mean value. After removing the outliers, the ES changed from 0.79 to 0.65. Additionally, outlier residual analysis flagged Hassan et al. (2022) with value greater than 4. Therefore, the two studies were excluded from further analysis to ensure stability. After excluding the influential studies, baseline difference tests were conducted with significantly different data being flagged for each outcome. The fitted models were then cross-verified excluding these flagged studies using REML and ML. After excluding the flagged studies, the pooled ES for RAT remained at the same level of 0.65 (p < 0.01) without significant variance component change. The pooled ES for RT experienced trivial change from 0.52 (p > 0.05) to 0.51 (p > 0.05), while the pooled ES for PAT change significantly from 0.55 to 0.70 (p < 0.01) after excluding flagged studies. We then further verified the results using change score from PAT data, the pooled results remained as 0.55 (p < 0.01).Therefore, after sensitivity analysis only two studies from RAT results were excluded from further analyses. Pre-test baseline differences for PAT and RT were given in Supplementary Supplementary Table S6, Supplementary Table S7 and Supplementary Table S8.
Certainty of evidence (Supplementary Table S12) was moderate for main outcome RAT, mainly owing to the inconsistent measures of RAT such as different angles, possible exit paths, and number of COD during tests. The overall certainty of evidence for secondary outcomes was low, mainly owing to inconsistent measurements and risk of publication bias.
Main effectTwenty-eight studies provided 136 ESs in total, and the number was reduced to 59 ESs after applying RAT. After excluding highly influential studies, there were 53 ESs and 26 studies in total. Comparison of traditional meta-analysis showed that three-level meta-analysis had lower AIC, BIC, AICc (p < 0.01) for RAT and PAT but not for RT (Supplementary Table S9). Therefore, a three-level meta-analysis is necessary and should be utilized for further analyses in RAT and PAT. In addition, the baseline test revealed four pairs of data with significant difference. After conducted sensitivity analysis the overall ES only changed trivially (Supplementary Table S10 and Supplementary Table S11), proving the stability of analysis results. The main effect of agility training on RAT was large (ES = 0.65, 95% CI: 0.42-0.87, Q(52) = 180.13, p < 0.01, I2-level 2 = 61.23%, I2-level 3 = 0%, total I2 = 61.23%, GRADE: moderate), indicating a significant improvement on RAT with moderate within-study heterogeneity and negligible between-study heterogeneity (Figure S4 ). However, it can be a consequence of low sample size (26 studies) and masking effect of high within-study heterogeneity due to inconsistent methodologies and participants’ characteristics (Choi and Kang, 2025). Effects on RT (ES = 0.52, p > 0.05, GRADE: very low), and PAT (ES = 0.55, p < 0.01, GRADE: low) were medium (Figure S1, Figure S2 and Figure S3). The results revealed improvements across all outcome dimensions except for RT with p value greater than 0.05. Subgroup analyses were conducted when effect sizes clustered were greater than 10 according to pre-specified criteria (Figure 4). Although the effects on RAT were large across all age groups, analysis indicated that effects on adolescent (ES = 0.73, CI: 0.49-0.97, p < 0.05) and junior players (ES = 0.77, CI: 0.44-1.09, p < 0.05) are larger than senior players (ES = 0.62, CI: 0.27-0.97, p < 0.05). Although similar improvements were observed in adolescent and junior players, the results should be interpreted with caution due to imbalance sample size across strata. Effects on RAT were larger in studies incorporating both genders (ES = 0.71, CI: 0.25-1.18, p < 0.05) than those incorporating only male (ES = 0.64, CI: 0.45-0.83, p < 0.05). However, it should be noted that female participants were underrepresented in the pooled results. Therefore, results regarding this subgroup are highly preliminary. For training status, effects on national level athletes (ES = 0.80, CI: 0.55-1.06, p < 0.05) were greater than developmental athletes (ES = 0.58, CI: 0.34-0.82, p < 0.05). And interventions that used visual stimulus produced larger ES (ES = 0.85, CI: 0.60-1.11, p < 0.05) than human or visual-auditory stimulus, but it should be noted that pooled results were obtained from a relatively small sample size and imbalance exits between different modalities. Effect sizes ranged from 0.56 (CI: 0.21-0.90, p < 0.05) to 0.82 (CI: 0.57-1.08, p < 0.05) across all subgroups. Specifically, interventions that involve visual stimulus with longer intervention (≥ 6 weeks), higher frequency (≥ 3 times/week) and short session time (< 25 minutes/session) had larger ESs.
Linear and non-linear meta-regressionLinear regression showed non-significant relationship between training duration (β = -0.009, p = 0.596, Figure 5), session time (β = - 0.009, p = 0.376, Figure 5) and RAT. Analyses revealed significant positive linear relationship between training frequency (β = 0.172, p < 0.01, Figure 5) and RAT. Subsequently, linear regression revealed non-significant relationships between all training modalities and PAT or RT with p value greater than 0.05 (Figure S5 ). Except for relationship between training frequency and RAT, the linear regression model was implausible due to the goodness of fit. Therefore, nonlinear regression models were fitted and cross-verified. Based on goodness-of-fit, cubic polynomial model fitted best for relationship between training weeks and RAT, simple linear model fitted training frequency best and restricted cubic spline model fitted session time best. We fitted each training modality with their best-fitted model separately, additionally, we fitted training frequency into cubic polynomial model to see if potential non-linear relationship exists. Non-linear regression revealed a significant wavy relationship between training duration and RAT (QM = 11.64, p < 0.01, Figure 5). Training duration peaked at 7.58 weeks, the minimum significant point occurred at 2.06 weeks and the maximum point occurred at 22 weeks. Restricted cubic spline model yielded non-significant relationship between session time and RAT performance (QM = 3.17, p = 0.37, Figure 5). But positive significant ESs were clustered around 20-30 minutes per session. The analyses detected a near linear relationship between training frequency and RAT (QM = 9.63, p < 0.05, Figure 5). The maximum significant point and the peak point of training frequency overlapped at 6 times per week, while the minimum point occurred at 1 time per week. Therefore, linear model was used for further interpretation.
The main aims of this review were to investigate the effects of agility training on agility performance among team-sport athletes, and to identify the optimal training parameters. Our findings indicate that agility training yields large overall effects on RAT performance, with moderate effects on PAT and non-significant effects on RT. Although participant characteristics and training modalities have potential moderating effects on agility training gains, the ecological validity varied among test protocols. Therefore, the overall and moderating effects should be interpreted in sport-specific context. Evidence certainty was moderate for RAT, low for PAT and very low for RT, and the results were characterized by considerable within-study heterogeneity. Non-linear analyses suggested that agility performance improvements play plateau beyond approximately 7-8 weeks of training, with higher frequencies and moderate session durations (20-25 minutes) showing favorable trends. These relationships are indicative rather than prescriptive, as the available data preclude precise model-based determination of optimal parameters.
The contribution of perception-action couplingPrevious research has demonstrated the importance of perceptual-cognitive factors in agility performance. While basic cognitive functions like simple reaction time can facilitate agility performance (Pojskic et al., 2018; Sekulic et al., 2019), higher order cognitive (executive) functions, encompassing working-memory, inhibitory control and cognitive flexibility are more crucial (Diamond, 2013). Working memory involves holding, manipulating and refreshing information in dynamic game context, such as monitoring teammates’ positions and predicting opponents’ actions. Inhibitory control is the ability to suppress automatic or irrelevant responses to focus on task-relevant information, enabling players to filter distractions like noise from spectators. While cognitive flexibility entails the ability to switch between tasks or to adapt to changing conditions, such as decision on a COD running during the transition between attack and defense. Agility differs from COD ability in that it requires continuous integration of perceptual operations with motor execution (Young et al., 2015). Team-sport players are constantly challenged by simultaneous perceptual tasks (e.g. anticipation and decision-making) and motor execution (e.g. dribbling or running) (Jordet et al., 2020), highlighting the need for perception-action coupling during training (Young et al., 2021). Evidence that knee mechanics and decision processes diverge between reactive and pre-planned tasks further underscores the need for representative practice to maintain perception-action coupling (Brown et al., 2014). The results confirmed that training programs integrating perception-action coupling, such as COD training with external stimulus (Born et al., 2016; Chaalali et al., 2016; Zouhal et al., 2019), SSGs (Neag et al., 2025; Zago et al., 2016), paired drills and mirror games (Engelbrecht et al., 2016), and skill practice with external stimulus (Hicheur et al., 2020; Vogt et al., 2018), yielded greater ESs in RAT than PAT or RT. To better interpret this, it’s necessary to investigate the underlying mechanisms that facilitates this transfer of training. The effectiveness of perception-action coupled training can be understood through several interacting cognitive and neurophysiological mechanisms. A potential pathway might be the dual-process theory, which conceptualized skill acquisition as a gradual shift from controlled, deliberative operations (Type 2 processing) to automatic, intuitive responses (Type 1 processing) (Furley et al., 2015). Repeated exposure to representative perceptual-motor contexts accelerates the transfer from Type 2 to Type 1 processing, thereby, reducing the reliance on working memory and enabling faster, more automated responses (Anderson, 1982). Consequently, the adaptation explains why training tasks that integrate visual-spatial perception and motor execution, such as SSGs and reactive COD training, are more effective in improving RAT performance. In conclusion, perception-action coupling is fundamental in agility training among team-sport players, as it ensures the seamless integration of perceptual-cognitive factors and motor processes, promoting adaptive performance in complex scenarios.
Moderators and dose-response modellingModerators of agility training
This study identified several moderators of agility training including participant characteristics and training modalities. Among these, age (maturation) is a prominent topic in agility training (Horníková et al., 2025; Thieschäfer and Büsch, 2022). Although subgroup analysis revealed that adolescent and junior players had similar training gains, there might be confounding factors underlying. Research indicated that maturation rather than chronological age, influences agility training gains with COD ability culminating before adolescence (around 14 years old) (Thieschäfer and Büsch, 2022). Results from 8 studies with a longitudinal design (Arede et al., 2021; Bekris et al., 2023; Ehmann et al., 2022; Gonzalo-Skok et al., 2023; Klatt et al., 2025; Neag et al., 2025; Trecroci et al., 2016; Vogt et al., 2018; Zago et al., 2016) in the current analysis aligned with previous research showing that younger participants experienced the largest improvement (Thieschäfer & Büsch, 2022). However, only one study reported maturity offset among youth soccer players (Trecroci et al., 2016), showing that agility training introduced pronounced effect (ES = 0.8, p < 0.05) on experimental group with small effect size (ES = 0.2) in control group regarding RAT. Rapidly developing physical abilities may contribute to observed large effect size (Di Mascio et al., 2020), which explains the pronounced ES in subgroup junior (11-14 years old). However, pronounced agility training gains are also observed after the above-mentioned culminating of COD ability (Pojskic et al., 2018). This may stem from improved perceptual-cognitive factors, which are more correlated with agility performance in middle adolescence than early adolescence (Horníková et al., 2025). Therefore, the moderator of age suggests different emphases in agility training for various maturation groups with physical factors being more important before adolescence and perceptual-cognitive factors being more important after. However, we are not suggesting training components of agility separately, as perception-action coupling is vital in agility performance (Young et al., 2021). Nevertheless, more studies with maturity status are needed to further validate the result. Sex revealed trivial moderating influence on agility training gains, but only two studies recruited female participants in their experiments (Lee et al., 2024; Mancini et al., 2024), and six studies recruited both genders in their studies. Although subgroup analysis indicated that studies with both genders experienced similar large ES with studies recruiting male participants, the results had limited guidance on real-world practice due to insufficient data. Previous research demonstrated that untrained team-sport athletes benefit more from various training methods than highly trained ones regarding COD speed (Forster et al., 2022; Nygaard Falch et al., 2019), which illustrated the contribution of physical training to COD speed and the moderating effects of training status. However, the moderating effects on RAT performance remain underexplored. The current analysis found that developmental team-sport players experienced less pronounced responses than national level players regarding RAT performance. Subgroup analysis revealed that overall ES for national level subjects (ES = 0.8, CI: 0.55-1.06, p < 0.05) was larger than that for developmental participants (ES = 0.58, CI: 0.34-0.82, p < 0.05). This was cross-verified by independent studies, showing that agility training produced significant group differences among elite participants in RAT performance (p < 0.05) (Zouhal et al., 2019), while no significant group effects (p = 0.26) were found among developmental participants. Possible mechanisms underlying this may be: physical factors dominate agility performance in developmental athletes, but their influence diminishes with advancing age and training status, while perceptual-cognitive factors become more prominent (Thieschäfer & Büsch, 2022). Therefore, elite athletes who possess greater match experience and better perceptual-cognitive abilities such as decision-making and anticipation can have superior transfer from training to RAT performance. The study also explored the effects of various stimuli on RAT performance. Subgroup analysis revealed that using visual stimuli can generate very large improvement (ES = 0.81, CI: 0.54-1.07, p < 0.05) regarding RAT performance, while the second largest ES was observed in human stimuli (ES = 0.68, CI: 0.38-0.98, p < 0.05). However, the results should be interpreted with caution owing to contradiction to previous research suggesting using human and life-sized video as stimulus in agility training (Paul et al., 2016), possibly reflecting imbalanced samples with more visual-stimulus interventions, which will be discussed in Section 4.3. Although moderators were explored using subgroup data, the moderate within-study heterogeneity suggest inconsistent internal study design, such as varied number of CODs, angles, entry and exit speed in RATs. Moreover, many studies incorporated less than 10 participants per group (Engelbrecht et al., 2016; Hassan et al., 2022; Hicheur et al., 2020; Lee et al., 2024; Serpell et al., 2011), which may cause unstable within-study and pooled ESs. Previous sensitivity analysis revealed that Hassan et al., (2022) and Steff et al., (2024) as highly influential studies, which changed the pooled estimates from 0.79 to 0.65. Therefore, studies with inconsistent internal study design and small sample size may potentially lead to inflated pooled estimates. Additionally, most samples were gained from soccer and basketball, the results remained an indicative estimate rather than a one-size-fits-all guidance. And interpretation of results should not be extrapolated to team-sport broadly.
Dose-response modelling of agility trainingThe present meta-regression analysis elucidated dose-response relationships between key training modalities (training duration, frequency and session time) and improvements in agility performance among team-sport athletes. Findings revealed a non-linear pattern between training duration and RAT as well as a linear pattern between training frequency and RAT, highlighting the potential threshold of agility training. For training duration, the wavy non-linear regression indicated a peak ES at 7.58 weeks with a minimum significant ES after 2 weeks, suggesting an initial rapid adaptation phase before 8 weeks followed by a dip and a later rising at 18 weeks. Since agility performance requires “perception-action coupling” (Young et al., 2021), where movement and stimulus influence each other, training gains can be amplified through this “coupled” training. Therefore, resulting in primary ascending phase of agility performance. Especially for initial neural adaptation. As illustrated by Klatt et al., (2025), participants experienced significant improvement in reaction time (p < 0.05) and RAT (p < 0.05) after 6 weeks of agility training. In addition, perceptual-cognitive abilities develop rapidly during adolescence (Legault et al., 2022), making maturation a confounding factor contributing to rapid RAT performance improvement. The later stagnant and diminishing phase may stem from plateau effects. It was suggested that training monotony can result in training plateau for physical ability like explosive lower limb power (Oliveira et al., 2025), which is important in agility performance (Sheppard and Young, 2006). However, this can be solved by using short intense training mesocycle, which lowers the chance of training monotony and plateau of adaptation (Forster et al., 2022). Therefore, diverse training design can mitigate the potential plateau effects after 8 weeks of agility training. And only one study with excessive long training duration appeared in the later returning phase (Zago et al., 2016). The observed agility training gains may stem from progressive loading and diverse drill formats (Zago et al., 2016), and progressive loading was proven effective in COD ability among various population (Thompson et al., 2017; Zhou et al., 2024). Therefore, preventing training monotony and establishing progressive loading can assist agility training gains in excessive long training duration. However, more studies with longer intervention duration are warranted to validate the mechanism. There was no significant relationship between session time and RAT based on linear and non-linear regression results. But reflecting on the non-linear plot for session time, it can be inferred that most large significant ESs clustered around 20-30 minutes per session. And previous subgroup analysis showed that shorter session time (< 25 minutes) yielded larger responses. Therefore, it can be concluded that a session time between 20-25 minutes is plausible in agility training. Possible mechanism for smaller ESs in longer session time may stem from mental fatigue. Mental fatigue is a psychobiological state caused by excessive long period of demanding cognitive activity (Marcora et al., 2009), which has been proved harmful in sport-specific skills in Australian Football (Weerakkody et al., 2021), soccer (Smith et al., 2016) and cricket (Veness et al., 2017). While in agility training, mental fatigue is inevitable for exercises are performed in response to a stimulus. Therefore, to avoid detrimental effects caused by prolonged training session, agility training should be conducted before fatigue. However, a session time between 20-25 minutes should be interpreted with caution due to relatively high heterogeneity and low certainty of evidence. Moreover, evidence was aggregated mostly from soccer and basketball, the results may not be universally applicable in team-sport. In contrast, the positive “linear” relationship for training frequency indicated that higher frequencies may have favorable adaptative RAT performance. Although most studies clustered at 2-3 times/week, maximum ES was achieved at 6 times/week. The result along with relationship between training duration, session time and RAT performance indicated that repeated exposure to sport-specific stimuli in agility training without excessive fatigue for a suitable duration can appropriately enhance RAT performance in team-sport players. Study also revealed similar results regarding training frequency and COD ability with a positive relation of r = 0.436 (p < 0.05) (Asadi et al., 2017). However, this study only reflected the physical aspect of agility performance, the perceptual-cognitive factors remain underexplored. It should also be noted that training frequency should be adjusted based on volume and intensity to avoid injury and over-training (Bourdon et al., 2017). Therefore, future studies should investigate the influence of training frequency on RAT performance using both short and long session time while controlling training volume and intensity to validate the results. In conclusion, 7-8 weeks interventions delivered at higher frequency with approximately 20-25 minutes session time are associated with favorable adaptations. However, monitoring volume and intensity is critical to avoid injury or over-training. Especially when the data regarding training volume and intensity was absent from many studies, scrutiny monitoring (e.g. total distance covered, hear rate, rate of perceived exertion etc.) should accompany training programs. But the results should be interpreted with caution due to the dominance of soccer and basketball samples.
Revisiting the importance of stimuliIn the current research, results indicated that using visual stimuli in agility training was associated with favorable performance outcome (ES = 0.81, p < 0.05). However, the results need further discussion due to the relatively small sample size, imbalance across modalities and contradiction to prior theory. Team-sport players are rarely required to react to light or video stimulus and most decisions are made based on teammates or opponents’ actions. But most included studies were conducting training and testing using visual stimuli (light and video), except for (Chaalali et al., 2016; Chaouachi et al., 2014; Engelbrecht et al., 2016; Lee et al., 2024; Neag et al., 2025; Young and Rogers, 2014; Zago et al., 2016; Zouhal et al., 2019). Although training with these stimuli may improve RAT performance among team-sport athletes, the transferability on on-field performance remains skeptical. Because training stimuli should match the demands of the sports being trained for (Ramirez-Campillo et al., 2023), and light and video stimulus are rare in game context. The widely used light and video stimulus in agility training and testing may stem from its high level of repeatability. For such programmed stimulus can be generated at the same time on each occasion (Paul et al., 2016), making it replicable. And the protocol was used for both training and testing, therefore, when participants become more familiar with the system, the performance in tests would potentially increases. However, in game context, athletes can anticipate opponents’ movements based on their experience (Roca et al., 2012), enabling them to make faster decisions, which is an important factor to agility performance (Sheppard and Young, 2006). While using light stimulus restricts anticipation, the stimulus is training and testing the speed of decision-making independent of game context in team-sport athletes. Thus, training agility with sport-specific context is highly recommended (Young et al., 2021). But variability of using human stimulus is also a major concern. To fix the problem, one study tried a novel agility training method in soccer using human stimulus (Chaalali et al., 2016). By touching the ball with both feet, the coach consistently provides at most four possible COD running. Similarly, one study used chasing runs and “mirror” drills in agility training (Trecroci et al., 2016), controlling the variability of using human stimulus. The methods provided players with live interaction and information for anticipation, greatly improving sport-specific stimulus in agility training. Also, agility training gains in these studies were both significant. Subsequently, human stimuli were also utilized in RAT. Attempt was made using a tester to initiate four possible scenarios based on passing movements in soccer and recorded total time, tester time, response time, decision-making time and movement time respectively (Altmann et al., 2022). The results provided valuable information on player development with good overall validity (ICC = 0.82, p < 0.01). And the ability of discriminating high-level and low-level players were addressed by a similar test with four possible scenarios in Australian Football (Veale et al., 2010), which correctly classified 75% of the participants. A systematic review also recommended the use of human stimulus in agility studies (Paul et al., 2016), highlighting repeatability of coaches and testers. Therefore, incorporating human stimuli in agility training and testing can provide better game context and discriminability. However, we’re not discouraging the utilization of visual stimuli in agility training. Since advanced technologies such as BATAK pro (Arede et al., 2021), digital screen (Serpell et al., 2011), and Speedcourt (Born et al., 2016) etc. have made using visual stimuli more systematic and reliable. It’s more accessible for coaches and practitioners to utilize visual stimuli in agility training. And live-sized video is preferable when using 2-D visual stimulus (Paul et al., 2016). A recent research suggested that decision-making is equally important as anticipation(Williams et al., 2023), where video-based techniques were used to validate the theory. Therefore, we suggest that coaches can utilize visual stimuli (preferably live-sized video) to training decision-making speed to positively influence agility performance at off-season or preparation stage of training. While human stimuli are encouraged during in-season period for better sport-specific stimulus and anticipation involved training.
Retention of agility trainingResearch suggested that agility performance plateaus around adolescence and have distinct difference between early and middle adolescence (Horníková et al., 2025). Therefore, implementing agility training before this key stage is of great importance. However, dual-career limitations such as balancing training and education for young players (Pink et al., 2018), injury and off-season period may prevent them from performing their regular habitual exercises, which significantly lower their agility training adherence. Therefore, it’s pivotal to examine the retention of agility training-induced effects to better inform training schedule. However, most included studies didn’t perform retention test except for two studies (Engelbrecht et al., 2016; Zwierko et al., 2023). Investigation on the retention effects of agility training in rugby players using human stimulus revealed that RAT performance was maintained after a 6-week period without receiving the training stimulus (Engelbrecht et al., 2016). Another study also investigated the retention effects of agility training in volleyball players using stroboscopic glasses (Zwierko et al., 2023). Anecdotally, results showed that RAT performance was maintained in control group rather than experimental group after 4-week period without training. This may suggest that retention of agility training using stroboscopic glasses is trivial. Moreover, there is a large body of evidence examining the retention of physical components of agility such as power and strength (Chaouachi et al., 2019), sprint performance (Padrón-Cabo et al., 2025), and dynamic strength (Ingle et al., 2006). While the retention of perceptual-cognitive components of agility and agility performance is under-represented. Overall, existing evidence suggest agility training gains among team-sport players may be partially maintained. But the retention effects were inconsistent and under-represented, which limits the information on proper scheduling of training sessions. More research is needed to validate the retention of agility training gains, especially the retention of perceptual-cognitive components of agility using different stimulus.
Limitations and Methodological ConsiderationsWhile this systematic review with meta-analysis provides evidence on agility training effects and prescriptions, several limitations warrant consideration. First, training volume and intensity were not coded owing to inconsistent reporting in different agility training modalities such as SSGs, speed agility and quickness (SAQ) training, and stroboscopic training etc. Future studies should seek to explore the standardized way to monitor agility training load (e.g. using rate of perceived exertion or heart rate for internal load and using global positioning systems for external load). Second, subgroup analyses for moderators like sex and training status were constrained by small study numbers and the underrepresentation of female subjects, limiting information on targeted population. Third, although the moderator “age” revealed that adolescent team-sport players benefit more from agility training, maturation was not thoroughly considered due to the insufficient data. Thus, it can only be referred that players around the pre-specified chronological age category may benefit more from agility training. Moreover, most evidence was gained from soccer and basketball, narrowing the scope of team-sport. Therefore, results should be applied cautiously, especially regarding team-sport players other than soccer and basketball or female subjects. Lastly, this meta-analysis only included interventions that incorporated exercises with perceptual-cognitive factors, partially overlooked potential research that train components of agility independently. For methodological consideration, the registration of this study took place after initial article retrieval but before data extraction. Which is allowed in PROSPERO registration but could introduce potential selective reporting. Ideally, registration should precede literature search. Additionally, although dual screening with a third reviewer resolving disagreements aligns with PRISMA 2020 guidelines, not reporting agreement metrics (e.g. Cohen’s kappa coefficient) could obscure the consistency of decisions. The heterogeneity across various RAT modalities narrows the applicability of the pooled results. To the best of our knowledge, there isn’t a consensus regarding the gold ecological standard of reactive agility testing. Therefore, we only included reactive agility tests modalities that are recognized as spatial agility or universal agility to improve the consistency of the RAT results. While our primary analysis provides evidence for large improvements in RAT performance (ES = 0.65), a key limitation involves the exclusion of two influential outliers influential (Hassan et al., 2022; Steff et al., 2024). These studies were excluded based on pre-specified criteria, including high Cook’s distance, and disproportionate contributions to heterogeneity (I2 from 61.23% to 77.43%). Methodologically, both studies exhibited risks such as small sample sizes, lack of randomization and blinding, and high test-training specificity using FITLIGHT system, which may have inflated their study ESs (1.11-3.56). Therefore, excluding them provide a more conservative but less biased pooled estimates, future studies should examine the large effects with rigorous designs to confirm their validity.
Recommendations for Future Agility StudiesTo advance agility research, future studies should address current gaps with targeted designs. Included studies had relative high risk of bias regarding familiarization of tests, reporting of reliability and sample size, which should be addressed by future studies for better practical guidance. Upon improving study design, reporting on maturation is warranted in agility studies for more specific exploration on possible “window of opportunity” for agility training among team-sport players. Subsequently, visual stimuli and human stimuli in agility training should be evaluated using on-field performance to better understand the contribution of different perceptual-cognitive factors. But proper training for tester should be provided to ensure consistent stimulus before implementing agility training. In addition, there is a call for agility research on female participants to better inform the gender difference in agility training gains. Also, future studies should compare the agility training gains between elite and non-elite team-sport players to validate current results. Given the barriers like dual-career limitations, injuries and off-season period can prevent players from training, it’s of great value to examine the retention of agility training gains using various modalities to better inform training scheduling among various competitive levels.
The results of the current meta-analysis revealed that the overall effect of agility training on RAT performance among team-sport athletes was large. The findings suggest that interventions of approximately 7-8 weeks, performed more than three times per week with sessions lasting around 20-25 minutes, were most frequently associated with favorable adaptations. However, these values reflect the central tendencies of available studies rather than model-predicted optima due to the high heterogeneity, moderate to low certainty of evidence, the dominance of soccer and basketball samples, underrepresentation of female participants, insufficient training load and retention data. Nevertheless, heart rate, rate of perceived exertion and global positioning systems should be utilized during agility training to monitor internal and external load to prevent injury and over-training. Coaches and practitioner may use visual stimuli for decision-making speed during off-season or preparation stage, which can positively influence the overall agility performance. However, human stimuli are encouraged during in-season period as they provide a more sport-specific stimulus, which is critical for anticipation. Overall, the current study is an attempt in aggregating evidence regarding agility training effects among team-sport players. But the limited and imbalanced data restricting practical guidance on team-sport training. Future studies should explore agility training in more diverse demographics.
| ACKNOWLEDGEMENTS |
Protocol available from: .The author reports no actual or potential conflicts of interest. The datasets generated and analyzed in this study are not publicly available, but are available from the corresponding author who organized the study upon reasonable request. All experimental procedures were conducted in compliance with the relevant legal and ethical standards of the country where the study was performed. |
|
| AUTHOR BIOGRAPHY |
|
 |
Zhiwei Zhao |
| Employment: School of Physical Education and Sports, Central China Normal University, Wuhan, China |
| Degree: MEd |
| Research interests: Agility training; Soccer; Strength and Conditioning |
| E-mail: zhou4210elliot@gmail.com |
| |
 |
Yuhang Liu |
| Employment: School of Physical Education, Wuhan Sports University, Wuhan 430079, China |
| Degree: PhD |
| Research interests: Public health; Strength and conditioning |
| E-mail: liuyuhangcoach@163.com |
| |
 |
Kai Xu |
| Employment: School of Athletic Performance, Shanghai University of Sport, Shanghai, China. |
| Degree: PhD |
| Research interests: High-intensity Interval Training; Energy Contribution; Strength and conditioning |
| E-mail: 2221152066@sus.edu.cn |
| |
|
| |
| REFERENCES |
 Altmann S., Neumann R., Ringhof S., Rumpf M. C., Woll A. (2022) Soccer-specific agility: Reliability of a newly developed test and correlates of performance. The Journal of Strength and Conditioning Research 36. Crossref
|
 Anderson J. R. (1982) Acquisition of cognitive skill. Psychological Review 89, 369. Crossref
|
 Arede J., Carvalho M., Esteves P., de las Heras B., Leite N. (2021) Exploring the effects of LED lighting training program on motor performance among young athletes. Creativity Research Journal 33, 63-73. Crossref
|
 Asadi A., Arazi H., Ramirez-Campillo R., Moran J., Izquierdo M. (2017) Influence of maturation stage on agility performance gains after plyometric training: A systematic review and meta-analysis. The Journal of Strength and Conditioning Research 31, 2609-2617. Crossref
|
 Assink M., Wibbelink C. J. (2016) Fitting three-level meta-analytic models in R: A step-by-step tutorial. The Quantitative Methods for Psychology 12, 154-174. Crossref
|
 Bekris E., Gioldasis A., Souglis A., Zacharakis E., Smirniotou A. (2023) Enhancing soccer-specific motor skills through visual training: A quasi-experimental study in young soccer players. Baltic Journal of Sport and Health Sciences 2, 28-38. Crossref
|
 Born D.-P., Zinner C., Düking P., Sperlich B. (2016) Multi-directional sprint training improves change-of-direction speed and reactive agility in young highly trained soccer players. Journal of Sports Science and Medicine 15, 314. Pubmed
|
 Bourdon P. C., Cardinale M., Murray A., Gastin P., Kellmann M., Varley M. C., Gabbett T. J., Coutts A. J., Burgess D. J., Gregson W. (2017) Monitoring athlete training loads: Consensus statement. International Journal of Sports Physiology and Performance 12, 161-170. Crossref
|
 Brown S. R., Brughelli M., Hume P. A. (2014) Knee mechanics during planned and unplanned sidestepping: A systematic review and meta-analysis. Sports Medicine 44, 1573-1588. Crossref
|
 Brughelli M., Cronin J., Levin G., Chaouachi A. (2008) Understanding change of direction ability in sport. Sports Medicine 38, 1045-1063. Crossref
|
 Chaalali A., Rouissi M., Chtara M., Owen A., Bragazzi N. L., Moalla W., Chaouachi A., Amri M., Chamari K. (2016) Agility training in young elite soccer players: Promising results compared to change of direction drills. Biology of Sport 33, 345-351. Crossref
|
 Chaouachi A., Ben Othman A., Makhlouf I., Young J. D., Granacher U., Behm D. G. (2019) Global training effects of trained and untrained muscles with youth can be maintained during 4 weeks of detraining. The Journal of Strength and Conditioning Research 33. Crossref
|
 Chaouachi A., Chtara M., Hammami R., Chtara H., Turki O., Castagna C. (2014) Multidirectional sprints and small-sided games training effect on agility and change of direction abilities in youth soccer. The Journal of Strength and Conditioning Research 28, 3121-3127. Crossref
|
 Cheung M. W.-L. (2019) A guide to conducting a meta-analysis with non-independent effect sizes. Neuropsychology Review 29, 387-396. Crossref
|
 Choi G. J., Kang H. (2025) Heterogeneity in meta-analyses: An unavoidable challenge worth exploring. Korean Journal of Anesthesiology 78, 301-314. Crossref
|
 Citkowicz M., Vevea J. L. (2017) A parsimonious weight function for modeling publication bias. Psychological Methods 22, 28. Crossref
|
 Cobley S., Baker J., Wattie N., McKenna J. (2009) Annual age-grouping and athlete development. Sports Medicine 39, 235-256. Crossref
|
 Cohen, J. (2013) Statistical power analysis for the behavioral sciences.
Routledge. Crossref
|
 Currell K., Jeukendrup A. E. (2008) Validity, reliability and sensitivity of measures of sporting performance. Sports Medicine 38, 297-316. Crossref
|
 Di Mascio M., Ade J., Musham C., Girard O., Bradley P. S. (2020) Soccer-specific reactive repeated-sprint ability in elite youth soccer players: Maturation trends and association with various physical performance tests. The Journal of Strength and Conditioning Research 34, 3538-3545. Crossref
|
 Diamond A. (2013) Executive functions. Annual Review of Psychology 64, 135-168. Crossref
|
 Dunton A., O’Neill C., Coughlan E. K. (2020) The impact of a spatial occlusion training intervention on pass accuracy across a continuum of representative experimental design in football. Science and Medicine in Football 4, 269-277. Crossref
|
 Ehmann P., Beavan A., Spielmann J., Mayer J., Ruf L., Altmann S., Forcher L., Klever N., Rohrmann S., Nuß C. (2022) Perceptual-cognitive performance of youth soccer players in a 360-environment - an investigation of the relationship with soccer-specific performance and the effects of systematic training. Psychology of Sport and Exercise 61, 102220. Crossref
|
 Engelbrecht L., Terblanche E., Welman K. E. (2016) Video-based perceptual training as a method to improve reactive agility performance in rugby union players. International Journal of Sports Science and Coaching 11, 799-809. Crossref
|
 Fernández-Castilla B., Declercq L., Jamshidi L., Beretvas S. N., Onghena P., Van den Noortgate W. (2021) Detecting selection bias in meta-analyses with multiple outcomes: A simulation study. The Journal of Experimental Education 89, 125-144. Crossref
|
 Forster J. W. D., Uthoff A. M., Rumpf M. C., Cronin J. B. (2022) Training to improve pro-agility performance: A systematic review. Journal of Human Kinetics 85, 35-51.
|
 Friebe D., Banzer W., Giesche F., Haser C., Hülsdünker T., Pfab F., Rußmann F., Sieland J., Spataro F., Vogt L. (2024) Effects of 6-week motor-cognitive agility training on football test performance in adult amateur players - a three-armed randomized controlled trial. Journal of Sports Science and Medicine 23, 276. Crossref
|
 Furley P., Schweizer G., Bertrams A. (2015) The two modes of an athlete: Dual-process theories in the field of sport. International Review of Sport and Exercise Psychology 8, 106-124. Crossref
|
 Gonzalo-Skok O., Tous-Fajardo J., Maroto-Izquierdo S., Raya-González J., Sánchez-Sánchez J. (2023) The inclusion of preplanned and random and unanticipated/unexpected events during strength training improves the ability to repeat high-intensity efforts under uncertainty. International Journal of Sports Physiology and Performance 19, 249-256. Crossref
|
 Guyatt G. H., Oxman A. D., Vist G. E., Kunz R., Falck-Ytter Y., Alonso-Coello P., Schünemann H. J. (2008) GRADE: An emerging consensus on rating quality of evidence and strength of recommendations. BMJ 336, 924-926. Crossref
|
 Harrer, M., Cuijpers, P., Furukawa, T. and Ebert, D. (2021) Doing metaanalysis with R: A hands-on guide. Chapman and Hall/CRC. Crossref
|
 Hassan A. K., Alhumaid M. M., Hamad B. E. (2022) The effect of using reactive agility exercises with the FITLIGHT training system on the speed of visual reaction time and dribbling skill of basketball players. Sports 10. Crossref
|
 Hedges L. V., Tipton E., Johnson M. C. (2010) Robust variance estimation in meta-regression with dependent effect size estimates. Research Synthesis Methods 1, 39-65. Crossref
|
 Hedges, L. V. and Olkin, I. (2014) Statistical methods for meta-analysis.
Academic Press.
|
 Hicheur H., Chauvin A., Cavin V., Fuchslocher J., Tschopp M., Taube W. (2020) Augmented-feedback training improves cognitive motor performance of soccer players. Medicine and Science in Sports and Exercise 52, 141. Crossref
|
 Higgins J. P. T., Altman D. G., Gøtzsche P. C., Jüni P., Moher D., Oxman A. D., Savović J., Schulz K. F., Weeks L., Sterne J. A. C. (2011) The Cochrane Collaboration’s tool for assessing risk of bias in randomised trials. BMJ 343, d5928. Crossref
|
 Horníková H., Hadža R., Zemková E. (2025) The contribution of perceptual-cognitive skills to reactive agility in early and middle adolescent soccer players. The Journal of Strength and Conditioning Research 39. Crossref
|
 Ingle L., Sleap M., Tolfrey K. (2006) The effect of a complex training and detraining programme on selected strength and power variables in early pubertal boys. Journal of Sports Sciences 24, 987-997. Crossref
|
 Jordet G., Aksum K. M., Pedersen D. N., Walvekar A., Trivedi A., McCall A., Ivarsson A., Priestley D. (2020) Scanning, contextual factors, and association with performance in English Premier League footballers: An investigation across a season. Frontiers in Psychology 11. Crossref
|
 Kadlec D., Sainani K. L., Nimphius S. (2023) With great power comes great responsibility: Common errors in meta-analyses and meta-regressions in strength and conditioning research. Sports Medicine 53, 313-325. Crossref
|
 Klatt S., Thönnißen L., Fischer T., Sandmann A. (2025) Alternative training methods in soccer: A perceptual-skill intervention in youth soccer players. German Journal of Exercise and Sport Research. Crossref
|
 Lee Y.-S., Lee D., Ahn N. Y. (2024) SAQ training on sprint, change-of-direction speed, and agility in U-20 female football players. Plos One 19, e0299204. Crossref
|
 Legault I., Sutterlin-Guindon D., Faubert J. (2022) Perceptual cognitive abilities in young athletes: A gender comparison. Plos One 17, e0273607. Crossref
|
 Lucia S., Digno M., Madinabeitia I., Di Russo F. (2023) Testing a multicomponent training designed to improve sprint, agility and decision-making in elite basketball players. Brain Sciences 13, 984. Crossref
|
 Lucia S., Mattia D., Iker M., Di Russo F. (2024) Integration of cognitive-motor dual-task training in physical sessions of highly-skilled basketball players. Journal of Sports Sciences 42, 1695-1705. Crossref
|
 Mancini N., Di Padova M., Polito R., Mancini S., Dipace A., Basta A., Colella D., Limone P., Messina G., Monda M. (2024) The impact of perception-action training devices on quickness and reaction time in female volleyball players. Journal of Functional Morphology and Kinesiology 9, 147. Crossref
|
 Marcora S. M., Staiano W., Manning V. (2009) Mental fatigue impairs physical performance in humans. Journal of Applied Physiology 106, 857-864. Crossref
|
 Martin-Niedecken A. L., Bucher V., Adcock M., de Bruin E. D., Schättin A. (2023) Impact of an exergame intervention on cognitive-motor functions and training experience in young team sports athletes: A non-randomized controlled trial. Frontiers in Sports and Active Living 5, 1170783. Crossref
|
 McKay A. K. A., Stellingwerff T., Smith E. S., Martin D. T., Mujika I., Goosey-Tolfrey V. L., Sheppard J., Burke L. M. (2022) Defining training and performance caliber: A participant classification framework. International Journal of Sports Physiology and Performance 17, 317-331. Crossref
|
 Morral-Yepes M., Moras G., Bishop C., Gonzalo-Skok O. (2022) Assessing the reliability and validity of agility testing in team sports: A systematic review. The Journal of Strength and Conditioning Research 36. Crossref
|
 Neag I., Mihaila I., Fleancu L. J., Stancu M., Potop V., Barbu D., Paun L. I., Mihai I. (2025) Agility development in youth soccer: The efficacy of fixed-role small-sided games. Frontiers in Sports and Active Living 7, 1593906. Crossref
|
 Nygaard Falch H., Guldteig Rædergård H., van den Tillaar R. (2019) Effect of different physical training forms on change of direction ability: A systematic review and meta-analysis. Sports Medicine - Open 5, 53. Crossref
|
 Oliveira J. P., Marinho D. A., Jacinto P., Sampaio T., Morais J. E. (2025) Characterization of physical performance and change of direction deficit across age groups in young female volleyball players. BMC Sports Science, Medicine and Rehabilitation 17, 209. Crossref
|
 Padrón-Cabo A., Lorenzo-Martínez M., De Dios-Álvarez V., Rey E., Solleiro-Durán D. (2025) Effects of a short-term detraining period on the physical fitness in elite youth soccer players: A comparison between chronological age groups. The Journal of Strength and Conditioning Research 39. Crossref
|
 Page M. J., McKenzie J. E., Bossuyt P. M., Boutron I., Hoffmann T. C., Mulrow C. D., Shamseer L., Tetzlaff J. M., Akl E. A., Brennan S. E. (2021) The PRISMA 2020 statement: An updated guideline for reporting systematic reviews. International Journal of Surgery 88, 105906. Crossref
|
 Paul D. J., Gabbett T. J., Nassis G. P. (2016) Agility in team sports: Testing, training and factors affecting performance. Sports Medicine 46, 421-442. Crossref
|
 Pink M. A., Lonie B. E., Saunders J. E. (2018) The challenges of the semi-professional footballer: A case study of the management of dual career development at a Victorian Football League (VFL) club. Psychology of Sport and Exercise 35, 160-170. Crossref
|
 Pojskic H., Šslin E., Krolo A., Jukic I., Uljevic O., Spasic M., Sekulic D. (2018) Importance of reactive agility and change of direction speed in differentiating performance levels in junior soccer players: Reliability and validity of newly developed soccer-specific tests. Frontiers in Physiology 9, 506. Crossref
|
 Ramirez-Campillo R., García-Hermoso A., Moran J., Chaabene H., Negra Y., Scanlan A. T. (2022) The effects of plyometric jump training on physical fitness attributes in basketball players: A meta-analysis. Journal of Sport and Health Science 11, 656-670. Crossref
|
 Ramirez-Campillo R., Sortwell A., Moran J., Afonso J., Clemente F. M., Lloyd R. S., Oliver J. L., Pedley J., Granacher U. (2023) Plyometric-jump training effects on physical fitness and sport-specific performance according to maturity: A systematic review with meta-analysis. Sports Medicine - Open 9, 23. Crossref
|
 Roca A., Williams A. M., Ford P. R. (2012) Developmental activities and the acquisition of superior anticipation and decision making in soccer players. Journal of Sports Sciences 30, 1643-1652. Crossref
|
 Rodgers M. A., Pustejovsky J. E. (2021) Evaluating meta-analytic methods to detect selective reporting in the presence of dependent effect sizes. Psychological Methods 26, 141. Crossref
|
 Sekulic D., Foretic N., Gilic B., Esco M. R., Hammami R., Uljevic O., Versic S., Spasic M. (2019) Importance of agility performance in professional futsal players; Reliability and applicability of newly developed testing protocols. International Journal of Environmental Research and Public Health 16, 3246. Crossref
|
 Serpell B. G., Young W. B., Ford M. (2011) Are the perceptual and decision-making components of agility trainable? A preliminary investigation. The Journal of Strength and Conditioning Research 25, 1240-1248. Crossref
|
 Sheppard J. M., Young W. B. (2006) Agility literature review: Classifications, training and testing. Journal of Sports Sciences 24, 919-932. Crossref
|
 Smith M. R., Coutts A. J., Merlini M., Deprez D., Lenoir M., Marcora S. M. (2016) Mental fatigue impairs soccer-specific physical and technical performance. Medicine and Science in Sports and Exercise 48, 267-276. Crossref
|
 Steff N., Badau D., Badau A. (2024) Improving agility and reactive agility in basketball players U14 and U16 by implementing Fitlight technology in the sports training process. Applied Sciences 14, 3597. Crossref
|
 Sun M., Soh K. G., Ma S., Wang X., Zhang J., Yaacob A. B. (2025) Effects of speed, agility, and quickness (SAQ) training on soccer player performance—A systematic review and meta-analysis. Plos One 20, e0316846. Crossref
|
 Thieschäfer L., Büsch D. (2022) Development and trainability of agility in youth: A systematic scoping review. Frontiers in Sports and Active Living 4, 952779. Crossref
|
 Thompson B. J., Stock M. S., Mota J. A., Drusch A. S., DeFranco R. N., Cook T. R., Hamm M. A. (2017) Adaptations associated with an after-school strength and conditioning program in middle-school-aged boys: A quasi-experimental design. The Journal of Strength and Conditioning Research 31, 2840-2851. Crossref
|
 Trecroci A., Milanović Z., Rossi A., Broggi M., Formenti D., Alberti G. (2016) Agility profile in sub-elite under-11 soccer players: Is SAQ training adequate to improve sprint, change of direction speed and reactive agility performance?. Research in Sports Medicine 24, 331-340. Crossref
|
 Van den Noortgate W., López-López J. A., Marín-Martínez F., Sánchez-Meca J. (2013) Three-level meta-analysis of dependent effect sizes. Behavior Research Methods 45, 576-594. Crossref
|
 Veale J. P., Pearce A. J., Carlson J. S. (2010) Reliability and validity of a reactive agility test for Australian football. International Journal of Sports Physiology and Performance 5, 239-248. Crossref
|
 Veness D., Patterson S. D., Jeffries O., Waldron M. (2017) The effects of mental fatigue on cricket-relevant performance among elite players. Journal of Sports Sciences 35, 2461-2467. Crossref
|
 Viechtbauer W. (2010) Conducting meta-analyses in R with the metafor package. Journal of Statistical Software 36, 1-48. Crossref
|
 Vogt T., Gassen S., Wrede S., Spielmann J., Jedrusiak-Jung M., Härtel S., Mayer J. (2018) Football practice with youth players in the “Footbonaut”. German Journal of Exercise and Sport Research 48, 341-348. Crossref
|
 Weerakkody N., Taylor C., Bulmer C., Hamilton D., Gloury J., O’Brien N., Saunders J., Harvey S., Patterson T. (2021) The effect of mental fatigue on the performance of Australian football specific skills amongst amateur athletes. Journal of Science and Medicine in Sport 24, 592-596. Crossref
|
 Williams, A. M., Thomas, J. L., Jordet, G. and Ford, P. R. (2023)
Anticipation and decision-making. In Science and Soccer (pp.
124-140). Routledge.
Crossref
|
 Xu K., Blazevich A. J., Boullosa D., Ramirez-Campillo R., Yin M., Zhong Y., Tian Y., Finlay M., Byrne P. J., Cuenca-Fernández F. (2025) Optimizing post-activation performance enhancement in athletic tasks: A systematic review with meta-analysis for prescription variables and research methods. Sports Medicine , 1-32. Crossref
|
 Young W. B., Dawson B., Henry G. J. (2015) Agility and change-of-direction speed are independent skills: Implications for training for agility in invasion sports. International Journal of Sports Science and Coaching 10, 159-169. Crossref
|
 Young W., Rayner R., Talpey S. (2021) It’s time to change direction on agility research: A call to action. Sports Medicine - Open 7, 12. Crossref
|
 Young W., Rogers N. (2014) Effects of small-sided game and change-of-direction training on reactive agility and change-of-direction speed. Journal of Sports Sciences 32, 307-314. Crossref
|
 Zago M., Giuriola M., Sforza C. (2016) Effects of a combined technique and agility program on youth soccer players’ skills. International Journal of Sports Science and Coaching 11, 710-720. Crossref
|
 Zhang M., Li F., Jiao J., Liang W., Gomez M.-A., Scanlan A. T. (2025) Effects of different training methods on open-skill and closed-skill agility in basketball players: A systematic review and meta-analysis. Sports Medicine - Open 11, 50. Crossref
|
 Zhou Z., Xin C., Zhao Y., Wu H. (2024) The effect of multi-directional sprint training on change-of-direction speed and reactive agility of collegiate tennis players. PeerJ 12, e18263. Crossref
|
 Zouhal H., Abderrahman A. B., Dupont G., Truptin P., Le Bris R., Le Postec E., Sghaeir Z., Brughelli M., Granacher U., Bideau B. (2019) Effects of neuromuscular training on agility performance in elite soccer players. Frontiers in Physiology 10, 947. Crossref
|
 Zwierko M., Jedziniak W., Popowczak M., Rokita A. (2023) Effects of in-situ stroboscopic training on visual, visuomotor and reactive agility in youth volleyball players. PeerJ 11, e15213. Crossref
|
 Zwierko M., Jedziniak W., Popowczak M., Rokita A. (2024) Effects of six-week stroboscopic training program on visuomotor reaction speed in goal-directed movements in young volleyball players: A study focusing on agility performance. BMC Sports Science, Medicine and Rehabilitation 16, 59. Crossref
|
 Zwierko T., Tapia V., Vera J., Redondo B., Morenas-Aguilar M. D., García-Ramos A. (2024) Enhancing reactive agility in soccer: The impact of stroboscopic eyewear during warm-up across fatigued and non-fatigued conditions. European Journal of Sport Science 24, 1798-1808. Crossref
|
|
| |
|
|
|
|