Research article - (2025)24, 764 - 778
DOI:
https://doi.org/10.52082/jssm.2025.764
Classifying Soccer Players Based on Physical Capacities and Match-Specific Running Performance Using Machine Learning
Michel de Haan1, Stephan van der Zwaard1,2, Jurrit Sanders3, Peter J. Beek1, Richard T. Jaspers1,
1Department of Human Movement Sciences, Vrije Universiteit Amsterdam, Amsterdam, Netherlands
2Department of Cardiology, Amsterdam University Medical Center, Amsterdam, Netherlands
3PSV Eindhoven, Eindhoven, Netherlands

Richard T. Jaspers
✉ Vrije Universiteit Amsterdam, De Boelelaan 1108, 1081 HZ Amsterdam, Netherlands
Email: r.t.jaspers@vu.nl
Received: 21-01-2025 -- Accepted: 11-09-2025
Published (online): 01-12-2025
Narrated in English

ABSTRACT

Sprint and endurance capacities seem to be mutually exclusive or at least at odds with each other. However, this relationship has not been investigated in soccer, which appeals to both well-developed sprint and endurance capacities. This study explores the potential of machine learning to identify soccer players based on their unique combinations of sprint and endurance capacities and sprint and endurance match-specific running performance. In this context, the relationships between sprint and endurance capacities and between physical capacities and match-specific running performance are examined in detail. Match-specific running data were collected from 31 young elite male soccer players over two consecutive seasons. Additionally, these participants underwent exercise testing, consisting of a 20-meter sprint test and an incremental treadmill test to measure maximal oxygen uptake (V̇O2max). Subgroups were identified using k-means clustering and subgroup discovery, based on players’ sprint and endurance capacities, sprint and endurance match-specific running performance, and playing position. Three distinct subgroups were identified using machine learning: players with high sprint capacity and sprinted meters (n = 4), players with high endurance capacity and meters ran at moderate and high intensities (n = 6), and players without high physical capacities or matching match-specific running performance (n = 14). Across all players, there was no significant relationship between 20-meter sprint speed and normalized V̇O2max (R2 = 0.085, P = 0.17), although 20-meter sprint speed was positively related to average match sprint distance (R2 = 0.168, P = 0.03) and normalized V̇O2max to average match distance at moderate and high intensities (R2 = 0.151, P = 0.04). In young elite soccer players, sprint and endurance capacities show positive, moderate, relationships with corresponding match-specific running performance, but those capacities do not appear to be mutually exclusive or opposing. Clustering allows for identification of players who may benefit from alternative strategic roles during matches, are at risk of overuse, or could benefit from individualized training. This method can assist coaches in designing tailored training programs and optimizing overall match strategy.

Key words: Clustering, football, sprint speed, V̇O, MAS, MSS

Key Points
  • Sprint and endurance capacities showed positive, moderate, relationships with corresponding match-specific running performance but did not seem to be mutually exclusive or opposing.
  • Clustering revealed distinct subgroups of soccer players with unique combinations of sprint and endurance capacities, and sprint and endurance match-specific running performance.
  • Clustering allows for identification of players who may benefit from alternative strategic roles during matches. are at risk of overuse, or could benefit from individualized training.








Back
|
Full Text
|
PDF
|
Share