 The 8th Australasian Conference on Mathematics and Computers in Sport, 3-5 July 2006, Queensland, Australia AN ANALYSIS OF TEN YEARS OF THE FOUR GRAND SLAM MEN'S SINGLES DATA FOR LACK OF INDEPENDENCE OF SET OUTCOMES
 Graham Pollard1, Rod Cross2 and Denny Meyer3

1Faculty of Information Sciences and Engineering, University of Canberra, Australia.
2Physics Department, Faculty of Science, University of Sydney, Australia.
3Faculty of Life and Social Sciences, Swinburn University of Technology, Australia.

 Published 15 December 2006

 ABSTRACT The objective of this paper is to use data from the highest level in men's tennis to assess whether there is any evidence to reject the hypothesis that the two players in a match have a constant probability of winning each set in the match. The data consists of all 4883 matches of grand slam men's singles over a 10 year period from 1995 to 2004. Each match is categorised by its sequence of win (W) or loss (L) (in set 1, set 2, set 3,...) to the eventual winner. Thus, there are several categories of matches from WWW to LLWWW. The methodology involves fitting several probabilistic models to the frequencies of the above ten categories. One four-set category is observed to occur significantly more often than the other two. Correspondingly, a couple of the five-set categories occur more frequently than the others. This pattern is consistent when the data is split into two five-year subsets. The data provides significant statistical evidence that the probability of winning a set within a match varies from set to set. The data supports the conclusion that, at the highest level of men's singles tennis, the better player (not necessarily the winner) lifts his play in certain situations at least some of the time. KEY WORDS: Data analysis, independence in tennis, constant probabilities, psychological development.

