FormalPara Key Points

Clubhead speed (CHS) is one of the most important factors for golfers when aiming to optimise distance from tee shots, as it helps to offer an advantage over their competitors.

From empirical investigations, numerous physical characteristics have been shown to be associated with CHS.

Specifically, explosive force production in the lower body (as measured by jumping) elicits stronger associations than maximal lower body strength.

Similarly, upper body explosive strength appears to be of greater importance than upper body strength, most likely because it better replicates the time constraints that the upper body has to produce force during the swing.

Contrary to popular historical belief, flexibility was one of two physical characteristics not significantly associated with CHS. This is likely because a wide range of movement strategies are available for golfers during the swing, enabling a variety of movement solutions to be used, to achieve the desired swing technique.

1 Introduction

Golf is a globally played sport with an estimated 66 million people playing worldwide at the turn of the twenty-first century [1]. With the rise in participation rates, there is a growing interest in performance science for the sport, which has been highlighted by the recent increase in studies focusing on performance, medicine and health [2,3,4,5]. Given the wide array of abilities across these millions of golfers, authors have often looked to categorise players based on skill. For example, the International Golf Federation consensus statement on reporting and recording of injuries in golf suggested three descriptive classifications: (1) elite (professional players competing on tour or amateurs competing in international or national amateur championships); (2) sub-elite (Professional Golfers’ Association (PGA) teaching professionals, amateurs competing in regional, county and state tournaments, or with a handicap ≤ 5); and (3) recreational (handicap > 5) [6]. These classifications are determined by skill-related parameters (i.e. performance metrics that combine all aspects of a golfer’s performance). For example, handicap index is a score given during recreational golf, which provides golfers with a shot allowance relative to their skill level. In addition, handicap index also considers factors relating to the difficulty of the course, such as length of each hole, course rating and slope. In contrast, a ‘gross’ score (no adjustment on the final scoring) is given to determine performance within professional and high-level amateur competitions. Even though skill-related parameters differentiate across competitive abilities, the aim of golf remains the same: to complete each hole in as few shots as possible and ultimately achieve the lowest score attainable.

From a physical preparation standpoint, research has increased significantly in recent years, with training programmes now frequently used to enhance swing performance [7]. The ability to achieve maximal golf shot distance (largely on par 4 and 5 holes) is of great importance, as it is associated with lower scores, assuming that accuracy can be maintained [8]. An essential underpinning factor for achieving maximal shot distance is clubhead speed (CHS), which, if increased, will result in subsequent increases in other critical parameters such as ball speed, carry distance and total distance, when all other variables are held constant [9]. Consequently, CHS is a metric consistently utilised in the field of golf research as it reflects positive performance in the sport [10]. However, both distance and accuracy determine the outcome of any given shot, with a plethora of launch characteristics (e.g. ball speed, spin rate, launch angle) and impact factors (e.g. impact location, club path, face angle) concurrently responsible for where a shot ends up [11]. Despite the importance of these additional metrics, almost all research to date linking golf performance to physical characteristics has been conducted using CHS as the key performance indicator, with substantially less focus on other shot metrics or skill-related parameters. Therefore, understanding the association between physical capacities and CHS provides vital information on which physical qualities need prioritising for the testing and preparation of players.

Various physical characteristics have been suggested to contribute to golf performance and CHS [12, 13]. For example, previous research has reported that the one repetition maximum (1RM) back squat had a large correlation with CHS (r = 0.54; p < 0.001) in thirty-three elite male golfers [14]. Further, the same study also reported a large correlation between countermovement jump (CMJ) height and CHS (r = 0.61; p < 0.001) [14]. Further to this, additional characteristics, such as flexibility, have also been shown to be potentially important in golf, with large correlations between CHS and seated trunk rotation in the clockwise and anti-clockwise directions (r = 0.52; p < 0.001; r = 0.711; p < 0.001, respectively) [15]. Thus, the identification of key physical attributes associated with CHS allows practitioners to test and monitor the strengths and weaknesses of golfers, providing more appropriate suggestions on how to individualise their physical training [12, 16], and was the primary aim of this meta-analysis. Whilst this has already been reported in previous golf research [13], the associated r values were lifted directly from each study. In contrast, the present investigation undertook more advanced statistical analyses, enabling pooled effect sizes to be conducted and the same level of analysis provided for each empirical study that met our inclusion criteria.

2 Methods

2.1 Study Design and Literature Search Methodology

The present study was undertaken in line with the recommendations of the Preferred Reporting Items for Systematic Review and Meta-Analysis (PRISMA) guidelines [17]. Four databases (Medline, SPORTDiscus, CINAHL and PubMed) were electronically searched to gather relevant literature, and Fig. 1 provides a schematic overview of the search methodology, conducted in January 2023, with no restrictions on the publication date. A search strategy was used within Boolean operators in order to identify specific articles relevant to the research question, with a summary provided in Table 1.

Fig. 1
figure 1

Flow diagram showing the identification and selection of studies. Included studies often reported more than one physical characteristic, which is why the bottom row of numbers (n = 50) is greater than the total number of studies (n = 20). Anthro. Anthropometry

Table 1 An overview of the specific words and phrases used for search terms

2.2 Inclusion/Exclusion Criteria

Inclusion criteria required studies to have (1) determined the association between physical characteristics assessed in at least one physical test and CHS; (2) included golfers of any skill level, but they needed to be free from injury; and (3) been peer-reviewed and published in the English language. Studies were excluded if they were (1) reviews or conference articles, (2) intervention studies and (3) if they did not contain original data. After completing all relevant searches, an additional search was completed in Google Scholar for any articles that may have been relevant or not fully available in the aforementioned databases. Finally, reference lists of included studies, alongside forward citations, were also searched for relevant articles.

2.3 Screening Strategy

The articles extracted during the search strategy were then screened through a three-stage process: (1) duplicates of articles from previous search terms and databases were removed, (2) articles that were considered potentially appropriate were passed through for a full review and (3) articles were reviewed in full in line with the inclusion and exclusion criteria by two independent reviewers (AB and CB). If any disagreement occurred between reviewers one and two, a third reviewer (AE) was consulted to resolve this issue.

2.4 Grading Article Quality

To assess the quality of the study’s methodology, a modified version of the Downs and Black Quality Index tool [18] was used, in accordance with previous studies [19, 20]. From the original version, questions were removed for this current review if they were deemed irrelevant (not applicable to the research question). Specifically, questions associated with patient treatment, training interventions and group randomisation processes were removed. Following this, ten questions in the checklist were deemed relevant (Table 2). Each question was given a score of 1 ('+') or 0 ('−'), or ('?') if it was unable to be determined, by two reviewers (AB and CB), with a total score out of 10 possible for each study. If a consensus on the score could not be reached, a third reviewer (AE) was consulted to resolve this issue.

Table 2 Questions chosen from the Downs and Black [18] checklist used to evaluate the methodological quality of included studies

2.5 Statistical Analysis

Initially, key data were directly extracted from studies that met the inclusion criteria and transferred into Microsoft Excel. For all studies, key information extracted included the following: (1) sample population, (2) physical assessment(s), (3) CHS data and (4) correlation value(s) between physical assessment(s) and CHS. To provide summary effects for each of the physical characteristics, a random effects model was used, which accounts for the magnitude of the standard error associated with each of the included studies (due to different methodologies, skill level of samples, etc.). To achieve this, z-transformed r values (i.e. zr values) were computed to enable effect size pooling within the meta-analysis [21]. Where studies reported multiple eligible effect sizes for any one physical attribute (of which inclusion would violate the assumption of independence within the meta-analysis model), these were pooled by first transforming to Fisher’s Z (zr) values, aggregating an average, and back transforming to a Pearson’s r value for input within the meta-analysis. In instances where multiple clubs were reported within a single study, we opted to use driver CHS as the metric to carry forwards, for consistency. Lastly, where directional differences were found in the pooled data of a single study (e.g. where included metrics comprised both values where a larger value is considered more favourable and values where a smaller value is considered more favourable), the negatively aligned data were inverted by multiplying by − 1, to directionally align all metrics prior to pooling. The meta-analysis was performed using the ‘metafor’ package (version 4.4-0) in R (version 4.2.2; R core team), and effect size values were interpreted in line with suggestions by Cohen [22], whereby < 0.2 = trivial, 0.2–0.49 = small, 0.5–0.79 = moderate and ≥ 0.8 = large.

2.6 Stability and Validity of Changes in Effect Sizes

To assess for the presence and degree of heterogeneity in the data, both Q statistics and I2 were used. Statistical significance for Q was acknowledged at an alpha level of < 0.10, and I2 was interpreted as per the work of Higgins and Thompson [23], with an I2 threshold of 0–25% = trivial, 25–50% = low, 50–75% = moderate and 75–100% = high. Small study bias (including publication bias) was assessed firstly by the visualisation of funnel plots, and accompanied by the Egger’s regression test to quantify any asymmetries in the spread of data, and thus risk of small study bias. The occurrence of small study bias was considered present where p < 0.05, and in these circumstances, the trim and fill method was used to artificially impute potentially missing studies due to asymmetry in the funnel plot, and provide an adjusted pooled effect size to account for this [24].

3 Results

3.1 Literature Search Results

The search strategy produced a total of 3039 studies, of which 2048 were removed due to being duplicates. In total, 16 studies met our inclusion criteria (Fig. 1) with four additional studies included following reference list checks and forward citation tracking, resulting in a total of 20 included studies. Once full texts were assessed for eligibility, the most common reasons for exclusion were (1) no relationship determined between physical characteristics and CHS (or the r value was not reported), (2) articles were not peer-reviewed, (3) full texts were unavailable and (4) the sample included golfers who were recovering from injury.

3.2 Methodological Quality and Risk of Bias Assessment

Study methodological quality is presented in Table 3. There was no evidence of internal validity bias. We were unable to confirm external validity in all the studies analysed, with all failing to report the proportion of individuals recruited relative to the sample population. Scores ranged from 7 to 9 out of 10 for study methodological quality and risk of bias, with no studies excluded due to their chosen methodological approach.

Table 3 Results of study methodological quality for all included studies, using the checklist from Downs and Black [18]

3.3 Study Characteristics

Information on each study included in the final analysis is presented in Table 4. Sample populations included the following: (1) Category 1 golfers (n = 93) (15,25,26); (2) elite youth golfers (n = 82) [27, 28]; (3) National Collegiate Athletics Association (NCAA) Division I and II golfers (n = 61) [29,30,31,32]; (4) single- and double-digit handicap golfers (n = 282) [14, 33,34,35, 37,38,39,40]; (5) European Tour players (n = 31) [41]; and (6) PGA tour players (n = 20) [10]. From the final 20 studies, associations between CHS and physical characteristics included strength (n = 13) [14, 15, 26, 28, 29, 31,32,33,34,35,36, 38, 40], explosive strength (n = 14) [10, 14, 25,26,27, 31,32,33, 35, 37,38,39,40,41], anthropometrics [14, 27, 29, 34, 37, 38], flexibility [15, 30, 32, 33, 36, 38], muscle capacity (endurance) [14, 27, 34, 36] and balance [30, 36].

Table 4 Overview of study methodologies and raw data for physical characteristics

3.4 Meta-Analysis

A breakdown of each physical characteristic and its summary statistics are provided, with Figs. 2, 3, 4, 5, 6, 7, 8, 9, 10 and 11 visualising the data via forest plots, which represent the summary effect estimate relative to the inputed samples.

Fig. 2
figure 2

Forest plot showing the association (Fisher’s zr) between lower body strength and clubhead speed. n = sample size. CI confidence interval, F females, ISOS isometric squat, M males, PF peak force, RE random effects, RM repetition maximum

Fig. 3
figure 3

Forest plot showing the association (Fisher’s zr) between upper body strength and clubhead speed. n = sample size. CI confidence interval, F females, M males, Pec Dec resistance machine primarily involving use of the pectoralis major muscles, RE random effects, RM repetition maximum

Fig. 4
figure 4

Forest plot showing the association (Fisher’s zr) between jump displacement and clubhead speed. n = sample size. CI confidence interval, CMJ countermovement jump, F females, M males, RE random effects, SJ squat jump

Fig. 5
figure 5

Forest plot showing the association (Fisher’s zr) between jump impulse and clubhead speed. n = sample size. CI confidence interval, CMJ countermovement jump, RE random effects

Fig. 6
figure 6

Forest plot showing the association (Fisher’s zr) between jumping peak power and clubhead speed. n = sample size. CI confidence interval, CMJ countermovement jump, F females, M males, RE random effects

Fig. 7
figure 7

Forest plot showing the association (Fisher’s zr) between upper body explosive strength and clubhead speed. n = sample size. CI confidence interval, F females, M males, RE random effects

Fig. 8
figure 8

Forest plot showing the association (Fisher’s zr) between anthropometric measures and clubhead speed. n = sample size; *Denotes that directional differences existed in individual correlations, resulting in negatively aligned data being multiplied by minus 1, to directionally align all data prior to pooling. CI confidence interval, F females, M males, RE random effects

Fig. 9
figure 9

Forest plot showing the association (Fisher’s zr) between flexibility measures and clubhead speed. n = sample size. CI confidence interval, F females, M males, RE random effects

Fig. 10
figure 10

Forest plot showing the association (Fisher’s zr) between balance and clubhead speed. n = sample size. BESS Balance Error Scoring System, CI confidence interval, F females, M males, RE random effects

Fig. 11
figure 11

Forest plot showing the association (Fisher’s zr) between measures of muscle capacity and clubhead speed. n = sample size. CI confidence interval, F females, M males, RE random effects

3.4.1 Lower Body Strength

Collectively, results showed that lower body strength had small, significant associations with CHS (zr = 0.47 [95% confidence interval {CI} 0.24–0.69], r = 0.44 [95% CI 0.24–0.60], Z = 4.03, p < 0.001). Tests for heterogeneity were identified as significant and moderate (I2 = 51.09%, Q = 18.20, p = 0.03), and there was no significant evidence of small study bias (z = − 0.758, p = 0.45). Lower body strength assessments included 1RM back squat [14, 31, 32], isometric mid-thigh pull (IMTP) peak force [26, 28, 41], isometric squat peak force [38], hack squat estimated 1RM [34], 1RM deadlift [31], 1RM clean [31] and hip extension and flexion strength [40].

3.4.2 Upper Body Strength

Collectively, results showed that upper body strength had small, significant associations with CHS (zr = 0.48 [95% CI 0.28–0.68], r = 0.45 [95% CI 0.27–0.59], Z = 4.67, p < 0.001). Tests for heterogeneity were identified as low (I2 = 44.89%, Q = 14.96, p = 0.06), and there was no evidence of small study bias (z =  − 0.51, p = 0.61). Assessments of upper body strength included grip strength [14, 15, 29, 38], lower back strength [36], torso rotational strength [40], concentric-only push-up peak force [38], 1RM bench press [34], pec-dec (resistance machine primarily involving use of the pectoralis major muscles) 8RM [33], bar dips (repetitions × body weight) [14], pull-ups (repetitions × body weight) [14] and golf swing cable chop [34].

3.4.3 Lower Body Explosive Strength: Jump Displacement

Collectively, results showed that jump displacement had moderate, significant associations with CHS (zr = 0.53 [95% CI 0.28–0.78], r = 0.49 [95% CI 0.27–0.65], Z = 4.17, p < 0.001); however, significant and high heterogeneity was noted (I2 = 80.09%, Q = 56.99, p < 0.001), along with significance regarding small study bias (z = 2.22, p = 0.03); however, the model utilised estimated no missing studies based on the trim and fill method. Jump displacement was assessed in three different tests, which were the CMJ [14, 27, 31, 32, 37, 38, 40], squat jump [10, 14, 37] and broad jump [27].

3.4.4 Lower Body Explosive Strength: Jump Impulse

Collectively, results showed that jump impulse (which was recorded across the CMJ, drop jump and squat jump) had large, significant associations with CHS (zr = 0.82 [95% CI 0.63–1.02], r = 0.68 [95% CI 0.56–0.77], Z = 8.18, p < 0.001). Tests for heterogeneity were identified as trivial (I2 = 0.00%, Q = 0.38, p = 0.83), and there was no evidence of small study bias (z = − 0.32, p = 0.75). Positive impulse was measured in three different jump assessments: the CMJ [25, 26, 41], squat jump [26] and drop jump [26].

3.4.5 Lower Body Explosive Strength: Jump Peak Power

Collectively, results showed that peak power during jumping had moderate, significant associations with CHS (zr = 0.66 [95% CI 0.53–0.79], r = 0.58 [95% CI 0.49–0.66], Z = 9.70, p < 0.001). Tests for heterogeneity were identified as trivial (I2 = 0.00%, Q = 4.92, p = 0.67), and there was no evidence of small study bias (z = − 0.03, p = 0.97). Peak power was measured in two jump assessments, which were the CMJ [14, 25, 27, 35, 37, 40] and squat jump (n = 1) [37].

3.4.6 Upper Body Explosive Strength

Collectively, results showed that upper body explosive strength had moderate, significant associations with CHS (zr = 0.67 [95% CI 0.53–0.80], r = 0.58 [95% CI 0.49–0.66], Z = 9.39, p < 0.001). Tests for heterogeneity were identified as trivial (I2 = 0.00%, Q = 3.91, p = 0.92), and there was no evidence of small study bias (z = − 0.63, p = 0.53). Explosive strength was assessed in a number of assessments, including seated medicine ball throws [10, 27, 35], rotational medicine ball throws [10, 27, 33, 37], medicine ball throw velocity [38], ballistic bench press [39] and the backward overhead medicine ball (BOMB) throw [40].

3.4.7 Anthropometrics

Collectively, results showed that measures of anthropometry had small, significant associations with CHS (zr = 0.43 [95% CI 0.29–0.58], r = 0.41 [95% CI 0.28–0.52], Z = 5.86, p < 0.001). Tests for heterogeneity were identified as trivial (I2 = 0.00%, Q = 4.26, p = 0.64), and there was no evidence of small study bias (z = − 0.44, p = 0.66). A number of anthropometric measures were utilised, including body mass [14, 27, 34, 37, 38], height [27, 29, 37,38,39], body mass index [34, 38], body fat % [34], limb length [34, 37], sum of skinfolds [34], fat mass [34] and fat-free mass [34].

3.4.8 Flexibility

Collectively, results showed that measures of flexibility had trivial, non-significant associations with CHS (zr = − 0.04 [95% CI − 0.33 to 0.26], r = − 0.04 [95% CI − 0.32 to 0.25], Z = − 0.23, p = 0.82). Tests for heterogeneity were identified as high (I2 = 67.53%, Q = 21.72, p = 0.006), and there was no evidence of small study bias (z = − 0.13, p = 0.89). Flexibility was measured through several individual assessments, including the functional movement screen tests [32], sit-and-reach test [30, 36, 38], seated trunk rotation [15, 33, 38], standing trunk flexibility [15] and hip internal and external rotation [38].

3.4.9 Balance

Collectively, results showed that measures of balance had trivial, non-significant associations with CHS (zr = − 0.06 [95% CI − 0.46 to 0.34], r = − 0.06 [95% CI − 0.43 to 0.33], Z = − 0.29, p = 0.77). Tests for heterogeneity were identified as trivial (I2 = 12.95%, Q = 1.79, p = 0.41), and there was no evidence of small study bias (z = − 1.28, p = 0.20). Balance was measured through two distinct methods, the Balance Error Scoring System (BESS) test [30] and the Biodex System Score [36].

3.4.10 Muscle Capacity

Collectively, results showed that measures of muscle capacity had trivial, significant associations with CHS (zr = 0.17 [95% CI 0.04–0.31], r = 0.17 [95% CI 0.04–0.30], Z = 2.51, p = 0.01). Tests for heterogeneity were identified as trivial (I2 = 0.00%, Q = 2.34, p = 0.67), and there was no evidence of small study bias (z = 0.52, p = 0.60). Capacity was measured via maximum repetitions during push-ups [27, 36], pull-ups [14, 27], dips [14], sit-ups [14, 36], isometric prone hold [34] and wall squats [36].

4 Discussion

This systematic review with meta-analysis set out to determine the associations between physical characteristics and CHS in golfers. Significant small to large pooled associations were evident between CHS and lower body strength, upper body strength, a range of jump variables, upper body explosive strength, and measures of anthropometry. Interestingly, trivial non-significant associations were evident between CHS and measures of flexibility and balance. The forthcoming sub-sections will address these associations in detail, for each physical characteristic. However, it should be noted that significant heterogeneity was evident for (1) lower body strength, (2) jump displacement and (3) flexibility, indicating that some level of caution should be applied when generalising our findings for these physical characteristics to a wider golf population.

4.1 Lower Body Strength

Collectively, small correlations were reported between lower body strength and CHS (Fig. 2). These findings are in partial agreement with a review by Ehlert [13], who reported weak to strong correlations with CHS (r = 0.27–0.66). Despite this relationship being small, we maintain that strength is a key physical attribute for golfers to develop [12]. First, a golfer who possesses strength in the lower extremities is likely to have a more ‘stable base’, which probably allows for a more effective transfer of force both through the kinetic chain [2, 38] and between limbs during the inevitable changes in centre of pressure [42]. Previous research has suggested a golfer’s ability to efficiently produce and transfer force up through their body is advantageous for producing high CHS [43]. Second, the duration of the downswing typically lasts < 0.30 s [44, 45]. However, an essential component for golfers is the ability to separate the hips and the thoracic spine during the backswing (commonly termed the X-factor stretch), which results in large spikes in vertical ground reaction force before the downswing is initiated, coupled with changes in centre of pressure towards the lead leg [46]. Consequently, this conscious increase in force production is likely to be closer to 0.4–0.5 s before impact occurs, which is not dissimilar to the time required to produce maximal force [47]. This theory is partly supported by the current findings of this review and previous cross-sectional studies, where highly skilled golfers possessed greater levels of strength than their less skilled counterparts [34, 48].

Most commonly, squat variations were utilised as assessments of lower body strength [14, 29, 31, 32]. Despite 1RM free-weight testing being a commonly used assessment method for assessing lower body maximal strength, the hack squat demonstrated similar associations with CHS, and therefore may be a useful tool for practitioners who have access to the required equipment and are working with golfers who have limited resistance training experience. Further studies utilised the use of force plates to perform IMTP assessments presenting associations between peak force and CHS [26, 28, 41]. When force plates are a viable option, IMTP assessments can provide an accurate assessment of maximal force production in the lower body, which has been highlighted as having positive associations with CHS in European Tour golfers [41]. Ideally, when assessing lower body strength, multi-joint protocols (e.g. IMTP, 1–3RM back squat) should be prioritised over single-joint methods (e.g. isokinetic dynamometry), because they better (although not perfectly) represent the movement demands of a golfer [2, 38].

4.2 Upper Body Strength

Upper body strength showed small associations with CHS (Fig. 3). Similar to the aforementioned narrative for lower body strength, upper body strength should not be ignored. First, and potentially most importantly, if upper body strength is improved, it will increase the ability for relevant tissues to withstand load and may logically reduce the likelihood of injury. In turn, this may assist in maximising availability for practice and competition, ensuring golfers are able to continually swing at high speeds [49]. This suggestion is part of the Probability of Performance Model created by Brearley et al. [49], which indicates the most influential impact of physical training is its ability to maximise a player’s availability to practice and compete. Second, strength is the foundation for optimising speed and power development [50], and pooled associations between CHS and upper body explosive strength were noticeably larger than for upper body strength (discussed in Sect. 4.4).

Several assessments were employed to measure upper body strength, with grip strength being the most prominent [14, 15, 29, 38]. This finding is consistent with previous research, showing an association between lead side grip strength and golf performance. Komi et al. [51] reported that the lead hand exerts the most pressure during the downswing phase to control the swing, helping to transfer to greater CHS. Three assessments utilised multi-joint exercises for upper body strength (bench press, pec-dec machine and golf swing cable chop), also reporting positive associations with CHS. However, the pec-dec machine operates in a single plane of motion (which is not representative of the movements in golf), and the cable chop, whilst a useful supplementary exercise for golfers, is not an appropriate measure of true upper body strength. Thus, multi-joint assessments such as the bench press that assess a percentage of RM or velocity loss (if using appropriate technology) are recommended.

4.3 Lower Body Explosive Strength: Jump Displacement, Impulse and Peak Power

Despite these appearing as separate metrics in the results, we have chosen to discuss these together, given that impulse is the key determinant for how high or far someone can jump [52]. In addition, one study utilised the standing long jump as an assessment method [27], resulting in the terminology ‘jump displacement’ rather than ‘jump height’ being used. Several studies assessed the association between jump height and CHS, with moderate associations evident (Fig. 4). Despite jump height being the most typical outcome measure reported from jump testing (and the moderate associations reported in this review), we argue that it may not be the most appropriate metric to report for golfers. First, with CHS being a critical performance indicator in golf, this can be positively affected by an increase in body mass. However, increases in body mass may also be counter-productive to concurrently trying to improve jump height. Second, with additional body mass likely being of greater importance to golf than the ability to jump high, a better jump variable is likely to be ‘jump momentum’ (mass × velocity). This is because CHS is driven by angular momentum, which body mass contributes to [25, 26]; thus, the ability to generate momentum is likely to be a better proxy measurement of CHS than jump height, which is limited by body mass. Furthermore, in an ideal scenario, practitioners should evaluate jump performance using force plates, enabling a more detailed insight into both outcome measures and jump strategy [53]. However, even if practitioners only have access to a jump mat or a smartphone application, Wells et al. [25] showed how inverse dynamics can be used to quantify net impulse, when the only metrics available are jump height and a player’s body mass. Thus, with momentum being a key metric in golf, and the change in momentum being directly proportional to the impulse generated during a jump, monitoring impulse (or momentum) should be prioritised over the commonly reported outcome measure of jump height, which is supported by the initial large associations noted with CHS (Fig. 5).

Similar to jump height, pooled associations for peak power showed moderate associations with CHS (Fig. 6). Practically speaking, with power being a representation of force × velocity, and with the importance of force already discussed, it stands to reason that peak power is a useful metric to monitor as well. Put simply, the more force that can be applied during the high-velocity nature of a swing will result in increases in power production. This is also supported by Nesbit and Serrano [54], who noted that a corresponding decrease in both peak power and CHS were present with an increase in handicap (i.e. lesser skilled golfers). However, it should be acknowledged that lesser skilled golfers may possess similar force capabilities but may be less able to transfer this to their swing. Furthermore, when viewing Fig. 6, only one study [35] exhibited a CI that crossed zero, highlighting the consistent nature of peak power during jumping and its association with CHS.

4.4 Upper Body Explosive Strength

Our data showed moderate associations between measures of upper body explosive strength and CHS (Fig. 7). As previously discussed, the upper body is likely to have less time to produce force than the lower body. Consequently, ballistic force production or explosive strength may be more important for the upper body (compared to strength), and our data support this suggestion, given the considerably higher pooled associations with CHS. These findings are in agreement with Hume et al. [2], who suggested that muscles of the upper extremities are highly active during the swing to produce a powerful stretch–shortening cycle activity [2]. Specifically, the pectoralis major has been reported as the most active muscle of the upper body (when reported via electromyography as a percentage of maximal voluntary isometric contraction) during the early (64% on the trail side) and late downswing (93% trail and lead side) phases [55]. With this in mind, it seems prudent to suggest that upper body ballistic exercises, which require rapid activation of the chest muscles, would form appropriate assessments for physical profiling golfers. The most common methods utilised in the research to date are various medicine ball throws, most likely because they are easy to implement and provide a quick outcome measure of distance thrown. Whilst this method is not being down played, some of these assessments were done standing and would include major contributions from the lower body (which cannot be fully quantified). Thus, if medicine ball throws are considered the most viable option, it is suggested that these are done in a seated position, to isolate the assessment to the upper body. In addition, further research should consider alternative methods of assessment such as the bench press throw, given that it is also isolated to the upper body, is ballistic in nature and has strong involvement of the pectoral muscles.

4.5 Anthropometry

Pooled associations showed that measures of anthropometry (e.g. height, body mass, limb length) had small associations with CHS (Fig. 8). However, the nature of how these data are reported needs careful consideration. For example, six out of seven studies had pooled data, indicating that more than one measurement was utilised during the Fisher’s z transformation. As such, it becomes challenging to truly determine which measure of anthropometry (e.g. height, body mass, limb length) has the greatest influence on CHS. Theoretically, it seems plausible to suggest that golfers with greater body mass will possess more muscle mass and, therefore, likely be able to produce more force during the golf swing. Aside from mass-related measures, all additional assessments of anthropometrics consisted of non-modifiable factors such as height and limb length. Although little can be done to change these non-modifiable factors, they should not be ignored. First, it should be noted that longer limbs may enable a longer and wider path length when swinging the club, which may have positive effects on rotational torque and power [2]. For example, MacKenzie et al. [56] reported that if hand path length was increased by 0.12 m, then CHS would likely increase by approximately 2.7 mph. Second, where physical training is concerned, longer limbs may have an impact on which exercises are selected for players. For example, achieving full-depth squats may be harder for some taller players, and in such cases, additional options such as box, pin or partial squats may be a viable alternative enabling the desired physical adaptation to still be achieved.

4.6 Flexibility

Pooled correlations showed trivial findings between flexibility and CHS (Fig. 9), which is in agreement with previous literature [13]. These findings are perhaps some of the most interesting in this meta-analysis, given the commonly held belief amongst golfers and practitioners that stretching is an essential component of training. This is supported in a recent survey by Wells and Langdown [57], who reported that ‘flexibility and stretching’ was the most common training modality employed by highly skilled golfers (handicap = 0.42 ± 2.81) during the in-season. It is important to reiterate that our findings show there was a wide variety of assessments used, some of which seem highly irrelevant for golfers (e.g. sit-and-reach test). Therefore, the lack of important findings may be a consequence of poor ecological validity of some of the selected assessments, rather than a complete lack of importance of flexibility altogether. Furthermore, given the previously mentioned importance of separating the hips and the thoracic spine, it should be acknowledged that if a golfer is unable to do that sufficiently, it is likely that they will simply find an alternative movement strategy to achieve the outcome they need during the swing [58]. Such examples may include greater internal rotation of the hip or knee bend on the lead leg during the back swing, enabling the desired level of rotation prior to the start of the downswing. Consequently, it is suggested that more appropriate measures of flexibility are chosen, such as seated thoracic rotation, enabling the upper body (i.e. thoracic spine region) to be isolated, which has been suggested in recent golf literature [12]. Additionally, and as a final point of consideration, it seems plausible to suggest that flexibility may be more bespoke in its application for golfers than other physical characteristics, given it is likely to be based on a player’s preferred movement approach to a given task – namely, the swing. Put simply, increasing flexibility may open up a wider variety of shot options for some players who need improvements, whilst for others it may not be necessary. In contrast though, and given the results in Figs. 5, 6 and 7, it is hard to see how any golfer would not benefit from the ability to enhance force production ballistically.

4.7 Balance

Pooled correlations showed trivial findings between balance and CHS (Fig. 10), inclusive of only two studies [30, 36], which showed noticeably different results. First, Marshall and Llewelyn [30] conducted an associative analysis with sample sizes of five (for both males and females), which is undoubtedly too small to provide any meaningful conclusions as to the link between balance and CHS. In contrast, and although employing a different balance assessment, Loock et al. [36] evaluated a sample of 101 golfers, accounting for nearly 85% of the weighting in our meta-analytical statistics for this physical characteristic. This provides considerably stronger evidence concerning the link between balance and CHS, which appears to be negligible. Intuitively as well, despite shifts in centre of pressure during the swing [42], golfers are very rarely off-balance and never take a shot standing on one leg. Thus, it becomes challenging to suggest that balance exercises or assessment protocols should form a staple part of a golfer's training or testing practices.

4.8 Muscle Capacity

Pooled correlations showed trivial, but significant, associations between measures of muscle capacity and CHS (Fig. 11). First, it is important to outline that measures of muscle capacity were completed in bodyweight assessments, consisted of both upper and lower body protocols, and utilized outcome measures of either maximal repetitions or maximal time. Despite these data being significant, they highlight a very important distinction when compared with data for lower body or upper body strength. As the results show, associations were noticeably lower than for lower and upper body strength, which we believe to be an accurate representation of the physical requirements for golf. For example, despite the swing being a highly repetitive action, long periods of rest are provided between shots during competition, enabling sufficient recovery before the next shot is taken. Furthermore, some elements of the game are underpinned almost exclusively by skill (e.g. putting and chipping), with limited requirements for physical capacity. Thus, it stands to reason that maximal and explosive force production are of greater importance than muscle capacity or endurance, especially when considering maximal effort shots (e.g. using a driver, wood or long iron). With this in mind, practitioners should not only consider whether muscle capacity assessments offer any real benefits that support decision-making in golf, but also whether training methods that develop this muscular adaptation are truly needed. Although somewhat anecdotal, we believe that training programmes in the gym should be more focused on maximal strength and ballistic force production, with typically lower repetitions (i.e. ≤ 6) than are often achieved in these muscle capacity assessments.

4.9 Summary

Given the magnitude of data reported and discussed in this review, we have provided a summary plot reporting the mean summary effect estimates for all physical characteristics and their association with CHS (Fig. 12).

Fig. 12
figure 12

Summary forest plot showing the association (Fisher’s zr) between different physical characteristics and golf clubhead speed. CI confidence interval

4.10 Limitations and Directions for Future Research

The findings of this current review should be analysed with the acknowledgement of a few limitations, which in turn provide useful guidance for future research suggestions. First, several protocols were often selected for any given physical characteristic. For example, lower body strength was measured using a 1RM back squat, hack squat, IMTP and even isokinetic dynamometry. Although all provide some measure of lower body strength, they do not measure the same thing, and where possible, greater consistency should be utilised between assessment protocols and studies, with our aforementioned suggestions throughout the discussion, for each physical characteristic, considered for the sport of golf. Specifically, dynamic assessments of lower body strength (e.g. 1RM back squat) appear to show stronger associations with CHS than isometric assessments (e.g. IMTP). Thus, RM-based protocols maybe favourable when assessing golfers, although it should be acknowledged that such methods require substantial time investment, which may not always be possible. For upper body strength, there was a distinct lack of consistent protocols between the included studies. However, in line with suggestions for lower body strength, RM-based protocols for the bench press may be useful given the dynamic nature of the assessment and primary focus on the pectoralis muscle group. Assessments that measure explosive force production should be prioritised given the noticeably larger associations with CHS reported in this review, and can be achieved using relatively simple methods. For example, jump testing on force plates is now common in sport science, which will enable the quantification of net impulse (noting that this is the metric with the strongest association with CHS). However, even if practitioners only have access to a jump mat or smartphone application, inverse dynamics can be used to convert jump height into a net impulse value [41]. In addition, medicine ball throws serve as a simple and useful assessment of explosive force production capability in the upper body. Finally, and importantly for those working in golf, many studies using measures of flexibility to date were questionable. If it is deemed that this physical characteristic should be assessed, it is suggested that practitioners consider using the seated thoracic rotation assessment as a means of quantifying ‘flexibility’ [12]. This method provides an assessment of rotational mobility (as required in golf), which also serves to separate movement between the hips and thoracic spine, a concept known to be critical for the development of CHS [46].

Second, very few studies included have been conducted in female golfers, and this huge disparity between sexes is evident in a recent meta-analysis by Robinson et al. [58], which reported only three associative studies and six training interventions assessing the associations between and effects of physical training on CHS. Thus, future research should aim to address this imbalance. Third, it is very rare to see individual data analysis in golf studies [59], which is a surprise given the individual nature of the sport. Moving forward, we suggest that practitioners consider quantifying ‘true change’ in physical performance individually, by establishing whether differences are greater or less than the player’s associated test variability score. Specifically, when multiple trials of a given test are conducted, the standard deviation can be used to determine the natural bandwidth of variability for that athlete. Thus, and assuming the test in question exhibits acceptable levels of reliability, the standard deviation becomes the ‘target score’ by which practitioners can assess whether change is real [12]. Finally, future studies may wish to consider utilising some shot-related metrics beyond CHS and branch into additional skill parameters such as strokes gained. Brennan et al. [11] proposed a framework for monitoring golf performance measures that link to strokes gained off the tee. More specifically, it is common for players and coaches to use launch monitor technology (e.g. Trackman, Flightscope) to quantify shot performance during practice. Many of these use radar technology, which is what is used when tracking missiles. Put simply, the technology is looking for an object flying through the air, so it seems more logical that metrics such as ball speed and carry distance are monitored, potentially over CHS, which has been previously suggested [60].

5 Conclusion

In summary, our findings showed trivial to large associations between physical characteristics and CHS. More specifically, measures of explosive force production appear to be more important than maximal strength measures in both the lower and upper body. Metrics such as impulse and peak power are useful proxy measures from jump testing, and medicine ball throws serve as a practical method of assessing explosive force production in the upper body. In contrast, the association between flexibility/balance and CHS appears to be minimal (noting that the quality of evidence for these two physical characteristics is questionable). Practitioners can use these findings to better establish more appropriate physical tests that relate to CHS, which in turn can be used to provide appropriate benchmark data when profiling a golfer’s physical capacities.