Review Article | Open Access
Nicholas M. Beltz, Ann L. Gibson, Jeffrey M. Janot, Len Kravitz, Christine M. Mermier, Lance C. Dalleck, "Graded Exercise Testing Protocols for the Determination of VO2max: Historical Perspectives, Progress, and Future Considerations", Journal of Sports Medicine, vol. 2016, Article ID 3968393, 12 pages, 2016. https://doi.org/10.1155/2016/3968393
Graded Exercise Testing Protocols for the Determination of VO2max: Historical Perspectives, Progress, and Future Considerations
Graded exercise testing (GXT) is the most widely used assessment to examine the dynamic relationship between exercise and integrated physiological systems. The information from GXT can be applied across the spectrum of sport performance, occupational safety screening, research, and clinical diagnostics. The suitability of GXT to determine a valid maximal oxygen consumption (VO2max) has been under investigation for decades. Although a set of recommended criteria exists to verify attainment of VO2max, the methods that originally established these criteria have been scrutinized. Many studies do not apply identical criteria or fail to consider individual variability in physiological responses. As an alternative to using traditional criteria, recent research efforts have been directed toward using a supramaximal verification protocol performed after a GXT to confirm attainment of VO2max. Furthermore, the emergence of self-paced protocols has provided a simple, yet reliable approach to designing and administering GXT. In order to develop a standardized GXT protocol, additional research should further examine the utility of self-paced protocols used in conjunction with verification protocols to elicit and confirm attainment of VO2max.
1. Brief History of Graded Exercise Testing
The examination of the dynamic human physiological responses during incremental exercise has been an ever-evolving task for nearly 200 years. Beginning as early as the 18th century and continuing through the 19th century, pioneering physiologists such as Antoine Lavoisier and Nathan Zuntz have been credited with the first scientific examinations involving exercising humans under normal and hypoxic conditions. In 1918, Lambert described the use of a series of exercise tests to examine the impact on blood pressure to establish a reliable index of myocardial efficiency . Inspired by Lambert and the foundational works of Francis Benedict, Goran Liljestrand, and August Krogh, British physiologist Archibald Vivian (A. V.) Hill conducted a fundamental series of experiments that remain the genesis of exercise physiology as an academic discipline . Using Douglas bags to collect expired air samples, Haldane gas analyzers to determine fractional concentrations of oxygen and carbon dioxide, and a Tissot gasometer to measure air volumes, Hill and colleagues [3–6] repeated running trials of increasing speeds to plot the relationship between intensity and oxygen uptake (VO2). Interestingly, it was concluded that a “ceiling” or upper limit in the maximal uptake of oxygen (VO2max) existed . It must be appreciated that a difference exists between VO2peak and VO2max and that these terms are often used interchangeably in the literature. That is, VO2peak is the highest value attained during exercise and represents an individual’s exercise tolerance while VO2max represents the highest physiologically attainable value . Interestingly, a VO2max is always a peak but a VO2peak is not always maximal. The difference between VO2peak and VO2max is often determined by the presence of a VO2 “plateau,” although the plateau depends on many protocol variables. This review will further expound on this point. Hill’s experiments and a century of observations before him led to the discovery of graded exercise testing (GXT), the gold standard for quantifying cardiorespiratory fitness (CRF) and measuring VO2 during incremental to maximal exercise.
2. Applications of GXT
Graded exercise testing is used to observe the dynamic relationship between exercise workload and the integrated cardiovascular, pulmonary, musculoskeletal, and neuropsychological systems . Protocols require a systematic and linear increase in exercise intensity over time until the individual is unable to maintain or tolerate the workload. Selected cardiovascular, pulmonary, and metabolic variables are collected during the test to evaluate exercise tolerance and represent the efficiency in which the cardiovascular system is able to deliver oxygenated blood to working skeletal muscle and the ability of muscle to utilize oxygen. Due to the widespread use of GXT in healthy populations, normative criteria have been established to help practitioners identify metabolic and ventilatory patterns. Moreover, these metabolic and ventilatory patterns may even assist in categorizing cardiovascular disease (CVD) states and prognoses .
The assessment of exercise tolerance has been used to establish the relationship between CRF, CVD, and all-cause mortality . An early investigation by Blair et al.  examined the relationship between fitness and mortality in 32,421 men and women (20–80 years old) encompassing 264,978 living-years and 690 deaths. Their results were in agreement with a previous investigation  that found substantial strength and independence of low CRF as a predictor for all-cause mortality and future CVD. Similarly, a meta-analysis by Kodama et al.  showed that a 1-MET (3.5 mL·kg−1·min−1) increase in VO2max was associated with a 13% and 15% reduction in risk of all-cause mortality and CVD, respectively. Furthermore, a threshold to classify a substantially higher risk for all-cause mortality and CVD was established as low CRF (<7.9 METS) . The evidence clearly demonstrates the influence of low CRF as an independent precursor to mortality and underpins the application of standardized protocols to monitor VO2max in specific populations. These studies highlight the importance of accurate and reliable GXT results for minimal exercise tolerance and clinical evaluation of health status [10–12]. Furthermore, valid GXT results are relevant when interpreting studies using repeated measurements of VO2max (and not VO2peak) to establish a training effect or design exercise prescription.
Accurately capturing the dynamic physiological responses during GXT is essential to establish a valid VO2max and to quantify CRF responses throughout various training interventions. Many independent factors contribute to the varying opinions surrounding the appropriateness of current standardized GXT guidelines, thereby limiting the ability to compare results between tests and population or apply them. The modes for administering GXT are traditionally limited to cycle and treadmill, each resulting in unique physiological responses. Protocol design variables such as stage length, workload increment per stage, and total test duration may individually limit the accuracy of GXT. Furthermore, the criteria used to confirm attainment of VO2 plateau are not consistent or universally applied among studies. This review will examine the limitations within the current recommendations for GXT and highlight the importance of the continuous search for identifying an optimal protocol.
3. Applications of the Fick Equation
The importance of seminal works by Hill et al. [4–6] is apparent considering that the fundamental basis for quantifying oxygen transport, utilization, and mitochondrial energy production remain the same today as they did nearly 100 years ago. The Fick equation states that VO2 is equal to the product of cardiac output () and the difference between arterial and venous oxygen content at the level of the capillary ().
The equation can be expanded to represent as the product of heart rate (HR) and left ventricular (LV) stroke volume (SV), with SV being parsed into the difference between LV end-diastolic volume (EDV) and end-systolic volume (ESV).
Altogether, the components of Fick represent individual central () and peripheral (a-) factors. The central component consists of factors that impact the diffusion of O2 from the external environment into the arterial blood supply and transport of oxygenated blood to working skeletal muscle tissue. The peripheral component comprises various cellular and molecular mechanisms at the skeletal muscle level to diffuse O2 from arterial blood to the mitochondria for consumption in the process of ATP regeneration .
3.1. Implications of the VO2max Protocol
It has been well established that increases at a linear rate similar to VO2 upon the initiation of incremental to maximal exercise [14–18]. In response to metabolically induced peripheral vasodilation in working skeletal muscle, blood pressure is maintained by increases in HR and SV . Central medullary control of baroreceptors, chemoreceptors, and vascular tone contributes to the withdrawal of parasympathetic activity coupled with an increase in sympathetic drive. The result is an overall increase in chronotropic and inotropic characteristics of the heart.
Generally, HR increases linearly during incremental to maximal exercise; however, a breaking point (i.e., HR threshold) is eventually obtained after which the slope may increase or decrease until maximal heart rate (HRmax). The HR threshold is an individual phenomenon that may indicate chronotropic insufficiency . Moreover, the significance of the flattened HR response following HR threshold, in particular, may be associated with a downregulation in beta-1 adrenergic receptor activation during greater exercise intensities. For instance, a study by Knight-Maloney et al.  examined HR responses during incremental to maximal exercise in 14 healthy individuals and found that eight subjects demonstrated a decelerated post-HR threshold response while six subjects showed an accelerated post-HR threshold response. These findings are in agreement with previous research that demonstrated the intersubject variability in HR responses [20–24].
Alongside the increasing inotropic response, shifts in sympathetic nervous system dominance raise chronotropic activity and influence central mechanical changes during incremental to maximal exercise. Neural drive enhances myocardial contractility, reducing ESV. Additionally, the intramuscular oscillations during exercise promote an increase in venous return to the heart. The improved blood flow to the LV enhances preload and promotes LV myocardial stretch, increasing elastic potential energy for additional contractile force. This is known as the Frank-Starling mechanism . The net effect is an increase in SV, contributing to an increase in during incremental to maximal exercise. Contrary to traditional thought, SV within a healthy population may exhibit individual linear or plateau responses that are dependent on many factors.
Pioneering work by Astrand et al.  evaluating the SV response to incremental to maximal exercise established the widely accepted observation that SV plateaus are at approximately 40–50% of VO2max [27, 28]. This finding was primarily attributed to tachycardiac limitations on diastolic filling time, therefore reducing EDV and blunting SV response . More recently, a review by Vella and Robergs  underscores that the potential determinants of the intersubject variability in SV are more complex than originally established. Factors such as age, fitness level, and sex contribute to four main SV responses: the classic plateau, plateau with subsequent drop, plateau with subsequent rise, and gradual increase. While some studies have reported linear SV responses during incremental to maximal exercise in older individuals [31, 32] other studies showed that SV exhibited either a plateau or a subsequent drop at nonspecific points during incremental to maximal exercise [18, 33–35]. Although it may be logical to suggest that age has a negative impact on the maintenance of SV near maximal exercise due to reductions in myocardial compliance, the overall relationship between age and SV response remains unclear. Similarly, individual fitness level does not reliably predict the trend in SV during exercise. Some investigations showed a constant increase in SV up to maximal [36–39] and near maximal intensities  in trained individuals while others reported a plateau in both trained and untrained subjects [32, 41] and progressive increases until maximal exercise in untrained subjects [42–44]. One could postulate that adaptations consequent to aerobic training enhance SV through combinations of the following: increased blood volume leading to greater EDV, increased LV chamber size, improved LV compliance, greater myocardial contractility, and reduced afterload may explain the individual ability to increase SV progressively through incremental to maximal exercise .
3.2. A-vO2diff Implications of the VO2max Protocol
It has also been accepted that an increase in was the sole component of maintaining VO2 during the onset of exercise due to a potential lag between oxygen demand and venous return [45–47]. This was tested by Casaburi et al. , who found that pulmonary artery desaturation occurred as soon as four seconds after the onset of 150-Watt cycle exercise. The results reported by the Casaburi et al.  were questioned by De Cort et al.  and attributed to immobilized vena caval blood. Compared to Casaburi et al. , who measured a- upon exercise from rest, De Cort et al.  began measurements starting at the first increase in VO2 after the abrupt increase in cycling intensity from a submaximal level. Although a- may improve with aerobic training [49, 50], the cellular mechanisms contributing to oxygen extraction within skeletal muscle also increase at a predictable rate during incremental to maximal exercise. Recent meta-analyses by Montero et al.  and Montero and Diaz-Canestro  examined the effects of aerobic training on a- in untrained or moderately trained healthy young (<40 years old), middle-aged, and/or older (≥40 years old) individuals. These studies concluded that the improvements in VO2max from 5 to 52 weeks of endurance training were due to linear improvements in but not a-. Therefore, the peripheral mechanisms are viewed as a complement to the central mechanisms contributing to VO2max.
4. VO2max Protocol Critical Considerations
The foundational study by Taylor et al.  demonstrated that the sensitivity and reliability of physiological responses during GXT were limited by subject characteristics and GXT protocol design. In an attempt to investigate this issue, 115 healthy males (18–35 years) completed a wide range of GXT protocols under various conditions of physical stressors (caloric restriction, bed rest, temperature, and illness). It was the first comprehensive study to examine the sensitivity and reliability of VO2max based on modality, fitness status, illness, environment, gas sampling rate, test duration, and speed/grade increments.
Since then, there has been a search for an optimal standardized protocol suitable for the entire spectrum of fitness abilities and testing goals. The two modalities commonly used in GXT are treadmill and cycle ergometry. While the treadmill appears to be the most widely used modality due to familiarity with upright locomotion and greater muscle mass utilization, cycling protocols present an opportunity to test individuals with coordination or orthopedic limitations. Furthermore, opting to use a cycle ergometer over treadmill may result in a more quantifiable workload (Watts) and provides an opportunity to use a progressive ramp protocol allowing for more reproducible outcomes . However, VO2max attained using treadmill protocols tend to produce up to 20% greater VO2max values when compared to cycle protocols [54, 55]. This difference is attributed to a larger recruitment of exercising skeletal muscle mass, and a-, vascular conductance, and a lower rate of carbohydrate oxidation leading to a less severe development of metabolic acidosis at submaximal intensities [15, 56–60].
Realizing the need to investigate physiological responses to the earliest standardized GXT protocols, Pollock et al.  compared cardiopulmonary responses between four widely used treadmill testing protocols in 51 men (22 active, 29 sedentary): Balke , Bruce , Ellestad , and modified Astrand . Each test differed in the method of increasing work rate in a step fashion (either speed or grade). The Balke protocol maintains a constant speed (3.3 mph) but increases grade by 1% each minute. The Bruce protocol increases speed and grade every 3 min. The Ellestad protocol increases speed each stage until the 10th minute upon introduction of a single increase in grade (to 5%) followed by subsequent increases in speed. Finally, the Astrand protocol maintains a constant running speed with increase in grade (2.5%) every 2 min. Pollock et al.  observed a similar VO2max achieved between Balke, Bruce, Ellestad, and Astrand protocols (39.4, 40.0, 40.7, and 41.8 mL·kg−1·min−1, resp.) despite the difference in VO2 plateau attainment (69%, 69%, 59%, and 80% of participants, resp.). Interestingly, this shows that the individual characteristics of similar protocols do not impact VO2max but show inconsistencies in plateau. Their finding was one of the first to demonstrate the impact of protocol design characteristics on the attainment of VO2max and a VO2 plateau.
Early investigations by Whipp et al.  and Davis et al.  popularized the use of ramp protocols on electronically braked cycle ergometers. It was proposed that ramp cycle protocols would improve an individual’s ability to reach VO2max because the ramp increased work in a much more continuous fashion when compared to step increases in work rate used in traditional treadmill protocols . Since workload was deliverable in a linear fashion, attention turned to examining the slope of the VO2-work rate (ΔVO2/ΔWR) relationship. Buchfuhrer et al.  compared cycle tests of various changes in work rates (15 W·min−1, 30 W·min−1, and 60 W·min−1) and noted that ramping at an intermediate rate (30 W·min−1) produced the greatest VO2max values; however, the work rate was dependent on fitness status. In a similar study, Zhang et al.  compared multiple work rates (15 W·min−1, 20 W·min−1, and 30 W·min−1) applied in continuous ramp versus step (1 min, 2 min, and 3 min) fashion. Interestingly, no differences were found in aerobic parameters (VO2max, anaerobic threshold (AT), AT/VO2max, and ΔVO2/ΔWR) between any of the protocols. Furthermore, this study emphasized the importance of work rate increments independent of the stage length used. In contrast, Myers et al.  showed that ramp protocols (treadmill and cycle) represented a higher correlation between VO2 and workload compared to step, thus reducing the error in predicting the metabolic cost at individual workloads. Muscat et al.  completed the most comprehensive investigation to date, comparing physiological responses (cardiometabolic function, gas exchange, breathing patterns, pulmonary function, and leg discomfort) between ramp treadmill and ramp cycle protocol matched for work increase (25 W/2 min) in 15 healthy young men. It was concluded that VO2, VCO2, respiratory exchange ratio (RER), HR, O2 pulse, ventilation, and respiratory muscle effort (diaphragm) responses were greater at maximal and submaximal workloads for treadmill compared to cycle exercise; however, the responses in ventilatory equivalents and ventilatory thresholds were similar. Their results suggested that either mode may be applied for purposes of evaluating mode-specific fitness and determining optimal training prescriptions when work rate increases are applied in a ramp fashion.
Similar to protocol mode (cycle versus treadmill), stage length, and work rate increments (ramp versus step), GXT protocol duration should be considered when comparing results. Buchfuhrer et al.  utilized 1-minute stage protocols to examine the impact of protocol duration on the achievement of VO2max. Since VO2max values were greater in protocols lasting between 8 and 17 min compared to tests outside these limits, the current duration recommendation of 8–12 min was established . More recently, Yoon et al.  suggested that the Buchfuhrer et al.  study lacked appropriate statistical power and commented that overall test duration (5–14 min) may depend on age and training status [70–72]. Yoon et al.  compared VO2max and incidence of VO2 plateau across four protocols of different durations (5, 8, 12, and 16 min) using a cycle ramp protocol in moderate-to-highly trained individuals. They found that VO2max was higher in men for the 8-minute protocol compared to 5-, 12-, and 16-minute protocols, while there was no difference in VO2max in women. Further statistical analysis attributed this sex difference to lower fitness levels and perhaps more importantly the lower muscle mass (~45 kg in women versus ~67 kg in men) in women participating in the study. The impact of fitness level on test duration suggests that the steeper ΔVO2/ΔWR slope in shorter protocols may be disadvantageous to individuals of lower fitness. This could be due to increased reliance on nonmitochondrial energy systems, thus causing premature fatigue, as well as eliciting central cardiovascular limitations.
5. VO2max Protocol Paradigm Shift
For the past 60 years the push to standardize GXT procedures has been essential to progress understanding of the complex and sensitive interaction between exercise and the integrated human physiological responses. In spite of the advancements that have given test administrators the ability to control fixed increments of intensities in an open-loop fashion (constant administrator testing variable manipulation without a fixed termination time), recent research has introduced an alternative approach to exercise protocols that allow the subject to self-pace the protocol in an incremental format [73–75]. In effect, this type of protocol would not negate past research that emphasized the role of the heart, lungs, circulatory, and other integrated systems in a limiting capacity but rather challenge the role of the brain as a potential simultaneous regulator. Although not entirely self-paced, studies by Pollock et al. [61, 76] used a model where speed was adjusted to accommodate individual movement efficiency and workload was increased by adding treadmill grade. In an effort to appreciate the evolution of self-paced protocols, the Pollock et al. [61, 76] investigations underscore the importance of identifying movement speeds that engage the greatest amount of muscle mass. Hagerman  was the first to report the ability of an individual to reach a greater VO2 by self-pacing during a simulated competitive time trial compared to laboratory-based testing methods. The finding suggested that an individual’s ability to self-regulate muscular power output may serve as the ultimate variable in maximizing physiologic responses. Intrigued by this finding, Foster et al.  compared the physiological responses between a self-paced laboratory 5-km cycle time trial and a GXT using a cycle ergometer. They found that maximal VO2, HR, ventilation, and blood lactate (BLa−) levels were significantly greater in the 5-km time trial compared to the cycle ergometer GXT. These initial findings raised an important fundamental question regarding exercise testing protocols: if an individual possesses the ability to achieve a greater physiological ceiling when self-paced, does the search for achieving standardized protocol procedures serve a pragmatic purpose?
Eston and Thompson  used a closed-loop (no constant administrator manipulation and a fixed termination time) perceptually-regulated protocol guided by the 6–20 Rating of Perceived Exertion Scale (RPE)  to estimate maximal work rate in patients receiving β-blocker treatment. Using 4 × 3 min stages at an RPE of 9 (“very light”), 13 (“somewhat hard”), 15 (“hard”), and 17 (“very hard”), they reported that RPE could be used to predict maximal functional capacity. Similar RPE protocols using various stage lengths (2, 3, and 4 min) have been validated to accurately predict VO2max [74, 80–83]. Using a similar perceptually regulated paradigm, Mauger and Sculthorpe  investigated a self-paced cycle exercise protocol in 16 untrained university students. The test design consisted of 5 × 2-min stages, totaling 10 min, at incremental intensities utilizing Borg’s Rating of Perceived Exertion () Scale . The protocol was designed as follows: stage one was clamped at an RPE of 11 (“fairly light”), stage two clamped at an RPE of 13 (“somewhat hard”), stage three clamped at an RPE of 15 (“hard”), stage four clamped at an RPE of 17 (“very hard”), and the final stage clamped at an RPE of 20 (“maximal exertion”). The unique aspect of the protocol was the additional stage of “maximal exertion” (RPE 20) to the established series of 2 min RPE-clamped stages as applied previously  in order to directly measure VO2max. The participants achieved a significantly greater VO2max ( versus mL·kg−1·min−1) and peak power output ( versus Watts) in the self-paced protocol compared to a traditional GXT despite the absence of significant differences in HRmax, RERmax, VEmax, and mean power output. It is important to note that the results of Mauger and Sculthorpe  have received considerable criticism over methodology. Their results have been attributed to discrepancies in test duration between self-paced ( min) and traditional ( min) protocols and that direct GXT protocol comparison must require a match in total test duration [84, 85]. Interestingly, the closed-loop nature of the test elicited a motivation or “final push” during the final stage of the test similar to that expected toward the end of an athletic competition. Moreover, the results support the role of the brain during a closed-loop setting when the individual is able to vary work rate constantly, balancing discomfort with a maintainable power output and willingness to complete the test. The simplicity of the protocol design has produced many speculative explanations for the results. Mauger et al.  showed that a speed-based self-paced treadmill test elicited significantly greater VO2max and HRmax values compared to those relative to a traditional test. This study also received criticisms over flawed methods and lack of control, attributing findings to using different modes (motorized versus nonmotorized treadmill) and neglect of measurement error to test their hypothesis [83, 87–89]. Follow-up studies have shown a higher VO2max attainment during self-pacing using a cycle ramp protocol , similar VO2max attainment using motorized treadmill [83, 91–93] and cycle protocols [84, 94], and a lower VO2max attainment using an automated treadmill . It is important to note that findings showing no difference between self-paced and traditional protocols demonstrate the potential utility for self-paced GXT protocols, particularly when considering protocol duration. While test duration is tightly controlled during self-paced testing and the incremental steps in oxygen cost between stages have been shown to fall within recommended guidelines (1-2 METS) , physiological measurements that may distinguish self-paced from traditional protocols have yet to be adequately examined. It is purported that underlying variables that comprise the Fick equation, namely, and a-, and the role of blood flow redistribution may underpin differences between self-paced and traditional protocols [92, 93]. More recently, Astorino et al.  showed that a self-paced cycle protocol elicited higher VO2 compared to a ramp protocol ( versus mL·kg−1·min−1). Additionally, they were the first to compare central cardiovascular responses between protocols and showed a higher during self-paced compared to ramp ( versus L·min−1). It should be noted that their average test duration was not tightly controlled to 10 min ( min); therefore pacing was not restricted throughout the final 2-minute stage. Although initial results are intriguing, the investigation into the efficacy and suitability of self-paced protocols is in its infancy. Therefore, future researchers could choose to design studies to expound on the intertrial reliability using self-pacing protocols. Additionally, studies should examine the interaction between central, peripheral, and central regulating responses during self-paced exercise.
6. VO2max Attainment Criteria
In order to increase the reliability and validity of a test, an undefined combination of standardized criteria must be met during the GXT including the following: VO2 plateau, estimated HRmax, RER, BLa−, and RPE. This widely accepted set of characteristics, or VO2max criteria, has become a controversial topic of debate in recent years due to the high intersubject variability in attaining the criteria [96–99]. Furthermore, the number and type of criteria used to determine VO2max are often contingent on the preference of the researcher or clinician administering the test . Along with protocol design, other factors such as metabolic data processing methods and participant effort make comparing the results for clinical or research purposes difficult [53, 99, 100].
7. Detection of a VO2 Plateau
As with many of the principles used today in exercise physiology, the original reports of a slowing or “plateau” of oxygen consumption despite increasing muscular work can be attributed to Bassett Jr. and Howley . Taylor et al.  later confirmed the existence of a VO2 plateau in 9 of 13 men during a treadmill test of incremental speed (increasing 1 mph) and 108 of 115 (94%) men during treadmill tests of incremental grade (2.5%). Studies have since demonstrated that a VO2 plateau can be detected in 17% to 100% of subjects tested, suggesting that its existence represents an inconsistent “phenomenon” . Taylor et al.  were the first to apply the VO2 plateau criterion of ≤150 mL·min−1, defined by a change in VO2 ≤150 mL·min−1 despite a continuous increase in workload. This value, alongside the wide range of other values used as plateau criteria (≤50 mL·min−1, ≤100 mL·min−1, ≤200 mL·min−1, and ≤280 mL·min−1), should not be applied universally since the criteria must reflect the expected rate of VO2 increase per unit time relative to the protocol design . Among the most important factors impacting the incidence of a VO2 plateau are age , testing modality , and data analyses methodology [99, 105]. Astorino et al.  showed that the strongest predictor of VO2 plateau among 30 men and women consisting of groups of endurance-trained (), recreationally active (), and strength trained () individuals was age, not training status, body composition, or training history. Gordon et al.  demonstrated that treadmill testing (58%) was superior to cycle ergometer testing (8%) at eliciting a plateau. The diminished plateau in cycling was attributed to the increased metabolic cost of the eccentric skeletal muscle activity in treadmill running compared to the concentrically dominant cycle exercise. Furthermore, Astorino  confirmed that gas sampling rate impacted the incidence of a plateau. In this study, 13 sedentary, 48 recreationally active, and 47 competitive athletes completed a GXT using treadmill and cycle protocols. The group found that the incidence of plateau was greater using breath-by-breath (81%), 15-sec (91%), and 30-sec (89%) averaging compared to a longer sampling rate of 60-sec (59%). Collectively, these findings suggest that sampling rate in conjunction with the plateau threshold criteria may explain much of the discrepancy in incidence of VO2 plateau across studies.
7.1. Heart Rate Response to the VO2max Protocol
Due to its noninvasive nature, simplicity, and fairly predictable response to incremental to maximal exercise, HR is often used as a secondary criterion to VO2 plateau. Much like plateau and other secondary criteria, the criteria for HR are highly variable. Typically, the threshold is established at a specific percentage using an age-predicted HRmax equation. Surprisingly, Fox et al.  created the 220-age equation by drawing an arbitrary best fit line from the observation of 10 studies . Robergs and Landwehr  evaluated the Fox equation  and highlighted the fact that there were no statistical methods used to establish the regression equation from their data set. Instead, Fox and colleagues  summarize their methods by stating that “no single line will adequately represent the data on the apparent decline of HRmax with age. The formula, 220-age, defines a line not far from many data points.” Curious to investigate the mystery of 220-age, Robergs and Landwehr  replicated the data set presented by Fox et al. , applied linear regression analysis, and found the equation to be 215.4–0.9147 (age) with a ±21 bpm error. Tanaka et al.  cross-validated an age-predicted HRmax equation by combining 351 studies (18,712 subjects) and a laboratory investigation using 514 subjects (18–81 years old). Their regression analysis established a new equation (208 − 0.7 × age) with a less, yet still substantial (±7–11 bpm), error range. It was concluded that the 220-age equation underestimates age-predicted HRmax in individuals over the age of 40 years. For these reasons, equations with less inherent error should be applied when using “±10 bpm of age-predicted HRmax” as a secondary VO2max criterion.
7.2. RER Implication of the VO2max Protocol
The RER is another secondary VO2max criterion that is used to reflect the balance between bicarbonate buffering and hydrogen ion accumulation in the face of incremental exercise. Upon increasing metabolic acidosis, bicarbonate buffering leads to an increase in CO2 production, increased ventilation, and a subsequent increase in RER. A series of studies by Issekutz et al. [109, 110] was the first to examine the relationship between RER and incremental exercise, VO2max, and the metabolic state of the exercising human. Using a series of 4-5 min intermittent maximal exercise tests, they calculated the difference between CO2 and the product of a metabolic respiratory quotient constant and VO2 (CO2 − VO2 × 0.75). This value was termed “excess CO2” and used to reflect the change in substrate utilization and exercise intensity with the increase in VO2 [109, 110]. Ultimately, they established the most widely used threshold for RER criteria (≥1.15) today. Much like HR and VO2plateau criteria, a standard RER threshold value is not consistently applied, using 1.0, 1.05, 1.08, 1.10, 1.12, 1.13, or 1.20 to verify VO2max attainment [96, 97]. The high intersubject variability in RER responses due to inconsistent effort levels and training status makes higher RER values difficult to achieve for many individuals .
7.3. VO2max and Blood Lactate Accumulation
Analysis of postexercise BLa− compared to preexercise BLa− has been used as a reliable marker to quantify exercise intensity. Hill et al. [4–6] were the first to establish a relationship between BLa− and exercise intensity during vigorous to maximal exercise. Regardless of the debate for practical application or metabolic circumstances, it is agreed that BLa− accumulation is related to an individual’s ability to tolerate and sustain exercise; therefore, it is commonly used as a surrogate measure of the metabolic perturbations during maximal exercise and a secondary VO2max criterion [111, 112]. The origins of the criterion date back to an investigation by Astrand  who used postexercise BLa− concentrations to verify VO2max in young (14–18 years) boys and girls. Despite the fact that only half of the subjects demonstrated a VO2 plateau, it was noted that the individuals who exhibited a plateau had postexercise BLa− concentrations between 7.9 and 8.4 mM (average ≥ 8 mM). It has since been determined that factors such as age, training status, sex, and overall effort may impact the level of BLa−. This has led to the use of a ≥10 mM threshold or even a complete disregard for maximal BLa− concentration due to the high intersubject variability (anywhere from 1.2 to 18 mM) in postexercise lactate [96, 98, 114].
7.4. RPE Assessment in a VO2max Protocol
The simplest and most controversial measurement traditionally used as a VO2max criterion was developed by Borg , known as Borg’s Rating of Perceived Exertion (RPE) . While RPE is not a direct measurement of physiological responses, the behavioral, motivational, and physical factors that an individual perceives during GXT contribute greatly to the overall validity of the test. Many studies have shown a strong relationship between RPE, HR, and VO2 [82, 115–117]; however, others have demonstrated RPE to be less related to these variables in less active or sedentary individuals [118, 119]. Interestingly, Noakes  suggested that thresholds may not be as simple as limitations in central and peripheral components but rather controlled by a “central governor” that regulates self-pacing and overall effort throughout various points of a maximal exercise bout. Despite the relationship among RPE, HR, and BLa−, inconsistencies in recent studies call to question the validity of RPE. Edvardsen et al.  examined the attainment of the commonly used RPE criterion of ≥17 in 840 individuals (20–85 years) and found that 84% of the subjects were able to achieve the criterion. Due to the variability and subjective nature of the criterion, Magnan et al.  examined the attainment of an RPE ≥18 in 240 inactive individuals (18–45 years) and found that 93.7% of the individuals reached the desired RPE threshold despite the fact that only 59% were demonstrating a VO2 plateau. Overall, the assumptions in employing an RPE criterion depend on the subject’s understanding of the scale and associated verbal descriptors, ability to differentiate between discomfort and physiological fatigue, and motivation.
8. Verification Protocols of VO2max
Many of the current criteria used to determine VO2max were established with technology that is no longer used today. Douglas bags and Tissot gasometers have been replaced by sophisticated metabolic analyzing systems and pneumotach and turbine flow measurement devices. Furthermore, the VO2max criteria were developed using certain modalities (treadmill versus cycle) on relatively small samples of homogenous populations. These reasons and the overall variability of the criteria have resulted in some researchers rejecting secondary criteria altogether. Recent attempts have been made to establish new VO2max attainment criteria through the introduction of a verification protocol [97, 122–126]. The origin of the verification protocol is believed to exist in a text by Thoden et al. , in which the first recommendation was made to include a bout of supramaximal exhaustive exercise (higher workload than achieved during GXT) following the completion of GXT. Since the inception of the verification protocol, there has been much effort to establish verification protocol intensity, duration, rest period after completing initial GXT, and the criteria used to verify VO2max attainment. In follow-up guidelines, Thoden  recommended that the recovery phase between GXT and verification protocol should be between 5 and 15 minutes and a workload of one stage higher than the final completed stage during GXT should be used. Unfortunately, the recommendations by Thoden et al. [127, 128] were theoretical rather than research-based guidelines. The first study to directly examine the efficacy of a verification protocol was done by NiemelÄ et al. , who performed a verification protocol at a workload equal to the greatest workload achieved during initial GXT within a week of completing initial GXT. Using a ±5% repeatability range for VO2max , they were able to confirm attainment in 8 of the 16 subjects. Day et al.  sought to observe the differences between cycle GXT and a subsequent verification test in 38 healthy individuals (19–61 years) and reported no difference in VO2max using a workload of ~90% . Follow-up studies commonly used verification workloads between 95 and 130% of peak workload achieved during initial cycle GXT or 0.5–1.6 km·hr−1 higher than the peak velocity achieved during initial treadmill GXT. The recovery time between tests ranged from the same day [122, 126, 131–134], 1–10 min active rest [122, 132, 133, 135], and 5–60 min passive rest [125, 135], to separate day testing [8, 123, 126]. Furthermore, tests were done on endurance-trained runners [122, 123], male athletes , recreationally active men and women [122, 125, 126], sedentary men and women , and middle-aged men and women . Studies commonly revealed a nonsignificant mean difference in VO2max between GXT and verification protocols when analyzed on an individual, rather than group mean, basis within the ±5% measurement error (accuracy of metabolic system when properly calibrated according to manufacturer) associated with VO2 measurement [136–138]. Most recently, a study by Nolan et al.  investigated the impact of two verification intensities and rest periods. After an initial treadmill test, 12 active males and females completed each of the following four verification conditions: 105% maximal GXT workload, 20 min rest; 105% maximal GXT workload, 60 min rest; 115% maximal GXT workload, 20 min rest; and 115% maximal GXT workload, 60 min rest. Their results demonstrated a 100% success rate in verifying attainment of VO2max when using 105% maximal GXT workload, regardless of the rest period between tests, and highlighted the current recommendation for intensity and rest period optimization during verification tests .
9. Error in VO2max Measurement
The sophisticated metabolic systems used to collect and analyze data during exercise testing represent the most sensitive and reliable means for laboratory-based research. In order to compare results between studies, the data must be validated. Even if metabolic systems are appropriately maintained and calibrated, measurement error of ±5% is commonly accepted [136–138]. A study by Yule et al.  reported a 15% difference in VO2max between three identical systems in the same laboratory. Furthermore, differences attributed to measurement error between 10 and 22% have been reported when comparing identical testing protocols using different metabolic systems . A meta-analysis by Vickers  examined the test-retest reliability in maximal exercise testing and found that the average standard measurement error was 2.58 mL·kg−1·min−1. To account for total error, however, a source of biovariation reflecting the inherent biological fluctuations within an individual must also be considered . Knowing that the underlying assumption for comparing VO2max between and within studies is that biological variability must account for a portion of total error, Katch et al.  designed a study in which five participants completed an average of 16 maximal exercise tests over the course of a 2–4-week period. They found that, within the total error of ±5.6%, biological variability accounted for ~93% of the error while measurement error accounted for only ~7%. Meanwhile, recommendations by Balady et al.  report that the biological component of variability intrinsic to GXT is commonly accepted within 3-4%. Due to the numerous factors that contribute to biological variability and subsequent total error, an argument can be made against studies that consider only manufacturers’ guidelines of measurement error when comparing tests . This can be represented by a scenario comparing VO2 responses to various protocol designs, whereas an individual who falls within the accepted measurement error range between tests may have only done so based on the small contribution in variance from the system rather than the large contribution from biological variability due to heredity, homeostatic stress, training status, psychological stress, sleep, or nutrition . For this reason, comparing attainment of VO2max between protocols may be more reliable when considering total error (biological variability + measurement) rather than measurement error alone.
Due to the valuable information gathered and the wide spectrum of applications for the use of GXT, it has become an increasingly important objective to derive an optimal set of standardized procedures for the determination of VO2max. Many years of observations examining the potential sources for individual variability in GXT responses are cited in the literature. Despite the pitfalls in physiological variability, standardized tests using traditional methods for VO2max verification remain the most commonly employed. Furthermore, the methods in which these criteria have been previously established, as well as the comparison between studies evaluating the appropriateness of universal protocols, do not consider combined sources of inherent measurement and biological error. These reasons underpin the current suitability of more standardized GXT guidelines and subsequent methods for determining test validity. More recent approaches have highlighted alternative methods for measuring exercise capacity using a closed-loop, self-paced testing model. Future research directions should seek investigating perceptually regulated (RPE-clamped) protocols with verification protocols for the overall suitability and individualization of GXT.
The authors declare that they have no competing interests.
- G. Lambert, “The exercise blood pressure test of myocardial efficiency,” BMJ, vol. 2, no. 3014, pp. 366–368, 1918.
- C. M. Tipton, History of Exercise Physiology, Human Kinetics, 2014.
- A. V. Hill and H. Lupton, “The oxygen consumption during running,” The Journal of Physiology, vol. 56, pp. 32–33, 1922.
- A. V. Hill, C. N. H. Long, and H. Lupton, “Muscular exercise, lactic acid, and the supply and utilisation of oxygen-parts i-iii,” Proceedings of the Royal Society of London B: Biological, vol. 96, pp. 438–475, 1924.
- A. V. Hill, C. Long, and H. Lupton, “Muscular exercise, lactic acid and the supply and utilisation of oxygen—Parts VII-VIII,” Proceedings of the Royal Society of London B: Biological, vol. 97, pp. 155–176, 1924.
- A. V. Hill, C. N. H. Long, and H. Lupton, “Muscular exercise, lactic acid, and the supply and utilisation of oxygen- parts iv-vi,” Proceedings of the Royal Society of London B: Biological, vol. 97, pp. 84–138, 1924.
- D. R. Bassett Jr., “Scientific contributions of A. V. Hill: exercise physiology pioneer,” Journal of Applied Physiology, vol. 93, no. 5, pp. 1567–1582, 2002.
- J. R. Day, H. B. Rossiter, E. M. Coats, A. Skasick, and B. J. Whipp, “The maximally attainable during exercise in humans: the peak vs. maximum issue,” Journal of Applied Physiology, vol. 95, no. 5, pp. 1901–1907, 2003.
- K. Albouaini, M. Egred, A. Alahmar, and D. J. Wright, “Cardiopulmonary exercise testing and its application,” Postgraduate Medical Journal, vol. 83, no. 985, pp. 675–682, 2007.
- S. N. Blair, J. B. Kampert, H. W. Kohl III et al., “Influences of cardiorespiratory fitness and other precursors on cardiovascular disease and all-cause mortality in men and women,” Journal of the American Medical Association, vol. 276, no. 3, pp. 205–210, 1996.
- S. N. Blair, H. W. Kohl III, C. E. Barlow, R. S. Paffenbarger Jr., L. W. Gibbons, and C. A. Macera, “Changes in physical fitness and all-cause mortality: a prospective study of healthy and unhealthy men,” The Journal of the American Medical Association, vol. 273, no. 14, pp. 1093–1098, 1995.
- S. Kodama, K. Saito, S. Tanaka et al., “Cardiorespiratory fitness as a quantitative predictor of all-cause mortality and cardiovascular events in healthy men and women: a meta-analysis,” Journal of the American Medical Association, vol. 301, no. 19, pp. 2024–2035, 2009.
- B. D. Levine, “VO2max: what do we know, and what do we still need to know?” The Journal of Physiology, vol. 586, no. 1, pp. 25–34, 2008.
- A. Crisafulli, F. Melis, F. Tocco et al., “Anaerobic threshold and the oxygen consumption-cardiac output relationship during exercise,” Sport Sciences for Health, vol. 1, no. 2, pp. 75–80, 2005.
- J. A. Faulkner, D. E. Roberts, R. L. Elk, and J. Conway, “Cardiovascular responses to submaximum and maximum effort cycling and running,” Journal of applied physiology, vol. 30, no. 4, pp. 457–461, 1971.
- S. F. Lewis, W. F. Taylor, R. M. Graham, W. A. Pettinger, J. E. Schutte, and C. G. Blomqvist, “Cardiovascular responses to exercise as functions of absolute and relative work load,” Journal of Applied Physiology: Respiratory, Environmental and Exercise Physiology, vol. 54, no. 5, pp. 1314–1323, 1983.
- W. W. Stringer, J. E. Hansen, and K. Wasserman, “Cardiac output estimated noninvasively from oxygen uptake during exercise,” Journal of Applied Physiology, vol. 82, no. 3, pp. 908–912, 1997.
- D. N. Proctor, K. C. Beck, P. H. Shen, T. J. Eickhoff, and M. J. Halliwill Joyner Jr., “Influence of age and gender on cardiac output-VO2 relationships during submaximal cycle ergometry,” Journal of Applied Physiology, vol. 84, no. 2, pp. 599–605, 1998.
- M. Knight-Maloney, R. A. Robergs, A. Gibson, and F. Ghiasvand, “Threshold changes in blood lactate, beat-to-cardiovascular function, and breath-by-breath VO2 during incremental exercise,” Journal of Exercise Physiology Online, vol. 5, no. 3, pp. 39–53, 2002.
- V. Bunc, P. Hofmann, H. Leitner, and G. Gaisl, “Verification of the heart rate threshold,” European Journal of Applied Physiology and Occupational Physiology, vol. 70, no. 3, pp. 263–269, 1995.
- F. Conconi, M. Ferrari, P. G. Ziglio, P. Droghetti, and L. Codeca, “Determination of the anaerobic threshold by a noninvasive field test in runners,” Journal of Applied Physiology Respiratory Environmental and Exercise Physiology, vol. 52, no. 4, pp. 869–873, 1982.
- P. Hofmann, R. Pokan, K. Preidler et al., “Relationship between heart rate threshold, lactate turn point and myocardial function,” International Journal of Sports Medicine, vol. 15, no. 5, pp. 232–237, 1994.
- P. Hofmann, R. Pokan, S. P. Von Duvillard, F. J. Seibert, R. Zweiker, and P. Schmid, “Heart rate performance curve during incremental cycle ergometer exercise in healthy young male subjects,” Medicine and Science in Sports and Exercise, vol. 29, no. 6, pp. 762–768, 1997.
- A. M. Jones and J. H. Doust, “Lack of reliability in Conconi's heart rate deflection point,” International Journal of Sports Medicine, vol. 16, no. 8, pp. 541–544, 1995.
- D. G. Allen and J. C. Kentish, “The cellular basis of the length-tension relation in cardiac muscle,” Journal of Molecular and Cellular Cardiology, vol. 17, no. 9, pp. 821–840, 1985.
- P. O. Astrand, T. E. Cuddy, B. Saltin, and J. Stenberg, “Cardiac output during submaximal and maximal work,” Journal of Applied Physiology, vol. 19, pp. 268–274, 1964.
- S. Bevegard, A. Holmgren, and B. Jonsson, “The effect of body position on the circulation at rest and during exercise, with special reference to the influence on the stroke volume,” Acta Physiologica Scandinavica, vol. 49, pp. 279–298, 1960.
- G. Grimby, N. J. Nilsson, and B. Saltin, “Cardiac output during submaximal and maximal exercise in active middle-aged athletes,” Journal of Applied Physiology, vol. 21, no. 4, pp. 1150–1156, 1966.
- D. E. R. Warburton, M. J. Haykowsky, H. A. Quinney, D. Blackmore, K. K. Teo, and D. P. Humen, “Myocardial response to incremental exercise in endurance-trained athletes: influence of heart rate, contractility and the Frank-Starling effect,” Experimental Physiology, vol. 87, no. 5, pp. 613–622, 2002.
- C. A. Vella and R. A. Robergs, “A review of the stroke volume response to upright exercise in healthy subjects,” British Journal of Sports Medicine, vol. 39, no. 4, pp. 190–195, 2005.
- C. G. Wiebe, N. Gledhill, V. K. Jamnik, and S. Ferguson, “Exercise cardiac function in young through elderly endurance trained women,” Medicine and Science in Sports and Exercise, vol. 31, no. 5, pp. 684–691, 1999.
- A. M. Rivera, A. E. Pels III, S. P. Sady, M. A. Sady, E. M. Cullinane, and P. D. Thompson, “Physiological factors associated with the lower maximal oxygen consumption of master runners,” Journal of Applied Physiology, vol. 66, no. 2, pp. 949–954, 1989.
- P. F. McLaren, Y. Nurhayati, and S. H. Boutcher, “Stroke volume response to cycle ergometry in trained and untrained older men,” European Journal of Applied Physiology and Occupational Physiology, vol. 75, no. 6, pp. 537–542, 1997.
- T. Ogawa, R. J. Spina, W. H. Martin III et al., “Effects of aging, sex, and physical training on cardiovascular responses to exercise,” Circulation, vol. 86, no. 2, pp. 494–503, 1992.
- S. D. McCole, M. D. Brown, G. E. Moore, J. M. Zmuda, J. D. Cwynar, and J. M. Hagberg, “Cardiovascular hemodynamics with increasing exercise intensities in postmenopausal women,” Journal of Applied Physiology, vol. 87, no. 6, pp. 2334–2340, 1999.
- N. Gledhill, D. Cox, and R. Jamnik, “Endurance athletes' stroke volume does not plateau: major advantage is diastolic function,” Medicine and Science in Sports and Exercise, vol. 26, no. 9, pp. 1116–1121, 1994.
- B. Zhou, R. K. Conlee, R. Jensen, G. W. Fellingham, J. D. George, and A. G. Fisher, “Stroke volume does not plateau during graded exercise in elite male distance runners,” Medicine and Science in Sports and Exercise, vol. 33, no. 11, pp. 1849–1854, 2001.
- M. H. Crawford, M. A. Petru, and C. Rabinowitz, “Effect of isotonic exercise training on left ventricular volume during upright exercise,” Circulation, vol. 72, no. 6, pp. 1237–1243, 1985.
- S. Ferguson, N. Gledhill, V. K. Jamnik, C. Wiebe, and N. Payne, “Cardiac performance in endurance-trained and moderately active young women,” Medicine and Science in Sports and Exercise, vol. 33, no. 7, pp. 1114–1119, 2001.
- J. H. P. Vanfraechem, “Stroke volume and systolic time interval adjustments during bicycle exercise,” Journal of Applied Physiology Respiratory Environmental and Exercise Physiology, vol. 46, no. 3, pp. 588–592, 1979.
- R. J. Spina, T. Ogawa, W. M. Kohrt, W. H. Martin III, J. O. Holloszy, and A. A. Ehsani, “Differences in cardiovascular adaptations to endurance exercise training between older men and women,” Journal of Applied Physiology, vol. 75, no. 2, pp. 849–855, 1993.
- C. B. Chapman, J. N. Fisher, and B. J. Sproule, “Behavior of stroke volume at rest and during exercise in human beings,” The Journal of Clinical Investigation, vol. 39, pp. 1208–1213, 1960.
- B. Krip, N. Gledhill, V. Jamnik, and D. Warburton, “Effect of alterations in blood volume on cardiac function during maximal exercise,” Medicine and Science in Sports and Exercise, vol. 29, no. 11, pp. 1469–1476, 1997.
- M. Martino, N. Gledhill, and V. Jamnik, “High VO2max with no history of training is primarily due to high blood volume,” Medicine and Science in Sports and Exercise, vol. 34, no. 6, pp. 966–971, 2002.
- S. C. De Cort, J. A. Innes, T. J. Barstow, and A. Guz, “Cardiac output, oxygen consumption and arteriovenous oxygen difference following a sudden rise in exercise level in humans,” The Journal of Physiology, vol. 441, pp. 501–512, 1991.
- K. Wasserman, B. J. Whipp, and J. A. Davis, “Respiratory physiology of exercise: metabolism, gas exchange, and ventilatory control,” International Review of Physiology, vol. 23, pp. 149–211, 1981.
- B. J. Whipp, J. A. Davis, F. Torres, and K. Wasserman, “A test to determine parameters of aerobic function during exercise,” Journal of Applied Physiology Respiratory Environmental and Exercise Physiology, vol. 50, no. 1, pp. 217–221, 1981.
- R. Casaburi, J. Daly, J. E. Hansen, and R. M. Effros, “Abrupt changes in mixed venous blood gas composition after the onset of exercise,” Journal of Applied Physiology, vol. 67, no. 3, pp. 1106–1112, 1989.
- C. G. Blomqvist and B. Saltin, “Cardiovascular adaptations to physical training,” Annual Review of Physiology, vol. 45, pp. 169–189, 1983.
- B. Ekblom, P. O. Astrand, B. Saltin, J. Stenberg, and B. Wallström, “Effect of training on circulatory response to exercise,” Journal of Applied Physiology, vol. 24, no. 4, pp. 518–528, 1968.
- D. Montero, C. Diaz-Canestro, and C. Lundby, “Endurance training and VO2max,” Medicine & Science in Sports & Exercise, vol. 47, pp. 2024–2033, 2015.
- D. Montero and C. Diaz-Canestro, “Endurance training and maximal oxygen consumption with ageing: role of maximal cardiac output and oxygen extraction,” European Journal of Preventive Cardiology, vol. 23, no. 7, pp. 733–743, 2016.
- H. L. Taylor, E. Buskirk, and A. Henschel, “Maximal oxygen intake as an objective measure of cardio-respiratory performance,” Journal of Applied Physiology, vol. 8, pp. 73–80, 1955.
- J. Myers, N. Buchanan, D. Walsh et al., “Comparison of the ramp versus standard exercise protocols,” Journal of the American College of Cardiology, vol. 17, no. 6, pp. 1334–1342, 1991.
- K. M. Muscat, H. G. Kotrach, C. A. Wilkinson-Maitland, M. R. Schaeffer, C. T. Mendonca, and D. Jensen, “Physiological and perceptual responses to incremental exercise testing in healthy men: effect of exercise test modality,” Applied Physiology, Nutrition and Metabolism, vol. 40, no. 11, pp. 1199–1209, 2015.
- L. Hermansen, B. Ekblom, and B. Saltin, “Cardiac output during submaximal and maximal treadmill and bicycle exercise,” Journal of Applied Physiology, vol. 29, no. 1, pp. 82–86, 1970.
- I. Jacobs and B. Sjödin, “Relationship of ergometer-specific VO2 max and muscle enzymes to blood lactate during submaximal exercise,” British Journal of Sports Medicine, vol. 19, no. 2, pp. 77–80, 1985.
- K. Okita, H. Nishijima, K. Yonezawa et al., “Skeletal muscle metabolism in maximal bicycle and treadmill exercise distinguished by using in vivo metabolic freeze method and phosphorus-31 magnetic resonance spectroscopy in normal men,” American Journal of Cardiology, vol. 81, no. 1, pp. 106–109, 1998.
- J. Porszasz, R. Casaburi, A. Somfay, L. J. Woodhouse, and B. J. Whipp, “A treadmill ramp protocol using simultaneous changes in speed and grade,” Medicine & Science in Sports & Exercise, vol. 35, no. 9, pp. 1596–1603, 2003.
- D. A. Tanner, J. W. Duke, and J. M. Stager, “Ventilatory patterns differ between maximal running and cycling,” Respiratory Physiology and Neurobiology, vol. 191, no. 1, pp. 9–16, 2014.
- M. L. Pollock, R. L. Bohannon, K. H. Cooper et al., “A comparative analysis of four protocols for maximal treadmill stress testing,” American Heart Journal, vol. 92, no. 1, pp. 39–46, 1976.
- B. Balke and R. W. Ware, “An experimental study of physical fitness of Air Force personnel,” United States Armed Forces Medical Journal, vol. 10, no. 6, pp. 675–688, 1959.
- R. A. Bruce, J. R. Blackmon, J. W. Jones, and G. Strait, “Exercising testing in adult normal subjects and cardiac patients,” Pediatrics, vol. 32, pp. 742–756, 1963.
- M. H. Ellestad, W. Allen, M. C. Wan, and G. L. Kemp, “Maximal treadmill stress testing for cardiovascular evaluation,” Circulation, vol. 39, no. 4, pp. 517–522, 1969.
- N. C. Dey, A. Samanta, and R. Saha, “The pulse rate and energy expenditure profile of underground coal miners in India,” Mining Technology, vol. 113, no. 3, pp. 137–141, 2004.
- J. A. Davis, B. J. Whipp, N. Lamarra, D. J. Huntsman, M. H. Frank, and K. Wasserman, “Effect of ramp slope on determination of aerobic parameters from the ramp exercise test,” Medicine & Science in Sports & Exercise, vol. 14, no. 5, pp. 339–343, 1982.
- M. J. Buchfuhrer, J. E. Hansen, T. E. Robinson, D. Y. Sue, K. Wasserman, and B. J. Whipp, “Optimizing the exercise protocol for cardiopulmonary assessment,” Journal of Applied Physiology Respiratory Environmental and Exercise Physiology, vol. 55, no. 5, pp. 1558–1564, 1983.
- Y.-Y. Zhang, M. C. Johnson, N. Chow, and K. Wasserman, “Effect of exercise testing protocol on parameters of aerobic function,” Medicine & Science in Sports & Exercise, vol. 23, no. 5, pp. 625–630, 1991.
- B. Yoon, L. Kravitz, and R. Robergs, “VO2max, protocol duration, and the VO2 plateau,” Medicine & Science in Sports & Exercise, vol. 39, no. 7, pp. 1186–1192, 2007.
- T. A. Astorino, J. C. Rietschel, P. A. Tam et al., “Reinvestigation of optimal duration of VO2max testing,” Journal of Exercise Physiology Online, vol. 7, no. 6, pp. 1–8, 2004.
- P. M. Lepretre, J. P. Koralsztein, and V. L. Billat, “Effect of exercise intensity on relationship between and cardiac output,” Medicine & Science in Sports & Exercise, vol. 36, no. 8, pp. 1357–1363, 2004.
- S. D. McCole, A. M. Davis, and P. T. Fueger, “Is there a disassociation of maximal oxygen consumption and maximal cardiac output?” Medicine & Science in Sports & Exercise, vol. 33, no. 8, pp. 1265–1269, 2001.
- R. G. Eston and M. Thompson, “Use of ratings of perceived exertion for predicting maximal work rate and prescribing exercise intensity in patients taking atenolol,” British Journal of Sports Medicine, vol. 31, no. 2, pp. 114–119, 1997.
- R. G. Eston, K. L. Lamb, G. Parfitt, and N. King, “The validity of predicting maximal oxygen uptake from a perceptually-regulated graded exercise test,” European Journal of Applied Physiology, vol. 94, no. 3, pp. 221–227, 2005.
- A. R. Mauger and N. Sculthorpe, “A new VO2max protocol allowing self-pacing in maximal incremental exercise,” British Journal of Sports Medicine, vol. 46, no. 1, pp. 59–63, 2012.
- M. L. Pollock, C. Foster, D. Schmidt, C. Hellman, A. C. Linnerud, and A. Ward, “Comparative analysis of physiologic responses to three different maximal graded exercise test protocols in healthy women,” American Heart Journal, vol. 103, no. 3, pp. 363–373, 1982.
- F. C. Hagerman, “Applied physiology of rowing,” Sports Medicine, vol. 1, no. 4, pp. 303–326, 1984.
- C. Foster, M. A. Green, A. C. Snyder, and N. N. Thompson, “Physiological responses during simulated competition,” Medicine & Science in Sports & Exercise, vol. 25, no. 7, pp. 877–882, 1993.
- G. A. V. Borg, “Psychophysical bases of perceived exertion,” Medicine and Science in Sports and Exercise, vol. 14, no. 5, pp. 377–381, 1982.
- R. G. Eston, J. A. Faulkner, E. A. Mason, and G. Parfitt, “The validity of predicting maximal oxygen uptake from perceptually regulated graded exercise tests of different durations,” European Journal of Applied Physiology, vol. 97, no. 5, pp. 535–541, 2006.
- M. Morris, K. Lamb, D. Cotterrell, and J. Buckley, “Predicting maximal oxygen uptake via a Perceptually Regulated Exercise Test (PRET),” Journal of Exercise Science and Fitness, vol. 7, no. 2, pp. 122–128, 2009.
- J. B. Coquart, M. Garcin, G. Parfitt, C. Tourny-Chollet, and R. G. Eston, “Prediction of maximal or peak oxygen uptake from ratings of perceived exertion,” Sports Medicine, vol. 44, no. 5, pp. 563–578, 2014.
- N. J. Hanson, C. M. Scheadler, T. L. Lee, N. C. Neuenfeldt, T. J. Michael, and M. G. Miller, “Modality determines achieved in self-paced exercise tests: validation with the Bruce protocol,” European Journal of Applied Physiology, vol. 116, no. 7, pp. 1313–1319, 2016.
- W. Chidnok, F. J. DiMenna, S. J. Bailey et al., “VO2max is not altered by self-pacing during incremental exercise,” European Journal of Applied Physiology, vol. 113, no. 2, pp. 529–539, 2013.
- R. Eston, “Use of ratings of perceived exertion in sports,” International Journal of Sports Physiology and Performance, vol. 7, no. 2, pp. 175–182, 2012.
- A. R. Mauger, A. J. Metcalfe, L. Taylor, and P. C. Castle, “The efficacy of the self-paced O2max test to measure maximal oxygen uptake in treadmill running,” Applied Physiology, Nutrition, and Metabolism, vol. 38, no. 12, pp. 1211–1216, 2013.
- R. G. Eston, A. Crockett, and A. M. Jones, “Discussion of ‘the efficacy of the self-paced VO2 max test to measure maximal oxygen uptake in treadmill running’,” Applied Physiology, Nutrition and Metabolism, vol. 39, no. 5, pp. 581–582, 2014.
- D. C. Poole, “Discussion: ‘the efficacy of the self-paced test to measure maximal oxygen uptake in treadmill running’,” Applied Physiology Nutrition and Metabolism, vol. 39, no. 5, pp. 581–582, 2014.
- C. M. Scheadler and S. T. Devor, “VO2max measured with a self-selected work rate protocol on an automated treadmill,” Medicine & Science in Sports & Exercise, vol. 47, no. 10, pp. 2158–2165, 2015.
- T. A. Astorino, D. W. McMillan, R. M. Edmunds, and E. Sanchez, “Increased cardiac output elicits higher VO2max in response to self-paced exercise,” Applied Physiology, Nutrition and Metabolism, vol. 40, no. 3, pp. 223–229, 2015.
- P. F. Sperlich, H.-C. Holmberg, J. L. Reed, C. Zinner, J. Mester, and B. Sperlich, “Individual versus standardized running protocols in the determination of VO2max,” Journal of Sports Science and Medicine, vol. 14, no. 2, pp. 386–393, 2015.
- J. Faulkner, A. R. Mauger, B. Woolley, and D. Lambrick, “The efficacy of a self-paced VO2max test during motorized treadmill exercise,” International Journal of Sports Physiology and Performance, vol. 10, no. 1, pp. 99–105, 2015.
- J. S. Hogg, J. G. Hopker, and A. R. Mauger, “The self-paced VO2max test to assess maximal oxygen uptake in highly trained runners,” International Journal of Sports Physiology and Performance, vol. 10, no. 2, pp. 172–177, 2015.
- A. M. Straub, A. W. Midgley, G. S. Zavorsky, and A. R. Hillman, “Ramp-incremented and RPE-clamped test protocols elicit similar values in trained cyclists,” European Journal of Applied Physiology, vol. 114, no. 8, pp. 1581–1590, 2014.
- J. Faulkner, G. Parfitt, and R. Eston, “Prediction of maximal oxygen uptake from the ratings of perceived exertion and heart rate during a perceptually-regulated sub-maximal exercise test in active and sedentary participants,” European Journal of Applied Physiology, vol. 101, no. 3, pp. 397–407, 2007.
- E. T. Howley, D. R. Bassett, and H. G. Welch, “Criteria for maximal oxygen uptake: review and commentary,” Medicine and Science in Sports and Exercise, vol. 27, no. 9, pp. 1292–1301, 1995.
- A. W. Midgley, L. R. McNaughton, R. Polman, and D. Marchant, “Criteria for determination of maximal oxygen uptake: a brief critique and recommendations for future research,” Sports Medicine, vol. 37, no. 12, pp. 1019–1028, 2007.
- A. W. Midgley, S. Carroll, D. Marchant, L. R. McNaughton, and J. Siegler, “Evaluation of true maximal oxygen uptake based on a novel set of standardized criteria,” Applied Physiology, Nutrition and Metabolism, vol. 34, no. 2, pp. 115–123, 2009.
- R. A. Robergs, “An exercise physiologist's ‘contemporary’ interpretations of the ‘ugly and creaking edifices’ of the VO2max concept,” Journal of Exercise Physiology Online, vol. 4, no. 1, pp. 1–44, 2001.
- T. A. Astorino, R. A. Robergs, F. Ghiasvand, D. Marks, and S. Burns, “Incidence of the oxygen plateau at VO2max during exercise testing to volitional fatigue,” Journal of Exercise Physiology Online, vol. 3, no. 4, pp. 1–12, 2000.
- D. R. Bassett Jr. and E. T. Howley, “Limiting factors for maximum oxygen uptake and determinants of endurance performance,” Medicine & Science in Sports & Exercise, vol. 32, no. 1, pp. 70–84, 2000.
- G. R. Cumming and W. Friesen, “Bicycle ergometer measurement of maximal oxygen uptake in children,” Canadian Journal of Physiology and Pharmacology, vol. 45, no. 6, pp. 937–946, 1967.
- T. A. Astorino, J. Willey, J. Kinnahan, S. M. Larsson, H. Welch, and L. C. Dalleck, “Elucidating determinants of the plateau in oxygen consumption at VO2max,” British Journal of Sports Medicine, vol. 39, no. 9, pp. 655–660, 2005.
- D. Gordon, M. Mehter, M. Gernigon, O. Caddy, D. Keiller, and R. Barnes, “The effects of exercise modality on the incidence of plateau at VO2max,” Clinical Physiology and Functional Imaging, vol. 32, pp. 394–399, 2012.
- T. A. Astorino, “Alterations in VO2max and the VO2 plateau with manipulation of sampling interval,” Clinical Physiology and Functional Imaging, vol. 29, no. 1, pp. 60–67, 2009.
- S. M. Fox, J. P. Naughton, and W. L. Haskell, “Physical activity and the prevention of coronary heart disease,” Annals of Clinical Research, vol. 3, no. 6, pp. 404–432, 1971.
- R. A. Robergs and R. Landwehr, “The surprising history of the “HRmax=220-age” equation,” Journal of Exercise Physiology Online, vol. 5, no. 2, pp. 1–10, 2002.
- H. Tanaka, K. D. Monahan, and D. R. Seals, “Age-predicted maximal heart rate revisited,” Journal of the American College of Cardiology, vol. 37, no. 1, pp. 153–156, 2001.
- B. Issekutz and K. Rodahl, “Respiratory quotient during exercise,” Journal of Applied Physiology, vol. 16, pp. 606–610, 1961.
- B. Issekutz, N. C. Birkhead, and K. Rodahl, “Use of respiratory quotients in assessment of aerobic work capacity,” Journal of Applied Physiology, vol. 17, no. 1, pp. 47–50, 1962.
- G. A. Brooks, “Anaerobic threshold: review of the concept and directions for future research,” Medicine and Science in Sports and Exercise, vol. 17, no. 1, pp. 22–31, 1985.
- P. J. Maud and C. Foster, Physiological Assessment of Human Fitness, Human Kinetics, 2006.
- P. O. Astrand, Experimental Studies of Physical Working Capacity in Ventilation to Sex and Age, Einar Munkagaard, Copenhagen, Denmark, 1952.
- E. Edvardsen, E. Hem, and S. A. Anderssen, “End criteria for reaching maximal oxygen uptake must be strict and adjusted to sex and age: a cross-sectional study,” PLoS ONE, vol. 9, no. 1, Article ID e85276, 2014.
- B. E. Ainsworth, R. G. McMurray, and S. K. Veazey, “Prediction of peak oxygen uptake from submaximal exercise tests in older men and women,” Journal of Aging and Physical Activity, vol. 5, no. 1, pp. 27–38, 1997.
- M. J. Chen, X. Fan, and S. T. Moe, “Criterion-related validity of the Borg ratings of perceived exertion scale in healthy individuals: a meta-analysis,” Journal of Sports Sciences, vol. 20, no. 11, pp. 873–899, 2002.
- R. G. Eston, “Perceived exertion: recent advances and novel applications in children and adults,” Journal of Exercise Science and Fitness, vol. 7, pp. S11–S17, 2009.
- A. R. Hoogeveen, C. Schep, and J. Hoogsteen, “The ventilatory threshold, heart rate, and endurance performance: relationships in elite cyclists,” International Journal of Sports Medicine, vol. 20, no. 2, pp. 114–117, 1999.
- D. V. B. James and J. H. Doust, “Oxygen uptake during moderate intensity running: response following a single bout of interval training,” European Journal of Applied Physiology and Occupational Physiology, vol. 77, no. 6, pp. 551–555, 1998.
- T. D. Noakes, “Testing for maximum oxygen consumption has produced a brainless model of human exercise performance,” British Journal of Sports Medicine, vol. 42, no. 7, pp. 551–555, 2008.
- R. E. Magnan, B. M. Kwan, J. T. Ciccolo, B. Gurney, C. M. Mermier, and A. D. Bryan, “Aerobic capacity testing with inactive individuals: the role of subjective experience,” Journal of Physical Activity and Health, vol. 10, no. 2, pp. 271–279, 2013.
- C. Foster, E. Kuffel, N. Bradley et al., “VO2max during successive maximal efforts,” European Journal of Applied Physiology, vol. 102, no. 1, pp. 67–72, 2007.
- M. N. Hawkins, P. B. Raven, P. G. Snell, J. Stray-Gundersen, and B. D. Levine, “Maximal oxygen uptake as a parametric measure of cardiorespiratory capacity,” Medicine & Science in Sports & Exercise, vol. 39, pp. 103–107, 2007.
- C. M. Mier, R. P. Alexander, and A. L. Mageean, “Achievement of VO2max criteria during a continuous graded exercise test and a verification stage performed by college athletes,” Journal of Strength and Conditioning Research, vol. 26, no. 10, pp. 2648–2654, 2012.
- P. B. Nolan, M. L. Beaven, and L. Dalleck, “Comparison of intensities and rest periods for VO2max verification testing procedures,” International Journal of Sports Medicine, vol. 35, no. 12, pp. 1024–1029, 2014.
- D. C. Poole, D. P. Wilkerson, and A. M. Jones, “Validity of criteria for establishing maximal O2 uptake during ramp exercise tests,” European Journal of Applied Physiology, vol. 102, no. 4, pp. 403–410, 2008.
- J. S. Thoden, J. D. MacDougall, and B. A. Wilson, Testing Aerobic Power, Mouvement Publications Inc, 1982.
- J. S. Thoden, Testing Aerobic Power, Human Kinetics, 1991.
- K. NiemelÄ, I. Palatsi, M. Linnaluoto, and J. Takkunen, “Criteria for maximum oxygen uptake in progressive bicycle tests,” European Journal of Applied Physiology and Occupational Physiology, vol. 44, no. 1, pp. 51–59, 1980.
- C. H. Wyndham, “Submaximal tests for estimating maximum oxygen intake,” Canadian Medical Association Journal, vol. 96, no. 12, pp. 736–745, 1967.
- L. C. Dalleck, T. A. Astorino, R. M. Erickson, C. M. McCarthy, A. A. Beadell, and B. H. Botten, “Suitability of verification testing to confirm attainment of VO2max in middle-aged and older adults,” Research in Sports Medicine, vol. 20, no. 2, pp. 118–128, 2012.
- A. W. Midgley, L. R. McNaughton, and S. Carroll, “Verification phase as a useful tool in the determination of the maximal oxygen uptake of distance runners,” Applied Physiology, Nutrition and Metabolism, vol. 31, no. 5, pp. 541–548, 2006.
- H. B. Rossiter, J. M. Kowalchuk, and B. J. Whipp, “A test to establish maximum O2 uptake despite no plateau in the O2 uptake response to ramp incremental exercise,” Journal of Applied Physiology, vol. 100, no. 3, pp. 764–770, 2006.
- R. M. Weatherwax, T. B. Richardson, N. M. Beltz, P. B. Nolan, and L. Dalleck, “Verification testing to confirm VO2max in altitude-residing, endurance-trained runners,” International Journal of Sports Medicine, vol. 37, no. 7, pp. 525–530, 2016.
- A. W. Midgley, L. R. McNaughton, and S. Carroll, “Time at V∙ during intermittent treadmill running: test protocol dependent or methodological artefact?” International Journal of Sports Medicine, vol. 28, no. 11, pp. 934–939, 2007.
- American Thoracic Society and American College of Chest Physicians, “ATS/ACCP statement on cardiopulmonary exercise testing,” American Journal of Respiratory and Critical Care Medicine, vol. 167, no. 2, pp. 211–277, 2003.
- C. Babineau, L. Léger, A. Long, and L. Bosquet, “Variability of maximum oxygen consumption measurement in various metabolic systems,” Journal of Strength and Conditioning Research, vol. 13, no. 4, pp. 318–324, 1999.
- G. J. Balady, R. Arena, K. Sietsema et al., “Clinician's guide to cardiopulmonary exercise testing in adults: a scientific statement from the American heart association,” Circulation, vol. 122, no. 2, pp. 191–225, 2010.
- E. Yule, L. A. Kaminsky, D. A. Sedlock, B. A. King, and M. H. Whaley, “Inter-laboratory reliability of VO2max and submaximal measurements,” Medicine & Science in Sports & Exercise, vol. 28, no. 5, p. S15, 1996.
- R. R. Vickers, Measurement Error in Maximal Oxygen Uptake Tests, Naval Health Research Center, 2003.
- V. L. Katch, S. S. Sady, and P. Freedson, “Biological variability in maximum aerobic power,” Medicine and Science in Sports and Exercise, vol. 14, no. 1, pp. 21–25, 1982.
- A. W. Midgley and S. Carroll, “Emergence of the verification phase procedure for confirming ‘true’ VO2max,” Scandinavian Journal of Medicine and Science in Sports, vol. 19, no. 3, pp. 313–322, 2009.
- T. N. Mann, R. P. Lamberts, and M. I. Lambert, “High responders and low responders: factors associated with individual variation in response to standardized training,” Sports Medicine, vol. 44, no. 8, pp. 1113–1124, 2014.
Copyright © 2016 Nicholas M. Beltz et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.