
ISSN: 2638-6003
Helen Razmjou1,2,3*, Monique Christakis4,5, Tim Dwyer6,7, Varda van Osnabrugge1, Patrick Henry6,8, Diane Nam6,8 and Richard Holtby 6,8
Received: December 13, 2018 Published: January 16, 2019
Corresponding author: Helen Razmjou, Holland Orthopedic and Arthritic Centre, 43 Wellesley Street East, Toronto, Ontario, Canada
DOI: 10.32474/OSMOAJ.2019.02.000141
Purpose: There is minimal information on predictive value of strength-related clinical tests in detecting rotator cuff (RC) tear size and tendon reparability of large and massive tears. The purpose of this diagnostic study was to examine the validity of four strength-related clinical sign/tests in relation to RC tear size and reparability.
Methods: This was a prospective blinded study of consecutive patients with a full thickness RC tear who underwent a repair. The magnetic resonance imaging (MRI) and arthroscopic surgery were used as the gold standards.
Results: Eighty-five patients, 50 males (59%), age 65, SD=10 completed the study. There were 60 (71%) minor tears (small/ moderate) and 25 (29%) major tears (large/massive) with 70 (82%) patients achieving a full repair. The Jobe test had a sensitivity of 93% and 88% and a negative likelihood ratio (LR) of 0.16 and 0.27 for tendon reparability and tear size respectively. The dropping sign, hornblower sign and lift-off test had poor sensitivity (<60%) and high specificity (>98%) values with large positive LRs for tear size detection and tendon reparability. The validity indices in relation to MRI findings were similar to surgical findings.
Conclusion: A negative Jobe test accurately ruled out the presence of a major tear, significant supraspinatus fatty infiltration and a need for partial repair. The dropping and hornblower signs and lift off test were highly specific and when positive, they confirmed the presence of a major tear, fatty infiltration in the corresponding muscle and difficulty achieving a full repair.
Keywords:Accuracy; Shoulder, Rotator Cuff Tear; Tendon Reparability; Validity
Level of Evidence:Level I, Prospective diagnostic study
Rotator cuff (RC) pathology is a significant cause of visits to general practitioners [1]. The direct cost of imaging, conservative treatment, surgical repair [2,3] and indirect costs of impact on employment, income loss, missed workdays, and disability payments [4-6] have made this pathology an expensive musculoskeletal condition. Many of these costs could be avoided, in particular imaging costs, through early clinical detection and appropriate early management. This highlights the important role of the clinical examination in achieving the initial diagnosis. Unfortunately, difference in testing positions, diverse criteria used for positive test results and variability in reference standards impact the interpretation of the clinical tests’ measurement properties.
While pain provocation tests have shown poor performance characteristics in confirming pathology due to low specificity, clinical tests or signs that are based on weakness and represent the integrity of specific muscles tend to have better specificity [7-12]. The most commonly used strength-related clinical examination tests or signs for pathology in supraspinatus, infraspinatus, teres minor and subscapularis muscles are the Jobe test [13], dropping sign [14], hornblower sign [15] and the lift-off test [16] respectively.
The majority of previous studies related to these clinical tests have examined their ability to detect the presence of a RC tear with minimal research existing on tear size detection [17-19]. To date, we are not aware of studies that have examined the value of these clinical tests in relation to tendon reparability which is affected by tear size, fatty infiltration and tendon quality. The initial concept of partial versus full repair was first introduced by Burkhart and colleagues approximately two decades ago [20-22]. The goal of the partial repair in patients with large/massive tears is to bring back the torn tendon to the tuberosities without excessive tension and to restore the humeral head force couple and fulcrum and improve the overall shoulder kinematics [20-22].
The increasing body of literature on large and massive tears and partial repairs in recent years [21,24-26] warrants examining strength-related clinical tests in relation to tear size and tendon reparability. Establishing the relationship between clinical examination and these factors will expedite care pathways, reduce unnecessary health care visits and improve the clinical decision-making process. The primary objective of this study was therefore to examine the validity of four clinical examination sign/tests in estimating rotator cuff tear size and determining tendon reparability. The secondary objective was to examine the value of these clinical tests in relation to pathological changes in the corresponding muscles (associated tear, atrophy and fatty infiltration). The Magnetic Resonance Imaging (MRI) and arthroscopic surgery were used as the gold standards.
This prospective blinded diagnostic study was conducted at a tertiary shoulder center where consecutive surgical candidates for rotator cuff repair were examined. Inclusion criteria included pain and functional disability for more than 6 months which had failed non-operative treatment and presence of a full-thickness rotator cuff tear diagnosed on MRI and later confirmed by surgery. Exclusion criteria included previous shoulder surgery on the affected side, presence of an active work-related shoulder injury, infection, avascular necrosis or frozen shoulder. Informed consent was obtained from all individual participants included in the study.
The clinical examination was conducted 2-3 weeks prior to surgery by a physical therapist to maintain the blindness of the orthopedic surgeon and independence of the surgical findings. The Jobe test was based on weakness using the Medical Research Council (MRC) Scale for manual muscle strength testing and was considered positive if the strength of elevation in scapular plane was less than 5 on a 5-point scale. The MRC has established reliability and validity in asymptomatic subjects and patients with musculoskeletal conditions [27]. Other tests/signs were based on observing a faulty posture (hornblower sign) or inability to initiate (lift-off test) or maintain a position (dropping sign) without examiner’s interpretation of the degree of weakness. The details on testing positions and criteria for positive and negative clinical findings are presented in Figures 1-4.
Figure 1: Operational definition of the Jobe Test.
This test was conducted at 90° of the scapular plane elevations with the thumb down. The outcomes were documented as negative when the patient reported no pain or pain without weakness and positive when weakness was detected (<5/5 manual muscle testing) with or without pain.
Figure 2: Operational definition of the Dropping sign.
The patient was asked to push against the examiner’s hand while maintaining the elbow at 90° and shoulder at 45° of external rotation position. The outcomes were documented as positive when the forearm was dropped back to neutral position.
Figure 3: Operational definition of the Hornblower sign.
This sign involved observing the patient while bringing both hands to the mouth. The outcomes were documented as negative when the patient was able to externally rotate the arm in abduction and positive when patient was not able to reach the mouth without abducting the affected arm.
Figure 4: Operational definition of Lift off Test.
The test was conducted with the patient in standing. The ability to internally rotate to lumbar spine (waist line) was first examined. While placing the dorsum of the hand against mid-lumbar spine, the patient lifted the affected side hand away from the back. The outcomes were documented as negative when the hand was lifted away from the back. An inability to perform this task was considered a positive lift-off test.
Tear size and tendon reparability were based on surgical findings. The level of fatty infiltration in rotator cuff muscles and the number of tears extending to infraspinatus or subscapularis were based on MRI imaging findings.
Surgical procedures were conducted by a senior surgeon with subspecialty in shoulder reconstruction and over 30 years of experience. With the patient in the lateral decubitus position, arthroscopic examination of the glenohumeral joint and subacromial regions were completed. The rotator cuff was repaired using marginal convergence or side to side techniques with lateral suture anchors.
The overall tear size was measured arthroscopically using a calibrated probe and classified as small (<1cm), medium (1-3cm), large (3-5cm) and massive (>5cm) based on the largest dimension [28]. For the purpose of calculating sensitivity and specificity that require binominal variables, the small and medium tears were collapsed together as minor tears and large and massive tears were collapsed as major tears.
In terms of reparability, patients were classified into two categories of full and partial repair. Full repair was either an anatomical repair or a repair to the articular margin with less than 1cm residual defect. Partial repair referred to a residual defect of more than 1cm [24] and was done when a full repair was not feasible.
The majority of the MRI studies were performed internally on a 1.5-T system (General Electric Medical Systems, Milwaukee, Wis.) using a 15 platform and dedicated GE shoulder surface coil. However, all MRI images examined in this study were I.5T and the measurements were made on a PACS workstation using Agfa IMPAX software technology. All images were interpreted by a senior musculoskeletal trained radiologist with 21years of clinical experience. We examined the inter-examiner reliability of the MRI findings between the radiologist and an orthopedic surgeon (not involved in surgery of the participants) with shoulder subspecialty training and 10 years of clinical experience on a subsample of patients.
Presence of a full thickness tear of the supraspinatus tendon was examined on the sagittal T2 fat-suppressed images. Full thickness tears of the infraspinatus tendon were diagnosed by defining the infraspinatus muscle and musculotendinous junction on the sagittal images and following the tendon laterally to the attachment on the greater tuberosity. Teres minor and subscapularis tendons were evaluated on sagittal and axial proton-density fat-saturated images.
Fatty infiltration was documented for all muscles on the T1 sagittal image on the most lateral oblique image in which the spine is seen in contact with the scapular body as defined by Goutallier [29]. Accordingly, stage 0 corresponds to no fat, stage 1 corresponds to the muscle containing some fatty streaks, stage 2 referring to more muscle than fat, stage 3 corresponding to as much fat as muscle and stage 4 is fatter than muscle. The imaging results were specific to the muscle related to the clinical test as described in the original studies test [13-16]. For example, presence of fatty infiltration (stages 3-4 vs. stages 0-2) in supraspinatus muscle was examined against results of the Jobe test while presence of fatty infiltration in the infraspinatus muscle (stages 3-4 vs. stages 0-2) was examined in relation to the dropping and the hornblower signs as both tests are affected by the integrity of this muscle. The lift-off test was used to examine the presence of the subscapularis pathology.
The sample size calculation was based on the estimation of the positive likelihood ratio (LRs). To detect a LR+ of 2-5, considered the least acceptable LR, a minimum sample of 70 to 79 patients was considered necessary [30].
Descriptive statistics were provided for all relevant data. The Kappa coefficient and percentage of agreement examined interexaminer reliability on MRI findings. Strength of agreement was interpreted as suggested by Landis [31]. Surgical and imaging findings were recorded for true and false positive and negatives and 2x2 tables were constructed to calculate sensitivity (Se), specificity (Sp) and likelihood ratios (LRs). The LRs which are based on both sensitivity and specificity of the test were used to determine whether a test result changed the probability of having a condition. A test with a higher LR+ has a greater value of ruling in the disease while a test with a lower value of LR− has a better predictive value of ruling out the disease. Guidelines suggested by Jaeschke et al. [32] were used for interpretation of LRs and Fagan’s nomogram [33] was used to compute the approximate post-test probability based on the pre-test probability and LRs.
Ninety patients consented to participate in the study. Of these patients, two were excluded at the time of surgery due to having a partial thickness rotator cuff tear, and another three patients cancelled their surgery due to personal or other medical reasons. Hence, data of 85 patients with full-thickness tear of supraspinatus, 35 females (41%), 50 males (59%), age 65, SD=10 were used for analysis
CI: Confidence interval
PLR: Positive Likelihood Ratio,
NLR: Negative Likelihood Ratio
Exact CI were reported for sensitivity/specificity tables that had cell<5.
*The lift-off test was performed on 63 patients who had a full range of internal rotation.
N/A: Likelihood ratios could not be calculated when sensitivity or specificity were 100%
Range of LRs: LRs >10 or < 0.1 (large); LRs of 5-10 and 0.2-0.1 (moderate); LRs of 2-5 and 0.5-0.2 (small); LRs of 1-2 and 0.5-1 (insignificant).
Four patients had a small tear (<1cm.), 56 had a moderatesized tear (1-3cm.), 17 had a large tear (3 to 5cm) (7 partial and 10 full repair) and eight patients had a massive tear (>5cm). Therefore, there were 60 minor tears and 25 major tears. Seven of the large tears and all eight of the massive tears could not be fully repaired. Therefore, 15 patients (18%) had a partial repair and seventy (82%) patients had a full repair. There were 26(31%) and 15 (18%) associated full-thickness tears of the infraspinatus and subscapularis, respectively. The inter-examiner reliability was assessed between the radiologist and an orthopedic surgeon not involved in surgery. The Kappa values were 0.77(0.34-1.00) and 0.82(0.49-1.00) ranging from substantial to almost perfect agreement for fatty infiltration of supraspinatus and infraspinatus respectively. There was no tear or significant fatty infiltration in the teres minor. Table 1 shows the relationship between surgical findings (tear size and reparability) and each clinical sign/test. Table 2 shows the relationship between imaging findings (fatty infiltration, associated tears) and clinical signs/tests.1cm.),>
CI: Confidence interval
PLR: Positive Likelihood Ratio
NLR: Negative Likelihood Ratio
Range of LRs: LRs >10 or < 0.1 (large); LRs of 5-10 and 0.2-0.1 (moderate); LRs of 2-5 and 0.5-0.2 (small); LRs of 1-2 and 0.5-1 (insignificant)
Exact CI were reported for sensitivity/specificity tables that had cell<5.
*The lift-off test was performed on 63 patients who had a full range of internal rotation and due to lack of significant number of subscapularis fatty infiltrations, this component was not analyzed.
N/A: Likelihood ratios could not be calculated when sensitivity or specificity were 100%.
The sensitivity of the Jobe test [13] for detecting a major tear (large or massive) was relatively high at 88%, LR− =0.27). However, the test was not very specific for ruling in a major tear (Sp= 45%). This indicates that the Jobe test is helpful in ruling out a major tear when it is negative. In terms of reparability, only one false negative was seen leading to high sensitivity of 93% (LR− = 0.16) which indicates a high ability of a negative Jobe test in ruling out a need for partial repair. In terms of LRs, a negative Jobe test generates a moderate change (0.16) in pretest probability of having a partial repair. Using a nomogram proposed by Fagan [33], a negative likelihood ratio of 0.16 makes a significant change in the pre-test probability of 50%, lowering that to a probability of 14%. All patients with stage 3/4 infiltration had a positive Jobe test, producing a sensitivity of 100%.
The dropping sign [14] had low sensitivity for the overall tear size detection, reparability, associated infraspinatus tear and advanced fatty infiltration. However, the sign was highly specific indicating a low false positive rate for all the above findings (Tables 1 & 2). As an example, a LR+= of 19 for tear size detection changes the pre-test probability of 50% to approximately 96%, meaning that if the clinician gives a 50% chance of having a major tear in a patient and observes a positive dropping sign, the 50% chance increases to 96%, a very significant chance of there being a large/ massive tear.
A positive hornblower sign [15] was highly related to the overall tear size, reparability, infraspinatus associated tear and advanced fatty infiltration which shows that only a very small percentage of patients with a positive hornblower sign had a minor tear size (2%), achieved a full repair (1%) and had stage1/2 fatty infiltration (5%). A LR+= of 22 for presence of a major tear changes the pre-test probability of 50% to approximately 96%, meaning that observing a positive hornblower sign increases a moderate pre-test probability of 50% to high post-test probability of 96%. In summary, having a negative hornblower sign does not accurately rule out a major tear, but having a positive hornblower sign accurately predicts a major pathology.
The lift-off test [16] had a perfect specificity of 100% for both tear size detection and reparability and a high specificity of 98% for subscapularis full-thickness tear. The sensitivity was low for all surgical and imaging outcomes for all outcomes (e.g. tear size, reparability and subscapularis full-thickness tear, Tables 1&2). These findings indicate that this test is helpful in clinical decision making only when it is positive.
The primary objective of this study was to examine the ability of commonly used strength-related clinical tests in predicting reparability and tear size of the RC tendons. The concept of reparability has been gaining importance since its conception in early 1900’s as the surgeons continue to face challenges in managing large and massive tears in patients with high physical demands [21,23-26]. To our knowledge, despite the significant number of validity studies and reviews [11,12,17-19,34-36], the relationship between clinical tests and tear size and particularly reparability of the tendons has not been systematically examined. The significance of the present study is providing further evidence on clinicians’ ability to differentiate between minor and major tears. Costly imaging investigations that would not alter management can be avoided by simple clinical tests and signs and most importantly, achievable and realistic post-operative patient expectations can be facilitated in the presence of large and massive tears.
The results of the present study indicate that a negative Jobe test helps with a successful ruling out of a major tear that may not be fully repairable. However, when positive, this test does not guide clinical management due to its low specificity. The findings of the present study are consistent with higher sensitivity and poorer specificity of this test reported by other investigators being 86% and 50% by Leroux et al. [19] 84% and 58% by Hartel et al. [17] 87% and 43% by Ioti et al. [11] 88% and 40% by Holtby and Razmjou [18] 88% and 62% by Jain et al. [37], 89% and 55% by Yuen et al. [12], and 99% and 43% by Kim et al. [35]. The LR− of 0.17 for detecting major tears from minor tears found in our study is identical with another study [18], showing the consistency of the test in ruling out large and massive rotator cuff tears.
The Jobe test was initially described to assess the supraspinatus muscle in isolation in early 1980s [13]. However, there is minimal anatomical basis for this position to select out the supraspinatus muscle. It has been shown that in addition to the supraspinatus muscle, nine other muscles including the infraspinatus, upper subscapularis, trapezius, and serratus anterior are activated during this test [38]. In addition, the Jobe test is conducted with the shoulder internally rotated which would typically aggravate any inflamed structures located in the subacromial area between the greater tuberosity and coracoacromial arc (e.g. bursitis, tendonitis, osteophytes impingement, partial and full-thickness tears) causing pain and weakness regardless of severity of supraspinatus pathology which makes the Jobe test act more like a pain provocation test rather than a strength test. The limitation of the Jobe test in activating the supraspinatus muscle in isolation lowers the value of this test in guiding management in the presence of a minor RC pathology.
The dropping sign is highly specific in predicting a major tear, advanced fatty infiltration and associated infraspinatus tear and inability to achieve a full repair. The large positive likelihood ratios direct the clinicians’ pre-test probability in a significant way, assisting with the diagnosis and overall management. There is very limited published literature on this clinical sign [12,39]. In a study conducted by Walch et al. [39] in late 1998 [39], the CT arthrogram was used as the gold standard at over one year following a cuff tear. The investigators reported sensitivity and specificity of 100% for detecting stage 3 and 4 fatty infiltration. In their study, all 12 patients of the sample of 54 who had stage 3/4 fatty infiltration had a positive dropping sign and all patients (N=29) with stage1/2 fatty infiltration had a negative dropping test. We found a negative dropping sign in 63% (12/19) of our sample with stage 3/4 fatty infiltrations. The co-contraction of a healthy teres minor and a larger and more diverse sample may explain some of these inconsistencies. Generally, it is unlikely that a test would have a sensitivity and specificity of 100%. In such cases, distinguishing between a diseased and non-diseased is so obvious that application of a test is not necessary. More consistent with our results, Yuen et al. [12] reported a low sensitivity and high specificity for presence of individual tear (50% and 88% respectively) or combined tears (44% and 97%) following an acute anterior dislocation. High specificity of dropping test has been noted in a recent study that has examined the strength of external rotation in a similar position as the dropping sign (45 degrees of external rotation) [40]. The authors reported that weakness of infraspinatus muscle was more significant at 45 degrees of external rotation compared to the neutral position, confirming a high specificity of the dropping sign position.
The hornblower sign was also highly specific and when positive it was associated with larger tears, infraspinatus advanced fatty infiltration and tear and inability to achieve a full repair. There is limited published literature on the accuracy of the hornblower sign. In a retrospective study by Walch et al. [39], in which the CT arthrogram was used at over one year following a cuff tear, sensitivity and specificity values of 100% and 93% were reported for detecting stage 3 and 4 fatty infiltration or complete absence of teres minor. In their study, all patients with stage 3/4 fatty infiltration in the infraspinatus had a negative hornblower sign. The authors concluded that the hornblower test was specific to teres minor only and not affected by the infraspinatus pathology [39]. However, the infraspinatus involvement in shoulder elevation and external rotation has been confirmed in the literature [41- 44]. McMahon et al. [42] suggested that infraspinatus activity progressively increased from 0 to 90 degrees and reported that the highest level of infraspinatus muscle activity occurred during shoulder forward flexion at 90 degrees. In a study by Ha et al. [41] the infraspinatus was highly activated at external rotation with the shoulder flexed to 90 degrees. Similarly, the activity of the infraspinatus at 90 degrees of abduction has been noted by Reinold et al. [44]. Studies that have examined the pathophysiology of rotator cuff tendons have suggested that overhead shoulder injuries in young athletes are shown to affect the infraspinatus tendon more often than other rotator cuff tendons, highlighting the activity and strain of this tendon in the external rotation/abduction position [43]. More specifically, in a recent prospective study by Jain et al. [37], the sensitivity and specificity values of the hornblower test for infraspinatus tear were 17% and 96% which is consistent with our results.
The lift-off test was highly specific in predicting a major tear, subscapularis tear and inability to achieve a full repair. The subscapularis muscle is one of the internal rotators of the shoulder joint. The hand behind back is reported to be superior to internal rotation in neutral position. Using the data of electromyography, [45] moment arms [46] and physiologic cross-sectional areas [47] the contribution of the subscapularis muscle to internal rotation strength is calculated to be approximately 50% with the arm at the side. With the arm in full internal rotation (lift off position), the contribution of this muscle increases up to almost 90%.
The original study by Gerber and Krushell [16] used a sample of 162 with 16 patients having an isolated subscapularis tear. Twelve out of 16 patients had a true positive lift-off test. The other 4 patients did not complete the test. Consistent with our results, Yuen et al. [12] reported sensitivity and specificity values of 39% and 74% respectively in 49 patients following an acute anterior dislocation for presence of a full-thickness tear using ultrasonography as the gold standard. In a study by Itoi et al. [11] where the strength of the lift off test was graded, inability to lift the hand off the back against gravity had a sensitivity of 14% and specificity of 100% for subscapularis tear [11]. In a study by Leroux et al. [19] sensitivity of 0% and specificity of 61 % was reported. The authors used 55 patients of whom 27 patients had a full-thickness tear (4 small, 7 medium sized and 16 large tears). We could not verify the binominal variable of surgical criteria for the lift off test in that study and it is not clear whether the authors examined the lift-off test against presence or absence of a full-thickness tear or against different tear sizes. Therefore, we cannot comment on their sensitivity of 0%. Similarly, Hertel et al reported a sensitivity of 62% and specificity of 100 [17]. In a study by Naredo et al. [48] who used ultrasonography as the gold standard, a sensitivity and specificity of 50% and 96% were reported respectively for subscapularis tear [48]. All of these studies have similar results, reporting low sensitivity and high specificity for the lift-off test.
Normally, clinical tests that are based on pain are more sensitive and less specific, while clinical tests that are based on weakness and represent the integrity of specific RC muscles tend to have better specificity and lower sensitivity [7-12]. The reason for this finding is that a sensitive test corresponds with a negative test results which helps to rule out pathology. Absence of pain with pain provocation tests rules out a cluster of pathologies under umbrella of impingement syndrome such as inflammation, tendonitis, bursitis, partial or full thickness tear. However, presence of pain does not confirm any specific pathology which affects the utility of these tests in clinical decision making as different shoulder conditions have different managements. Strength-related clinical tests have different performance characteristics. They are not very sensitive which means they are not good screening tools to rule out a specific pathology but once they are positive, they confirm the presence of pathology. This general principle is more obvious when the severity of pathology is more significant. For example, in the present study we chose large/massive tears as the positive criterion. A number of patients with a major tear presented with a negative sign/test which led to poor sensitivity for the dropping/ hornblower signs and liftoff test. However, due to low proportion of false positives, specificity and LR+ were quite significant indicating that in clinical settings similar to ours (specialty orthopedic clinics where the prevalence of a major tear is high) a positive sign or test directs clinicians in the right direction and guide management more effectively by confirming pathology
A negative Jobe test accurately ruled out the presence of a major tear, significant supraspinatus fatty infiltration and a need for partial repair. The dropping and hornblower signs and lift off test were highly specific and when positive, they confirmed the presence of a major tear, fatty infiltration in the corresponding muscle and difficulty in achieving a full repair.
Bio chemistry
University of Texas Medical Branch, USADepartment of Criminal Justice
Liberty University, USADepartment of Psychiatry
University of Kentucky, USADepartment of Medicine
Gally International Biomedical Research & Consulting LLC, USADepartment of Urbanisation and Agricultural
Montreal university, USAOral & Maxillofacial Pathology
New York University, USAGastroenterology and Hepatology
University of Alabama, UKDepartment of Medicine
Universities of Bradford, UKOncology
Circulogene Theranostics, EnglandRadiation Chemistry
National University of Mexico, USAAnalytical Chemistry
Wentworth Institute of Technology, USAMinimally Invasive Surgery
Mercer University school of Medicine, USAPediatric Dentistry
University of Athens , GreeceThe annual scholar awards from Lupine Publishers honor a selected number Read More...
The annual scholar awards from Lupine Publishers honor a selected number read more...