Time as a factor during endoscopic assessment of swallowing: relevance in defining the score and severity of swallowing disorders
Time is a parameter of great interest in swallowing and can be considered in different ways to express severity during endoscopic evaluation. Theobjectives of this study are to evaluate how the severity of this score changes at different times of scoring and the interaction between residue persistenceand airway invasion. Two experienced raters blindly evaluated 35 short clips of bolus transit that were recorded during endoscopic evaluationsof 16 patients with dysphagia of differing aetiologies. The pooling score (p-score) and the Penetration Aspiration Scale (PAS) were detected afterthe first swallowing (T1) and after the fifth dry swallow (T5). For each task, the time needed to complete the clearing of the bolus (total time: TT)was blindly determined by the two raters and compared with the Functional Oral Intake Scale (FOIS) and Dysphagia Outcome and Severity Scale(DOSS) scales, previously detected.The inter-rater agreement between the 2 raters in scoring the p-score and PAS was good (ICC > 0.800) for T1vs T1 and T1 vs T5, and in determining TT for each consistency (ICC > 0.9), with a Spearman’s Rho > 0.70 and > 0.90 respectively. A statisticalcorrelation of the p-score total, TT and consistency with FOIS and DOSS was found. The p-score showed a good trade-off between sensitivity andspecificity compared with the PAS aspiration and penetration scores. The time of scoring (T1 vs T5) is relevant in detecting severity of dysphagiaduring endoscopic evaluation. The time spent to clear residue is a useful parameter and is correlated with severity of dysphagia expressed by thep-score and with functional swallowing status in dysphagic patients. The p-score is correlated with the PAS score in detecting airway invasion.
In normal conditions, the progression of the bolus through the upper digestive tract must meet criteria of efficiency and effectiveness: any condition that deviates from these criteria generates an unsafe swallowing act with nasal regurgitation, penetration, aspiration, or an inefficient swallowing act, with residue of bolus 1. These two conditions can coexist and influence each other: a bolus that leaves residue in the pharynx can be cleared or can invade the upper airway during a subsequent swallowing act 2, while a bolus invading the upper airway can indefinitely remain in the larynx or cervical trachea. Instrumental investigation must be able to document these conditions, with the aim of defining criteria of severity that better indicate, in a clinical context, the risk of respiratory or nutritional complications 3 4. This same information, in the short term, is useful in planning therapeutic strategies and in reducing long-term complications 5. Today, several tools are available to evaluate and quantify ineffective or inefficient swallowing. Some tools designed for the radiological setting have been adapted and applied to endoscopic examination. An example is the penetration aspiration scale (PAS) 6, originally designed for the modified barium swallow (MBS) and subsequently replicated for endoscopy 7-10. The PAS is widely used to define the entity of airway invasion and possible ejection attempts. The PAS score is applied after the first/second swallow, without a standard reported in the literature or in the original paper 6. However, in clinical practice, more varied conditions can be found. A bolus can be propelled by more swallowing acts (multiple swallowing) or coexist with residue, imposing several dry swallows before clearing. In these conditions, airway invasion may occur after the swallow 11 or require defensive strategies (spontaneous or requested) carried out over a longer period of time (more than one or two dry swallows).
In the literature, some rating scales are applied after the first-second swallowing acts, such as the aforementioned PAS, and others after the second one, such as the pooling score (p-score) 12 and Boston Residue and Clearance Scale 13. Some scales do not mention the time of scoring 8 and others leave the decision to the clinician as to when to score 14-18. The first tool proposed, to be applied after the second swallow, was the p-score, a validated scale 12 19 that considers 5 dry swallows before scoring residue in the pharyngeal or laryngeal cavities. The total score derives from the sum of partial scores attributed to the anatomical site where material is pooling, its amount (semi-quantitatively compared to the volume’s bidimensional viewing of the cavities) and to the number and effectiveness of dry swallowing, or other strategies, performed in the attempt to clear residue (“management” of residue, see Table I). The score expresses a continuum of severity which, in clinical practice, can be spread over four levels, describing different levels of clinical severity (Table I). The consistency of the bolus does not seem to affect the score 19. The p-score, like the other scores, does not consider the parameter ’time’, i.e. the time necessary to clear the bolus or to complete the 5 dry swallows requested, but focuses on the residue and the correlates that define it. Swallowing times during endoscopic evaluation of swallowing have been assessed in several studies 20-25 and time association with swallowing abnormalities was explored in some of them 22-25. Nevertheless, the time considered in those investigations does not include the time needed to clear residue, a parameter which has previously been considered in our preliminary report 26. Cinical observations, indeed, show conditions where the material pooling is cleared in a few seconds and other conditions where a longer time is needed to match this goal, provided that it is possible. In this context, the time of ‘management’ of the residue recalls the attempts to clear saliva with spontaneous swallowing acts performed by elderly patients 15 or stroke patients 27, although different pressure and awareness are required to clear residue. This perspective leads to a parameter of severity where the clearing time can become a marker of effectiveness of the swallowing act in pathological conditions or where its increase, under stress, can become a marker of fatigue 28 29. If this were true, the time of ‘management’ could vary similarly to the variation of other scales 26 such as the Functional Oral Intake Scale (FOIS) 30 and the Dysphagia Outcome and Severity Scale (DOSS) 31.
With these considerations in mind, the aims of this study are to evaluate: 1) if the p-score changes when applied before the fifth swallowing act and whether the PAS changes when applied after the first; 2) the reliability of 2 raters in scoring residue at different times and their reliability in determining these times (time of ’management’ of the p-score); 3) possible correlation of PAS to p-score.
Materials and methods
In a prospective way, 16 consecutive outpatients (11M/5F, mean age 63.94 years ± 15.46, range 25-88) were submitted to a fibreoptic endoscopic evaluation of swallowing (FEES) 32. The patients were complaining of swallowing disorders due to different aetiologies (Table II). Inclusion criteria were: over 18 years old, an instrumentally documented impaired swallow (residue, false routes), compliance to the endoscopic procedure; exclusion criteria were: less than 18 years old and non-compliance to the endoscopic procedure. The patients with low dysphagia were considered because they respected the criteria of inclusion. FEES was performed with a Storz endoscope (model 11101RP2, 30 cm long, 3.5 mm in diameter) and recorded with a workstation (Xion medical products GmbH, Berlin Buchholz). During FEES and with the endoscope in place, one bolus of each consistency was given to each patient: 5 cc pureed (P), ¼ of a cracker (regular-R) and 5 cc liquid (L) 33. The patients prepared the bolus and swallowed without any command. Some patients were not able to test all three consistencies, owing to the severity of their complaint. For each patient, short videos were obtained for each swallowing trial so that a total of 35 clips were collected and reviewed by two expert raters (with more than 15 years’ experience in performing FEES) in a blind manner. The raters were requested to score each bolus trial with the p-score and the PAS. Both the p-score and the PAS score were applied after the first (time 1 – T1) and fifth swallow (time 5 – T5). In this way, the parameter ‘management’ of the p-score was always the minimum provided by the score. The raters also blindly determined the time necessary to perform the 5 dry swallowing acts (total time: TT). TT was timed with a stopwatch in iOS 9.0, 4+ (Tim O’s Studios, LLC) at the beginning of the first white-out and at the conclusion of the fifth white-out 34. In accordance with the p-score, spontaneous and cued dry swallows were considered. TT was compared with the patients’ ability for oral intake of food and liquid, measured against the Functional Oral Intake Scale (FOIS), even if only validated for stroke patients 30 and the functional severity of dysphagia measured against the Dysphagia Outcome and Severity Scale (DOSS) 31. The scales were previously determined by rater 1. Because of the small sample, monovariate analysis was previously performed among TT and consistencies (explanatory variables) and FOIS and DOSS, respectively. Subsequently, multiple linear regression, considering FOIS and DOSS as dependent variables, was performed taking into account the TT, p-score total and consistency.
The intra-class correlation coefficient was performed to evaluate the inter-rater reliability of the two raters for FEES at T1 and T5 (ICC) and determine TT. In accordance with the literature, the following were considered for ICC values: 0-0.2 poor; 0.3-0.4 fair; 0.5-0.6 moderate; 0.7-0.8 strong; and > 0.8 almost perfect. For each rater, the Rho Spearman’s coefficient (r > 0.70-sing < 0.05) was performed to evaluate the correlation between PAS and p-score and TT. Furthermore, in order to determine optimal thresholds for the p-score when compared to PAS diagnoses, the Receiver Operator Characteristic (ROC) curve analysis was performed after dichotomising PAS between penetration scores 2 to 5 and aspiration scores 6 to 8. To determine the best balance between sensitivity and specificity, the Youden Index (Y = sensitivity+specificity−1), was chosen as the criterion for cut-off value selection. All statistical analyses were performed using SPSS v.21.0 (IBM Corp., Armonk, NY, USA) and STATA version 13 (STATA Corp., TX, USA).
All patients gave their written consent to the procedures, in accordance with the Declaration of Helsinki. The study was approved by the local Ethical Research Committee.
The mean times necessary to clear residue for P, R and L were 22.5 secs (range 4-42), 30.7 secs (range 11-44) and 16.6 secs (range 8-33), respectively, with a mean TT of 53.8 sec. The inter-rater agreement between the 2 raters in scoring the p-score and PAS was good (ICC > 0.800) for each consistency and time of scoring (T1 vs T1 and T1 vs T5) with the exception of the liquid bolus. The inter-rater agreement between the 2 raters in detecting TT was good (ICC > 0.9) for each consistency (Spearman’s Rho > 0.90-sing < 0.001). A correlation between the PAS and the p-score at T1 vs T1 and T1 vs T5 was observed only for the pureed consistency for rater 1 and for pureed and liquid ones for rater 2 (Spearman’s Rho > 0.70-sing < 0.05). The linear regression model documented a significant correlation of the p-score total, TT and consistency with FOIS and DOSS. In particular, increasing the time spent in clearing residue corresponded to an increase in the p-score and decreased the FOIS for all consistencies; increasing the p-score decreased the DOSS score for R and L (Table III, IV). A good correlation between PAS score and p-score was found (Spearman’s rho 0.924-P < 0.05). The screening properties of the p-score when compared to the PAS cut-off diagnosis of penetration (scores 2 to 5) and aspiration (scores 6 to 8) showed a good trade-off between sensitivity and specificity compared with the PAS aspiration scores (area under the ROC curve = 0,958; 95% CI = 0.784-0.994) and with the PAS penetration scores (area under the ROC curve = 0.622; 95% CI = 0.352-0.792), with a p-score cut-off of 3 for penetration and of 4 for aspiration, respectively.
Our experience shows that when applying the p-score at T1 and the PAS at T5, with FEES, they correlate only for the P consistency for rater 1 and P and L for rater 2. This leads us to consider that the lack of correlation between T1 and T5 suggests a real different value of the score applied, i.e. applying the PAS over the first swallowing act changes the score itself. It is also worth mentioning that for the liquid bolus there was no concordance between the two raters, contrary to the other consistencies.
The parameter ‘time’ also shows its importance under the quantitative perspective, as the time spent in completing a sequence of dry swallows. In our sample, the mean time necessary to clear residue for P, R and L was far longer than the time physiologically reported in the literature for clearing boluses of the same consistency 20-25. The detection of this parameter, in our experience, seems to be a reliable parameter worth including in endoscopic evaluation of swallowing 26. The p-score, which indirectly considers this parameter (residue ‘management’ in TT) enriched in that sense, may express a further criterion of severity (see Appendix). In our sample, the increase in TT is related to the increase in the p-score, and both are related to the decrease in the FOIS score for all consistencies tested and to the increase of the DOSS scale for R and L 26.
Even the consistency, which does not affect the p-score 19, when related to the TT, seems to be a parameter able to influence the outcome of the swallowing act (presence of residue) performed spontaneously or upon request by the patients in our sample. Increasing TT, a reduction in the efficiency of the swallowing act could be hypothesised: the fate of the residue during the TT is not predictable, but it is plausible that it may be related to the aetiology/comorbidities 35 36. In this sense, a possible correlation between the p-score compared with PAS, in detecting penetration and aspiration, is expressed by the area under the ROC curves: these values indicate a good predictability of the p-score for the three consistencies in terms of sensitivity and specificity. The cut-off for aspiration is 4 and the cut-off for penetration is 3, coinciding with the sub-parameter ‘site’ of the p-score, identifying residues below and above the vocal cords, respectively.
The main limitations of this work are the small sample and the different numerical representations of the bolus swallowed in different consistencies. Bearing this in mind, the work is intended to have a preliminary character, and to test the value of the ‘time’ parameter in defining the clinical severity of a swallowing disorder. Further research is in progress to correlate the tp-score with the fatigability of patients with swallowing disorders due to specific aetiologies.
The parameter ‘time’ was evaluated applying scores that consider directly (p-score) or indirectly (PAS) the bolus and its fate after subsequent swallowing acts (multiple swallows or cued swallows). The evaluation of FEES clips of swallowing tasks suggests how, by applying the p-score at T1 and PAS at T5, although for different consistencies, produced different scores between two expert raters. We conclude that the time of detection of a score modifies the score, so that the time of scoring (the first or subsequent swallowing act) should be previously defined and considered.
The time needed by the patient to clear the residue is a reliable parameter that correlates with the severity of the p-score and other scales that relate to the patients’ functional status or with their deglutition skills, suggesting the possibility of a clinical use of the tp-score (Appendix) in the follow-up of patients with swallowing disorders due to specific aetiologies or after stressful swallowing activities (fatigue detection).
Figures and tables
|Site||Vallecule – marginal zone||1|
|Vestibule – vocal cords||3|
|Below the vocal cords||4|
|2 > < 5||3|
|Pts n.||Main pathology||Gender||Age|
|10||Parkinson’s and ictus||M||81|
|Beta||95% confidence interval||P value|
|Fees - T - P tot||-1.321||-4.649||-0.635||0.029|
|Fees - T - P sec||1.128||-0.022||1.216||0.053|
|Fees - T - R tot||-1.927||-0.801||-1.494||0.004|
|Fees - T - R sec||-2.437||-0.370||-0.171||0.007|
|Fees - T - L tot||-0.439||-0.505||-0.080||0.027|
|Fees - T - L sec||-0.903||-0.113||-0.050||0.007|
|Beta||95% confidence interval||P value|
|Fees - T - P tot||-,294||-,519||,208||,208|
|Fees - T - P sec||,025||-,034||,039||,826|
|Fees - T - R tot||-1,338||-,378||-1,215||,015|
|Fees - T - R sec||-,931||-,173||-,033||,024|
|Fees - T - L tot||-,488||-,697||-,047||,046|
|Fees - T - L sec||-,634||-,101||-,014||,030|
|Site||Vallecule – marginal zone||1|
|Vestibule – vocal cords||3|
|Below the vocal cords||4|
|2 > < 5||3|