Various questionnaires are used to assess the impact of tinnitus on the quality of life. The Tinnitus Functional Index (TFI) has excellent properties for scaling the severity of tinnitus and treatment-related changes in both clinical and research settings. The aim of this study was to evaluate the psychometric properties of the Italian version of the TFI with particular emphasis on factor analysis, internal consistency, reliability and validity. The original English version of the TFI was translated into Italian using the translation/back - translation process; 137 participants who were recruited at the Tinnitus Clinic in Milan and had suffered from tinnitus for at least three months (39.4% females, age: 18-80 years, mean age: 48.26, SD: 14.08) completed the Italian version of the TFI, the Tinnitus Handicap Inventory (THI), the Beck Depression Inventory - Primary Care Version (BDI-PC) and the Numeric Rating Scale of annoyance (NRS-A). Of these patients, 57 completed the TFI again at a second visit 7-14 days later, before undergoing any intervention, in order to provide data for reproducibility assessment. The psychometric properties were investigated using exploratory factor analysis and internal consistency and test-retest reliability instruments. The convergent validity of the TFI was evaluated using correlation coefficients obtained from the remaining measurements. The Italian TFI has a four-factor structure that was somewhat different from the original. The internal consistency proved to be good (0.92 ≤ α ≤ 0.96) as did the test-retest reliability (0.79 ≤ α ≤ 0.85). In terms of convergent validity, the TFI showed high correlations with the THI (r = 0.77) and the NRS-A (r = 0.70) scores, and moderate correlations with the BDI-PC scores (r = 0.46). The difficulties encountered when attempting to reproduce the original eight-factor structure were consistent with other studies in which the TFI was translated into European languages. In spite of this, the factorial structure of the Italian version of the TFI was characterised by high levels of reliability and validity. Overall, the Italian adaptation of the TFI was shown to be suitable to measure the impact of tinnitus on the daily lives of individuals.


Tinnitus is generally defined as the perception of sound in the absence of a corresponding external stimulus. It is a common auditory complaint with prevalence rates ranging from 2.4% to 20.1% 1. Some people can ignore this phantom sound, while others suffer considerably and consequently, with negative impact on the quality of life. Tinnitus is a purely subjective phenomenon and lacks any objectively identifiable markers; moreover, tinnitus may affect different domains of well-being (i.e. sleep, concentration, hearing, emotions). For these reasons, quantifying the severity of this symptom and how it changes over time and with treatment is challenging.

Several questionnaires have now been developed and validated for assessment of tinnitus complaints: Tinnitus Handicap Questionnaire 2, Tinnitus Reaction Questionnaire 3, Tinnitus Handicap Inventory (THI) 4, Tinnitus Questionnaire (TQ) 5 and its short form, Mini-TQ 6 and the Tinnitus Primary Function Questionnaire 7. The THI and Mini-TQ have also been validated in their Italian language versions 8,9. More recently Moschen et al. validated the Italian version of the Chronic Tinnitus Acceptance Questionnaire (CTAQ-I) 10.

In 2012, Meickle et al. proposed the Tinnitus Functional Index (TFI) 11. The TFI has been shown to be a sensitive tool to rate the severity and negative impact of tinnitus and is especially effective in evaluating the effects of treatment. Since its publication, the TFI has received considerable international attention in both clinical and research settings and has become the new “gold standard”. It has been translated into and validated in a number of languages including Dutch 12, Polish 13, Swedish 14, German 15,16, Hebrew 17 and Chinese 18,19. To our knowledge, there has been no validated Italian version to date.

The TFI consists of 25 items, each rated on a 11-point Likert scale. Patients rate each item according to how they have felt over the past week. The total score varies from 0 to 100 and is classified according to five levels of clinical severity 20: not a problem (0-17); small problem (18-31); moderate problem (32-53); big problem (54-72); very big problem (73-100). The TFI incorporates eight subscales named: Intrusive (items 1-3), Sense of Control (items 4-6), Cognitive (items 7-9), Sleep (items 10-12), Auditory (items 13-15), Relaxation (items 16-18), Quality of Life (items 19-22) and Emotional (items 23-25). In addition to the total score, the subscale scores can be calculated. A reduction of 13 in the TFI score is considered a significant reduction in discomfort 20.

The aim of this study was to evaluate the psychometric properties of the Italian TFI (I-TFI). More specifically, we investigated the factor structure of the instrument together with its internal consistency, test-retest reliability, convergent and discriminant validity.

Materials and methods


The study was carried out in accordance with the Declaration of Helsinki. For this trial, participants were recruited at the Tinnitus Clinic in Milan where they had referred a complaint of tinnitus. Patients were included in this study if they had had tinnitus for a minimum of three months, were fluent in the Italian language, were at least 18 years old and had normally cognitive function and preserved reading skills. All of the subjects enrolled in the study gave written informed consent. None of the patients was undergoing any type of treatment for tinnitus at the time of evaluation.

During their first examination at the Tinnitus Clinic (T0), all participants underwent complete audiological evaluation including full medical anamnesis, otomicroscopy and pure-tone audiometry in a sound attenuated room. Hearing levels were measured in each ear separately at 0.25-8 kHz at half octave steps for air conduction and at 0.5-4 kHz for bone conduction. The Pure Tone Average (PTA) was measured at 0.5, 1 and 2 kHz. After audiological assessment, the participants completed the I-TFI, THI, Beck Depression Inventory - Primary Care Version (BDI-PC) 21 and a Numeric Rating Scale of annoyance (NRS-A). Of the patients who took part in the first survey, a group was selected for test-retest reliability analysis. For this purpose, these patients were asked to complete the I-TFI a second time within a 1-2-week interval (T1) and before undergoing any intervention, in order to provide data for reproducibility assessment. A variation of two days before or after the requested one/two-week long interval between trials was considered acceptable in compliance with the patients’ needs. No access to the responses given in the first questionnaire was allowed to patients when filling out the second I-TFI. All of the participants included in the study completed the I-TFI without assistance. The time required to fill out the questionnaire was approximately 10 minutes.

The study consisted of two different phases: a translation and cross-cultural adaptation process (phase 1) and analysis of psychometric properties (phase 2).


One hundred and fifty subjects (40% females) attending the Tinnitus Clinic of Milan were asked to participate in this study; 13 patients refused to participate and therefore 137 adults aged 18-80 years (39.4% females Mage = 48.26, SDage = 14.08) suffering from tinnitus were included. All were examined by an ENT physician, underwent the audiometric tests described above and completed the I-TFI, I-THI, BDI-PC and NRS-A. Of the 137 patients who took part in the first survey, 70 were selected for the second survey based on the I-TFI questionnaire alone. 57 participants completed the second questionnaire within the accepted period and their scores were therefore included in the test-retest reliability analysis (10 patients did not complete the I-TFI within the accepted period of time and three did not complete the second questionnaire).


  • The TFI 11 consists of 25 items: the patient answers each question by giving a score ranging from 0 to 10 (except for items 1 and 3, which are expressed as a percentage ranging from 0% to 100%). The overall TFI score is calculated by multiplying the mean of all questions by 10 (questions 1 and 3 must be converted into a 0-10 scale) or dividing the sum of all of the scores by 2.5, to give a global score out of 100. Subscales are calculated as the mean of the answered questions multiplied by 10.
  • The THI 4 is one of the most commonly used questionnaires for assessing tinnitus distress and reporting treatment outcomes. The THI consists of 25 items that can be divided into three subscales: Functional, Emotional and Catastrophic. Patients score their symptoms on a three-point scale. The total score ranges from 0 to 100; a higher score is indicative of greater tinnitus distress. Good reliability (0.94) and validity were reported for the total score of the Italian THI 8,22.
  • The BDI-PC 21 is a screening instrument for depression that minimises the possibility of yielding spuriously high estimates of depression for patients with medical problems by focusing on symptoms of sadness, pessimism, past failure, loss of pleasure, self-dislike, self-criticalness and suicidal thoughts or wishes. Its seven items are taken from the Beck Depression Inventory-II 23. Each item is rated on a 4-point scale ranging from 0 to 3, and the BDI-PC score is calculated by summing up all of the highest ratings for each of its seven items. The respondents are asked to describe their feelings “over the past 2 weeks, including today.”
  • NRS-A: the patient is asked to rate tinnitus-related annoyance from 0 (minimal annoyance) to 10 (maximum annoyance). Of the 137 participants, only 125 answered this scale at T0.
  • Translation and cultural adaptation process (phase 1)

    A cross-cultural adaptation process of translation and back-translation was followed in accordance with the “Principles of Good Practice for the Translation and Cultural Adaptation Process for Patient-Reported Outcome Measures” 24. Two professional translators carried out independent forward translations of the original TFI into Italian (forward translation). Discussion of the translated manuscripts with an otorhinolaryngologist and an audiologist with extensive experience in tinnitus resolved discrepancies between the original independent translations and ensured a single forward translation (reconciliation). This new and final version of the I-TFI was then translated back into English by a qualified professional translator (backward translation). The same professional translator compared this back-translation to the original manuscript; no signs of incongruent translation were noted as every item was semantically identical to the original English manuscript (back translation review). The final I-TFI was tested for cognitive equivalence by a group of tinnitus patients who discussed the wording and the meaning of each item of the I-TFI with the clinicians. Subsequently, the wording of the questionnaire was modified on the basis of the suggestions given by patients (cognitive debriefing and finalisation).

    Analysis of psychometric properties (phase 2)


    An exploratory factor analysis was carried out to investigate the I-TFI structure with the aim of replicating the questionnaire’s original eight-factor configuration.

    The final number of factors to be retained was identified by two selection methods: Kaiser-Guttman criterion 25 and Cattell’s Scree test; as a consequence, only factors with eigenvalues greater than one or positioned to the left of the inflexion point on the eigenvalue plot were inspected further. Parallel analysis was also carried out in order to corroborate the identified factor solution.

    Since Mardia’s multivariate omnibus test showed violation of the assumption of multivariate normality, a Principal Axis Factoring (PAF) analysis was carried out 26. The direct oblimin rotation method made it possible to correlate the factors.


    Internal consistency and test-retest reliability were investigated using inspection of Cronbach’s alphas and test-retest α coefficients respectively. In line with literature, values between 0.70 and 0.79 were considered acceptable, scores between 0.80 and 0.89 good, and values higher than 0.90 were deemed excellent.


    As a final step, the instrument’s convergent and discriminant validity were investigated using Pearson bivariate correlation coefficients. In order to evaluate convergent validity, the I-TFI total and subscale scores were correlated with the global scores from the THI and the NRS-A scale. Similarly, in order to evaluate discriminant validity, the total and subscale scores of the I-TFI were correlated with the global BDI-PC score. In line with literature, values below |0.30| were interpreted as indices of absence of meaningful association between the involved constructs; values between |0.30| and |0.49| were deemed as indices of low correlation, values between |0.50| and |0.69| as indices of moderate correlation and values above |0.70| as indices of high correlation between constructs 27.


    A total of 54 females and 83 males completed the I-TFI questionnaire at T0. The mean PTA for the right ear was 16.35 dB HL (SD: 12.45) and for the left ear 18.23 dB HL (SD: 13.34). Descriptive statistics for all of the I-TFI items at T0 were subsequently calculated (Tab. I). No univariate outlier emerged in item distribution, although the results of Mardia’s multivariate omnibus test indicated a violation of the assumption of multivariate normality. For this reason, exploratory factor analysis (EFA) with Principal Axis Factoring rather than Principal Component Analysis was preferred.

    Factor analysis

    All of the I-TFI items were initially checked to make sure that the EFA assumptions were met. Sampling adequacy was verified using the Kaiser-Meyer-Olkin measure (KMO). The total KMO value was 0.92 and all of the KMO values for the individual items were higher than 0.77, and therefore above the threshold value of 0.60 28. Bartlett’s test of sphericity (c2(300 = 4060.38 p < 0.001) indicated that between-item correlations were sufficiently large to perform EFA.

    Both Kaiser-Guttman’s criterion and Scree plot inspection suggested a four-factor solution, together explaining 75.8% of the variance. The four-factor solution also showed eigenvalues higher than the cut-off value suggested by Parallel Analysis 29. These results show that the I-TFI factor structure differs from the original eight-factor solution observed by Meikle et al. 11.

    Table II shows the I-TFI factor loading matrix.

    Overall, all items showed primary loadings higher than 0.60 on one of the four factors, with the exception of items 18 (0.43), 21 (0.57), and 24 (0.42); at the same time, items 18 and 24 were the only items, (together with item 17) that had cross-loading values higher than 0.30 with a second factor.

    As Table II shows, the first extracted factor of the I-TFI (Factor 1) combined items from the Intrusive (I) and Sense of Control (S) subscales of the original instrument’s structure, together with item 24 whose primary loading in the eight-factor structure was on the Emotional subscale (E). With the exclusion of item 24, our results showed that items belonging to the Cognitive (C), Quality of life (Q) and Emotional (E) factors of the original structure collapsed into the second I-TFI factor (Factor 2). Auditory (A) was the only factor that completely mirrored the solution proposed by Meikle et al. 20. Finally, the fourth I-TFI factor (Factor 4) combined the Sleep (SL) and Relaxation (R) items from the original factor structure.

    As for the level of association among the four I-TFI factors, Table III shows the correlation values together with descriptive statistics.

    In general, negligible or low correlations were found among factors, with the exception of factor 2 which showed moderate levels of association with both factor 1 and factor 4.

    As a last step, an additional EFA with a fixed eight-factor structure was performed in order to test the original structure suggested by Meickle and colleagues. All of the items showed primary loadings on the expected items with the exception of item 16 that loaded on the Sleep factor (data are available from the corresponding author upon request).

    Internal consistency and test-retest reliability

    Internal consistency was evaluated using Cronbach’s alpha indices calculated for each of the four factors and for the total I-TFI score at T0 (N = 137) and T1 (N = 57); the test-retest alpha coefficient 30 was employed to evaluate test-retest reliability (T0-T1, N = 57).

    The results are reported in Table IV. All of the Cronbach α indices showed very high levels of internal consistency (> 0.90). As for test-retest reliability, it ranged from 0.79 (Factor 1), to 0.85 (Factor 3) indicating acceptable to good levels of reliability.

    Convergent and discriminant validity of the I-TFI

    Both the convergent and discriminant validity of the instrument were assessed at T0 using Pearson’s correlation coefficients calculated between the total I-TFI score and: a) the THI; b) the BDI-PC, c) the NRS-A. The descriptive statistics and correlation matrix of all of the instruments employed are reported in Table V.

    The total I-TFI score showed a high positive correlation with both the THI and the NRS-A, demonstrating acceptable levels of convergent validity with measures related to tinnitus construct. Moreover, acceptable levels of discriminant validity are suggested by the low association between the total I-TFI score and that of the BDI-PC. Even though both depression and tinnitus can be considered weakly related, as both are an expression of the health-illness continua, each dimension is characterised by distinctive features.


    This is the first study aimed at examining the psychometric properties (i.e. the factor structure, internal consistency and test-retest reliability, and validity) of the Italian TFI in a clinical sample of Italian adults experiencing tinnitus. It is important to carry out thorough evaluation of the psychometric properties of a questionnaire in order to assess whether this instrument is a reliable and valid form of measurement.

    Unlike the original structure, analysis of the I-TFI demonstrates that the best factorial structure is four factors. Factor 1 includes items that originally loaded on Intrusiveness (1-3) and Sense of Control (4-6), and item 24 (“how bothered or upset have you been…”) which originally loaded on Emotional distress. Factor 2 comprises the original Cognitive (7-9), Quality of Life (19-22) and Emotional (23, 25) items. Factor 3 includes the three Auditory items (13-15). Factor 4 contains the Sleep (10-12) and Relaxation items (16-18).

    Similarly, the evaluations of TFI versions translated into other languages show some uncertainty regarding its factor structure: using the eigenvalues > 1 criterion, the Dutch version found 7 factors, the Swedish 6 factors and the Polish version identified 5 factors; the German TFI extracted 5 factors adopting the less restrictive Jolliffe’s criterion (eigenvalues > 0.7).

    Similar to the German validation, our study groups the original Intrusive and Sense of Control items in Factor 1, but with the addition of item 24 (Intrusive/Sense of Control/item 24). A possible explanation of the attribution of item 24 to the same factor loading Intrusiveness and Sense of Control items could be the Italian translation: in fact, the Italian translation for “bothered” (item 24) is similar (but not the same) as “annoyed” (item 3). Furthermore, in our study item 24 showed comparable loadings with factor 2, which included Emotional items.

    Factor 2 of our extracted factors included three subscales of the original TFI: Cognitive, Quality of Life and Emotional. This result is supported by a similar finding during the validation of the German version for Switzerland; the Dutch version also combined the two subscales, Cognitive and Quality of Life, in one factor. This result can be justified by examining the content of the items involved in the Emotional and Cognitive scales. The former presents the topics of activation (two items) and depressed mood (one item), while the latter includes items concerning attention capacity. A link could be hypothesised if we were to consider the influence of hyperactivation on attention skills. The item concerning mood in the Emotional scale, together with items about activation, could have some concept points in common with the scale of Quality of Life, where items are about the pleasure of being involved in social activities or other commitments, linking anxiety and depressive moods with a decreased hedonia.

    Factor 3 is the only factor to be defined individually and corresponds to the Auditory subscale of the questionnaire; moreover, it has high factor loadings (0.84, 0.98, 0.91). Validations in other languages, even those that did not confirm the eight-factor structure, also found the Auditory factor to be separate. Recently, Fackrell et al. suggested that the Auditory subscale does not contribute to the level of global functional impact of tinnitus and proposed a modified seven-factor model, without the Auditory items, which fits the variance in the patients’ scores better 31.

    Factor 4 incorporates the Sleep and Relaxation items. Herein, features of tinnitus often disturb the falling asleep phase, during which the transition from the vigil state of consciousness to the state of sleep requires relaxing parasympathetic processes.

    An EFA with the superimposed eight-factor structure was also performed to investigate whether the original structure suggested by Meickle et al. could be confirmed. In this case, primary loadings of the items loaded on the original factors with the exception of item 16 that loaded on Sleep. Furthermore, the definition “quiet resting activities” (item 16) in the Relaxation subscale, is translated as “sleep” in the Italian version. This result suggests that further data must be collected to combine the results of a confirmatory analysis with those of the exploratory analysis.

    The Italian version of the TFI shows very high levels of internal consistency with Cronbach’s alpha: above 0.90 for the overall TFI and for each of the four factors.

    The test-retest reliability analysis was carried out on a subgroup of patients and the results showed acceptable to good levels of test-retest reliability (0.79 ≤ α ≤ 0.85).

    In order to assess convergent and discriminant validity, the I-TFI was compared with the original THI, the most widely used tinnitus questionnaire in Italy, which has already been validated in the Italian language and used in other validation studies. The results show a strong positive correlation with the THI, thus confirming that the I-TFI and the THI measure a similar construct.

    The I-TFI also shows a high convergent validity with the scores for tinnitus annoyance; this was predictable as the TFI includes aspects related to the annoyance experienced by patients with tinnitus. Instead, the global I-THI score showed marginal correlations compared with the BDI-PC scores, thus indicating acceptable levels of discriminant validity.

    There are some limitations in this study. First of all, the number of patients recruited was limited and insufficient to enable us to perform a Confirmatory factor analysis; further studies on a larger number of subjects suffering from tinnitus are needed. Secondly, we did not evaluate the I-TFI responsiveness to treatment-related changes and therefore further research is needed to address this point.


    The difficulties in reproducing the original eight-factor structure are consistent with other TFI validation studies on European language versions of the questionnaire, which also failed to reproduce the original eight-factor structure. In spite of this, the factorial structure of the Italian version of the TFI is characterised by high levels of reliability and validity. Overall, the Italian adaptation of the TFI is suitable for measuring the impact of tinnitus on the daily lives of individuals.

    Figures and tables

    Table I..

    Item Min-Max M (SD) Item Min-Max M (SD)
    1. 1-10 * 6.42 (2.82) 14. 1-10 2.51 (2.95)
    2. 1-10 5.88 (2.20) 15. 1-10 2.69 (3.13)
    3. 1-10 * 5.56 (3.06) 16. 1-10 3.79 (3.43)
    4. 1-10 6.68 (3.18) 17. 1-10 4.62 (3.15)
    5. 1-10 6.38 (2.93) 18. 1-10 5.45 (3.38)
    6. 1-10 6.34 (2.83) 19. 1-10 3.30 (3.24)
    7. 1-10 4.15 (3.06) 20. 1-10 3.64 (3.58)
    8. 1-10 3.66 (3.21) 21. 1-10 3.28 (3.25)
    9. 1-10 3.72 (2.94) 22. 1-10 3.35 (3.18)
    10. 1-10 4.23 (3.52) 23. 1-10 4.69 (3.39)
    11. 1-10 3.72 (3.45) 24. 1-10 5.69 (3.12)
    12. 1-10 3.73 (3.57) 25. 1-10 3.93 (3.44)
    13. 1-10 2.80 (3.02)
    Descriptive statistics of the I-TFI.

    Table II..

    Item Factor 1 (I)ntrusive/ (S)ense of Control/(item 24) Factor 2 (C)ognitive/(Q)uality of life/ (E)motional Factor 3 (A)uditory Factor 4 (SL)eep/ (R)elaxation
    1 (I) 0.71
    2 (I) 0.78
    3 (I) 0.77
    4 (S) 0.78
    5 (S) 0.75
    6 (S) 0.78
    7 (C) 0.90
    8 (C) 0.96
    9 (C) 0.83
    10 (SL) 0.87
    11 (SL) 0.96
    12 (SL) 0.92
    13 (A) 0.84
    14 (A) 0.98
    15 (A) 0.91
    16 (R) 0.93
    17 (R) 0.32 0.51
    18 (R) 0.35 0.43
    19 (Q) 0.64
    20 (Q) 0.66
    21 (Q) 0.57
    22 (Q) 0.68
    23 (E) 0.64
    24 (E) 0.42 0.37
    25 (E) 0.79
    Exploratory factor analysis (principal axis factoring): oblimin-rotated factor loadings.

    Table III..

    N = 137 1. 2. 3. 4.
    1. (I)ntrusive/(S)ense of Control/(item 24) -
    2. (C)ognitive/(Q)uality of Life/(E)motional 0.54 -
    3. (A)uditory 0.35 0.39 -
    4. (SL)eep/(R)elaxation 0.49 0.61 0.14 -
    Min-Max 6-70 0-88 0-30 0-60
    M 42.95 33.71 8 25.52
    SD 16.47 25.21 8.75 18.58
    I-TFI factor correlation and descriptive statistics.

    Table IV..

    αT0 (N = 137) αT1 (N = 57) α(T0-T1) (N = 57)
    1. (I)ntrusive/(S)ense of Control/(item 24) 0.91 0.91 0.79
    2. (C)ognitive/(Q)uality of Life/(E)motional 0.96 0.95 0.81
    3. (A)uditory 0.96 0.97 0.85
    4. (SL)eep/(R)elaxation 0.96 0.94 0.84
    I-TFI total score 0.96 0.95 0.83
    Internal consistency and test-retest reliability.

    Table V..

    1. 2. 3. 4.
    TFI (total score) -
    THI 0.77 ** -
    NRS-A 0.70 ** 0.64 ** -
    BDI-PC 0.46 ** 0.63 ** 0.43 ** a -
    Min-Max 3-95 0-94 0-10a 0-17
    M 44.08 37.64 5.92a 2.31
    SD 23 21.75 2.09a 2.98
    I-TFI convergent and discriminant validity: Pearson’s correlation indices and descriptive statistics.