Development and psychometric validation of Skindex-29 and Skindex-16

Dermatological disease significantly affects patient's health-related quality of life (HrQoL). Skindex is one of the most frequently used dermatology-specific HrQoL measures. Currently no Chinese version of Skindex is available. The aim of this study was to translate and culturally adapt Skindex-29 and Skindex-16 into Chinese, and to evaluate their reliability and validity.

Results

There were both seven items of Skindex-29 and Skindex-16 requiring a second forward- and backward- translation to achieve the final satisfactory Chinese version. The internal consistency reliability was high (range of Cronbach's alpha for the scales of Skindex-29 0. 85-0. 97, Skindex-16 0. 86-0. 96). Known- group validity was demonstrated by higher scores from patients with inflammatory dermatosis than from patients with isolated skin lesions (P < 0. 05). Evidence of factor structure validity of the Skindex-29 and Skindex-16 was demonstrated by both exploratory factor analysis that accounted for 68. 66% and 77. 78% of the total variance, respectively, and confirmatory factor analysis with acceptable fitness into the expected three-factor structure.

Conclusion

This study has developed semantically equivalent translations of Skindex-29 and Skindex-16 into Chinese. The evaluation of the instruments' psychometric properties shows they have substantial evidence of reliability and validity for use as HrQoL instruments in Chinese patients with dermatological disease.

Background

Health-related quality of life (HrQoL) in dermatological patients has received growing recognition as measuring the burden of skin disease on patients in the fields of clinical practice, clinical trial, and healthcare management. In China, however, less attention has been paid to HrQoL of dermatological patients, and one of the main reasons is a lack of suitable instruments in Chinese that have been developed or adapted according to established scientific attributes and criteria.

Skindex is one of the best dermatological instruments to measure dermatology- specific HrQoL. Skindex was comprised of 61 items initially and was then modified to two brief versions known as Skindex-29 and Skindex-16. The two versions have been extensively studied and refined in different languages and population samples. Skindex-29 was recommended as one of the instruments of choice in dermatology, and Skindex-16 was recognized as an accurate and sensitive measurement of bother of patients' experience and used in many studies of dermatological diseases. However, there is no Chinese version of Skindex available.

In this study, we translated into Chinese and cross-culturally adapted Skindex-29 and Skindex-16. Furthermore, assessment of psychometric properties was carried out.

Methods

The study was carried out in two phases: translation, and evaluation of psychometric properties. The study was approved by the ethics committee of the Guangdong Provincial Hospital of Chinese Medicine.

Translation

The translation methodology conformed to the guidelines for cross-cultural adaption of health-related quality of life measures. There were 5 steps. Forward-translation: Two bilingual translators whose native language was Chinese independently translated Skindex-29 and Skindex-16 into Chinese. Then the two translators discussed their translations and agreed on a reconciliation version defined as the first Chinese version.

Pre-testing: The second Chinese version was tested on a pilot group of patients with skin disease. The patients were all native speakers of Chinese. Each item was performed face to face interviews to determine whether it was acceptable and comprehensible.

Production of final version: The final Chinese version was produced after refining problematic items encountered in pre-testing and proof-reading.

Evaluation of psychometric properties

Instruments

The measurement instruments included sociodemographic items, the validated Chinese version of Dermatology Life Quality Index (DLQI) and the final Chinese versions of Skindex-29 and Skindex-16. The sociodemographic items included age, gender, marital status, education level, smoking, drinking alcohol, co-existing chronic disease, and duration of skin disease.

The DLQI is a 10-item brief questionnaire to assess the quality of life impact of skin disease. Its Chinese version has been evaluated to have good psychometric properties. It is designed as 6 aspects of quality of life: symptoms and feelings, daily activities, leisure, work and school, personal relationships and bother with treatment, while its unidimensionality has been proved for several language versions and a total score of DLQI is generally recommended to be used. Items are answered on a 4-point ordinal scale ranging from 0 to 3, and item scores are summed to yield a total score (030).

Skindex-29 contains 30 items, while item 18 about side-effects of treatment is not scored. It covers 3 scales: symptoms, emotions and functioning. Each item is rated on a 5-point Likert scale (never, rarely, sometimes, often, all the time). All responses are transformed to a linear scale of 100 ranging from 0 to 100, and the scale score is regarded as the mean of a patient's responses to the items in a given scale. Skindex-16 contains 16 items with each rated on a 7- point Likert scale. Scores for the emotions, symptoms and functioning scales are also expressed in a linear scale from 0 to 100, and the scoring method is the same with those of Skindex-29.

For all the three instruments used, higher scale scores reflect greater impairment.

Statistical analysis

For each scale of the Skindex-29 and Skindex-16, the floor and ceiling effects were assessed. If more than 20% of the participants reported lowest or highest possible score, the floor or ceiling effects exist. For reliability analysis, Cronbach's alpha coefficient was determined to assess internal consistency reliability. Based on the hypotheses that patients with inflammatory dermatosis would have higher scale scores than patients with isolated skin lesions, known-groups validity was assessed by comparison between the two groups using Wilcoxon rank sum test. For Skindex-29, known-groups validity was also assessed by comparison among different frequency groups of bother from side-effects of treatment (measured by item 18), and the Kruskal-Wallis H test was used. Convergent validity assessed the degree to which the Skindex-29 and Skindex-16 were similar to (converged on) other measures that they should theoretically be related to. Spearman's rank correlation coefficient was used to assess correlations between the Skindex-29, Skindex-16 and the DLQI.

Factor structure validity was assessed by extracting factors using exploratory factor analysis (EFA) with principal components method. As substantial correlation between factors was expected, oblique rotation was performed. Factors were identified based on eigenvalues greater than 1. A criterion of highest factor loading, above 0. 4 and at least 0. 1 stronger than the next was used to identify items that were salient in defining a given factor. Each factor was labeled by the heavily loaded items according to the above criterion. Confirmatory factor analysis (CFA) was also used to evaluate goodness of fit of the supposed three-domain structure of the instruments. The goodness of fit indices valued between 0 and 1 included goodness of fit index (GFI), adjusted goodness of fit index (AGFI), non-normed fit index (NNFI) and comparative fit index (CFI), standardized root mean square residual (SRMR), and root mean square error of approximation (RMSEA). The first four indices were expected to be larger than 0. 90, the SRMR was expected to be less than 0.08, and the RMSEA was expected to be less than 0.10.

All the analysis was performed using SPSS 17. 0 (SPSS Inc. , Chicago, IL, USA) and LISREL 8. 8 (Scientific Software International, Chicago, IL, USA).

Results

Translation

There were 23 items of Skindex-29 and 9 items of Skindex-16 classified as (a). The items that were classified as (b) or (c) were presented in Table 1 respectively for Skindex-29 and Skindex-16. The translation of some feelings and emotions such as being frustrated, being ashamed, being annoyed and showing affection were particularly problematic.

Response distribution and reliability

The means, standard deviations, percentiles, corrected item-total correlations, and Cronbach's alpha coefficients are presented in Table 4 for Skindex-29 and Skindex-16 and their scales. All scales of Skindex-29 had small floor and ceiling effects (5%), while emotions and functioning scales of Skindex-16 had mild floor and ceiling effects (>10%). The Cronbach's alpha coefficients of Skindex-29 and Skindex-16 and their scales were high (>0. 80), and the item- total correlation ranged from 0. 5 to 0. 88, which signified very good internal- consistency reliability of the two instruments.

Known-group validity

Patients with inflammatory dermatosis had significantly higher scale and total scores of Skindex-29 and Skindex-16 compared with patients with isolated lesions (Table 5). Box plot of Skindex-29 total scorein patients with various grades of bothered by the side-effects of treatment.

Convergent validity

It was demonstrated that moderate to good correlations between Skindex-29 and DLQI, Skindex-16 and DLQI (Skindex-29, r from 0. 43 to 0. 84; Skindex-16, r from 0. 39 to 0. 83) (Table 6).

Factor structure validity

Exploratory factor analysis using eigenvalues greater than 1 as the criterion resulted in 4 factors for Skindex-29 and 3 factors for Skindex-16, which accounted for 68. 66% and 77. 78% of the total variances, respectively. With oblique rotation method, the factor loadings of Skindex-29 and Skindex-16 are presented in Tables 7 and 8. For Skindex-29, the first, second and fourth factors matched almost the three original scales: Functioning, Emotion and Symptoms. The third factor, however, seemed as the combination of items originally belonging to the Function and Symptoms. For Skindex-16, the extracted three factors reflected almost perfectly the original scales of functioning, emotion and symptoms. And the factor loadings ranged from 0. 799 to 0. 969 for factor 1 (functioning), 0. 459 to 0. 998 for factor 2 (emotion) and 0. 637 to 0. 882 for factor 3 (symptoms) respectively.

Confirmatory factor analysis was also performed to evaluate the factor structure validity of Skindex-29 and Skindex-16 according to their original structure. The goodness-of-fit indices are shown in Table 9, and the structure graphs are shown in Figures 2 and 3 respectively for Skindex-29 and Skindex-16. The two instruments got similar results. The indices GFI, AGFI and RMSEA were not as good as possible, but the NNFI and CFI were large enough to meet the cut-off criterion of goodness-of-fit indices (lager than 0. 90). Meanwhile, the SRMR, which was regarded as the most important index, was also satisfying (less than 0. 08).

The three-factor model for Skindex-29 obtained from confirmatory factor analysis.

The three-factor model for Skindex-16 obtained from confirmatory factor analysis.

Discussion

In this study, we translated Skindex-29 and Skindex-16 into Chinese according to accepted steps in the generation of conceptually and linguistically equivalent translations of quality of life measures. Our evaluation of the psychometric properties of the resulting Chinese versions of Skindex-29 and Skindex-16 suggested that they are reliable and valid measures of HrQoL in Chinese patients with dermatological disease.

With respect to the reliability, the Cronbach's alpha coefficients for the global and three scales were all greater than 0. 80, which indicated satisfactory internal consistency of the Chinese version of Skindex-29 and Skindex-16. And the results were similar to those of other language versions of Skindex-29 and Skindex-16. High item-total correlations further confirmed the good homogeneity of the questionnaires. Mild floor and ceiling effects were noted for the scales of functioning and emotions of Skindex-16, while no floor or ceiling effect was observed for Skindex-29. It might suggest that Skindex-16 provide limited information on bothering by dermatological disease for those with low level of functioning and high level of emotions.

Our results showed that the Chinese version of Skindex-29 and Skindex-16 had good capacity to discriminate patients with inflammatory dermatosis and patients with isolated lesions, which was consistent with the original version and other language versions. Furthermore, we found a difference in the total score of Skindex-29 among patients with different levels of being bothered by the side-effects of treatment which was measured by the item 18 of Skindex-29. The more bothered by the side-effects of treatment, the worse quality of life patients had.

The pattern of correlation between Skindex-29 and Skindex-16 scale scores and DLQI subscale scores provides available evidence of convergent validity. For all three scales, they exhibited moderate to excellent correlations with the six subscales of DLQI. The reasons of choosing DLQI as the measures that Skindex should theoretically be related to are that the DLQI is the most commonly used HrQoL instruments in dermatology and its Chinese version has been proved to be reliable and valid.

The exploratory factor analysis yielded four factors for Skindex-29 and three factors for Skindex-16. The result of Skindex-29 was consistent with that of the Polish version, but in contrast to the three-factor structure extracted by the original version and Italian version. The differences between present results and the results in the US and Italy samples might be partly due to differences in sample characteristics. The results of Skindex-16 were in accordance with the original version and provided further confirmation of the three factors of the instrument.

There has not been confirmatory factor analysis of Skindex, and our results indicated that the three-factor structure of Skindex-29 and Skindex-16 was acceptable. The reason of low values of GFI and AGFI and high value of RMSEA might be that they could be impacted by the sample size which in this study was relatively small.

There were some limitations in this study. The study sample was from the outpatients of the department of dermatology and the time consuming of completing the questionnaire might lead to selection bias. To decrease burden of the study on patients, test-retest reliability was not evaluated. And responsiveness, i. e. the ability of the instrument to detect small but meaningful changes over time was not evaluated either because it was hard to investigate clinical significant changes for many dermatologic conditions enrolled in this study within a reasonable time frame.

Additional studies would include further validation with patients from multi- center clinics, evaluation of test-retest reproducibility and responsiveness of the instruments. And international cooperative research could be expected to use the Skindex to measure HrQoL in patients with dermatological disease. In addition, more information on patients from different socio-cultural areas and environments would be desirable. In particular, using the original English version of scale could have introduced response bias because of cultural differences between the US and China. How the Skindex might exhibit Differential Item Functioning (DIF) among patients with different cultural background needs further research.

Conclusion

The Chinese versions of Skindex-29 and Skindex-16 have been developed and demonstrated to be reliable and valid instruments for use as HrQoL instruments in Chinese patients with dermatological disease. The Skindex is expected to be adopted into clinical practice and clinical trials to allow physicians to increase the attention paid to HRQoL of patients not only the symptoms of dermatological disease.

Abbreviations

HrQoL: Health-related quality of life
DLQI: Dermatology life quality index
EFA: Exploratory factor analysis
CFA: Confirmatory factor analysis
GFI: Goodness of fit index
AGFI: Adjusted goodness of fitindex
NFI: Normed fit index
CFI: Comparative fit index
SRMR: Standardized root mean square residual
RMSEA: Rootmean square error of approximation