A RASCH-BASED COMPARISON OF THE FUNCTIONAL INDEPENDENCE MEASURE AND SPINAL CORD INDEPENDENCE MEASURE FOR OUTCOME AND QUALITY IN THE REHABILITATION OF PERSONS WITH SPINAL CORD INJURY

Roxanne Maritz, PhD1,2#, Carolina Fellinghauer, PhD1#, Mirjam Brach, MBA, MPH1, Armin Curt, MD3, Hans Peter Gmünder, MD4, Maren Hopfe, PhD1, Margret Hund-Georgiadis, MD, PhD5, Xavier Jordan, MD6, Anke Scheel-Sailer, MD2,4 and Gerold Stucki MD, MS1,2

From the 1Swiss Paraplegic Research, Nottwil, 2Center for Rehabilitation in Global Health Systems, Department of Health Sciences and Medicine, University of Lucerne, Lucerne, 3Spinal Cord Injury Center, Balgrist University Hospital, Zurich, 4Swiss Paraplegic Centre, Nottwil, 5Clinic for Neurorehabilitation and Spinal Cord Injury, REHAB Basel, Basel and 6Clinique Romande de Réadaptation, Sion, Switzerland

#Shared first authorship.

Objective: The Functional Independence Measure (FIM™) and spinal cord injury (SCI)-specific Spinal Cord Independence Measure (SCIM) are commonly used tools for outcome measurement and quality reporting in rehabilitation. The objective of this study was to investigate the psychometric properties of FIM™ and SCIM and to equate the 2 scales.

Methods: First, content equivalence of FIM™ and SCIM was established through qualitative linking with the International Classification for Functioning, Disability and Health (ICF). Secondly, a Rasch analysis of overlapping contents determined the metric properties of the scales and provided the empirical basis for scale equating. Furthermore, a transformation table for FIM™ and SCIM was created and evaluated.

Subjects: Patients with SCI in Swiss inpatient rehabilitation in 2017–18.

Results: The ICF linking and a separate Rasch analysis of FIM™ restricted the analysis to the motor scales of FIM™ and SCIM. The Rasch analysis of these scales showed good metric properties. The co-calibration of FIM™ and SCIM motor scores was supported with good fit to the Rasch model. The operational range of SCIM is larger than for FIM™ motor scale.

Discussion: This study supports the advantage of using SCIM compared with FIM™ for assessing the functional independence of patients with SCI in rehabilitation.

Key words: spinal cord injury; Functional Independence Measure; Spinal Cord Independence Measure; activities of daily living; Rasch measurement model; psychometrics; outcome assessment (healthcare); quality in healthcare; rehabilitation.

J Rehabil Med 2022; 54: jrm00262

DOI: 10.2340/jrm.v54.82

Accepted Dec 16, 2021; Epub ahead of print Jan 25, 2022

Correspondence address: Roxanne Maritz, Department of Health Sciences and Medicine, University Lucerne, Frohburgstrasse 3, 6002 Lucerne, Switzerland. E-mail: roxanne.maritz@unilu.ch

lay abstract

In our study we compared two rehabilitation outcome assessment tools commonly used to measure functional independence in Spinal Cord Injury: the Functional Independence Measure (FIM™), a general tool, and the Spinal Cord Independence Measure (SCIM), a tool specifically developed for Spinal Cord Injury rehabilitation. We first compared the content of the two tools using the International Classification of Functioning Disability and Health (ICF). Then we tested their measurement properties and put them on a common measurement scale, which allows to directly compare scores of the two tools. The common measurement scale was obtained by mean of a so-called Rasch analysis. The results showed that the FIM™ motor items can be compared to the SCIM items from a content but also from a metric point of view. The study showed an advantage in using the SCIM compared to the FIM™ for assessing the functional independence of patients in Spinal Cord Injury rehabilitation.

INTRODUCTION

Over the last decade, health systems have established national quality monitoring systems to strengthen quality assurance and quality management by providing a strategy for performance comparison across healthcare providers (1). To measure and ensure an adequate depiction of quality, these systems need to rely on important determinants to describe case complexity, such as diagnosis, treatment, and functioning information (2).

In the context of rehabilitation, the Functional Independence Measure (FIM™) has been established in several countries as a standard for measuring change at patient level and outcome quality at institutional or national level (e.g. in Australia or Canada) (3–6).

In Switzerland, the National Association for Quality Development (ANQ) (7), an organization mandated by the Swiss cantons, health insurances, and clinics, has also established a quality monitoring system including FIM™ for musculoskeletal and neurological rehabilitation.

Quality monitoring of rehabilitation services for rehabilitation outcomes has its merits, including the possibility to learn from best practices. Nevertheless, the imposition of an instrument such as the FIM™ as a standard patient assessment instrument can pose challenges for the adequate depiction of case complexity of relevant subgroups and subsequent reporting of outcome quality.

Patients with spinal cord injury (SCI) differ from those with neurological and musculoskeletal disorders in terms of their rehabilitation needs (8). They exhibit higher medical complexity, which requires treatment in specialized rehabilitation centres with the corresponding infrastructure, personnel, and processes (9). In order to consider the medical and rehabilitative characteristics of this group of patients, specific survey instruments are required to ensure a complete and meaningful illustration of the patients’ functioning.

For rehabilitation patients with SCI, the FIM™ was found to be unsatisfactory for assessment and follow-up of functional independence (10, 11). Consequently, a SCI-specific assessment instrument, the Spinal Cord Independence Measure (SCIM) was developed (12). Compared with the FIM™, the SCIM does not include cognitive items, but instead it addresses several issues typical for SCI, such as respiration problems, bladder and bowel management, and transfer and walking ability, including usage of SCI-related mobility aids.

In Switzerland, the specialized rehabilitation clinics for SCI have conducted a project to examine whether it would be possible to calculate functioning scores equally to FIM™ scores based on SCIM. As the SCIM can be expected to suit the SCI population better than the FIM™, using the SCIM in SCI rehabilitation would enable collecting more clinically meaningful data on the one hand, and, through an equating of the 2 metrics, provide data comparable to the FIM™ for the national quality management on the other hand. The Swiss SCI centres also hypothesized that the daily independence of patients with a lower level of functioning would be better assessed with the SCIM than with the FIM™ and, consequently, reduce bias against clinics with highly dependent patients for which the FIM™ can be expected to show a floor effect.

The objective of this study was to investigate the psychometric properties of FIM™ and SCIM and to equate the 2 scales for outcome measurement and quality management in SCI. This would help to examine whether SCIM can replace FIM™ for outcome measurement and quality management purposes.

METHODS

Design

Rasch-based score equating based on a common person design.

Sample

From May 2017 to March 2018, all specialized SCI clinics in Switzerland collected FIM™ and SCIM III data in parallel for the same patients with SCI. The data collection was part of a larger investigation for the development of a new reimbursement system in rehabilitation, which aims to include the same assessment tools already used for national quality reporting (13). A total of 663 patients with SCI were assessed 1–6 times for 6 weeks, resulting in a total of 985 observations. Approximately 66% of the patients participated in 1 wave of data collection, 22.3% in 2 waves, 8.5% in 3 waves, and 2.5% in 4 waves of data collection. Only 2 participants underwent 5 and 6 measurements, respectively. To avoid repeated person measures in the sample, only data from 1 assessment time-point was randomly selected from each individual (14).

Assessment tools

The FIM™ was developed in the 1980s (3) and is currently one of the most commonly used functioning assessment instruments in neurological and musculoskeletal rehabilitation (10). The instrument consists of 18 items, with 13 items on motor and 5 items on cognitive abilities. The motor domain is further divided into 4 subscales: “self-care”, “continence”, “transfer”, and “locomotion” (mobility). The cognitive domain consists of 2 subscales: “communication” and “cognition”. For each item, 7 response options, from “total assistance” to “complete independence” evaluate the degree of dependence in everyday autonomy.

The psychometric properties of FIM™ have been extensively studied (15). Studies comparing FIM™ with similar instruments designed to address functional independence in SCI, showed that, despite FIM™ sharing many similarities with these instruments, they often do not cover all areas equally well (10, 16–20).

The SCIM was developed in the mid-1990s as the first outcome measure for everyday independence in SCI. Since then, the SCIM has been widely used in SCI rehabilitation (12). The current version of the SCIM is SCIM III (21). The SCIM comprises a total of 18 items, divided into 4 subscales: “self-care”, “respiration and sphincter management”, “mobility in rooms”, and “mobility indoors and outdoors” (mobility). The response scales vary between 0–2 and 0–15 points. The total score, which covers only motor functioning, ranges from 0 “total assistance” to 100 “complete independence”.

SCIM and FIM™ are both reliable and valid assessment tools for measuring everyday independence (10). Studies support the good correlation of the FIM™ with the SCIM III. Still, there is also some evidence that the SCIM III has a higher sensitivity and responsiveness for patients with SCI (22, 23).

Data analysis

Qualitative linking: ICF-based content comparison. Scale equating refers to methods establishing equivalence of total scores from different assessment tools so that the scores are interchangeable (24). Qualitative linking was applied to determine the conceptual overlap and similarities, of FIM™ and SCIM. Equality of constructs is an essential requirement for scale equating (24). The International Classification of Functioning, Disability and Health (ICF) (25) is a WHO classification that provides a standardized and international language to describe health and functioning. The ICF was used for qualitative linking, as both FIM™ and SCIM assess functioning information. Concepts identified in both assessment tools were linked by 2 researchers to the most precise ICF category, using the current version of the standardized ICF linking rules (26).

Furthermore, selections of relevant ICF categories for different settings and health conditions, so-called ICF Core Sets, can be consulted to discuss the relevance of assessed functioning information. In a second step, the content of the FIM™ and the SCIM identified through the ICF linking was contrasted with the content of the ICF Generic-30 Set, representing relevant functioning categories for neurological, musculoskeletal, and cardiovascular rehabilitation (27).

Quantitative equating: metric equivalence. Metric properties of total scores of assessment instruments, such as SCIM and FIM™, can be evaluated with a modern probabilistic measurement approach. Rasch analysis, applying the partial credit model (PCM), was used in the current study. The PCM is a model developed for the psychometric analysis of ordinal response scales (28). Previous studies have shown that PCM is suitable to calibrate the FIM™ (15).

A Rasch analysis examines critical assumptions for reliable measurement with ordinal scales, such as the underlying dimensionality, the monotonicity of the response options, or the conditional independence of items when having conditioned out the total score (29). If the outcome of a Rasch analysis fulfils the model’s assumptions, it can be concluded that the total scores are interval scaled and applicable for quantifying change and making comparisons (30).

It has been shown that FIM™ (15), as well as SCIM III (31), are multidimensional assessment instruments, exhibiting local dependencies within subscales. Thus, the redundant information within domains often enters the analysis as testlets, i.e. as a sum score aggregating the respective subscale items. For this study, 3 testlets were formed for the motor items of the FIM™ and SCIM: “self-care”, “continence”, and “mobility”. Furthermore, 1 testlet was formed with all the cognitive items of the FIM™, incorporating the two areas: “communication” and “cognition”.

Some response options of SCIM III items are coded in a non-uniform and non-equidistant manner, such as the SCIM’s breathing item with 6 non-uniform options (0-2-4-6-8-10) or the SCIM’s bowel management item with 4 non-equidistant options (0-5-8-10). For the metric analysis presented in this study, the non-uniform non-equidistant response options of the SCIM have been recoded to represent incremented values starting from 0 with directly consecutive integers. Without this modification, the Rasch model would estimate more thresholds than available by considering the response gaps as missing information. As the PCM does allow thresholds to be non-equidistant, the increase in difficulty of response thresholds will be estimated by the model directly.

Only if the qualitative linking confirms the construct equality and the metric properties of the scales show fit to the Rasch model, can scale equating be conducted.

In the current analysis, FIM™ and SCIM data were collected for all participants (common person design). In a common person equating procedure, given that the scales to equate fit the Rasch model, a co-calibration of the row scores of the scales can be undertaken. The fit of the respective scales to the Rasch model is given by the total-Item χ2 test. When co-calibrating two scales or subscales by their total scores, the quality of the model fit, i.e. a good adaptation of the data to the model, is given with a conditional χ2 fit statistic (32).

In a Rasch analysis, the reliability of the model can be measured with the Cronbach α and the Person Separation Index (PSI). A PSI > 0.85 indicates high reliability for measurements at individual levels. A PSI > 0.8 still indicates good reliability of the instrument for measurements at the population level (29). Cronbach α is interpreted similarly. A scale equating, based on total scores of two scales, is comparable, technically speaking, to an analysis with testlets, but represents a bi-factor equivalent solution. In a Rasch analysis that uses testlets instead of items, the change in reliability, given by the PSI or the Cronbach α, can be described by mean of the A-score. The A-score is understood as the percentage of the remaining variance observed after aggregation of the items. A-scores above 90% indicate that more than 90% of the unique variance of the items is retained by a testlet formation. The interpretation of the A-score is similar to the Explained Common Variance used in bi-factor analysis, where values of 80% would indicate a strong first factor (33). According to Quinn (34), A-values above 90% are a minimum threshold and A-values < 70% indicate that a multidimensional model is needed.

A dimensionality analysis was used to evaluate whether testlets measure the same latent construct. Ability estimates derived from oppositely loading testlets on the first principal component are compared individually with a t-test for each participant, and the number of significant deviations is determined. In the presence of unidimensionality, the proportion of significant t-tests should not exceed 5%, meaning that the lower boundary of the 95% confidence interval (95% CI) should not be above 5% (35).

If the metric properties of FIM™ and SCIM allowed for it, a co-calibration of the scales would deliver a transformation table, indicating how a total score from one assessment tool can be translated into the total score of the other tool. Information from the entire population (n = 985) was used to analyse the quality of a SCIM to FIM™ motor total score transformations. The equivalence of the transformed FIM™ motor scores, as derived from SCIM with the observed FIM™ motor scores, was compared with the Spearman’s rank correlation, the Wilcoxon signed-rank test (36), and Cohen’s D effect size (37). The validity of the transformation table was then determined by comparing the FIM™ motor scores resulting from the transformation table, as derived from SCIM with the truly observed FIM™ motor scores, as assessed with the scale. Likewise, the accuracy, in terms of correctly predicted FIM™ scores, can be described.

In addition, the Leunbach’s model for direct equating has been applied (38), to test whether the two scales measure the same latent construct by relating the respective total scores to a common metric. This analysis provides, among other key values, a Cohen’s kappa and a mean weighted standard error of equating (SEQ) (39). A Cohen’s kappa of zero indicates that the agreement is equivalent to chance. Values of 0.4–0.6 represent moderate agreement, 0.6–0.8 substantial agreement, 0.8–0.99 near-perfect agreement, and 1 perfect agreement. An SEQ mean below 0.91 is acceptable (40).

Due to a very small percentage of missing values in a few cognitive items of the FIM™ and the ability of the Rasch model to handle missing values, no data were imputed. The Rasch analyses were performed with the software RUMM2030 (34). Leunbach’s model analyses for scale equating were conducted with DIGRAM (41), and all further quantitative analyses with R (42).

RESULTS

Descriptive statistics of FIM™ and SCIM scores, including frequencies and proportions for the complete dataset and the analysis sample, are shown in Appendices 1 and 2. Further sample characteristics, such as age, sex, lesion level, etc., were not provided in the dataset for the current study.

Qualitative linking: ICF-based content comparison

The ICF linking of SCIM and FIM™ (Table I) showed that the items of both assessment tools contain ICF categories of the chapters d4 – Mobility and d5 – Self-care, b5 – Functions of the digestive, metabolic and endocrine systems, and b6 – Functions of the urogenital and reproductive systems. Except for the categories b440 Respiration and d410 Changing basic body position, the same ICF categories are addressed through the motor items of FIM™ and SCIM. ICF categories from chapters b1 – Mental functions, d1 – Learning and application of knowledge, d3 – Communication and d7 – Interpersonal interactions and relationships occur only for the FIM™ and correspond to the cognitive items of the FIM™ (items N–R). The SCIM does not capture cognitive limitations. However, with the exception of 3 SCIM items (items1, 7 and 8), which are linked to b525 Defecation functions, b440 Respiration functions, and d560 Drinking, all other items can be mapped to the ICF Generic-30, in comparison with the FIM™, where 5 (H, N, O, Q and R) of 18 items cannot be linked to the Generic-30, including mostly its cognitive items.

Quantitative equation: metric equivalence

The results of the Rasch analyses are shown in Table II. Rasch analysis of the FIM™ scale, including motor and cognitive domains, clearly supported its multidimensionality with 13.28% (11.5%–15%) significant t-tests. The multidimensionality is further confirmed with an A-score below 90% (87.65%). Together with the fact that SCIM does not assess cognitive domains, the decision was made to create a transformation metric that only includes the motor domains of FIM™.

The individual analyses of FIM™ motor and SCIM scale with a testlet-based approach, i.e. by aggregating the items by subscales of the motor domain, resulted in good model fit and targeting, without floor and ceiling effects (Table II). The assessment instruments’ motor scales are one-dimensional and show good reliability (PSI > 0.85). Based on these findings, an equating of the motor total scores was justified.

The equating of FIM™ motor and SCIM by mean of a co-calibration of their total scores, resulted in good model fit with good targeting and high reliability (PSI > 0.9). Also, in the co-calibration context, the one-dimensionality assumption was fulfilled. The A-value of the co-calibration was 98.82%, indicating that less than 2% of the variance had to be discarded to achieve a unidimensional latent estimate. An equivalence of the constructs and the equivalence of the latent trait being assessed by FIM™ motor and SCIM scale were supported statistically.

A person-item map visualizing the distribution of the ability estimates and of the difficulty thersholds from the co-calibration of the SCIM and the FIM™ is shown in Appendix 3. The person-item map shows that the SCIM has additional difficulty thresholds for lower levels of functional independence.

The co-calibrations enabled total scores to be transformed from one scale to another. The transformed values can then be compared with the observed total score. The tables with the SCIM scores and the corresponding Rasch transformed FIM™ motor scores are shown in Appendix 4. The score transformation is based on the sum of re-scored items, as specified in Appendices 2a–c. The derived and observed total scores correlated very highly (r >0.9). A direct comparison with the Wilcoxon signed-rank test showed no significant mean deviation between the total motor scores. Cohen’s D values also supported that the observed deviations of the means are negligible (Table III).

Furthermore, the direct raw score equating under the Leunbach’s model also supports that SCIM and FIM™ motor items measure a common latent construct. The Cohens’ kappa statistic under Leunbach indicated substantial agreement among the raw scores (κ=0.79). The mean weighted SEQ of 0.61 indicated that the scale equating results are acceptable.

The confusion matrix regarding the precision of the total score quantile transformation from SCIM to FIM™ motor score showed the correctly equated total scores on the diagonal. The derivation of the FIM™ motor scores from the SCIM scores had a precision of 68.12% (Table IV). Notably, a substantial part of the unprecise transformations is adjacent to the diagonal in the next quantile, indicating that the observed departures from the correct transformation quantile are not excessive.

DISCUSSION

Qualitative linking with the ICF as a reference system enabled the content overlap between SCIM and FIM™ to be determined. Further separate quantitative testing of the FIM™ motor and SCIM scale with Rasch analysis revealed good metric properties and supported their co-calibration and creation of a transformation table. Co-calibration with Rasch showed that FIM™ motor scores could be derived from observed SCIM scores with sufficient reliability. The Leunbach’s approach of direct test equating supported that the 2 scales measure a common latent construct. Statistical comparison of the observed and transformed FIM™ motor scores showed negligible differences. However, the precision of a transformation into FIM™ quantile scores was 68.12%, indicating that 68.12% of the FIM™ scores transformed from the observed SCIM scores were correctly found in the observed FIM™ quantiles. The direction of the equating error, meaning systematic over- or under-estimation of the observed FIM™ score when transforming from SCIM is not found in this analysis.

This study further showed that the SCIM scores have a broader measurement range than the FIM™ motor scores, being more able to determine limitations in functioning, especially in presence of low functional independence, regarding the mobility, self-care, and sphincter control independence experienced by more severely injured patients with SCI. This advantage can be explained by the differences of the content addressed by the SCIM subscales. For example, the continence subscale of the SCIM includes an item about respiration in addition to bowel and bladder management. “Respiration” is an item, which becomes particularly relevant with high levels of SCI where pulmonary functions are impaired due to the injury. Having more thresholds to differentiate lower levels of functioning means, concretely, that clinics treating a more severely affected patient population will be enabled to better report improvements achieved at the lower end of the continuum.

As all Swiss specialized SCI centres contributed to the collection of information on functional independence with SCIM and FIMTM, the results are expected to be representative for the Swiss setting and can therefore be used to inform further developments of the Swiss ANQ quality reviews in SCI rehabilitation.

Study limitations

This study is based on secondary data analysis of a sample of persons with SCI. Information on sex, age, time since injury, lesion type and lesion level were not provided in the dataset. Therefore, a more refined analysis that could also determine the invariance of the scales across sample subgroups was not possible. Evidence from another Rasch analysis of the FIM™, with data from the Swiss national quality reports in a neurological and musculoskeletal population, supported the absence of differential item functioning for sex, age, nationality, insurance status of patients, rehabilitation group, clinic language, and time-point of measurement (4). More advanced research is needed in order to improve the precision of the score transformation. Availability of clinical cut-points for the scale score comparisons instead of using data driven cut-points (quantiles) may be of the highest benefit in that regard (43). Ultimately, the Rasch analyses of this report are based on a random sample of 663 participants with SCI.

Applicability of results

The study results support some advantages in using the SCIM III to assess independence in patients with SCI. When functional independence data is required for national quality reports or reimbursement purposes, SCIM scores could be transformed to FIM™ motor scores using a metrically sound transformation table. Furthermore, the Rasch analysis can be used to transform the ordinal-level scaled data to interval-scale level, which allows for sound comparisons and calculation of change scores in everyday independence.

Future research could focus on transformations at the subscale level, which could be more beneficial for monitoring in clinical practice. Furthermore, analysis of the invariance of the total scores for subgroups of the SCI sample, such as lesion level or age groups, may further improve the accuracy of the transformation. Considering injury and person characteristics would allow deriving group-specific reference values. These could be used in clinical practice to discuss the everyday independency of individuals in the light of normative scores.

Conclusion

This study allows us to equate and compare FIM™ motor and SCIM III scores. The results showed that SCIM III has a more comprehensive operational range, supporting the use of the SCIM III for SCI rehabilitation, as it provides more information for patients with lower functional abilities.

With respect to the motor domains, evaluations based uniquely on SCIM can be expected to result in a comparable benchmarking in comparison with other rehabilitation populations, in which the FIM™ has been used as outcome measure. In this regard, the assessment of patients with SCI based on SCIM instead of FIM™ may have a positive impact and result in more precise description of rehabilitation outcomes of SCI-specialized centres for quality reports.

ACKNOWLEDGEMENTS

The authors thank Melissa Selb from the Swiss Paraplegic Research for support with ICF-related questions, Nuria Adroher for technical advice with DIGRAM, Stefan Metzger from the Swiss Paraplegic Clinic, and Renato Mattli from the Zürcher Hochschule für Angewandte Wissenschaften for their support in obtaining and understanding the dataset for secondary use.

REFERENCES

  1. Busse R. Diagnosis-related groups in Europe moving towars transparency,efficiency and quality in hospitals. Maidenhead: Open University Press; 2011.
  2. Prodinger B, Tennant A, Stucki G, Cieza A, Ustun TB. Harmonizing routinely collected health information for strengthening quality management in health systems: requirements and practice. J Health Serv Res Policy 2016; 21: 223–228.
  3. Keith RA, Granger CV, Hamilton BB, Sherwin FS. The functional independence measure: a new tool for rehabilitation. Adv Clin Rehabil 1987; 1: 6–18.
  4. Maritz R, Tennant A, Fellinghauer C, Stucki G, Prodinger B. The Functional Independence Measure 18-item version can be reported as a unidimensional interval-scaled metric: internal construct validity revisited. J Rehabil Med 2019; 51: 193–200.
  5. Australasian Rehabilitation Outcomes Centre – University of Wollongong, Wollongong, Australia (2021 UOW). [Accessed January 10, 2022] Available from https: //ahsri.uow.edu.au/aroc/index.html.
  6. Canadian Institute for Health Information: Better data. Better decisions. Healthier Canadians. Canadian Institute for Health Information, Ottawa, Canada (CIHI2022). [Retrieved January 10, 2022] Available from https://www.cihi.ca/en/about-cihi.
  7. Swiss National Association for Quality Development in Hospitals and Clinics, Bern Switzerland (2022 ANQ). Review Information: Rehabilitation. [Accessed January 10, 2022] Available from https://www.anq.ch/en/departments/rehabilitation/review-information-rehabilitation/.
  8. World Health Organization (WHO), ISCOS: International Perspectives on Spinal Cord Injury. Malta: World Health Organization; 2013.
  9. Curt A, Hund-Georgiadis M, Jordan X, Baumberger M. Qualitäts- und Leistungskriterien für die Behandlung von Patienten mit einer Para- oder Tetraplegie sowie mit querschnittähnlicher Symptomatik. Nottwil: Swiss Society of Paraplegia; 2018.
  10. Anderson K, Aito S, Atkins M, Biering-Sorensen F, Charlifue S, Curt A et al. Functional recovery measures for spinal cord injury: an evidence-based review for clinical practice and research. Spinal Cord Med 2008; 31: 133–144.
  11. Middleton JW, Harvey LA, Batty J, Cameron I, Quirk R, Winstanley J. Five additional mobility and locomotor items to improve responsiveness of the FIM in wheelchair-dependent individuals with spinal cord injury. Spinal Cord 2006; 44: 495–504.
  12. Catz A, Itzkovich M, Agranov E, Ring H, Tamir A. SCIM – Spinal Cord Independence Measure: a new disability scale for patients with spinal cord lesions. Spinal Cord 1997; 35: 850–856.
  13. Dettling M, Wirz M, Mattli R. ST Reha – Zusatzdatenerhebung: Parallelerhebung SCIM/FIM® in den Para-Kliniken im Jahr 2017/2018 : Finaler Schlussbericht. In. Zürcher Hochschule für Angewandte Wissenschaften: Winterthurer Institut für Gesundheitsökonomie; 2018.
  14. Mallinson T. Rasch analysis of repeated measures. Rasch Meas Trans 2011; 25: 1317–1318.
  15. Lundgren Nilsson A, Tennant A. Past and present issues in Rasch analysis: the functional independence measure (FIM) revisited. J Rehabil Med 2011; 43: 884–891.
  16. Donnelly C, Eng JJ, Hall J, Alford L, Giachino R, Norton K, et al. Client-centred assessment and the identification of meaningful treatment goals for individuals with a spinal cord injury. Spinal Cord 2004; 42: 302–307.
  17. Marino RJ, Shea JA, Stineman MG. The Capabilities of Upper Extremity instrument: reliability and validity of a measure of functional limitation in tetraplegia. Arch Phys Med Rehabil 1998; 79: 1512–1521.
  18. Yavuz N, Tezyurek M, Akyuz M. A comparison of two functional tests in quadriplegia: the quadriplegia index of function and the functional independence measure. Spinal Cord 1998; 36: 832–837.
  19. Marino RJ, Huang M, Knight P, Herbison GJ, Ditunno JF, Segal M. Assessing selfcare status in quadriplegia: comparison of the quadriplegia index of function (QIF) and the functional independence measure (FIM). Paraplegia 1993, 31: 225–233.
  20. Morganti B, Scivoletto G, Ditunno P, Ditunno JF, Molinari M. Walking index for spinal cord injury (WISCI): criterion validation. Spinal Cord 2005; 43: 27–33.
  21. Itzkovich M, Gelernter I, Biering-Sorensen F, Weeks C, Laramee MT, Craven BC et al. The Spinal Cord Independence Measure (SCIM) version III: reliability and validity in a multi-center international study. Disabil Rehabil 2007; 29: 1926–1933.
  22. Stumm C, Hug K, Ballert C, Hund-Georgiadis M. Responsivität des »Spinal Cord Independence Measure« (SCIM) und des »Functional Independence Measure« (FIM) bei Personen mit Rückenmarksverletzung. Neurol Rehabil 2017; 23: 227–232.
  23. Jones LAT, Li CY, Weitzenkamp D, Steeves J, Charlifue S, Whiteneck G. Development and validation of crosswalks Between FIM(R) and SCIM III for voluntary musculoskeletal movement functions. Neurorehabil Neural Repair 2021; 35: 880–889.
  24. Kolen MJ, Brennan RL, Kolen MJT. Test equating, scaling, and linking : methods and practices, 2nd edn. New York: Springer; 2004.
  25. World Health Organization: International Classification of Functioning, Disability and Health (ICF). Geneva: World Health Organization; 2001.
  26. Cieza A, Fayed N, Bickenbach J, Prodinger B. Refinements of the ICF Linking Rules to strengthen their potential for establishing comparability of health information. Disabil Rehabil 2016; 41: 574–583.
  27. Prodinger B, Reinhardt JD, Selb M, Stucki G, Yan T, Zhang X et al. Towards system-wide implementation of the International Classification of Functioning, Disability and Health (ICF) in routine practice: developing simple, intuitive descriptions of ICF categories in the ICF Generic and Rehabilitation Set. J Rehabil Med 2016; 48: 508–514.
  28. Masters GN. A Rasch model for partial credit scoring. Psychometrika 1982; 47: 149–174.
  29. Tennant A, Conaghan PG. The Rasch measurement model in rheumatology: what is it and why use it? When should it be applied, and what should one look for in a Rasch paper? Arthritis Rheumatol 2007; 57: 1358–1362.
  30. Petrillo J, Cano SJ, McLeod LD, Coon CD. Using classical test theory, item response theory, and Rasch measurement theory to evaluate patient-reported outcome measures: a comparison of worked examples. Value in Health 2015; 18: 25–34.
  31. Prodinger B, Ballert CS, Brinkhof MW, Tennant A, Post MW. Metric properties of the Spinal Cord Independence Measure – self report in a community survey. J Rehabil Med 2016; 48: 149–164.
  32. Andrich D, Sheridan B, Luo G. Rasch models for measurement: RUMM2030. Perth: RUMM Laboratory Pty Ltd; 2010.
  33. Rodriguez A, Reise SP, Haviland MG. Evaluating bifactor models: Calculating and interpreting statistical indices. Psychol Methods 2016; 21: 137–150.
  34. Quinn HO. Bifactor models, explained common variance (ECV), and the usefulness of scores from unidimensional item response theory analyses. Chapel Hill: University of North Carolina; 2014.
  35. Hagell P. Testing Rating Scale unidimensionality using the principal component analysis (PCA) /t- test protocol with the Rasch model: the primacy of theory over statistics. Open J Stat 2014; 4: 456–465.
  36. Wilcoxon F. Individual comparisons of grouped data by ranking methods. J Econ Entomol 1946; 39: 269.
  37. Cohen J. The statistical power of abnormal-social psychological research: a review. J Abnormal Social Psychol 1962; 65: 145–153.
  38. Leunbach G. A probabilistic measurement model for assessing whether two tests measure the same personal factor. Copenhagen: Danish Institute for Educational Research; 1976.
  39. Kolen MJ, Brennan RL. Test equating, scaling, and linking. Methods and practices (2nd edition). New York: Springer; 2004.
  40. Adroher ND, Kreiner S, Young C, Mills R, Tennant A. Test equating sleep scales: applying the Leunbach’s model. BMC Med Res Methodol 2019; 19: 141.
  41. Kreiner S, Nielsen T. Item analysis in DIGRAM 3.04. Part I: Guided tours. Research report 2013/06.Copenhagen: University of Copenhagen, Department of Public Health; 2013.
  42. R Core Team: R. A language and environment for statistical computing. Vienna: R Foundation for Statistical Computing; 2016.
  43. Bennette C, Vickers A. Against quantiles: categorization of continuous variables in epidemiologic research, and its discontents. BMC Med Res Methodol 2012; 12: 21.