1 / 64

Generic measures: limitations of use within specific settings ?

Generic measures: limitations of use within specific settings ? . J Freeman Institute of Health Studies Plymouth University. Properties. Clinical feasibility Psychometric reliability validity appropriateness responsiveness. Validity. Does it measure what it says it measures?

lovette
Download Presentation

Generic measures: limitations of use within specific settings ?

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Generic measures:limitations of use within specific settings ? J Freeman Institute of Health Studies Plymouth University

  2. Properties • Clinical • feasibility • Psychometric • reliability • validity • appropriateness • responsiveness

  3. Validity Does it measure what it says it measures? • Content validity • Criterion validity • Construct (convergent and discriminant) Bowling 1997

  4. Construct validity • The extent to which empirical data supports hypotheses concerning the attributes being measured • detective work • jigsaw puzzle

  5. Appropriateness • Is the range of the construct measured within the sample similar to the range covered by the instrument? Van der putten et al 1999

  6. The 36-item Short Form Health Survey (SF-36): • Gold-standard generic self-report measure of health status • Adopted & disseminated world-wide • Standardised UK and US version

  7. SF-36 dimensions

  8. The SF-36 • Relatively few studies have evaluated its use as an outcome measure for clinical practice or clinical trials in MS

  9. Aim of study • To explore the reliability, validity, clinical appropriateness, and responsiveness of the SF-36 in MS patients within a health care setting

  10. Methods • 150 patients with clinically definite MS • Broad spectrum of disease severity • Assessments completed in 106 patients once, twice in 44 rehabilitation inpatients

  11. Assessments • Disease severity: EDSS • Health Status: SF-36 • Disability: FIM • Handicap:LHS • Emotional well-being:GHQ

  12. Assessment of construct validity... • Convergent validity Correlation's between SF-36 dimensions & instruments measuring similar & different constructs • Group differences validity • ANOVA to differentiate between different groups

  13. ...Assessment of construct validity • Hypothesis testing T-tests to investigate whether results in line with theoretical expectation

  14. Assessment of appropriateness • Examination of the scale score distributions of the 8 dimensions and the 2 summary components of the SF-36 and all other measures range, mean, sd, floor, ceiling

  15. Sample characteristics • Mean age 45 (24 - 78yrs) • Female 68% • Disease pattern • SP 50% RR 33% • PP 11% Benign 6% • Mean yr’s since diagnosis 11 (0.1 - 38) • Mean EDSS 5.7 (1 -9)

  16. Results: convergent & discriminat validity • Convergent & discriminant validity supported Substantial correlation’s with related scales, e.g. FIM with SF-36 physical function (r = 0.68), EDSS (r = 0.82) Weak correlation's with unrelated scales e.g. GHQ with SF-36 physical function (r =0.26)

  17. Results: group differences validity • Group differences validity supported Significant differences demonstrated in health status at different level of disease severity (p<0.05)

  18. Results: hypothesis testing As hypothesised: • Patients requiring carer assistance reported lower physical scores(p<0.0001) • Patients scoring > 5 GHQ points reported lower SF-36 emotional scores (p<0.0001)

  19. Results: appropriateness • Scores span the entire spectrum of available range • Significant floor and ceiling effects (>20%) in - physical function - physical role limitations - emotional role limitations - bodily pain

  20. Results: appropriateness • Floor & ceiling effects particularly marked when patient selection restricted to narrow range - physical dimensions 52% floor in severe group - physical role limitations 84% floor in severe group - role limitations 45% ceiling in mild group

  21. Implications Score range floor ceiling

  22. Implications Spectrum of SF-36 scale too limited to detect changes which may occur in pwMS  likely to limit its potential responsiveness  limited usefulness within specific MS populations /settings

  23. Recommendations • Generic measures should be tested for specific populations and for specific purposes • When evaluating health status in MS the SF-36 should be supplemented with other relevant & validated measures to ensure comprehensive & valid measurement

  24. Recommendations Clinicians & researchers should understand the properties of an outcome measure when choosing an instrument and interpreting the information it generates ...the measure you choose is key in determining effectiveness

  25. Properties of Outcome Measures • Clinical • feasibility • Psychometric • reliability • validity • appropriateness • responsiveness

  26. Reliability of gait measurements using CODAmpx30 motion analysis system Veronica Maynard Institute of Health Studies University of Plymouth

  27. Reliability • Reliability refers to the • consistency or repeatability of • a measurement taken under the • same conditions

  28. Factors affecting reliability • instrumental reliability - reliability of measurement device • rater reliability - reliability of rater administering measurement device • response reliability - reliability/stability of variable being measured

  29. Sources of error • Measurement error • difference between a measurement & its true value • Systematic error • bias resulting from one or more processes • Random error

  30. Reliability • 3 broad categories of reliability: • equivalence (reproducibility) • stability (repeatability) • internal consistency (homogeneity)

  31. Types of reliability & how they are determined Reliability Equivalence or Reproducibility Stability or consistency Internal consistency Inter-rater reliability Intra-rater or test-retest reliability Split half reliability & item analysis (Adapted from: Sim & Wright 2000, p.132)

  32. Aim of study • To determine intra-rater • and inter-rater reliability • of gait measurements using • CODA mpx30 motion • analysis system

  33. Reliability studies (I) • Intra-rater reliability study: • 10 healthy subjects • mean age 39.2 (29-52) yrs • 3 recordings • single trained observer

  34. Reliability studies (II) • Inter-rater reliability study: • 19 healthy subjects • mean age 34.4 (20-49) yrs • 3 trained observers

  35. Procedure • self-selected speed • Investigators blind • Points for analysis: • i) initial contact (IC) • ii) mid-stance and (MSt) • iii) mid swing (MSw)

  36. 1) 2) 3) Stick figure illustrations of position of right leg (red) at 1) IC 2) MSt and 3) MSw. Joint angles, moments and powers were determined at these points in the gait cycle.

  37. Procedure (cont) • Spatiotemporal parameters: • walking velocity • duration of stance • duration of swing • Kinematic variables: • hip, knee & ankle angles at IC, MSt & MSw • Kinetic variables: • moments & power at hip, knee, ankle at IC and MSt

  38. Analysis • Sagittal plane data • Bland & Altman methods • Intraclass correlation coefficient (ICC) to determine consistency and agreement among ratings

  39. IC MSt TO MSw Graphical illustration of sagittal plane joint movement of the ankle during a single gait cycle (dorsiflexion positive, plantarflexion negative). IC= Initial contact; MSt = Mid stance; TO = Toe off; MSw = Mid swing

  40. Results (I) • Intra-rater study: • Good agreement for spatio-temporal • Generally low ICC values (ICC < 0.75) for all parameters • Bland & Altman plots reasonable agreement for kinematic data at ankle and knee

  41. Summary of key findings (II) • Inter-rater study: • Generally good agreement for spatio-temporal parameters (ICC > 0.70) • Lower ICC values & wide limits of agreement for kinematic data (especially hip)

  42. 1) 2) Examples of distribution plots from Bland & Altman test for am-pm repeatability showing mean measurements against differences between measurements for ankle range of motion (ºs) at 1) initial contact 2) mid stance.

  43. Factors affecting reliabilty • Errors associated with marker placement • Soft tissue motion • Natural variation in individual gait cycle • Sampling rate

  44. Recommendations • Standard protocol for marker placement • Training of observers • Averaging of min 3 gait cycles (Winter 1984) • Interpret with caution data from single cycle

  45. General Recommendations • Standard protocol • Training • Averaging may be required • Determine level of error • Assess reliability before use in research/clinically • Assess reliability in population under study

  46. Responsiveness S.K. Spooner PhD BSc SRCh Scheme Co-ordinator Podiatry

  47. Properties • Clinical • feasibility • Psychometric • reliability • validity • appropriateness • responsiveness

  48. Responsiveness to Change • HRQOL measures should be responsive to interventions that change HRQOL • Evaluating responsiveness requires assessing HRQOL relative to an external indicator of change

  49. Testing for Responsiveness • Measurement tools should be tested on patients receiving treatment of known efficacy • Capable of detecting treatment effects?

More Related