1 / 59

CCSSO National Conference on Student Assessment New Orleans, LA June 27, 2014 Strand 11A

Accountability and Students with Disabilities: Assuring Valid Inferences about Teachers and Schools. CCSSO National Conference on Student Assessment New Orleans, LA June 27, 2014 Strand 11A. IES-funded 5-state longitudinal study on student growth  Research on teacher evaluation.

shona
Download Presentation

CCSSO National Conference on Student Assessment New Orleans, LA June 27, 2014 Strand 11A

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Accountability and Students with Disabilities: Assuring Valid Inferences about Teachers and Schools CCSSO National Conference on Student Assessment New Orleans, LA June 27, 2014 Strand 11A

  2. IES-funded 5-state longitudinal study on student growth  • Research on teacher evaluation • National advisory panel member for NCAASE • NCEO research • Research supported by IES-funded National Research and Development Center on Assessment and Accountability for Special Education • Observations from Massachusetts and discussant remarks

  3. Assuring valid inferences about teachers and schools…

  4. Describing and Using Growth from Students with Disabilities on Summative Assessments Heather Buzick Educational Testing Service Princeton, NJ A portion of this research was funded by the Institute for Education Sciences (Award #R324A120224)

  5. Current research

  6. Motivation and importance • Approximately 14% of students have a diagnosed disability • The majority spend most of instructional time in general education classroom* • Approximately 80% of teachers have at least one student with a disability in their classroom** • At least 75% of students with disabilities take the general assessment* • Students’ disabilities can have an impact on access to test content, student may require testing accommodations, teaching and learning may differ from other students • How should students’ test scores be included in accountability systems? *from Historical State-Level IDEA Data Files (http://tadnet.public.tadnet.org/pages/712 **Estimate. Sources available from author

  7. Study 1 Research Questions

  8. Some definitions of growth within individual students • Differences in vertically scaled scores (gains) • E.g., 300 scaled score in grade 3, 320 scaled score in grade 4 • Transitions across proficiency levels • E.g., “basic” in grade 3, proficient in grade 4 • Student growth percentiles • From grade 3 to grade 4, the student grew as much as or more than 70 percent of other students in the state peers who had similar grade 3 test scores

  9. Conclusions • The model matters for the inferences we make about schools, teachers, and individual students (sometime is subtle, but important ways) • Policy makers: identify the claims they wish to make • Measurement experts: help policymakers understand the meaning derived from a particular model • How much growth is enough? • Norms based on accumulated growth data from multiple sources • Prediction associated with college- and career-ready standards

  10. Accountability Dilemmas for Students with Disabilities and Policy Alternatives Ann Schulte Arizona State University Natalie Murr North Carolina State University Joseph Stevens University of Oregon

  11. National Center on Assessment & Accountability for Special Education • NCAASE www.ncaase.com • Institute of Education Sciences, 2011-2016 • Co-PI’s • Stephen Elliott & Ann Schulte, Arizona State Univ • Joseph Stevens & Gerald Tindal (Project Director), Univ of Oregon This work is supported by the Institute of Education Sciences, U.S. Department of Education, through grant R32C110004 awarded to the University of Oregon. The opinions expressed are those of the authors and do not necessarily represent views of the Institute or the U.S. Department of Education.

  12. NCAASE 2011-2016:Our Key Research Questions • What is the natural developmental progress in achievement for students with disabilities? • What models best characterize achievement growth for students with disabilities who are participating in general achievement tests? • How do various growth models represent school effects for students with and without disabilities, and how do results compare to those derived from the status models now in use? • How do results from different types of interim assessments of students’ achievement meaningfully contribute to a model of academic growth for students with disabilities? • How can information about opportunity to learn and achievement growth be used to enhance academic outcomes for students with disabilities?

  13. Persistent Accountability Dilemmas • Bias introduced by including only current students with disabilities in students with disabilities (SWD) subgroup (Ysseldyke & Bielinski, 2002) • “One shot” model of assessing proficiency and SWD performance variability—retests to assure assessment fairness (Wei, 2012) • Students start at differing levels, status measures do not consider student progress relative to starting point—importance of looking at growth to assess school and teacher effects (Dunn & Allen, 2009; Stevens, 2005)

  14. Data Sources for Presentation • North Carolina test data (NCAASE also looking at AZ, OR, PA) • Cross sectional-2010 • Allowed retests for non-proficient student, inclusion of students who had exited special education for two years or less • State-level growth metric—residual gain score using two prior years’ test scores, z-score score based on mean gain and sd in standard setting year • Longitudinal—Math 2001-2005 cohort, Reading 2003-2007 cohort

  15. Impact of Two Specific Policies • Including students who have exited special education • Allowing retesting for students who do not reach proficiency

  16. Stable Subgroup Membership Matters Mathematics Achievement Gap

  17. Change in Mean Number of Students Reaching Proficiency

  18. Change in School-level Percent Proficient for SWD w/ Exiters Included

  19. SWD’s Reaching Math Proficiency With and Without Retest

  20. SWD’s Reaching Reading Proficiency With and Without Retest

  21. Growth vs. Proficiency • What does growth across grades look like for specific exceptionalities? • Relationship between status and growth for students with and without disabilities

  22. Mathematics Growth by Exceptionality

  23. Mathematics Growth by Exceptionality

  24. Mathematics Growth by Exceptionality

  25. Mathematics Growth by Exceptionality

  26. Reading Growth by Exceptionality

  27. Reading Growth by Exceptionality

  28. Reading Growth by Exceptionality

  29. Reading Growth by Exceptionality

  30. Growth by Starting Proficiency Level-Math General Ed SWD

  31. Growth by Starting Proficiency Level-Rdg General Ed SWD

  32. Conclusions • SWD subgroup is not stable and policy changes allowing longer time to “count” in subgroup improve school SWD outcomes • Retesting benefits SWDs and may also be likely to benefit other groups characterized by large achievement gaps • SWDs show growth mathematics and reading achievement across grades, although improvement may not be reflected in changes in status (Non-proficient/proficient) • Large differences in starting point achievement skills within SWD group, smaller differences in growth

  33. Accountability and Students with Disabilities: Assuring valid inferences about teachers and schoolsJim Ysseldyke

  34. Purposes of Monitoring Student Growth • District/State Accountability • Individual Progress Monitoring/instructional planning • Teacher evaluation (value added)

  35. Typical Accountability Models for SWD • Cross-sectional • Cohort Static • Cohort Dynamic

  36. Typical Scores • Scaled Scores • Proficiency Levels • Effect Sizes • More recently Student Growth Percentiles (ala Betebenner) or Student Deciles in some of our work

  37. Main Issues • Reducing achievement gap (GE v SE) • Nobody wants SWD in their accountability profile • How long should SWD count? • What model should be used? • What scores should be used?

  38. Major Points I Heard • Students start at differing levels, so status measures do not consider student progress relative to starting point • Use of cross sectional dangerous • SWD are growing, but many may not meet proficiency standards

  39. Major Points I Heard • We have limited data on growth norms for SWD (small Ns) • Much concern about how long to count SWD (at district or state level).

  40. Achievement Gap Using 3 Analytic Methods over 6 Years

  41. Spring-Fall SGP Growth by Category v National Norms

  42. STAR Reading Growth for Grade 10 SED Students

  43. Spring to Fall SGP for Students with SED in Differing Programs

More Related