1 / 37

What Are We Measuring?  The Use of Formative and Summative Assessments

What Are We Measuring?  The Use of Formative and Summative Assessments. Laura Goe, Ph.D. Research Scientist, ETS Principal Investigator for Research and Dissemination, National Comprehensive Center for Teacher Quality. 2 nd Teacher Leader Institute

quito
Download Presentation

What Are We Measuring?  The Use of Formative and Summative Assessments

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. What Are We Measuring?  The Use of Formative and Summative Assessments Laura Goe, Ph.D. Research Scientist, ETS Principal Investigator for Research and Dissemination, National Comprehensive Center for Teacher Quality 2nd Teacher Leader Institute Sponsored by the Delaware Academy for School Leadership College of Education and Human Development at the University of Delaware Camden, DE  June 19, 2012

  2. Laura Goe, Ph.D. • Former teacher in rural & urban schools • Special education (7th & 8th grade, Tunica, MS) • Language arts (7th grade, Memphis, TN) • Graduate of UC Berkeley’s Policy, Organizations, Measurement & Evaluation doctoral program • Principal Investigator for the National Comprehensive Center for Teacher Quality • Research Scientist in the Performance Research Group at ETS

  3. The goal of teacher evaluation

  4. An aligned teacher evaluation system: Part I

  5. An aligned teacher evaluation system: Part II

  6. A well-aligned evaluation system (Goe et al., 2012) • In a well-aligned system, evidence of practice as it relates to high-quality teaching standards will: • Form the basis for a professional growth plan • Give structure and consistency to coaching and mentoring by providing the basis for shared expectations and a common language, and possibly suggesting a direction for development • Provide a diagnostic approach to understanding inadequate student learning growth (i.e., determining which standards are not being met and considering how they might relate to student outcomes) • Offer a set of criteria to help principals, consulting teachers, mentors, and others identify areas in which teachers are successful and areas for improvement

  7. Six components in an aligned teacher evaluation/professional growth system • High-quality standards for instruction • Multiple standards-based measures of teacher effectiveness • High-quality training on standards, tools, and measures • Trained individuals to interpret results and make professional development recommendations • High-quality professional growth opportunities for individuals and groups of teachers • High-quality standards for professional learning

  8. Chicago Study (Sartain et al., 2011) • “The work in Chicago and across the country to improve evaluation was motivated by two main factors. First, evaluation systems were failing to give teachers either meaningful feedback on their instructional practices or guidance about what is expected of them in the classroom. Second, traditional teacher evaluation systems were not differentiating among the best teachers, good teachers, and poor teachers.” (p. 1, italics added)

  9. Summative vs. Formative Assessment: Students • Summative assessment measures students’ knowledge and skills at a particular point in time • Frequently used for accountability purposes • Formative assessment takes place before or during the instruction • Used for eliciting evidence about students’ knowledge and skills to develop a better understanding of current learning progress and to make adjustments (Black et al, 2003) • Minute-to-minute (a class poll), after a lesson to check understanding (exit tickets), or after a unit to determine what students know prior to moving to next unit (unit test)

  10. Summative vs. Formative Assessment: Teachers • Summative assessment combines evidence from multiple measures according to the state or district weighting formula and gives teachers a score at the end of the year • Evidence may be collected throughout the year, and may be used for formative purposes as well • Formative assessment takes place throughout teachers’ careers, focusing on classroom practice and/or artifacts of teaching • Considers evidence about teachers’ knowledge and skills that can be used to improve current practice

  11. Can we use both formative and summative assessment to evaluate teachers? • Popham (1998): “fixing” (formative) the teacher vs. “firing” (summative) the teacher • Milanowski (2005): Study with new teachers found “…no major differences between the groups in terms of openness to discussion of difficulties, reception and acceptance of performance feedback, stress, turnover intentions, actual turnover, or performance improvement.” (p. 153)

  12. Multiple measures of teacher effectiveness • Evidence of growth in student learning and competency • Standardized tests, pre/post tests in untested subjects • Student performance (art, music, etc.) • Curriculum-based tests given in a standardized manner • Classroom-based tests such as DIBELS • Evidence of instructional quality • Classroom observations • Lesson plans, assignments, and student work • Student surveys such as Harvard’s Tripod • Evidence binder (next generation of portfolio) • Evidence of professional responsibility • Administrator/supervisor reports, parent surveys • Teacher reflection and self-reports, records of contributions

  13. Questions to ask about each measure used • How will using this measure in the teacher evaluation system impact teaching and learning in classrooms and schools? • How will the use of this measure look different in low-capacity vs. high-capacity schools? • How will reporting on results from this measure be done (to provide actionable information to teachers, principals, schools, districts, teacher preparation programs, and the state)? • How will we know if this measure is working as we intended?

  14. Measures that may contribute to teachers’ professional growth • Measures which include protocols and processes that teachers can examine and comprehend • Measures that are directly and explicitly aligned with teaching standards • Measures that motivate teachers to examine their own practice against specific standards • Measures that allow teachers to participate in or co-construct the evaluation (such as portfolios) • Measures that give teachers opportunities to discuss the results for formative purposes with evaluators, administrators, teacher learning communities, mentors, coaches, etc. • Measures that are aligned with and used to inform professional growth and development offerings

  15. Why teachers generally value observations • Observations are the traditional measure of teacher performance • Teachers feel they have some control over the process and outcomes • They report that having a conversation with the observation and receiving constructive feedback after the observation is greatly beneficial • Evidence-centered discussions can help teachers improve instruction • Peer evaluators often report that they learn new teaching techniques

  16. When teachers don’t value observations, it’s because… • They do not receive feedback at all • The feedback they receive is not specific and actionable • The observer suggests actions but is unable to offer the means and resources to carry out those actions • Mentors/coaches, other support personnel • Time for individual growth planning/activities • Protected time for collaboration with others

  17. Value-added models Many variations on value-added models TVAAS (Sander’s original model) typically uses 3+ years of prior test scores to predict the next score for a student Used since the 1990’s for teachers in Tennessee, but not for high-stakes evaluation purposes There are other models that use less student data to make predictions Considerable variation in “controls” used (demographics, school effects)

  18. Achievement Proficient Teacher A: “Success” on Ach. Levels Teacher B: “Failure” on Ach. Levels Start of School Year End of Year Growth vs. Proficiency Models In terms of growth, Teachers A and B areperforming equally Slide courtesy of Doug Harris, Ph.D, University of Wisconsin-Madison

  19. Achievement Proficient Teacher A Teacher B Start of School Year End of Year Growth vs. Proficiency Models (2) A teacher with low-proficiency students can still be high in terms of GROWTH (and vice versa) Slide courtesy of Doug Harris, Ph.D, University of Wisconsin-Madison

  20. What value-added and growth models cannot tell you • Value-added and growth models are really measuring classroom, not teacher, effects • Value-added models can’t tell you why a particular teacher’s students are scoring higher than expected • Maybe the teacher is focusing instruction narrowly on test content • Or maybe the teacher is offering a rich, engaging curriculum that fosters deep student learning. • How the teacher is achieving results matters!

  21. Recommendation from NBPTS Task Force (Linn et al., 2011) • Recommendation 2: Employ measures of student learning explicitly aligned with the elements of curriculum for which the teachers are responsible. This recommendation emphasizes the importance of ensuring that teachers are evaluated for what they are teaching.

  22. Measuring teachers’ contributions to student learning growth: A summary of current models

  23. School-wide VAM illustration for middle-school Tested subject teachers receive their own value-added score while non-tested subject teachers receive a school-wide average for their value-added score

  24. Differentiating among teachers • “It is nearly impossible to discover and act on performance differences among teachers when documented records show them all to be the same.” (Glazerman et al., 2011, pg 1)

  25. Using student learning outcomes to inform teacher professional growth

  26. Computerized adaptive assessments • Unlike end-of-year paper-and pencil standardized tests, computerized adaptive assessments (CAAs) can be used at the beginning and end of the year to measure growth (and in the middle, too, as a way to establish progress) • If used in this way, the results can be useful to teachers in adjusting their instruction • The process of analyzing data from this type of assessment requires connecting student results to specific instructional content and practices

  27. Professional growth through examining student learning results • Throughout the year teachers have an opportunity to collect evidence on student learning, analyze it, and adjust content or teaching practice • Test results are most informative when combined with other evidence on student knowledge and skills • Tests may cover only half the standards • Multiple choice doesn’t allow students to demonstrate other types of skills (writing, synthesizing, constructing an argument, presenting)

  28. The 4 Ps (Projects, Performances, Products, Portfolios) • Yes, they can be used to demonstrate teachers’ contributions to student learning growth • Here’s the basic approach • Use a high-quality rubric to judge initial knowledge and skills required for mastery of the standard(s) • Use the same rubric to judge knowledge and skills at the end of a specific time period (unit, grading period, semester, year, etc.)

  29. Interpreting results for alignment with teacher professional learning options • Different approach; not looking at “absolute gains” • Requires ability to determine and/or link student outcomes to what likely happened instructionally • Requires ability to “diagnose” instruction and recommend/and or provide appropriate professional growth opportunities • Individual coaching/feedback on instruction • Observing “master teachers” • Group professional development (when several teachers have similar needs)

  30. Feedback (Coggshall et al., 2012) • “Feedback on whether or not instructional practices are working can come in the form of student learning data, the teachers’ own observations of student engagement, observations from a peer or a coach, a video-taped record of the practice, discussion within a professional learning community, or the results of a formal evaluation.” (p. 6)

  31. Memphis professional development system • Teaching and Learning Academy began April ‘96 • Nationally commended program intended to • “…provide a collegial place for teachers, teacher leaders and administrators to meet, study, and discuss application and implementation of learning…to impact student growth and development” • Practitioners propose and develop courses • Responsive to school/district evaluation results • Offerings must be aligned with NSDC standards • ~336 On-line and in-person courses, many topics

  32. Questions about implementation • What steps will you take to ensure that evaluation feedback given to teachers will impact teaching and learning in classrooms and schools? • How will the evaluation/feedback cycle look different in isolated rural schools? Hard-to-staff urban schools? High-performing schools? • How will we know if evaluation system (including the feedback cycle) is working as we intended?

  33. Final thoughts • The limitations: • There are no perfect measures • There are no perfect models • Changing the culture of evaluation is hard work • The opportunities: • Evidence can be used to trigger support for struggling teachers and acknowledge effective ones • Multiple sources of evidence can provide powerful information to improve teaching and learning • Evidence is more valid than “judgment” and provides better information for teachers to improve practice

  34. Resources • Memphis Professional Development System • Main site: http://www.mcsk12.net/admin/tlapages/academyhome.asp • PD Catalog: http://www.mcsk12.net/aoti/pd/docs/PD%20Catalog%20Spring%202011lr.pdf • Individualized Professional Development Resource Book: http://www.mcsk12.net/aoti/pd/docs/Individualized%20Growth%20Resource%20Book.pdf

  35. References • Black, P., Harrison, C., Lee, C., Marshall, B., & Wiliam, D. (2003). Assessment for Learning: Putting it into Practice. Buckingham, UK: Open University Press. • http://www.amazon.com/Assessment-Learning-Putting-into-Practice/dp/0335212972 • Coggshall, J. G., Rasmussen, C., Colton, A., Milton, J., & Jacques, C. (2012). Generating teaching effectiveness: The role of job-embedded professional learning in teacher evaluation. Washington, DC: National Comprehensive Center for Teacher Quality. • http://www.tqsource.org/publications/GeneratingTeachingEffectiveness.pdf • Glazerman, S., Goldhaber, D., Loeb, S., Raudenbush, S., Staiger, D. O., & Whitehurst, G. J. (2011). Passing muster: Evaluating evaluation systems. Washington, DC: Brown Center on Education Policy at Brookings. • http://www.brookings.edu/reports/2011/0426_evaluating_teachers.aspx# • Goe, L., Biggers, K., & Croft, A. (2012). Linking teacher evaluation to professional development: Focusing on improving teaching and learning. Washington, DC: National Comprehensive Center for Teacher Quality. • http://www.tqsource.org/publications/LinkingTeacherEval.pdf • Linn, R., Bond, L., Darling-Hammond, L., Harris, D., Hess, F., & Shulman, L. (2011). Student learning, student achievement: How do teachers measure up? Arlington, VA: National Board for Professional Teaching Standards. • http://www.nbpts.org/index.cfm?t=downloader.cfm&id=1305

  36. References (cont’d) • Milanowski, A. (2005). Split roles in performance evaluation: A field study involving new teachers. Journal of Personnel Evaluation in Education, 18(3), 153-169. • http://www.springerlink.com/content/65817gu562k25482/fulltext.pdf • Polikoff, M. S. (2011). How well aligned are state assessments of student achievement with state content standards? American Educational Research Journal, 48(4), 965-995. • http://aer.sagepub.com/content/48/4/965.abstract?rss=1 • Popham, W. J. (1998). The dysfunctional marriage of formative and summative teacher evaluation. Journal of Personnel Evaluation in Education(1), 269-273. • https://springerlink3.metapress.com/content/pr2171452m314w21/ • Sartain, L., Stoelinga, S. R., & Brown, E. R. (2011). Rethinking teacher evaluation in Chicago: Lessons learned from classroom observations, principal-teacher conferences, and district implementation. Chicago: Consortium on Chicago School Research at the University of Chicago. • http://ccsr.uchicago.edu/sites/default/files/publications/Teacher%20Eval%20Report%20FINAL.pdf

  37. Laura Goe, Ph.D. 609-619-1648 lgoe@ets.org www.lauragoe.com https://twitter.com/GoeLaura National Comprehensive Center for Teacher Quality 1000 Thomas Jefferson Street, NWWashington, D.C. 20007 www.tqsource.org

More Related