Monitoring and program evaluation

Monitoring and program evaluation Hossein Naraghi CE 590 Special Topics Safety June 2003 Time spent:9 hrs

The need fro monitoring • Monitoring is systematic collection of data about the performance of road safety treatments after their implementation • The effectiveness of treatment can be assessed • Post-implementation monitoring is essential to ascertain the effects of a treatment • Improve the accuracy and confidence of predictions of that treatment’s effectiveness in subsequent applications

The need fro monitoring (continued) • Monitoring is important to ensure that a particular treatment has not led to a significant increase in accidents • The road safety engineer has a duty to ensure that the public does not experiencing additional hazard as a result of treatments • The Institution of Highways and Transportation defines the purpose of monitoring as follow • Assess the effects of crash occurrence in relation to safety objectives

The need fro monitoring (continued) • Assess the effects on distribution of traffic and speeds of motor vehicle • Call attention to any unintended effects on traffic movement or accident occurrence • Assess the effect of treatment on the local environment • Find out about the public response to the treatment in terms of its acceptability in general and people’s concern about safety in particular

The need fro monitoring (continued) • The County Surveyors’ Society 1991 suggests three ways for monitoring a site • Pay careful attention to a site immediately after treatment • In case things go badly wrong • Assess the effect over a longer time period • About three years to determine the influence of treatment on crashes or other performance measures • This needs careful statistical analysis to correct for external factors

The need fro monitoring (continued) • Focus on the accident types which the treatment was intended to correct • Assess whether these have in fact declined • Monitoring and evaluation is meaningful if • there has been a clear statement of objectives of the treatment • A prediction of its effects • A logical link between treatment and its effects • Monitoring should apply to all accident investigation and prevention work

The need fro monitoring (continued) • Road safety treatments potentially affect the following parameters which need to be monitored • The number and type of crashes • The severity of crashes • The distribution of crashes over the road network • Traffic flows and travel times • Turning movements and delays at intersections • Access times and distances within residential areas • Route taken by motorists, cyclists and pedestrians • Operation of buses

The need fro monitoring (continued) • A comprehensive monitoring practice should involve all these effects • Since crashes are relatively rare event, it may take a very long time to acquire a statistically reliable sample, which makes monitoring difficult • This can be partially overcome by the use of proxy measures such as • Traffic conflict measures • Indirect measures • Insurance company claim record • Emergency service records • Ambulance, hospital admission • Tow truck records

The need fro monitoring (continued) • Resources devoted to monitoring in most agencies are limited • Resources should be devoted directly to development and implementation of treatments which have been prioritized and shown to have potential for crash reduction rather than monitoring exercises • We should admit that our understanding of safety effectiveness of road safety engineering treatments is limited and in many cases rests on shaky foundation

The need fro monitoring (continued) • This point comprehensively argued by Hauer 1988, who says that • “the level of safety built into roads is largely unpremeditated. Standards and practices have evolved without a foundation of knowledge. At times the safety consequences of engineering decisions are not known, at others some knowledge exists but not used.”

Monitoring techniques • The essence of monitoring is to measure what is happening in the real world. There are several experimental challenges in doing this • There may be changes in road environment • Change in speed limit • Change in traffic flow • Change in abutting land uses • Change in traffic control • Since crashes are rare and randomly occurring events, there will be year by year fluctuations which have nothing to do with the treatment being analyzed

Monitoring techniques (continued) • It is necessary to monitor all significant factors which might affect the outcome • If two variables are systematically related and both are measured, it will not be possible to reliably isolate their independent effects • Statistical correlation does not necessary imply logical correlation • Ensure a linkage between the treatment being monitored and the change in performance measure • Seasonal factors must be taken into account

Monitoring techniques (continued) • It would be incorrect to compare the summer (before) accident record with the winter (after) accident record if one was trying to assess the effect of skid-resistance • Accident reporting levels may change over time, and there may be inconsistencies in accident data which need to be considered • There may be a long term trend in crash occurrence, therefore changes over time in the rate of crashes at a site may merely reflect global trends • It is usually necessary to use some form of control group and compare crashes at the site with those at the control site

Monitoring techniques (continued) • There are four ways that the evaluation of the effect of a road safety treatment can be done • Controlled experimentation • All other factors held constant except the factor whose effect is being investigated • This approach is rarely applicable in road safety engineering, because in the real world it is not possible to hold everything constant • Before and after studies • Comparisons using control sites • Time trend comparisons

Before and after studies • Is the simplest and least satisfactory method because of the lack of control of extraneous factors and essentially involves • Determining in advance the relevant objectives (e.g. accident types intended to be affected) and the corresponding evaluation criteria (e.g. accident frequency, accident rate) • Monitoring the site to obtain numerical values for these criteria before and after treatment • Comparing the before and after results • Considering whether there are other plausible explanations for the change and correct them

Before and after studies (continued) • Any before and after study, usually relies on pre-existing data • This underlies the need for systematic on-going data collection, so the effect of changes can be monitored routinely • Statistical analysis of data should be carefully undertaken with regard to accuracy of data • It will be helpful to consider more than just changes in accident frequency of the particular accident type • It may also be useful to check the changes in the 85 percentile values, variance, skew, etc

Before and after studies (continued) • For any formal before and after study to be statistically valid, a reasonable period of time must be considered to obtain a sufficient sample • While one year may be considered a minimum analysis period, three years is generally regarded as a reasonable period for trends to be established and a sufficiently large sample obtained • Nicholson (1987) recommended 5 years for statistical confidence

Comparisons using control sites • The problem of before and after study is that it takes no account for changes across the network as a whole • This can be overcome by using the control sites • There are two variations using control sites • Using control group which are randomly selected • A controlled experiment by selecting several candidate sites foe a particular treatment in advance

Comparisons using control sites (continued) • Then they are randomly split into two groups • All sites in the first group and no sites in the second group are treated • The objective here is to make control and treatment groups equal in all factors except for the execution of treatment • Two groups do not need to be of equal size, but they must satisfy sample size requirements • This method is powerful as an investigation tool • It is of limited validity for most applications in roads safety engineering • There will barely be the opportunity to conduct a controlled experiment of this nature

Comparisons using control sites (continued) • Using selected comparison groups • Involves a before and after study • The result of before and after study will be compared with the result of the control site • The process involves: • Specifying the objectives in advance • Identifying a set of control sites • No remedial work have been or intended to be introduced • Monitoring both the treated and control sites before and after treatment to obtain numerical values

Comparisons using control sites (continued) • Comparing the before and after results at both the treated and control sites • Considering if there other plausible explanations for the changes and correct them if possible • Selection of control sites is very important in this process and should satisfy the following criteria: • Be similar to treated sites in general characteristics • Network configuration • Geometric standards • Land use

Comparisons using control sites (continued) • Socio-economic characteristics • Enforcement practices, etc • Be geographically close • Have similar traffic flows • Not affected by treatment at the test site • Not be treated in any way in the period of before and after study • Have crash record which are consistent in both collection criteria and coding covering the period of study • A useful device is simply graph the number of crash after against the number crash before treatment at both test & control sites (Figure 17.1 page 444)

Time trend comparison • This method involves the development of a model to estimate the trend of accident over time which involves the following process: • Identifying the objectives in advance and the corresponding analysis criteria • Obtaining data on each criteria for an extended period of time • Developing a model based on the before period • Comparing projections based upon the model for after period with the measured criteria for that period • Identifying if there are other plausible explanations for the changes and correct them

Time trend comparison (continued) • This method is useful where substantial countermeasure has been produced at a given point in time • Limited application in road safety engineering, since it is very difficult to control for all variables in real world analysis • Analytical power of this approach has been much extended by the development of log-linear models

Analysis of accident statistics • Three main applications of statistical testing in the area of road safety engineering are as follow: • Comparison of crash frequencies • A chi-squared test is suitable • Or a paired t-test if the distribution of crashes can be assumed to be normal • Comparison of crash rates • A paired t-test is appropriate • Comparison of proportions • A z-test is suitable (See table 17.1 page 447)

Analysis of accident statistics (continued) • Poisson distribution is a very simple test for calculating probabilities • Can be used in determining whether a specific crash frequency is within the bounds of normal year by year fluctuations • Methodological issues • There are four important methodological issues • Regression to the mean • Accident migration • Risk compensation • Sample size determination

Regression to mean • Over a period of time, if there are no changes in physical or traffic characteristics at a site, annual crashes at that site tend to fluctuate about the mean value due to the random nature of crash occurrence • Since the selected sites for treatment are based on their ranking in number of crashes compare to all other sites, there is a high possibility that sites will be chosen when their crash count is higher than long term average • The crash rates at these sites is likely to experience a lower rate in the following year even without implementing any treatment

Regression to mean (continued) • This aspect of crash experience need to be concerned in after implementation analysis of a safety treatment • Since this phenomenon is present, the impact of treatment will be exaggerated • Regression to mean may exaggerate the effect of a treatment from 5-30 percent • Since our knowledge of safety effects of treatment built up from the results of this kind of studies, therefore there is a tendency to over-state the effectiveness of road and traffic engineering treatment

Regression to mean (continued) • This sometimes called “bias by selection” (Hauer 1980) • Analysts are responsible to separate the real benefits from a particular treatment from the changes due to regression to mean phenomenon • The problem can be substantially minimized by increasing the number of years of data used in the site selection process • This does not solve the problem entirely • It is not always expedient to wait for several years before conducting an evaluation exercise

Regression to mean (continued) • To correct for regression to mean phenomenon, the true underlying accident rate should be estimated • There are two common approaches • Model the accident situation in order to estimate the true underlying accident rate and then based the evaluation on the model not the raw accident data • Multi-variate modeling approach developed by Hauer (1983, 1992) • Adjust the data to correct for biases using assumptions about the statistical distribution of accidents year by year

Regression to mean (continued) • Multi-variate modeling approach which extends the Empirical Bayes model to allow “unsafety” to be estimated when a large reference population does not exist • The model can be described as follows: • If XB and XA are respectively the accident frequencies observed before and after treatment at a site which prior to treatment had an underlying mean accident frequency m, then the treatment effect, t is given by: t = XA / m And the regression to the mean effect, r by: r = m / XB

Regression to mean (continued) • If regression to the mean effects are ignored it is assumed that m = XB however, rather than using data for study site itself to estimate m, the Empirical Bayes approach uses the following expression m = a + bxB

Accident migration • The assumption here is that accidents may increase at sites surrounding the treated site due to changes in trip pattern or driver’s assessment of risk • A study in a sample of sites in London show that accident at the treated sites fell by 22%, but accident in the surrounding street increased by 10% • The effect of remedial measure for this phenomenon is to relocate the accidents not to reduce them • A statistical explanation for this phenomenon showing that there is a spatial correlation between accident frequencies at adjacent or nearby sites, so use of neighboring sites as control sites leads to bias

Risk compensation • Road users adjust their behavior as they perceive the road system • One factor that affect the behavior is perception of risk • If the the road is perceived as more hazardous then drivers may respond accordingly • Reducing speed in icy condition • Some of the additional road safety provided as a result of road safety treatment is used up by drivers behaving in a more risky manner • To make sense of the risk notion in the context of road safety engineering, it is important to distinguish between objective and subjective risks

Risk compensation (continued) • Objective risk • Perceived risk • Subjective risk • What affects behavior • A road safety treatment may • Reduce the objective risk and increase the subjective risk • e.g. set of traffic signals both alerts drivers to the hazard presented by the intersection and moderating the hazard by separating conflicting streams of traffic • Increase subjective risk alone • e.g. warning sign effectiveness depends entirely on driver response

Risk compensation (continued) • Reduce the objective risk alone • e.g. skid resistance pavements are not usually discernible to the driver • Reduce both objective risk and subjective risk • e.g. improved road geometry, improved sight distance, grade separation, etc • It is only in this category that risk compensation could be a factor, in other categories there is either no change in subjective risk or an increase in it • Any road design change which reduces the subjective risk should also reduce the objective risk to at least the same extent, otherwise the road user will have a tendency to respond inappropriately

Sample size determination • The smaller the change in accident at any site, the larger is the sample necessary to determine the statistical significance • In evaluating a countermeasure, the analyst must use either a longer time period or a larger number of sites • The sample size required depends on • The effect that analyst seek to detect • e.g. whether the treatment is expected to decrease accidents by 10%, 20%, 50% or so on • The probability of detecting a real effect • The level of significance

Sample size determination (continued) • All feasible combinations of these three factors will produce a multitude of outcomes (see table 17.2 page 463)

Monitoring and program evaluation

Monitoring and program evaluation

Presentation Transcript

Population Policy and Program Monitoring and Evaluation

Monitoring and Evaluation

Monitoring and Evaluation

Monitoring and Evaluation

Monitoring and Evaluation

Monitoring and Evaluation

Population Policy and Program Monitoring and Evaluation

Monitoring and Evaluation

Population Policy and Program Monitoring and Evaluation

Population Policy and Program Monitoring and Evaluation

Monitoring Evaluation and Research program

Monitoring and Evaluation

Monitoring and Evaluation

MONITORING AND EVALUATION TOOLS Monitoring and Evaluation

Monitoring and evaluation

INTRODUCTION TO MONITORING AND EVALUATION Monitoring and Evaluation

Monitoring and Evaluation

Monitoring and Evaluation

MONITORING AND EVALUATION TOOLS Monitoring and Evaluation

Monitoring and Evaluation

Monitoring and Evaluation in SIYB Program

INTRODUCTION TO MONITORING AND EVALUATION Monitoring and Evaluation