Department of Health and Human Services   National Strategy for Suicide Prevention  
       
 
Contemplating Suicide? 1-800-273-TALK

NSSP Features

Suicide Facts

National Strategy for Suicide Prevention

Suicide: Cost to the Nation

Federal, State, and Private Activities

Around the World

The Newsroom/Media

Policy & Legislation

Resources for Researchers and Program Evaluators

Data Collection Systems & Interactive Databases

Upcoming Events

Funding Opportunities

National Strategy for Suicide Prevention Indicators
Want to Help the NSSP?

Site Map

NSSP Homepage

Disclaimer

E-mail to a Friend E-mail icon

 
RESOURCES FOR RESEARCHERS AND PROGRAM EVALUATORS

Six Steps to Program Evaluation

There are six basic steps to program evaluation, all of which are related. The first steps provide a foundation for the later ones. The steps (Box 3) are: engage stakeholders, describe the program, focus the evaluation design, gather credible evidence, justify conclusions, and ensure use and share lessons learned.

Summary of the Six Steps to Program Evaluation

Engage stakeholders

Most suicide prevention efforts involve partners, all of whom are stakeholders in the effort. Stakeholders must be involved in the evaluation; without their involvement, an evaluation might miss important parts of a program. Three groups of stakeholders are important:

  • Those involved in program operations (e.g., sponsors, collaborators, coalition partners, funding officials, administrators, managers, and staff)
  • Those served or affected by the program (e.g., clients, family members, neighborhood organizations, academic institutions, elected officials, advocacy groups, professional associations, skeptics, opponents, and staff of related or competing organizations)
  • Those who will be the primary consumers of information produced by the evaluation
  • Stakeholder involvement is different for each program evaluation. The objective of the evaluation will help define which stakeholders to involve. Without agreement from stakeholders, the evaluation may be of limited use.

Describe the program

The program description details the mission and objectives of the program. Descriptions should help evaluators understand the program goals and strategies. The description should discuss how the program works to effect change, describe the program’s stage of development, and show how the program fits into the larger organization and community. Stakeholders should review and agree with the program description. Program descriptions will vary for each evaluation.

Aspects to include in a program description are a statement of need, expected effects, activities, resources, stage of development, context, and a logic model. A statement of need describes the problem that the program addresses. Expected effects are what the program must do to be successful. Program activities are what the program does to effect change. Resources include the time, talent, technology, equipment, information, money, and other assets available to conduct program activities. The program’s stage of development reflects its maturity. The context should describe the setting within which the program operates. Some programs have used a logic model as a planning tool to outline resources and to describe the order of the program activities and associated outcomes. Logic models require a planned, sequenced thought process to design a program.

Focus the evaluation design

The evaluation must focus on issues of greatest concern to stakeholders. A funder may be concerned about efficient use of resources. The designer of an intervention may be concerned about fidelity of delivery. Most evaluations have limited time and resources. A good evaluation design is directly related to knowing its intended uses, that is, whether it is designed to demonstrate effectiveness to policymakers, acquire resources, or some other type of use. Significant items to consider in focusing an evaluation are purpose, users, uses, questions, methods, and agreements.

Describing the purpose of the evaluation (i.e., its intent) will guide how it should be conducted. The stage of a program’s development will define the purpose of the evaluation. For example, a program just starting may focus its evaluation on perceptions of the use of the materials being developed. A more developed program may attempt to conduct an outcome evaluation to see if the program is effective. The context of the program also affects the purpose. Evaluations of prevention programs have four general purposes. The first is to gain insight: We try to learn if doing something makes sense. A second purpose for program evaluation is to change practice. A third purpose is to assess the effects of the program. The fourth purpose is to effect change in the persons participating in the management and delivery of the program. Participating in an evaluation can provide insight for program staff that can be a catalyst for self-directed change.

Users are the persons who will receive evaluation findings, such as the program manager, funders, and other stakeholders. Intended users should participate in choosing the evaluation focus.

Uses are the ways in which information from the evaluation will be applied, as in deciding to continue funding or expand an effort or determining if program procedures should be modified for better delivery. Stakeholders should help identify, plan, and prioritize uses in advance of implementing the evaluation process. The program’s stage of development and current context should also be considered when identifying uses.

Evaluation questions clarify what aspects of the program will be addressed. Creating evaluation questions identifies what the evaluation should answer. The questions will suggest how and what information should be gathered to enable adequate answers.

The methods for an evaluation refer to scientific research options. Methods for suicide prevention evaluations are generally developed in the social, behavioral, and health sciences and include experimental, quasi-experimental, and observational designs. Choosing the best design depends on the circumstances. The method should enable the evaluation to address stakeholders’ questions. Experimental designs randomly assign persons to intervention and non-intervention groups. By using random assignment, the two groups are usually very similar, with the exception that one receives the intervention and the other receives something else. When the intervention is complete, the differences we see in the two groups in the outcome of interest is likely caused by the difference made by the intervention. Quasi-experimental methods are similar to experimental, except that the groups are not similar because they are not randomly assigned. This type of method is used when randomization is not feasible. Observational methods tend to be descriptive and attempt to understand differences, similarities, and processes within a group. Monitoring a sample of telephone calls on a crisis telephone line and describing how well procedures are being followed is an example of an application of an observational method.

The choice of design has implications for what will be used as evidence. The design will determine how evidence will be gathered and what kind of claims can be made. The design also determines how data sources will be selected, what data collection instruments will be used, who will collect the data, and what data management systems will be needed.

Each evaluation method has its own limitations. Evaluations that mix methods are generally more effective. Methods might need to be revised or modified over the course of an evaluation: The intended use of an evaluation might shift, or changing conditions might require redesign of methods to keep the evaluation on track.

Agreements are explicit written statements that summarize the procedures and clarify roles and responsibilities among those who will execute the evaluation plan. Written agreements ensure understanding among stakeholders and evaluators in terms of their expectations and help to avoid numerous problems that can develop from misunderstandings.

To focus an evaluation design, stakeholders should review the evaluation questions and determine if they will address concerns of interest groups. A list of evaluation uses could be circulated among stakeholders to determine which is most important. Intended users could also be interviewed to understand what they need or want to know and to focus the evaluation so it responds to those needs.

Gather credible evidence

The information collected from a suicide prevention evaluation should provide a useful understanding of the program that addresses stakeholders’ needs. Stakeholders must find the information believable and relevant. Having credible evidence strengthens the recommendations. Credibility can be improved by using multiple procedures and by involving stakeholders in defining and gathering data. The following factors affect people’s perceptions of the credibility of your evaluation evidence: indicators, sources, quality, quantity, and logistics.

  • Indicators are aspects of the program that can be examined to address the questions of the evaluation. Examples of indicators that can be defined and tracked include the program’s capacity to deliver services, the participation rate, levels of client satisfaction, the efficiency of resource use, and the amount of intervention exposure. Other measures of program effects, such as changes in participant behavior, community norms, policies or practices, health status, quality of life, and the settings or environment around the program, can also be tracked.

  • Sources of evidence are persons, documents, or observations. More than one source might be used to gather evidence. Use of multiple sources provides different perspectives. In an evaluation where the question is, "Does a training of trainers result in a desired number of new gatekeepers being trained?", the indicator might be a "number of persons trained by each trainer over the course of a year."

  • Quality refers to the correctness and integrity of the information. Quality data are representative of what they intend to measure and are informative for their intended use. Good indicators make it easier to collect quality data. Instrument design, data-collection procedures, training of data collectors, source selection, coding, data management, and routine error-checking all influence the quality of your data collectors, source selection, coding, data management, and routine error-checking all influence the quality of your data. For example, a data collection effort where data collectors are trained on procedures and where there are checks to monitor adherence to procedures will produce higher-quality data than one with inconsistent or minimal training of data collectors.

  • Quantity refers to the amount of evidence gathered. The amount of information needed should be estimated in advance. All evidence collected should have a clear and anticipated use, with only minimal burden placed on respondents. Information gathered from participants in an intervention should be kept to a minimum. Each indicator should be clearly justified in terms of addressing an evaluation question. Another example of quantity would involve determining how many persons must provide information to adequately address the evaluation question. The burden on persons to provide information should always approach the minimum needed.

  • Logistics encompass the methods, timing, and physical infrastructure for gathering and handling evidence. Each technique selected for gathering evidence must be suited to the source(s), analysis plan, and strategy for communicating findings. Cultural issues should influence decisions about acceptable ways of asking questions and collecting information. Procedures for gathering evidence should be sensitive to cultural conditions in each setting and must ensure that the privacy and confidentiality of the information and sources are protected.

Justify conclusions

Evaluation conclusions must be drawn from the evidence gathered and then compared to the standards set by the stakeholders. Stakeholders must agree that conclusions are valid: otherwise, the evaluation results will be of limited use. Justifying conclusions on the basis of evidence requires the following: standards, analysis and synthesis, interpretation, judgment, and recommendations.

  • Standards reflect what stakeholders think is important and are the basis for forming judgments concerning program performance. Using standards distinguishes evaluation from other approaches, in which priorities are set without reference to any sort of specific statement about what is important.
  • Analysis and synthesis of an evaluation’s findings might detect patterns in evidence. Analysis involves isolating important findings, while synthesis involves combining sources of information to reach a larger understanding.
  • Interpretation is the effort of figuring out what the findings mean. It increases understanding of the evidence gathered in an evaluation. Merely uncovering facts regarding a program’s performance is not sufficient to draw evaluative conclusions; one must interpret the results based on criteria set before the evaluation begins.
  • Judgments are statements concerning the merit, worth, or significance of the program. They are formed by comparing the findings of and interpretations regarding the program against one or more selected standards. Because multiple standards can be applied to a given program, stakeholders might reach different or even conflicting judgments. For example, a program that increases referrals to a mental health clinician by 10 percent from the previous year might be judged positively by program managers who are using the standard of improved performance over time. However, community members might feel that, despite improvements, a minimum threshold of access to services has not been reached.
  • Recommendations are actions for consideration resulting from the evaluation. Recommendations that lack sufficient evidence or that deviate from stakeholders’ values can undermine an evaluation’s credibility. Sharing early drafts of recommendations and asking for feedback from multiple stakeholders during the process can increase the likelihood that the recommendations will be accepted. Additionally, when possible, presenting options instead of directive advice will make recommendations more acceptable.

Adequate justification of the conclusions and recommendations of an evaluation is an essential part of creating information that is useful. However, having useful information does not ensure that it is actually used.

Ensure use and share lessons learned

Unfortunately, despite the best intentions of evaluators and practitioners, lessons learned during an evaluation are not always used. For findings to be used effectively, they must be disseminated appropriately. This requires strategic thinking and should begin in the earliest stages of planning an evaluation and engaging your stakeholders. The goal of dissemination is to achieve full disclosure and impartial reporting. Additional uses for evaluation flow from the process of conducting it. Persons who participate in an evaluation can experience changes in thinking and behavior. Evaluation sometimes increases staff understanding of program goals.

Five critical elements for ensuring that an evaluation is used are as follows: design, preparation, feedback, follow-up, and dissemination.

  • Design refers to the construction of evaluation questions, methods, and overall processes. The design should be organized to achieve intended uses by stakeholders.
  • Preparation refers to sound evaluation planning and following the steps described in this booklet to ensure effective use of evaluation findings.
  • Feedback is the communication that occurs among all parties to the evaluation. Giving and receiving feedback creates an atmosphere of trust among stakeholders.
  • Follow-up refers to the technical and emotional support that users need both during the evaluation and after they receive evaluation findings.
  • Dissemination is the process of communicating to others the lessons learned. The reporting strategy should be discussed with intended users and other stakeholders.

Sharing information learned from implementing and evaluating suicide prevention programs is one of the most important responsibilities we have in our effort to prevent suicide. Without sharing information, we are isolated in our efforts, and adoption of innovative and promising practices is limited. Effective diffusion of prevention information can have revolutionary effects on the practice of prevention efforts.

The most important aspect of applying lessons learned from evaluation is ensuring that the program is improved as a result. Recommendations should be specific regarding suggested shifts in program emphasis or in specific procedures. A specific written plan should be developed for implementing accepted recommendations. Subsequent evaluation cycles should examine the implementation of the changes and their impact.

Table of Contents | Previous | Next