Guidelines for Informing Policy via Data

CHAPTER 1 - INTRODUCTION


Why does a policy-oriented professional need to think about data collection and analysis as a tool for policy creation? What does a survey-oriented professional or statistician need to understand about the policy-creation process in order to effectively collect or analyse data for the policy-maker's purposes? And how do both types of professionals ensure that accurate, high-quality data is interpreted well and used to inform effective policy? This chapter will introduce the concepts behind evidence-based policy creation and evaluation.

1.1 THE CONTEXT

The development of sound policy, particularly policy related to democratic governance and human rights, is crucial to the effective operation of governmental bodies. Given that it is the ethical and moral responsibility of governments to operate to serve their citizens, policy should, in part, work to remedy problems and issues related to those citizens' lives. In spite of rapid technological, scientific, and, arguably, governance development over the past century, a host of serious and overwhelming issues still affect all States: poverty, discrimination, mass human rights violations including genocide, a lack of universal primary education, a lack of basic health care, and global warming, to name just a few. The only method available to combat those universal issues is through sound policy enacted and monitored at the governmental, intra-governmental, and international level; only through combined effort can we hope to reduce and eliminate such problems and the suffering that they cause.

How is effective policy created? The Metagora community believes that effective policy is evidence-based policy - that is, policy informed by and based on valid empirical data that address the needs and preoccupations of both the population affected by the policy and also relevant stakeholders. We have therefore created these training materials as an introduction to the world of data, for policy-makers at all levels of government and in non-governmental organisations, and to the world of policy, for data-oriented professionals such as survey methodologists and statisticians.

Very often, policy-makers and statisticians do not see the need to communicate directly with each other. Policy-makers either do not see the relevance of empirical data for their work, preferring to rely on expert opinion, their personal or anecdotal experiences, or communication received from their constituency, or, since they are not experts on data-collection and analysis themselves, prefer to consider data only as it is presented in a final report. Conversely, data-oriented professionals may feel that as long as they are given a particular set of research questions, their role ends at the generation of a report responding to those questions, and understanding the policy process is not a requirement. Within the Metagora community, our experience has been that it is important for each type of professional, the policy-oriented and the data-oriented, to understand each other's field in order to create the most effective policy process. We therefore discuss below the advantages for each type of professional in learning the work of the other.

1.1.1 Why does a policy-oriented professional need to think about data-collection and analysis as a tool for policy creation?

Data are not a uniform product, collected via identical methods in different contexts. Rather, the practice of data-collection, and the subsequent quality of the data produced, varies significantly. Even when excellent methodology is used in the collection of data, different techniques of data-collection produce statistics and indicators that are subject to different interpretations.

An example will help to illustrate this concept. Policy-makers are often interested in the causal relationship between two variables - in order words, whether one phenomenon causes another. A classic historical example is the question of whether cigarette smoking causes cancer. Determining causality, however, is very difficult, if not impossible, unless data are collected via a randomised experiment.

In the most basic type of randomised experiment, a set of people is randomly split into two groups. One group receives the treatment, such as a pack of cigarettes to smoke each day, and the other group receives a placebo or no treatment. At the end of the experiment, the two groups are studied to see if the second variable of interest, such as cancer, is found in different rates in each of the two groups. If a statistically significant difference is found between the two groups, then a causal relationship has been established.

In most cases, the data available to policy-makers is non-experimental; no randomization has occurred, for example, to determine who will smoke and who will not. Very often such an experiment would be considered unethical and is simply not done. The issue is then one of confounding. In the historical example of cigarettes and cancer, the argument was that even if smokers as a group have higher cancer rates than non-smokers, there could be some other effect - a confounding variable - that caused both the higher rate of cancer and the tendency to smoke.

The policy-maker could not be sure, then, that policies intended to reduce the rate of smoking would indeed reduce the rate of cancer as well. The policy-maker would have to decide whether to implement a policy based on the evidence at hand about cigarette smoking and cancer, and implement that policy in a way that allowed the results of the policy to be monitored and evaluated. In the case of policies related to democratic governance and human rights, methods for monitoring and evaluating policies are discussed in Chapter 13 of this manual.

Data derived from sources other than a randomised experiment can be quite useful for informing policy creation, but not as direct measurements of causal relationships. One of the most basic distinctions between different types of non-experimental data is randomly collected data versus non-randomly collected data. When data are gathered via a random sample of a population, the resulting statistics represent the entire population, albeit with a margin of error. Clouding this distinction, however, is the case where data collected via a random sample are used to infer characteristics about a bigger population than the one from which the data were collected.

For example, if the sample frame for a survey is comprised of names of people who live in households in the federal district, then the homeless population has not been sampled and is not included in the resulting statistics. As a result, measurements of particular issues, such as poverty (and most certainly homelessness), will be biased. In other words, when interpreting the statistics created from these data, the policy-maker must be aware of the exclusion of the homeless from the sample and the potential impact on the data due to that exclusion.

1.1.2 What does a survey-oriented professional or statistician need to understand about the policy-creation process in order to effectively collect or analyse data for the policy-maker's purposes?

The policy process is complex and nuanced. If a data-oriented professional is interested in having an impact on the policy process, s/he must be aware of the role that politics can play in decision-making, and the need for transparent and well-explained methods. Situations such as upcoming elections, world events, and the relative bargaining positions of various politicians can all affect how data and/or statistics will be received. In such a situation, the role of the data-oriented professional may extend past simply collecting and/or analysing data of interest. S/he may be required to defend methods from political attacks, and even fight for the release of information that a superior within his/her organisation is withholding for political reasons.

As such, the most important reason that a data-oriented professional needs to be aware of the policy process is to maintain his/her own professional integrity and code of ethics. Understanding the potential barriers to collecting and releasing data that have policy or political impact, the political sensitivities that may influence general acceptance of the results of a data-collection process, and the potential misuses of data that are being collected all fall under the ethical responsibility of the data-oriented professional.

1.1.3 How do both types of professionals ensure that accurate, high-quality data is interpreted well and used to inform effective policy?

Data are analysed and interpreted correctly when:

  • The method of data-collection is transparent and well-documented;
  • The models and assumptions underlying the analysis of the data are clearly articulated;
  • The professionals performing the analysis and interpretation of the data are aware of their own biases and analyse the data in as impartial a method as possible; and
  • The public discussion about the data is focused on the best policy response to the results given by the data, rather than a discussion of the validity of the data and their subsequent interpretation.
In order for a policy-oriented professional to interpret data well, s/he must be able to review and understand the methodology used to collect and analyse the data. Only when all of the benefits and limitations inherent in the data and the assumptions underlying the analysis of the data are understood by all members of a debate can spurious attacks on the validity of the data be mitigated. Additionally data must be intepreted accurately, and unsupported assumptions of causality should not be made. In other words, both the policy-maker and the statistician must do their jobs effectively. When each understands the work of the other, they each serve as a "check" against bad practice, either bad interpretation of data or bad policy decision-making based on those data.

1.2 THE REMAINDER OF THESE GUIDELINES

The remaining thirteen chapters of these guidelines will encompass the following topics.

  • Chapter 2: The policy process: Creating policy within governments and other types of organisations

    This chapter will review the policy process, exploring two different models: the stages model and the streams model.

  • Chapter 3: The value of data and indicators

    Indicators are an often-used but seldom understood tool. This chapter discusses what data are and how they are collected, what indicators are and how they are developed, and how the different methods for data-collection, indicator-creation and statistical analysis can be used in the general context of the policy process, particularly in relation to policy-making in the field of democratic governance and human rights. Emphasis is placed on the complementary roles of the policy-oriented professional and the statistician, and the types of difficulties inherent in data-collection that must be considered.

  • Chapter 4: Developing indicators and other statistics from pre-existing data

    Data are already out there -- on the United Nations web site, in the national statistical agencies, and housed within NGOs. This chapter discusses the advantages and dangers of using pre-existing data to develop indicators and other statistics.

  • Chapter 5: Collecting data: Getting started

    If no pre-existing data exists, then data must be collected. In this case, several questions arise. What are the topics to be covered during the data collection? What research goals are part of the project? Are you measuring subjective or objective phenomena? Do you need quantitative or more qualitative data? Will you use a random sample survey or a more purposive method of data collection, such as focus group discussions? Are you best served by a survey instrument that contains open-ended questions or a highly structured questionnaire? Is there an existing data collection process to which you can add a module? What are the ethical implications and guidelines of your data-collection process? What confidentiality guarantees can you make? This chapter will explore all of these themes.

  • Chapter 6: Collecting data: Finding the people to interview

    This chapter contains a discussion of sampling, highlighting the reasons for using a random sample or more purposive sample, determining a target population for data collection, finding a sampling frame, types of random samples, and other types of samples. Some of the pitfalls encountered in the sampling process will also be discussed, including incomplete sampling frames, under-coverage, biased samples, household-based populations, homeless and institutionalised populations, and the potential for non-response and incomplete responses. The errors associated with different sampling procedures and the errors introduced through non-response will also be explored.

  • Chapter 7: Collecting data: Creating the statement/questionnaire

    Creating an instrument for the collection of data is a complex task within a single culture, and a phenomenal task when the instrument in question is to be used in multiple languages or cultures. This chapter reviews the questionnaire design process, with an emphasis on multi-language and multicultural surveys. After reading this section, the policy-oriented professional will gain a better understanding of the necessity for thorough pre-testing of questionnaires, and the survey-oriented professional will understand the basic steps required for a high-quality questionnaire-development process.

  • Chapter 8: Collecting data: Logistics

    Once the data-collection instrument is designed and the people to be interviewed are picked, the real work of training interviewers and collecting the data begins. This chapter discusses all aspects of the logistics of data collection, including supervising field workers, maintaining high quality, budgeting, and the use of incentives.

  • Chapter 9: Storing and processing data

    This chapter looks at what happens after the data are collected. Whether the data are qualitative or quantitative, they must be stored in a manner that facilitates analysis. In almost all modern data-collection processes, a computer is used to store the data. Quantitative data must be entered into a database, which involves a separate staff and the need to understand the underlying data structure. Qualitative data may be stored as-is or "coded" via a controlled vocabulary in order to extract quantitative elements. All of these approaches will be discussed.

  • Chapter 10: Analysing data

    Whether data are qualitative or quantitative, they will need to be analysed before they will become useful for policy creation and monitoring. This chapter will discuss good practices of qualitative analysis, types of quantitative analysis, principles for the creation of statistics and statistical graphics, and how a combined qualitative/quantitative analysis can be approached.

  • Chapter 11: Data presentation and the ethical use of data

    Presenting data well is not just a good idea, it is a matter of ethics. This chapter discusses best practices for data-presentation, gives examples of bad data-presentation, and discusses the issue of the use of data for unethical purposes.

  • Chapter 12: Evaluating the data-collection and analysis process

    After the data collection is over and the analysis is done, it is important to evaluate the quality of these efforts. It is rare that a data-collection process is completed without unexpected setbacks. This chapter will discuss how a good assessment of a data-collection and analysis process can be achieved. Concepts of quality control during the process will also be explored.

  • Chapter 13: Policy decisions and programmes based on data

    Now that the process by which indicators and data become available to the policy-oriented professional has been thoroughly discussed, this chapter will return to a more thorough review of the role that indicators and data play during the creation of policy and policy programmes.

  • Chapter 14: Monitoring and evaluating policy programmes and decisions

    Sometimes the purpose of a data-collection project will be to allow the monitoring and/or evaluation of an existing policy. This chapter will discuss the issues specific to this type of data-collection and analysis process.

It is our hope that, after reading these materials, the policy-oriented professional will be better informed as to why an empirical approach to policy is an effective one and what pitfalls to avoid in using data to inform policy, and the data-oriented professional will better understand the needs and sensitivities of the policy process and tailor his/her field/analysis methods appropriately. Any questions about these materials are welcome and should be directed to us at metagora@oecd.org.


Return to Guidelines for Informing Policy via Data
top