Synthetic Control and Spatial Analysis: Insights for Causality and Policy
X: @ljmaldon | Personal website: www.leonardojmaldonado.com

The Synthetic Control Method (SCM) and spatial analysis are two sophisticated techniques used in various fields, such as applied economics, epidemiology, and environmental science, to analyze complex datasets and infer causal relationships and patterns.
In particular, the SCM is a statistical approach proposed by Abadie and Gardeazabal (2003). It is a data-driven technique used in causal inference to construct a control measure, allowing us to identify and estimate the impact of an event (i.e., treatment => shock event or intervention) in comparative case studies. The idea involves creating a "synthetic" version of a control group that may serve as a counterfactual by using a weighted average of pre-event outcomes of potential control units. The resulting synthetic closely matches the characteristics of the treated group's outcome before the event and is a control for the treated group following the event. Therefore, it enables us to estimate the event's causal effect by comparing the treated group's post-event observed outcomes to those of the synthetic version. This method is beneficial for assessing the impact of policy changes, economic interventions, or large-scale social programs.
On the other hand, spatial analysis involves examining the patterns, processes, and relationships that occur in geographic locations. It integrates geographical data (i.e., maps, satellite images, etc.) with statistical methods to analyze spatial distributions, identify clusters, and explore locations, attributes, and relationships of features in spatial data by applying multiple techniques or geoprocessing tools. In this case, you may solve complex problems modeling spatial relationships using statistical analyses, identify patterns and trends, or make maps and visualizations using geo-referenced data.
The intersection of spatial analysis and the SCM is built upon integrating spatial data with counterfactual frameworks to assess the effects of specific events or policies within a geographic context. In this sense, it provides a powerful framework for causal inference in studies involving spatial factors or situations where conventional experimental designs are impractical or highly difficult.
Some key aspects of their relationship
- Enhanced Causal Inference in Spatial Contexts => SCM provides a methodological framework for conducting causal inference, but when combined with spatial analysis, it allows for assessing interventions across different geographic regions, accounting for the spatial heterogeneity that might influence the outcomes.
- Creation of Geographically Matched Controls => SCM can be applied to create a synthetic control unit that closely matches the treated geographic area in terms of pre-treatment features. This is particularly valuable in spatial analysis, as it accounts for spatial autocorrelation.
- Policy Evaluation and Planning => It enables evaluating the spatial effects of policies or interventions, such as urban development projects, environmental regulations, or public health initiatives, providing evidence-based insights for future planning and decision-making.
- Addressing Spatial Data Challenges => It may help address challenges related to data scarcity or lack of quality in certain locations (e.g., researchers could improve the robustness of their causal estimates by carefully selecting and weighting control units from areas with better data quality).

Evaluating policy impacts? Yes!
Let's assume we want to analyze a city's economic growth before and after hosting a major international event (e.g., the World Cup, the Olympic Games, United Nations Climate Change Conferences, etc.). We can do this analysis by comparing the city against a synthetic composite of similar cities that did not host the event, evaluating the impact of infrastructure projects, zoning changes, or associated economic development programs on local economies or populations.
We can also find applications regarding environmental conservation efforts. For example, it is possible to evaluate the effectiveness of conservation policies by comparing deforestation rates in protected areas against a synthetic control of similar, unprotected areas.
This method can also be used to understand the impact of public health interventions, such as introducing smoke-free legislation in a city or policies to reduce disease incidence rates across different geographic areas, by creating a synthetic control representing a scenario where the legislation or where policies were not enacted.
In summary, mixing spatial analysis with synthetic control methods is a game-changer. Think of spatial analysis as your GPS, helping you navigate the complex geography of data, telling you where things are happening, and helping identify patterns. If synthetic control methods are included, you will now have a "private" tour guide who can tell you what would happen if you took one path over another. It's not just about knowing where you are; it's about understanding what moves to make and why they matter. Cool, right? This mix sharpens the accuracy of drawing causal connections in spatial contexts and paves the way for deriving practical insights critical for policy formulation and decision-making processes.
If you found the time to read this post and consider it useful, please share it with your peers. You can leave comments and/or reach me to let me know your thoughts. Do not hesitate to contact me.
