Summary

The website content provides an overview of Interrupted Time Series (ITS) as a quasi-experimental method for causal inference, discussing its background, strengths, limitations, and applications, particularly in scenarios where Randomized Controlled Trials (RCTs) are not feasible.

Abstract

Interrupted Time Series (ITS) is presented as a valuable tool for evaluating the effects of interventions when Randomized Controlled Trials (RCTs) are not possible due to practical constraints. The method involves analyzing a single time series data set before and after an intervention, relying on the assumption that the outcome variable would remain unchanged without the intervention. ITS is noted for its strong inferential power, especially in fields like epidemiology and program evaluation, and is considered the strongest quasi-experimental method in causal inference. The article outlines the key aspects of ITS, including the creation of counterfactuals, control for time-varying confounders, and the use of segmented regression analysis. While acknowledging its merits, such as accounting for long-term trends and evaluating population-level data, the article also discusses limitations such as the need for multiple data entries and the challenge of time lag in intervention effects. The author provides a simulated data example to illustrate ITS analysis and concludes by inviting readers to explore further applications of ITS in causal inference.

Opinions

The author considers ITS to be a robust method for causal inference, particularly when RCTs are not an option.
ITS is seen as superior to other quasi-experimental methods due to its ability to control for long-term trends and evaluate both intended and unintended consequences of interventions.
The author emphasizes the importance of having a sufficient number of data entries (at least 8 before and after the intervention) for a robust causal analysis, although they acknowledge that this may not always be feasible.
There is a recognition that ITS deals with population-level data, which can be both an advantage (reducing individual-level bias) and a disadvantage (limiting inferences about individual behavior).
The article suggests that ITS can overcome the limitations of having only one case by using segmented regression, which allows for the examination of changes in the direction and extent of effects attributed to the intervention.
The author critiques prior research, such as studies linking the Great Recession to increased suicides, for failing to account for confounders like seasonality and social groupings.
The author advocates for the use of ITS in stratified analyses to understand how different sub-groups within a population may respond to interventions differently.
Visual inspections of ITS results are encouraged as a valid method for interpreting findings.
The author promotes their other content on Artificial Intelligence and Machine Learning, indicating a broader interest in these fields.

Experimentation and Causal Inference

A Practitioner’s Guide To Interrupted Time Series

Basics, Assumptions, Merits, Limitations, and Applications

Background

In the world of causal inference, Randomized Controlled Trials, RCTs, are considered the gold standard as it rules out any covariate differences before the intervention. However, running a RCT isn’t an option for multiple reasons (e.g., too expensive, invalid assumptions, too long, not ethical, etc.).

Under these circumstances, Interrupted Time Series (ITS) design comes in handy (see Netflix). Aa a quasi-experimental method, ITS contains a strong inferential power and has wide applications in epidemiology, medication research, and program evaluations in general.

Arguably, ITS is the strongest quasi-experimental method in causal inference (Penfold and Zhang, 2013).

In this post, we will learn the basics of the method and how to apply it in real life.

What is an ITS?

As a quasi-experimental design, ITS is an analysis of a single time-series data before and after the intervention (Bernal, et al. 2017). From the perspective of research design, ITS builds upon a rather straightforward design idea: the outcome variable would not be altered if there were no intervention.

However, the tricky part is:

how we can derive causal argumentation from a single time series data?

how can we eliminate confounders?

In other words, it is crucial to create “counterfactuals” that serve as the baseline point. We can attribute the “altered” trajectory to the presence of the intervention.

Fortunately, there is a time component with ITS, as the name suggests, that allows us to assume the outcome variable would not change if the intervention were not present.

Besides, we could examine whether the outcome variable returns to the baseline after taking away the treatment condition, if there are multiple data entries (see Netflix for examples).

Furthermore, we must control for time-varying confounders, including seasonable trends and concurrent events that may interfere with the results.

For example, researchers challenge and repudiate the prior findings that the Great Recession in 2008 leads to more suicides in the U.S., arguing the previous studies fail to consider seasonality and social groupings (Harper and Bruckner).

Strengths and Limitations of ITS

Penfold and Zhang (2013) have provided a complete list of the strengths and limitations, and I’m going to summarize the key points in the following.

Strengths

To control for long-term time trends in the data. ITS presents a long-term analytical framework with more extended periods, which better explain any data trends.
To account for individual-level bias and to evaluate the outcome variable at the population level. Individual-level data may introduce bias, but not with population data. Honestly, this is both a blessing and a curse. We will elaborate more on the latter aspect in the following part.
To evaluate both intended and unintended consequences of interventions. We can easily enlarge analysis and incorporate more outcome variables with minimum or no adaptations.
To conduct stratified analyses of subpopulations of individuals and to derive different causal effects. This is critical. We can divide the total population into different sub-groups according to various criteria and examine how each sub-group may behave differently. Social groups are different, and grouping them together may dilute or hide critical information, as positive and negative effects mix together and cancel out (see Harper and Bruckner for examples).
To provide clear and interpretable visual results. Visual inspections are always welcome and should be treated seriously (See my other post for more explanations).

Limitations

Multiple rounds of data entries. A minimum of 8 periods before and 8 after an intervention to evaluate the changes. So, we need a total of 16 data entries, which may not be possible all the time. I think Penfold and Zhang (2013) are being cautious about the number of data entries. It’s still possible to apply ITS with few rounds of data entry. Just the causal power may not as robust as the one with multiple rounds.
Time lag. It takes some unknown time for a program to achieve intended results, which makes it difficult to pinpoint the causal effects of several events that coincide. Let’s say the transportation department in the U.S. adopt three policies within a two-year timespan to curb highway speeding. Playing God, we somehow know it would take 1 yr for Policy A to have any effect, 1.5 ys for Policy B, and 3 yrs for Policy C. In the meantime, it becomes impossible to separate the intertwined effects using ITS.
Inference Level. It’s population-level data, so we can’t make inferences about each individual.

Applications

ITS uses Segmented Regression to examine the effects of the intervention. ITS requires two segments: the one before the intervention and the one after the intervention. Each segment has its own slope and intercept, and we compare the two segmented regression models to derive the effects.

We attribute any changes in the direction (e.g., from positive to negative) and/or the extent (from large effects to small effects) between these two segmented regression models to the intervention variable.

Actually, this is how ITS overcomes the limitations of having only one case and still has a strong inferential power.

Here is an example of ITS analyses using simulated data.

# simulated data

# data preparation
set.seed(1)
CaseID = rep(1:100,6)

# intervention
Intervention = c(rep(0,300), rep(1,300))
Outcome_Variable = c(rnorm(300), abs(rnorm(300)*4))
mydata = cbind(CaseID, Intervention, Outcome_Variable)
mydata = as.data.frame(mydata)

#construct a simple OLS model
model = lm(Outcome_Variable ~ Intervention, data = mydata)
summary(model)

Call:
lm(formula = Outcome_Variable ~ Intervention, data = mydata)

Residuals:
    Min      1Q  Median      3Q     Max 
-3.3050 -1.2315 -0.1734  0.8691 11.9185

Coefficients:
             Estimate Std. Error t value Pr(>|t|)    
(Intercept)   0.03358    0.11021   0.305    0.761    
Intervention  3.28903    0.15586  21.103   <2e-16 ***
---
Signif. codes:  0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

Residual standard error: 1.909 on 598 degrees of freedom
Multiple R-squared:  0.4268, Adjusted R-squared:  0.4259 
F-statistic: 445.3 on 1 and 598 DF,  p-value: < 2.2e-16

As can be seen, the regression result of the intervention variable is statistically significant.

This is a quick intro class to ITS using simulated data. Actually, ITS can do so much more in causal inference, and I’ll elaborate more in a follow-up post soon. Hopefully~~

Medium recently evolved its Writer Partner Program, which supports ordinary writers like myself. If you are not a subscriber yet and sign up via the following link, I’ll receive a portion of the membership fees.

Read every story from Leihua Ye, Ph.D. Researcher (and thousands of other writers on Medium)

As a Medium member, a portion of your membership fee goes to writers you read, and you get full access to every story…

leihua-ye.medium.com

Enjoy reading this one?

Please find me on LinkedIn and Youtube.

Also, check my other posts on Artificial Intelligence and Machine Learning.