Long-Run Event Studies
Long-run event studies measure the impact of an event over months or years rather than days. They are essential for answering whether the market fully and correctly prices an event at the time of announcement or whether abnormal performance persists afterward. This page covers the three main approaches — Buy-and-Hold Abnormal Returns (BHAR), Cumulative Abnormal Returns (CAR), and the Calendar-Time Portfolio method — along with the methodological challenges that make long-run studies more difficult than short-run analyses.
Part of the Methodology Guide
This page is part of the Event Study Methodology Guide. For short-run event study methods, see the Test Statistics Overview.
Long-run event studies extend the analysis horizon from days to months or years, testing whether the market's initial reaction was complete and correct. Research by Barber and Lyon (1997) demonstrates that conventional t-tests applied to 36-month BHARs reject the null hypothesis at rates of 10-15% instead of the nominal 5%, due to compounding-induced skewness. Skewness-adjusted tests and bootstrap inference are therefore essential for credible long-horizon analysis.
When Are Long-Run Event Studies Needed?
A long-run event study is an extension of the standard event study methodology that measures abnormal stock performance over horizons of 12 to 60 months rather than days. It tests whether markets fully and correctly price an event at announcement or whether significant post-event drift persists -- a question central to the debate on market efficiency that has generated over 500 published studies since the 1990s.
Use a long-run event study when your research question concerns the market's long-term assessment of an event, not just the immediate reaction. Typical applications include:
- Post-merger performance: Do acquiring firms earn positive or negative abnormal returns in the 1-3 years after a merger?
- Post-IPO returns: Do IPO firms underperform their benchmarks in the long run (the "IPO underperformance" puzzle)?
- Post-earnings announcement drift (PEAD): Do firms with positive earnings surprises continue to outperform for 60-90 trading days?
- SEO underperformance: Do firms that issue seasoned equity offerings underperform afterward?
- Spin-offs and divestitures: What is the long-run value creation from corporate restructuring?
Methodological challenges
Long-run event studies are significantly more challenging than short-run studies. Results are sensitive to the methodology, the benchmark, and the sample period. Always report results from multiple approaches and discuss the limitations.
Buy-and-Hold Abnormal Returns (BHAR)
The BHAR approach computes the difference between the compounded return of the event firm and the compounded return of a benchmark over the same holding period. It reflects the actual experience of an investor who buys the stock on the event date and holds it for T periods.
The BHAR for firm i over horizon T is:
BHAR_i(T) = Product(1 + R_i,t) - Product(1 + R_benchmark,t)
where the product runs from t = 1 to T (e.g., T = 12 months or 36 months). The benchmark can be a market index, a matched firm, or a portfolio of matched firms.
| Aspect | Details |
|---|---|
| Interpretation | The excess wealth from buying the event firm vs. the benchmark |
| Compounding | Fully compounds returns; captures the real investor experience |
| Statistical distribution | Severely right-skewed, especially over long horizons |
| Typical horizon | 12, 24, or 36 months |
| Benchmark options | Market index, size-matched firm, size/BM-matched firm, industry portfolio |
# Configure the EventStudy package for BHAR
ps <- ParameterSet$new(return_model = BHARModel$new())
# Set long event window in the request table
request <- tibble(
event_id = 1:50,
firm_symbol = firms,
index_symbol = "SP500",
event_date = event_dates,
group = "PostMerger",
event_window_start = 0,
event_window_end = 252, # ~12 months of trading days
shift_estimation_window = -10,
estimation_window_length = 250
)
task <- EventStudyTask$new(firm_data, index_data, request)
task <- run_event_study(task, ps)
# Retrieve BHAR for each event
task$get_car() # CAR over the long window = BHARMatched-firm benchmark
The most common BHAR benchmark is a matched firm selected on size and book-to-market ratio. This controls for the well-known size and value effects. Match each event firm to a non-event firm in the same size decile and book-to-market quintile.
Cumulative Abnormal Returns (CAR)
The CAR approach sums daily or monthly abnormal returns over the holding period without compounding. It is simpler to compute and has better statistical properties than BHAR (the distribution is closer to normal), but it does not reflect the actual investor experience because it ignores compounding.
CAR_i(T) = Sum of AR_i,t from t = 1 to T
| Aspect | Details |
|---|---|
| Interpretation | Sum of daily/monthly excess returns (no compounding) |
| Compounding | Does not compound; may diverge from BHAR over long horizons |
| Statistical distribution | Closer to normal; easier inference |
| Typical horizon | 12, 24, or 36 months |
| Benchmark options | Same as BHAR; Market Model or factor models |
For short horizons (up to about 12 months), CAR and BHAR produce similar results. Over longer horizons of 36-60 months, the compounding difference can exceed 5-10 percentage points, and BHAR is generally preferred because it represents the economically relevant quantity (investor wealth).
Calendar-Time Portfolio Approach
The calendar-time portfolio approach avoids many problems of the BHAR and CAR methods by aggregating event firms into a portfolio. Each calendar month, the portfolio includes all firms that experienced the event within the past T months. The portfolio return is then regressed on a factor model (e.g., Fama-French three-factor or five-factor), and the intercept (alpha) measures the abnormal return.
# Step 1: For each calendar month, identify event firms
# (firms with events in the past 36 months)
# Step 2: Compute equal-weighted or value-weighted portfolio return
# Step 3: Regress portfolio excess returns on factor model
library(lmtest)
library(sandwich)
# Example: Fama-French 3-factor regression
ct_model <- lm(
portfolio_excess ~ mkt_rf + smb + hml,
data = calendar_time_data
)
# Use Newey-West standard errors for autocorrelation
coeftest(ct_model, vcov = NeweyWest(ct_model, lag = 3))
# The intercept (alpha) is the monthly abnormal return
# Multiply by T for cumulative abnormal return| Aspect | Details |
|---|---|
| Interpretation | Monthly alpha from factor model regression |
| Cross-correlation | Naturally handled -- correlated firms are in the same portfolio |
| Varying sample size | Number of firms in portfolio changes monthly; can be weighted |
| Statistical inference | Standard t-test on alpha; Newey-West for autocorrelation |
| Weakness | Low power when few events per month; variable portfolio composition |
Preferred for clustered events
The calendar-time portfolio approach is strongly recommended when events cluster in time (e.g., IPO waves, merger waves). It is the only method that automatically handles cross-sectional correlation without additional adjustments.
Comparing the Three Approaches
| Feature | BHAR | CAR | Calendar-Time Portfolio |
|---|---|---|---|
| Compounding | Yes | No | Monthly returns (implicit) |
| Investor experience | Realistic | Approximation | Portfolio-level |
| Statistical properties | Skewed; inference difficult | Better behaved; standard tests | Well specified; standard regression |
| Cross-correlation | Not handled; requires adjustment | Not handled; requires adjustment | Naturally handled |
| Power | Moderate | Moderate | Lower (monthly frequency) |
| Benchmark sensitivity | High | High | Moderate (factor model) |
| Best for | Individual firm analysis; reporting economic magnitude | Statistical testing; large samples | Clustered events; robustness check |
Methodological Challenges
Long-run event studies face several well-documented problems that do not arise in short-run studies. Being aware of these challenges is essential for producing credible results.
New Listing Bias
When the benchmark is a market index or broad portfolio, firms that enter the benchmark after the event date are included in the benchmark return but not in the event sample. These new listings tend to be smaller and riskier, which biases the benchmark return. This is particularly problematic for IPO studies where the relevant comparison group (other IPOs) is systematically different from the full market.
Skewness in BHAR Returns
Compounded returns are bounded below at -100% but unbounded above. Over long horizons, this produces a severe right skew in the BHAR distribution. According to Barber and Lyon (1997), the skewness coefficient of 36-month BHARs typically exceeds 1.5, making standard t-tests unreliable with actual rejection rates of 10-15% at a nominal 5% significance level. Solutions include using the skewness-adjusted t-test of Lyon, Barber, and Tsai (1999) or bootstrapped p-values.
Cross-Correlation
With long event windows, multiple events frequently overlap in calendar time. This creates cross-sectional dependence that inflates test statistics. The calendar-time portfolio approach is the most effective solution.
Benchmark Sensitivity
Long-run results are highly sensitive to the choice of benchmark. Small differences in benchmark returns compound over time, producing large differences in measured abnormal performance. A study may find significant underperformance with one benchmark and no abnormal performance with another.
| Challenge | Impact | Solution |
|---|---|---|
| New listing bias | Biased benchmark returns | Use matched-firm benchmarks; exclude new listings from index |
| BHAR skewness | Invalid t-tests; over-rejection | Skewness-adjusted t-test; bootstrap; use CAR for inference |
| Cross-correlation | Inflated test statistics | Calendar-time portfolio approach |
| Benchmark sensitivity | Fragile results | Report results with multiple benchmarks |
| Low power | Failure to detect true effects | Larger samples; shorter horizons if possible |
| Survivorship bias | Upward bias in returns | Include delisted firms with delisting returns |
Best Practices
- Report multiple methods: Present BHAR, CAR, and calendar-time portfolio results. If all three agree, the finding is robust. If they disagree, discuss why.
- Use multiple benchmarks: Test with a market index, size-matched firms, and size/book-to-market matched firms. Report all results.
- Address skewness: For BHAR inference, use the skewness-adjusted t-test or bootstrapped p-values rather than standard t-tests.
- Include delisting returns: Firms that delist during the holding period (due to bankruptcy, acquisition, or going private) must be included with their delisting returns. Ignoring them creates survivorship bias.
- Check for confounding: Over 12-36 months, many other events occur. Consider whether your results might be driven by systematic confounding (e.g., all event firms are also in the same industry that experienced a downturn).
- Use the calendar-time approach for clustered events: If your events cluster in time (IPO waves, merger waves), the calendar-time portfolio approach is essential.
Implementation in R
library(EventStudy)
library(tidyverse)
# --- BHAR approach ---
ps_bhar <- ParameterSet$new(return_model = BHARModel$new())
request_long <- tibble(
event_id = 1:nrow(events),
firm_symbol = events$symbol,
index_symbol = "SP500",
event_date = events$date,
group = events$group,
event_window_start = 1, # Start day after event
event_window_end = 252, # 12 months
shift_estimation_window = -10,
estimation_window_length = 250
)
task_bhar <- EventStudyTask$new(firm_data, index_data, request_long)
task_bhar <- run_event_study(task_bhar, ps_bhar)
# BHAR results
bhar_results <- tidy.EventStudyTask(task_bhar, type = "ar")
# --- CAR approach (Market Model) ---
ps_car <- ParameterSet$new(return_model = MarketModel$new())
task_car <- EventStudyTask$new(firm_data, index_data, request_long)
task_car <- run_event_study(task_car, ps_car)
car_results <- tidy.EventStudyTask(task_car, type = "aar")
# --- Compare BHAR and CAR ---
cat("Mean BHAR (12 months):", mean(bhar_results$car), "\n")
cat("Mean CAR (12 months):", tail(car_results$caar, 1), "\n")# Skewness-adjusted t-test (Lyon, Barber, Tsai 1999)
bhar_values <- bhar_results$car
n <- length(bhar_values)
mean_bhar <- mean(bhar_values)
sd_bhar <- sd(bhar_values)
skew <- (1/n) * sum(((bhar_values - mean_bhar) / sd_bhar)^3)
# Adjusted t-statistic
t_conv <- mean_bhar / (sd_bhar / sqrt(n))
t_adj <- t_conv + (1/3) * skew * (1/sqrt(n)) +
(1/6) * skew^2 * t_conv / n
cat("Conventional t:", round(t_conv, 3), "\n")
cat("Skewness-adjusted t:", round(t_adj, 3), "\n")
cat("p-value (adjusted):", round(2 * pt(-abs(t_adj), df = n - 1), 4), "\n")Literature
- Barber, B.M. & Lyon, J.D. (1997). Detecting long-run abnormal stock returns: The empirical power and specification of test statistics. Journal of Financial Economics, 43(3), 341-372.
- Lyon, J.D., Barber, B.M. & Tsai, C.-L. (1999). Improved methods for tests of long-run abnormal stock returns. Journal of Finance, 54(1), 165-201.
- Mitchell, M.L. & Stafford, E. (2000). Managerial decisions and long-term stock price performance. Journal of Business, 73(3), 287-329.
- Fama, E.F. (1998). Market efficiency, long-term returns, and behavioral finance. Journal of Financial Economics, 49(3), 283-306.
- Kothari, S.P. & Warner, J.B. (2007). Econometrics of event studies. In Handbook of Corporate Finance, Vol. 1.
- BHAR (Buy-and-Hold Abnormal Return)
- The difference between the compounded return of the event firm and the compounded return of a benchmark over a holding period. BHAR reflects the actual investor experience but is severely right-skewed over long horizons.
- Calendar-Time Portfolio
- A method that forms a portfolio of all event firms each calendar month and measures abnormal performance as the alpha from a factor model regression. Naturally handles cross-sectional correlation among overlapping events.
- New Listing Bias
- A bias in long-run event studies caused by firms entering the benchmark index after the event date. These new listings tend to be smaller and riskier, distorting benchmark returns.
Implement this with the R package
Access advanced features and full customization through the EventStudy R package.
What Should I Do Next?
- Expected Return Models — the BHAR Model and other benchmarks
- Choosing the Right Test Statistic — select appropriate tests for your study design
- Inference & Robustness — bootstrap and multiple testing corrections
- Cross-Sectional Regression — analyze determinants of long-run abnormal returns