Analysis

20 - OTP → Ridership Causality

Ridership and External Factors

Coverage: 2017-01 to 2025-11 (from otp_monthly, ridership_monthly).

Built 2026-04-03 20:09 UTC · Commit 7c56b9a

Page Navigation

Analysis Navigation

Data Provenance

flowchart LR
  20_otp_ridership_causality(["20 - OTP → Ridership Causality"])
  t_otp_monthly[("otp_monthly")] --> 20_otp_ridership_causality
  01_data_ingestion[["Data Ingestion"]] --> t_otp_monthly
  u1_01_data_ingestion[/"data/routes_by_month.csv"/] --> 01_data_ingestion
  u2_01_data_ingestion[/"data/PRT_Current_Routes_Full_System_de0e48fcbed24ebc8b0d933e47b56682.csv"/] --> 01_data_ingestion
  u3_01_data_ingestion[/"data/Transit_stops_(current)_by_route_e040ee029227468ebf9d217402a82fa9.csv"/] --> 01_data_ingestion
  u4_01_data_ingestion[/"data/PRT_Stop_Reference_Lookup_Table.csv"/] --> 01_data_ingestion
  u5_01_data_ingestion[/"data/average-ridership/12bb84ed-397e-435c-8d1b-8ce543108698.csv"/] --> 01_data_ingestion
  t_ridership_monthly[("ridership_monthly")] --> 20_otp_ridership_causality
  01_data_ingestion[["Data Ingestion"]] --> t_ridership_monthly
  d1_20_otp_ridership_causality(("numpy (lib)")) --> 20_otp_ridership_causality
  d2_20_otp_ridership_causality(("polars (lib)")) --> 20_otp_ridership_causality
  d3_20_otp_ridership_causality(("scipy (lib)")) --> 20_otp_ridership_causality
  d4_20_otp_ridership_causality(("statsmodels (lib)")) --> 20_otp_ridership_causality
  classDef page fill:#dbeafe,stroke:#1d4ed8,color:#1e3a8a,stroke-width:2px;
  classDef table fill:#ecfeff,stroke:#0e7490,color:#164e63;
  classDef dep fill:#fff7ed,stroke:#c2410c,color:#7c2d12,stroke-dasharray: 4 2;
  classDef file fill:#eef2ff,stroke:#6366f1,color:#3730a3;
  classDef api fill:#f0fdf4,stroke:#16a34a,color:#14532d;
  classDef pipeline fill:#f5f3ff,stroke:#7c3aed,color:#4c1d95;
  class 20_otp_ridership_causality page;
  class t_otp_monthly,t_ridership_monthly table;
  class d1_20_otp_ridership_causality,d2_20_otp_ridership_causality,d3_20_otp_ridership_causality,d4_20_otp_ridership_causality dep;
  class u1_01_data_ingestion,u2_01_data_ingestion,u3_01_data_ingestion,u4_01_data_ingestion,u5_01_data_ingestion file;
  class 01_data_ingestion pipeline;

Findings

Findings: OTP -> Ridership Causality

Summary

There is no evidence that OTP declines predict subsequent ridership losses. After detrending and Bonferroni correction, zero of 93 routes show statistically significant Granger causality from OTP to ridership. The raw cross-correlations are weakly negative, suggesting the opposite direction: months with lower ridership tend to have better OTP.

Key Numbers

Granger causality: 8/93 routes significant at p < 0.05 uncorrected; 0/93 after Bonferroni correction
Median cross-correlation at lag 0: r = -0.18 (IQR: [-0.41, +0.10])
Median cross-correlation at lag 1: r = -0.15 (IQR: [-0.38, +0.09])
Cross-correlations are flat across lags 0--6 -- no dominant lag emerges
93 routes with 36+ months of paired OTP + ridership data (57--70 months each)
Both series detrended by subtracting system-wide monthly mean before testing
ADF stationarity test applied per route; routes with non-stationary detrended series are first-differenced before Granger testing to avoid spurious regression

Observations

The p-value histogram is roughly uniform (with a small pile-up near zero), consistent with the null hypothesis being true for most routes.
Of the 8 nominally significant routes, 3 have best lag = 1 month and the rest are scattered across lags 2--6. No consistent lag structure emerges.
The weakly negative contemporaneous correlation (median r = -0.18) is consistent with reverse causality: months when fewer people ride (summer, holidays) see better OTP because of reduced dwell times and less crowding, not because good OTP attracts riders.

Discussion

The hypothesis that poor OTP drives riders away is intuitive, but this data cannot confirm it. Several factors may explain the null result:

Temporal resolution is too coarse: monthly data may be too slow to capture rider responses, which could operate at the trip or week level.
Riders have limited alternatives: in a single-provider transit system, riders may tolerate poor OTP because they have no substitute, especially for commute trips.
Confounders dominate: ridership is driven primarily by employment, gas prices, weather, and COVID -- factors far larger than OTP fluctuations. The detrending removes system-wide trends but not route-specific confounders.
Reverse causality may mask the effect: if high ridership causes poor OTP (crowding, dwell times) while poor OTP also causes ridership loss, the two effects partially cancel.

Caveats

Granger causality tests linear predictive ability, not true causation. A null result does not prove OTP has no effect on ridership.
The 70-month overlap period with monthly granularity gives limited statistical power for 6-lag models (effective sample ~60 per route).
Detrending removes system-wide trends but not route-specific shocks (e.g., service changes, construction).
Bonferroni correction is conservative; a less strict correction (e.g., Benjamini-Hochberg) might yield a few significant routes, but the overall pattern would remain weak.

Review History

2026-02-27: RED-TEAM-REPORTS/2026-02-27-analyses-19-25.md — 1 significant issue. Added ADF stationarity tests; non-stationary routes are first-differenced before Granger testing. Null result confirmed as robust.

Output

image granger_summary.png
histogram of p-values across routes.

image lagged_crosscorr.png
median cross-correlation by lag with IQR band.

No interactive outputs declared.

data granger_results.csv

per-route Granger test results (F-stat, p-value, optimal lag).

Preview CSV

Expand to load preview.

data lagged_crosscorr.csv

Per-route lagged OTP-to-ridership cross-correlation values across tested lags.

Preview CSV

Expand to load preview.

Methods

Methods: OTP → Ridership Causality

Question

Does a decline in on-time performance predict subsequent ridership losses? If so, at what lag and magnitude?

Approach

Join monthly OTP with monthly weekday ridership by route and month over the overlap period.
For each route, compute lagged cross-correlations between OTP and ridership at lags of 1--6 months (OTP leading ridership).
Aggregate cross-correlations across routes (median and IQR) to identify the dominant lag.
Run Granger causality tests (statsmodels) on routes with sufficient data (36+ months), testing whether lagged OTP improves ridership prediction beyond ridership's own autoregressive trend.
Control for system-wide trends by detrending both series (subtract system monthly mean) before testing.
Check stationarity of each route's detrended series using the Augmented Dickey-Fuller (ADF) test. For routes where either series is non-stationary (ADF p >= 0.05), first-difference both series before Granger testing to avoid spurious regression.
Report the share of routes where Granger causality is significant at p < 0.05, with Bonferroni correction.

Data

Name	Description	Source
`otp_monthly`	route_id, month, OTP	`prt.db` table
`ridership_monthly`	route_id, month, day_type='WEEKDAY', avg_riders	`prt.db` table

Notes: Join on route_id and month; overlap period only (Jan 2019 -- Oct 2024). Exclude routes with fewer than 36 months of paired data.

Output

output/lagged_crosscorr.png -- median cross-correlation by lag with IQR band
output/granger_results.csv -- per-route Granger test results (F-stat, p-value, optimal lag)
output/granger_summary.png -- histogram of p-values across routes

  Source Code
    
      
      """Analysis 20: Test whether OTP declines predict subsequent ridership losses using lagged cross-correlation and Granger causality."""

import numpy as np
import polars as pl
from statsmodels.tsa.stattools import adfuller, grangercausalitytests

from prt_otp_analysis.common import analysis_dir, correlate, phase, query_to_polars, run_analysis, save_chart, save_csv, setup_plotting

OUT = analysis_dir(__file__)

MIN_MONTHS = 36
MAX_LAG = 6


def load_data() -> pl.DataFrame:
    """Load paired monthly OTP and weekday ridership for routes with enough data."""
    df = query_to_polars("""
        SELECT o.route_id, o.month, o.otp, r.avg_riders
        FROM otp_monthly o
        JOIN ridership_monthly r
            ON o.route_id = r.route_id AND o.month = r.month
            AND r.day_type = 'WEEKDAY'
        ORDER BY o.route_id, o.month
    """)

    # Filter to routes with at least MIN_MONTHS of paired data
    route_counts = df.group_by("route_id").agg(pl.col("month").count().alias("n"))
    keep = route_counts.filter(pl.col("n") >= MIN_MONTHS)["route_id"].to_list()
    df = df.filter(pl.col("route_id").is_in(keep))

    return df


def detrend(df: pl.DataFrame) -> pl.DataFrame:
    """Subtract system-wide monthly mean from each route's OTP and ridership."""
    system = df.group_by("month").agg(
        otp_sys=pl.col("otp").mean(),
        riders_sys=pl.col("avg_riders").mean(),
    )
    df = df.join(system, on="month")
    df = df.with_columns(
        otp_dt=(pl.col("otp") - pl.col("otp_sys")),
        riders_dt=(pl.col("avg_riders") - pl.col("riders_sys")),
    )
    return df


def compute_lagged_crosscorr(df: pl.DataFrame) -> pl.DataFrame:
    """Compute lagged cross-correlations (OTP leading ridership) for each route."""
    routes = df["route_id"].unique().sort().to_list()
    results = []

    for route in routes:
        rdf = df.filter(pl.col("route_id") == route).sort("month")
        otp = rdf["otp_dt"].to_numpy()
        riders = rdf["riders_dt"].to_numpy()

        for lag in range(0, MAX_LAG + 1):
            if lag == 0:
                lag_df = pl.DataFrame({"otp": otp, "riders": riders})
            else:
                # OTP at time t correlated with ridership at time t+lag
                lag_df = pl.DataFrame({"otp": otp[:-lag], "riders": riders[lag:]})
            corr = correlate(lag_df, "otp", "riders")
            results.append({
                "route_id": route,
                "lag": lag,
                "correlation": corr["pearson_r"],
                "p_value": corr["pearson_p"],
            })

    return pl.DataFrame(results)


def aggregate_crosscorr(ccdf: pl.DataFrame) -> pl.DataFrame:
    """Summarize cross-correlations across routes by lag."""
    return (
        ccdf.group_by("lag")
        .agg(
            median_r=pl.col("correlation").median(),
            q25_r=pl.col("correlation").quantile(0.25),
            q75_r=pl.col("correlation").quantile(0.75),
            mean_r=pl.col("correlation").mean(),
            n_significant=((pl.col("p_value") < 0.05) & (pl.col("correlation") > 0)).sum(),
            n_routes=pl.col("route_id").count(),
        )
        .sort("lag")
    )


def check_stationarity(df: pl.DataFrame, alpha: float = 0.05) -> dict:
    """Run ADF tests on each route's detrended OTP and ridership series.

    Returns a dict with summary statistics and a per-route DataFrame.
    """
    routes = df["route_id"].unique().sort().to_list()
    results = []
    for route in routes:
        rdf = df.filter(pl.col("route_id") == route).sort("month")
        otp = rdf["otp_dt"].to_numpy()
        riders = rdf["riders_dt"].to_numpy()
        try:
            otp_adf_p = adfuller(otp, autolag="AIC")[1]
        except Exception:
            otp_adf_p = None
        try:
            riders_adf_p = adfuller(riders, autolag="AIC")[1]
        except Exception:
            riders_adf_p = None
        results.append({
            "route_id": route,
            "otp_adf_p": otp_adf_p,
            "riders_adf_p": riders_adf_p,
            "otp_stationary": otp_adf_p is not None and otp_adf_p < alpha,
            "riders_stationary": riders_adf_p is not None and riders_adf_p < alpha,
        })
    adf_df = pl.DataFrame(results).with_columns(
        pl.col("otp_stationary").cast(pl.Boolean),
        pl.col("riders_stationary").cast(pl.Boolean),
    )
    n = len(routes)
    n_otp_stat = adf_df.filter(pl.col("otp_stationary")).height
    n_riders_stat = adf_df.filter(pl.col("riders_stationary")).height
    n_both_stat = adf_df.filter(pl.col("otp_stationary") & pl.col("riders_stationary")).height
    return {
        "adf_df": adf_df,
        "n_routes": n,
        "n_otp_stationary": n_otp_stat,
        "n_riders_stationary": n_riders_stat,
        "n_both_stationary": n_both_stat,
    }


def run_granger_tests(df: pl.DataFrame, adf_df: pl.DataFrame) -> pl.DataFrame:
    """Run Granger causality tests (OTP -> ridership) for each route.

    First-differences non-stationary series before testing.
    """
    routes = df["route_id"].unique().sort().to_list()
    n_routes = len(routes)
    adf_lookup = {
        row["route_id"]: (row["otp_stationary"], row["riders_stationary"])
        for row in adf_df.iter_rows(named=True)
    }
    results = []

    for route in routes:
        rdf = df.filter(pl.col("route_id") == route).sort("month")
        otp = rdf["otp_dt"].to_numpy()
        riders = rdf["riders_dt"].to_numpy()

        otp_stat, riders_stat = adf_lookup.get(route, (True, True))
        differenced = not (otp_stat and riders_stat)

        # First-difference if either series is non-stationary
        if differenced:
            otp = np.diff(otp)
            riders = np.diff(riders)

        if len(otp) < MAX_LAG + 5:
            results.append({
                "route_id": route, "best_lag": None, "f_stat": None,
                "p_value": None, "p_bonferroni": None,
                "n_months": len(rdf), "differenced": differenced,
            })
            continue

        # statsmodels expects [y, x] where we test if x Granger-causes y
        data = np.column_stack([riders, otp])

        best_lag = None
        best_p = 1.0
        best_f = 0.0

        try:
            res = grangercausalitytests(data, maxlag=MAX_LAG, verbose=False)
            for lag in range(1, MAX_LAG + 1):
                f_stat = res[lag][0]["ssr_ftest"][0]
                p_val = res[lag][0]["ssr_ftest"][1]
                if p_val < best_p:
                    best_p = p_val
                    best_f = f_stat
                    best_lag = lag
        except Exception:
            best_lag = None
            best_p = None
            best_f = None

        results.append({
            "route_id": route,
            "best_lag": best_lag,
            "f_stat": best_f,
            "p_value": best_p,
            "p_bonferroni": min(best_p * n_routes, 1.0) if best_p is not None else None,
            "n_months": len(rdf),
            "differenced": differenced,
        })

    return pl.DataFrame(results)


def make_crosscorr_chart(agg: pl.DataFrame) -> None:
    """Plot median cross-correlation by lag with IQR band."""
    plt = setup_plotting()
    fig, ax = plt.subplots(figsize=(8, 5))

    lags = agg["lag"].to_list()
    median = agg["median_r"].to_list()
    q25 = agg["q25_r"].to_list()
    q75 = agg["q75_r"].to_list()

    ax.plot(lags, median, "o-", color="#2563eb", linewidth=2, markersize=8, label="Median r")
    ax.fill_between(lags, q25, q75, alpha=0.2, color="#2563eb", label="IQR")
    ax.axhline(0, color="black", linewidth=0.5)
    ax.set_xlabel("Lag (months, OTP leading ridership)")
    ax.set_ylabel("Pearson r (detrended)")
    ax.set_title("Lagged Cross-Correlation: OTP -> Ridership")
    ax.set_xticks(lags)
    ax.legend()

    save_chart(fig, OUT / "lagged_crosscorr.png")


def make_granger_chart(gdf: pl.DataFrame) -> None:
    """Plot histogram of Granger p-values and best-lag distribution."""
    plt = setup_plotting()
    fig, (ax1, ax2) = plt.subplots(1, 2, figsize=(12, 5))

    valid = gdf.filter(pl.col("p_value").is_not_null())
    pvals = valid["p_value"].to_numpy()
    best_lags = valid["best_lag"].to_numpy()

    # P-value histogram
    ax1.hist(pvals, bins=20, range=(0, 1), color="#2563eb", edgecolor="white", alpha=0.8)
    ax1.axvline(0.05, color="#ef4444", linestyle="--", label="p = 0.05")
    n_sig = (pvals < 0.05).sum()
    n_bonf = valid.filter(pl.col("p_bonferroni") < 0.05).height
    ax1.set_xlabel("p-value (best lag, SSR F-test)")
    ax1.set_ylabel("Number of routes")
    ax1.set_title(f"Granger Causality p-values\n{n_sig}/{len(pvals)} sig. at 0.05, "
                  f"{n_bonf}/{len(pvals)} after Bonferroni")
    ax1.legend()

    # Best-lag distribution (for significant routes only)
    sig_lags = best_lags[pvals < 0.05]
    if len(sig_lags) > 0:
        ax2.hist(sig_lags, bins=range(1, MAX_LAG + 2), color="#e11d48",
                 edgecolor="white", alpha=0.8, align="left")
    ax2.set_xlabel("Best lag (months)")
    ax2.set_ylabel("Number of routes")
    ax2.set_title(f"Optimal Lag for Significant Routes (n={len(sig_lags)})")
    ax2.set_xticks(range(1, MAX_LAG + 1))

    save_chart(fig, OUT / "granger_summary.png")


@run_analysis(20, "OTP -> Ridership Causality")
def main() -> None:
    """Entry point: load, detrend, cross-correlate, Granger test, chart, and save."""
    with phase("Loading data"):
        df = load_data()
        n_routes = df["route_id"].n_unique()
        print(f"  {len(df):,} route-month observations ({n_routes} routes, {MIN_MONTHS}+ months each)")

    with phase("Detrending (subtracting system monthly mean)"):
        df = detrend(df)

    with phase("Computing lagged cross-correlations (lags 0--6)"):
        ccdf = compute_lagged_crosscorr(df)
        agg = aggregate_crosscorr(ccdf)
        print("\n  Lag  Median r   IQR              Sig+ routes")
        print("  ---  --------   ---------------  -----------")
        for row in agg.iter_rows(named=True):
            print(f"  {row['lag']:>3d}  {row['median_r']:>8.4f}   "
                  f"[{row['q25_r']:+.4f}, {row['q75_r']:+.4f}]  "
                  f"{row['n_significant']}/{row['n_routes']}")

    with phase("Checking stationarity (ADF tests on detrended series)"):
        adf_results = check_stationarity(df)
        print(f"  OTP stationary:      {adf_results['n_otp_stationary']}/{adf_results['n_routes']} routes")
        print(f"  Ridership stationary: {adf_results['n_riders_stationary']}/{adf_results['n_routes']} routes")
        print(f"  Both stationary:     {adf_results['n_both_stationary']}/{adf_results['n_routes']} routes")
        n_diff = adf_results['n_routes'] - adf_results['n_both_stationary']
        print(f"  Will first-difference {n_diff} routes before Granger testing")

    with phase("Running Granger causality tests"):
        gdf = run_granger_tests(df, adf_results["adf_df"])
        valid = gdf.filter(pl.col("p_value").is_not_null())
        n_sig = valid.filter(pl.col("p_value") < 0.05).height
        n_bonf = valid.filter(pl.col("p_bonferroni") < 0.05).height
        print(f"  {n_sig}/{valid.height} routes significant at p < 0.05")
        print(f"  {n_bonf}/{valid.height} routes significant after Bonferroni correction")

        # Most significant routes
        top = valid.sort("p_value").head(10)
        print("\n  Top 10 routes by Granger significance:")
        print(f"  {'Route':<10} {'Lag':>4} {'F-stat':>8} {'p-value':>10} {'p-Bonf':>10}")
        for row in top.iter_rows(named=True):
            print(f"  {row['route_id']:<10} {row['best_lag']:>4d} {row['f_stat']:>8.2f} "
                  f"{row['p_value']:>10.4f} {row['p_bonferroni']:>10.4f}")

    with phase("Saving CSVs"):
        save_csv(ccdf, OUT / "lagged_crosscorr.csv")
        save_csv(gdf, OUT / "granger_results.csv")

    with phase("Generating charts"):
        make_crosscorr_chart(agg)
        make_granger_chart(gdf)


if __name__ == "__main__":
    main()

    

    

Sources

Name	Type	Why It Matters	Owner	Freshness	Caveat
otp_monthly	table	Primary analytical table used in this page's computations.	Produced by Data Ingestion.	Updated when the producing pipeline step is rerun.	Coverage depends on upstream source availability and ETL assumptions.
Upstream sources (5) file data/routes_by_month.csv — Monthly route OTP source table in wide format. file data/PRT_Current_Routes_Full_System_de0e48fcbed24ebc8b0d933e47b56682.csv — Current route metadata and mode classifications. file data/Transit_stops_(current)_by_route_e040ee029227468ebf9d217402a82fa9.csv — Current stop-to-route coverage and trip counts. file data/PRT_Stop_Reference_Lookup_Table.csv — Historical stop reference file with geography attributes. file data/average-ridership/12bb84ed-397e-435c-8d1b-8ce543108698.csv — Average ridership by route and month.
ridership_monthly	table	Primary analytical table used in this page's computations.	Produced by Data Ingestion.	Updated when the producing pipeline step is rerun.	Coverage depends on upstream source availability and ETL assumptions.
Upstream sources (5) file data/routes_by_month.csv — Monthly route OTP source table in wide format. file data/PRT_Current_Routes_Full_System_de0e48fcbed24ebc8b0d933e47b56682.csv — Current route metadata and mode classifications. file data/Transit_stops_(current)_by_route_e040ee029227468ebf9d217402a82fa9.csv — Current stop-to-route coverage and trip counts. file data/PRT_Stop_Reference_Lookup_Table.csv — Historical stop reference file with geography attributes. file data/average-ridership/12bb84ed-397e-435c-8d1b-8ce543108698.csv — Average ridership by route and month.
numpy	dependency	Runtime dependency required for this page's pipeline or analysis code.	Open-source Python ecosystem maintainers.	Version pinned by project environment until dependency updates are applied.	Library updates may change behavior or defaults.
polars	dependency	Runtime dependency required for this page's pipeline or analysis code.	Open-source Python ecosystem maintainers.	Version pinned by project environment until dependency updates are applied.	Library updates may change behavior or defaults.
scipy	dependency	Runtime dependency required for this page's pipeline or analysis code.	Open-source Python ecosystem maintainers.	Version pinned by project environment until dependency updates are applied.	Library updates may change behavior or defaults.
statsmodels	dependency	Runtime dependency required for this page's pipeline or analysis code.	Open-source Python ecosystem maintainers.	Version pinned by project environment until dependency updates are applied.	Library updates may change behavior or defaults.