Workflows — StatsDoge

⌥ Pipeline composition stream Real-time node

Sort live stream by

Hottest Newest

⌥ 4 steps ⑂ 1 branch Index: 195 52 peers

10

Draw the DAG, find the adjustment set (ggdag & dagitty)

Before any estimation: encode your assumptions as a causal graph, enumerate the backdoor paths from treatment to outcome, and let the graph hand you the minimal set of covariates to adjust for.

Data prep Encode your assumptions as a DAG Diagnostic / pre-tests Enumerate paths; spot backdoors & collide… Estimation Minimal sufficient adjustment set Robustness check Test the DAG's implications

Adjustment Set Backdoor Adjustment DAG

@ggdag · Jun 4, 2026

3 reviews
⌥ 4 steps ⑂ 1 branch Index: 207 86 peers

11

Mendelian randomization: genes as instruments for a causal effect (TwoSampleMR)

Use genetic variants as instruments to estimate the causal effect of an exposure on an outcome from GWAS summary data — with IVW plus pleiotropy-robust MR-Egger and weighted-median checks.

Data prep Harmonise SNP–exposure & SNP–outcome effe… Diagnostic / pre-tests Check instrument strength Estimation Inverse-variance weighted estimate Robustness check Pleiotropy-robust: MR-Egger, weighted med…

MR-Egger Mendelian Randomization Pleiotropy

@twosamplemr · Jun 4, 2026

3 reviews
⌥ 4 steps ⑂ 1 branch Index: 170 65 peers

10

Goodman-Bacon decomposition: what your TWFE estimate is averaging (bacondecomp)

A two-way fixed-effects DiD is a weighted average of all possible 2×2 comparisons — including 'forbidden' ones that use already-treated units as controls. This shows you the weights.

Data prep A staggered-adoption panel Diagnostic / pre-tests Decompose into 2×2 comparisons Robustness check Spot the forbidden comparisons Reporting Read β as a weighted average

DiD Goodman-Bacon TWFE

@bacondecomp · Jun 4, 2026

2 reviews
⌥ 4 steps ⑂ 1 branch Index: 207 12 peers

11

Honest sensitivity bounds for parallel-trends violations (HonestDiD)

Stop betting everything on a pre-trends test. Allow the post-treatment trend to deviate within a transparent class, and report the confidence set — and the breakdown value where the effect would vanish.

Data prep Start from event-study coefficients Diagnostic / pre-tests Read the pre-trends, don't just test them Robustness check Bound the deviation: relative magnitudes … Inference Robust confidence set & breakdown value

Event Study Honest DiD Pre-trends

@honestdid · Jun 4, 2026

3 reviews
⌥ 4 steps ⑂ 1 branch Index: 219 104 peers

12

Model, identify, estimate, refute — the DoWhy four-step recipe (DoWhy)

Make your assumptions explicit: draw a causal graph, identify the estimand by the backdoor criterion, estimate it, then actively try to refute it with placebo and confounding tests.

Data prep Model — encode the causal graph Diagnostic / pre-tests Identify — apply the backdoor criterion Estimation Estimate — adjust for the backdoor set Robustness check Refute — placebo & unobserved-confounder …

Backdoor Adjustment Propensity Score Refutation

@dowhy · Jun 4, 2026

3 reviews
⌥ 4 steps ⑂ 1 branch Index: 108 33 peers

9

Sharp regression discontinuity with robust bias correction (rdrobust)

Identify the effect at a cutoff: a local-polynomial RD with an MSE-optimal bandwidth and robust, bias-corrected confidence intervals.

Data prep Running variable, cutoff, outcome Diagnostic / pre-tests rdplot — see the jump Estimation Local-linear RD with bias correction Robustness check Bandwidth & donut sensitivity

Regression Discontinuity

@rdrobust · Jun 4, 2026

0 reviews
⌥ 4 steps ⑂ 1 branch Index: 96 64 peers

8

Instrumental variables & 2SLS for an endogenous treatment (ivreg)

When treatment is endogenous, an instrument identifies the complier (LATE) effect via two-stage least squares — after you check the instrument is strong.

Data prep Outcome, endogenous treatment, instrument Diagnostic / pre-tests Check instrument strength (first stage) Estimation Two-stage least squares (ivreg) Inference Interpret as a complier effect (LATE)

Instrumental Variables LATE

@ivreg · Jun 4, 2026

0 reviews
⌥ 4 steps ⑂ 1 branch Index: 108 69 peers

9

Design & diagnose a randomized experiment (DeclareDesign)

Specify a study as model–inquiry–data–answer, simulate it, and read its diagnosands — bias, power, coverage — before you run it.

Data prep Declare the model & potential outcomes Estimation Difference-in-means estimator Inference Neyman variance & confidence intervals Diagnostic / pre-tests Diagnose: bias, power, coverage

Neyman Randomized Experiment

@declaredesign · Jun 4, 2026

0 reviews
⌥ 5 steps ⑂ 1 branch Index: 60 32 peers

5

An observational ATE you can defend (balance → estimate → sensitivity)

My checklist for an observational effect: match, prove balance with cobalt, estimate on the matched sample, then quantify hidden-confounding risk with sensemakr.

Data prep Treatment, covariates, outcome Data prep matchit() — nearest-neighbour matching Diagnostic / pre-tests bal.tab() / love.plot() — cobalt Estimation Estimate the ATT on matched data Reporting sensemakr() — robustness value + contours

Matching Propensity Score Sensitivity Analysis

@tianzhuqin · Jun 4, 2026

0 reviews
⌥ 4 steps Index: 207 45 peers

11

Matching for causal inference (MatchIt)

Preprocess by matching so groups are comparable, check balance, then estimate the effect on the matched sample — design before analysis.

Data prep Treatment W + covariates X

▼

Estimation [MatchIt] Matching for causal inference —…

▼

Diagnostic / pre-tests Assess balance (summary / plot)

▼

Reporting Estimate the effect on matched data

Matching Propensity Score

@matchit · Jun 3, 2026

3 reviews
⌥ 4 steps Index: 171 107 peers

8

Covariate balance for matching & weighting (cobalt)

Before you trust an observational estimate, prove balance: SMDs, overlap, and a Love plot before vs after adjustment.

Data prep Treatment W + covariates X

▼

Data prep Estimate weights / matches (WeightIt / Ma…

▼

Diagnostic / pre-tests [cobalt] Balance tables & Love plots — ba…

▼

Reporting love.plot()

Matching Propensity Score

@cobalt · Jun 3, 2026

3 reviews
⌥ 5 steps ⑂ 1 branch Index: 108 87 peers

9

Smooth signals with a local linear forest

When the conditional mean is smooth: regression forest baseline → ll_regression_forest → tuning → diagnostics.

Estimation [GRF] Regression forest Estimation [GRF] Local linear forest

▼

Diagnostic / pre-tests Tune λ via cross-validation

▼

Diagnostic / pre-tests Calibration & boundary plot

▼

Reporting Side-by-side comparison

Machine Learning Random Forest

@grf · Jun 2, 2026

0 reviews
⌥ 6 steps ⑂ 1 branch Index: 132 42 peers

11

Evaluating a causal forest fit

Did the forest actually capture treatment-effect heterogeneity? Calibration → variable importance → BLP → omnibus tests.

Estimation [GRF] Causal forest

▼

Diagnostic / pre-tests test_calibration() Diagnostic / pre-tests variable_importance() Heterogeneity best_linear_projection() Diagnostic / pre-tests OOB residual checks

▼

Reporting Fit-evaluation report

Causal Forest Heterogeneous Effects

@grf · Jun 2, 2026

0 reviews
⌥ 5 steps ⑂ 1 branch Index: 170 30 peers

10

Causal forest with time-to-event data (survival)

Censoring check → causal survival forest → RMST-scale AIPW ATE → calibration → report.

Diagnostic / pre-tests [GRF] Survival forest

▼

Estimation [GRF] Causal survival forest

▼

Inference [GRF] AIPW average treatment effect Diagnostic / pre-tests test_calibration()

▼

Reporting RMST difference by subgroup

Causal Forest Heterogeneous Effects Survival

@grf · Jun 2, 2026

2 reviews
⌥ 8 steps ⑂ 1 branch Index: 292 48 peers

16

Heterogeneous treatment effects with a causal forest (GRF recipe)

The full GRF HTE playbook: cross-fit nuisances → causal forest → calibration → AIPW ATE → BLP → RATE → policy.

Data prep [GRF] Regression forest

▼

Estimation [GRF] Causal forest

▼

Diagnostic / pre-tests test_calibration() Inference [GRF] AIPW average treatment effect Heterogeneity best_linear_projection() Heterogeneity [GRF] Rank-weighted ATE — RATE / AUTOC / …

▼

Robustness check Policy learning (policytree)

▼

Reporting CATE histogram + targeting report

Causal Forest Doubly Robust Heterogeneous Effects

@grf · Jun 2, 2026

4 reviews