σ StatsDoge Causal inference workflows
9
σ Building block · used in 5 workflows

Regression forest

OTHER Machine LearningRandom Forest
Source grf — Athey, Tibshirani & Wager
Summary by StatsDoge

Honest non-parametric regression for E[Y|X], with out-of-bag predictions and pointwise CIs.

You're looking at a building block — one of the estimators a workflow uses inside its pipeline. You reached it from a workflow step; it's used in 5 workflows (listed below).

GRF package logo

Figure: GRF package logo. Source — grf-labs docs.

⚠️ Unofficial community write-up of a method from grf-labs/grf (pinned at v2.6.1). Not affiliated with the grf-labs authors — this summarizes the public documentation for demonstration. All credit & copyright belong to the original authors (Athey, Tibshirani, Wager, et al.).

What it does

The workhorse under everything else: an honest random forest for E[Y | X]. Beyond a point prediction it gives out-of-bag estimates and asymptotically-valid pointwise confidence intervals via the forest's adaptive weights.

rf <- regression_forest(X, Y)
predict(rf, estimate.variance = TRUE)
variable_importance(rf)

Why it matters here

Used to cross-fit the nuisance functions Y.hat / W.hat that orthogonalize the causal and instrumental forests.

Used in these workflows (5)

Discussion (2)

  • 2

    The unsung hero of the whole package. Half my pipelines just use this for the nuisance models.

  • 6

    If your signal is smooth, try ll_regression_forest instead — the boundary bias here can bite.