σ Building block · used in 5 workflows

Regression forest

added by @grf · Jun 2, 2026

OTHER Machine LearningRandom Forest

@misc{grf,
  title        = {grf},
  author       = {Athey and Tibshirani and Wager},
  howpublished = {\url{https://grf-labs.github.io/grf/}},
  note         = {Software / documentation}
}

Summary by StatsDoge

Honest non-parametric regression for E[Y|X], with out-of-bag predictions and pointwise CIs.

You're looking at a building block — one of the estimators a workflow uses inside its pipeline. You reached it from a workflow step; it's used in 5 workflows (listed below).

⌥

https://github.com/grf-labs/grf @ v2.6.1tag

Open repo ↗

Paper ↗

Figure: GRF package logo. Source — grf-labs docs.

⚠️ Unofficial community write-up of a method from grf-labs/grf (pinned at v2.6.1). Not affiliated with the grf-labs authors — this summarizes the public documentation for demonstration. All credit & copyright belong to the original authors (Athey, Tibshirani, Wager, et al.).

What it does

The workhorse under everything else: an honest random forest for E[Y | X]. Beyond a point prediction it gives out-of-bag estimates and asymptotically-valid pointwise confidence intervals via the forest's adaptive weights.

rf <- regression_forest(X, Y)
predict(rf, estimate.variance = TRUE)
variable_importance(rf)

Why it matters here

Used to cross-fit the nuisance functions Y.hat / W.hat that orthogonalize the causal and instrumental forests.

Used in these workflows (5)

Smooth signals with a local linear forest

When the conditional mean is smooth: regression forest baseline → ll_regression_forest → tuning → diagnostics.

@grf
Cross-fold validation of heterogeneity

K-fold cross-fitted CATEs → RATE on out-of-fold priorities → honest verdict on heterogeneity strength.

@grf
An introduction to GRF (getting started)

A minimal first-contact recipe: regression forest, quantile forest, and a causal forest on the same data.

@grf
Assessing heterogeneity with RATE (AUTOC & Qini)

Causal forest → train/eval split → RATE with both AUTOC and Qini → TOC plot.

@grf
Heterogeneous treatment effects with a causal forest (GRF recipe)

The full GRF HTE playbook: cross-fit nuisances → causal forest → calibration → AIPW ATE → BLP → RATE → policy.

@grf

Discussion (2)

2

@oob_oscar · Jun 2, 2026

The unsung hero of the whole package. Half my pipelines just use this for the nuisance models.
6

@bandwidth_bo · Jun 2, 2026

If your signal is smooth, try ll_regression_forest instead — the boundary bias here can bite.

GRFRegression forest

What it does

Why it matters here

Used in these workflows (5)

Smooth signals with a local linear forest

Cross-fold validation of heterogeneity

An introduction to GRF (getting started)

Assessing heterogeneity with RATE (AUTOC & Qini)

Heterogeneous treatment effects with a causal forest (GRF recipe)

Discussion (2)

Regression forest