Lightning Rod — Build domain-expert AI from messy historical data

Real-world data has timestamps, not clean labels.

Generate verified training data from real-world outcomes.

DATASET Policy Forecast | Example 457 / 5,000

Question

Will the Trump administration impose 25% tariffs on all goods from Canada by March 1, 2025?

Question Source

New York Times Jan 27, 2025

Trump Threatens Canada With 25% Tariffs Over Border Security

Label

Yes.

Type

binary

Confidence

0.92

Label Source

Reuters Feb 1, 2025

Trump Signs Order Imposing 25% Tariffs on Canadian Imports

DATASET Medical QA | Example 117 / 4,200

SFT

Question

What is the mechanism by which beta-blockers reduce mortality in heart failure patients?

Source

Harrison's Internal Medicine

Chapter 257 — Heart Failure: Management, pp. 1762–1769

Answer

Beta-blockers block β₁-adrenergic receptors, reducing heart rate and myocardial oxygen demand, allowing reverse remodeling and improved systolic function.

Type

free response

Confidence

0.94

Excerpt

Beta-blocker therapy reverses adverse LV remodeling by attenuating the cardiotoxic effects of sustained adrenergic activation.

DATASET Portfolio Company Risk | Example 33 / 2,400

Question

Will ProServ Health's largest payer contract be renewed before its June 2025 expiration?

Question Source

ProServ Health — Q4 2024 Operating Review Feb 3, 2025

Anthem contract expires June 2025. Renewal discussions underway; management expects 2–4% rate adjustment.

Label

Yes.

Type

binary

Confidence

0.98

Label Source

ProServ Health — Q2 2025 Board Presentation Jul 18, 2025

Anthem contract renewed for 3 years at a 3.1% rate increase on May 23, 2025.

Prompt to AI

Describe what you want. Our agent handles the rest.

I want to predict the likelihood of geopolitical events using news data.

Got it. I'll pull from Reuters and AP News — about 18 months of coverage. Does that work?

Yes, go ahead.

Gathering sources now. You can track progress on the right.

Reply to agent...

✓

Gather sources

Generate questions

Resolve outcomes

Add context

Train model

The agent shows its reasoning at every step — you confirm before it commits. Get Started

Used to train frontier-beating models.

Feb 2026 UChicago Leaderboard

#1 on ProphetArena Sports

Foresight-32B ranked #1, ahead of GPT-5.2 and Gemini 3 Pro.

Jan 2026 Benchmark

Top 5 on ForecastBench

Outperformed Gemini 3 Pro, Claude Sonnet 4.5, and o3 on the Forecasting Research Institute benchmark.

2025–2026 Peer Reviewed

Cutting Edge Research

Beating frontier models using our novel Future-as-Label methodology.

See Research & Benchmarks

Simple, powerful API

Generate verified datasets in a few lines of code. Our SDK handles the complexity.

Grounded in real outcomes and source documents
Bootstrap with public feeds: news, SEC filings, Wikipedia
Full provenance with citations and source docs

GitHub

Examples on HuggingFace

from lightningrod import Pipeline

pipeline = Pipeline([
    NewsSeedGenerator(query="AI regulation"),
    ForwardLookingQuestionGenerator(
        instructions="Generate questions about future AI regulations and rulings"
    ),
    WebSearchLabeler()
])

dataset = pipeline.run(n_samples=100)