Mentatcurated
Tool

FutureSim

An open-source benchmark that replays real news chronologically so AI agents must forecast unresolved world events and revise their beliefs as the corpus updates.

1 find · high confidence
Accessoss
MakerOpenForecaster (Shashwat Goel, Nikhil Chandak, Arvindh Arun, et al.; ELLIS Institute Tübingen / Max Planck Institute for Intelligent Systems / University of Stuttgart / University of Southampton)
Released2026-05-14
CategoryAI agent benchmark / forecasting evaluation framework
Visit openforecaster.github.io/futuresim →

The lenses

Scored on its own merits — not rolled up from the finds we publish.

Novelty 3
Impact · breadth 3
Impact · depth 3
Actionable 4
Substance 5
Hype 2

How this connects

Tap a node to open it