Skip to content

connachermurphy/srfb

Repository files navigation

srfb: Stochastic Radiant ForecastBench Submission

I use Stochastic Radiant to create a submission for ForecastBench. Both of these tools are intended for demonstration purposes. However, I plan to continue development of Stochastic Radiant, with associated evaluation in this repo.

References

The ForecastBench Wiki is available here.

Usage

The forecast routine requires a .env file in the project root:

ANTHROPIC_API_KEY={your_anthropic_api_key}

Use fb-forecast to generate forecasts on the latest question set (question_set below). Example usage:

make fb-forecast file_prefix="srfb_demo" model="claude-3-5-haiku-20241022"

Arguments:

  • file_prefix controls the output file name (forecasts_file_prefix_question_set)
  • model specifies the model (must be an Anthropic model, as Stochastic Radiant only supports Antrhopic at present)

Use fb-prepare to prepare a ForecastBench submission file. Example usage:

make fb-prepare file_prefix="srfb_demo" question_set="2025-08-03-llm" organization="Stochastic Radiant" model="claude-3-5-haiku-20241022" model_organization="Anthropic"

Arguments:

  • file_prefix: Identifier for the forecast files (e.g., "srfb_demo")
  • question_set: Name of the question set being forecasted (e.g., "2025-08-03-llm")
  • organization: Your organization name (for the ForecastBench leaderboard)
  • model: The model used to generate forecasts
  • model_organization: Organization that created the model

Use fb-post to post a forecast set to the relevant GCS bucket. Example usage:

make fb-post file_prefix="srfb_demo" question_set="2025-08-03-llm" organization="Stochastic Radiant" N=1 upload_to_gcs=True
  • file_prefix: Identifier for the forecast files (e.g., "srfb_demo")
  • question_set: Name of the question set being forecasted (e.g., "2025-08-03-llm")
  • organization: Your organization name (for the ForecastBench leaderboard)
  • N: the submission number (integer)
  • upload_to_gcs: Whether to upload the submission to Google Cloud Storage (True/False). If False, only creates a local copy.

Contributions

Please contact me if you're interested in contributing. These submission tools and Stochastic Radiant are both in active development.

About

Stochastic Radiant: ForecastBench submission

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published