Methodology

Every Indie Bench evaluation is scored against a published rubric. Each rubric has a stable identifier (e.g. IB-CODE-2026.1), a public version history, and a transparent scoring breakdown. We publish the rubric so you can disagree with the weighting and recompute the score from the raw per-task data.

When a rubric changes substantively, we bump the version, mark the previous version superseded, and keep it published. Evaluations scored under the old version stay valid — just labelled with the version they used.

Active rubrics

No active rubrics yet. The first one is being drafted.

Drafts