Methodology
Every Indie Bench evaluation is scored against a published rubric. Each
rubric has a stable identifier (e.g. IB-CODE-2026.1), a public
version history, and a transparent scoring breakdown. We publish the
rubric so you can disagree with the weighting and recompute the score from
the raw per-task data.
When a rubric changes substantively, we bump the version, mark the previous version superseded, and keep it published. Evaluations scored under the old version stay valid — just labelled with the version they used.
Active rubrics
No active rubrics yet. The first one is being drafted.
Drafts
-
IB-CODE-2026.1— The Indie Operator Coding Rubric draftA reproducible rubric for evaluating AI coding tools against the tasks indie hackers and solo SaaS operators actually do. Twelve tasks, six scoring dimensions, weighted to 100. Tools are scored under identical task conditions; per-task scores are published.