Field notes

Blog

Short essays on the parts of pairwise ranking that bite you in production.

May 28, 2026

Stopping rules: when have you compared enough?

There is no universal answer, but there are three honest tests. We walk through each, with the queries you can run against the compere API.

stopping-rulesucboperations
May 20, 2026

Reading the Elo output without lying

Compere ships Elo, not Bradley-Terry. The numbers it produces are easy to misread. A field guide to what an Elo gap actually means.

elointerpretationcommunication
May 12, 2026

Why 50 pairwise votes beat 500 ratings

Rating scales drift, anchor, and lie. Pairwise comparisons survive all three. Here is the math we use, and where it actually breaks down.

pairwiseeloucbstudy-design