Question 1

What happens with my data?

Accepted Answer

We value your privacy and only store data that is relevant for our research. We act in accordance with the EU GDPR. comparity.ai is an ongoing research project. The full dataset will be publicly released alongside the first publication from this project.

Question 2

How much can I use comparity?

Accepted Answer

Effectively, there are no limits to your usage. However, to protect from malicious intent each registered user account and IP address is limited to 500 requests per 24-hour window – which corresponds to approximately 1,775,000 tokens per user on average.

Question 3

How are the scores computed?

Accepted Answer

There are two different leaderboards: One based on vote ELO and one based on Cascading engagement. In the ELO leaderboard, each pairwise vote nudges the two models' ratings via the standard ELO update (K = 32 overall, K = 64 personal). Both-good and both-bad count as draws. Cascading engagement measures how long users dwell on each response before moving on, then strips out position bias by fitting a mathematical model. The score represents engagement relative to average.

Question 4

Why does it look different for different users?

Accepted Answer

comparity.ai has two distinct usage modes. Random mode shows responses from two anonymous models side by side for comparison. Cascading mode shows one response at a time, allowing you to swipe through all models to find the best answer.

FAQ

What happens with my data?

How much can I use comparity.ai?

How are the scores computed?

Why does it look different for different users?