dqbench

DQBench Leaderboard

Published results across all five categories. Higher is better; the score is the tier-weighted composite (0-100).

Generated by dqbench publish from leaderboard/results/. Do not edit by hand — see how to submit.

Detect

Rank	Tool	Version	T1	T2	T3	Score	Submitter	Source	Date
1	GoldenCheck	1.2.0	94.1%	90.9%	83.0%	88.40	DQBench maintainers	reproduced	2026-05-24
2	Pandera (best-effort)	0.31.1	36.4%	38.1%	25.0%	32.51	DQBench maintainers	reproduced	2026-05-24
3	Pointblank (best-effort)	0.24.0	30.0%	47.6%	14.8%	30.97	DQBench maintainers	reproduced	2026-05-24
4	cuallee (best-effort)	0.15.4	30.0%	47.6%	13.8%	30.56	DQBench maintainers	reproduced	2026-05-24
5	Soda (best-effort)	4.0.0.b1	38.1%	23.5%	13.3%	22.36	DQBench maintainers	reproduced	2026-05-24
6	GX (best-effort)	1.17.2	36.4%	23.5%	12.5%	21.68	DQBench maintainers	reproduced	2026-05-24
7	GX (auto-profiled)	1.17.2	22.2%	42.1%	0.0%	21.29	DQBench maintainers	reproduced	2026-05-24
8	Soda (auto-profiled)	4.0.0.b1	0.0%	11.1%	6.2%	6.94	DQBench maintainers	reproduced	2026-05-24
9	ydata-profiling (auto-profiled)	4.18.4	0.0%	11.8%	0.0%	4.70	DQBench maintainers	reproduced	2026-05-24
10	frictionless (schema-inferred)	5.19.0	11.1%	0.0%	0.0%	2.22	DQBench maintainers	reproduced	2026-05-24
11	GX (zero-config)	1.17.2	0.0%	0.0%	0.0%	0.00	DQBench maintainers	reproduced	2026-05-24
12	Pandera (auto-profiled)	0.31.1	0.0%	0.0%	0.0%	0.00	DQBench maintainers	reproduced	2026-05-24
13	Pandera (zero-config)	0.31.1	0.0%	0.0%	0.0%	0.00	DQBench maintainers	reproduced	2026-05-24
14	Soda (zero-config)	4.0.0.b1	0.0%	0.0%	0.0%	0.00	DQBench maintainers	reproduced	2026-05-24

Transform

Rank	Tool	Version	T1	T2	T3	Score	Submitter	Source	Date
1	GoldenFlow	1.1.6	100.0%	100.0%	100.0%	100.00	DQBench maintainers	reproduced	2026-05-24
2	pandas (cleaning baseline)	3.0.3	100.0%	100.0%	100.0%	100.00	DQBench maintainers	reproduced	2026-05-24

ER

Rank	Tool	Version	T1	T2	T3	T4	Score	Submitter	Source	Date
1	GoldenMatch (auto-config)	1.18.1	89.3%	97.8%	88.4%	82.3%	92.36	DQBench maintainers	auto-config	2026-05-24
2	Splink	4.0.16	66.7%	99.9%	84.6%	66.7%	87.14	DQBench maintainers	reproduced	2026-05-24
3	recordlinkage	0.16	80.8%	83.8%	76.5%	33.3%	80.28	DQBench maintainers	reproduced	2026-05-24
4	GoldenMatch	1.18.1	87.0%	81.0%	67.8%	67.8%	76.91	DQBench maintainers	reproduced	2026-05-24

Pipeline

Rank	Tool	Version	T1	T2	T3	Score	Submitter	Source	Date
1	GoldenSuite (tuned)	1.2.0	80.0%	81.7%	67.3%	75.59	DQBench maintainers	reproduced	2026-05-24
2	GoldenPipe	1.2.0	80.0%	81.7%	56.8%	71.38	DQBench maintainers	reproduced	2026-05-24

Reference — not gate-verified

⚠️ These runs are not reproducible and are not enforced by CI — the tools are non-deterministic (auto-config that learns/samples, or active-learning matchers), so they produce different numbers across runs. Shown for reference only; see each entry’s notes for the observed range.

ER

Rank	Tool	Version	T1	T2	T3	T4	Score	Submitter	Source	Date
1	dedupe	3.0.3	91.6%	61.0%	53.4%	97.4%	64.08	DQBench maintainers	third-party	2026-05-24

Pipeline

Rank	Tool	Version	T1	T2	T3	Score	Submitter	Source	Date
1	GoldenSuite (zero-config)	1.2.0	49.8%	28.9%	30.9%	33.85	DQBench maintainers	auto-config	2026-05-24

This site is open source. Improve this page.