dqbench

DQBench Leaderboard

Published results across all five categories. Higher is better; the score is the tier-weighted composite (0-100).

Generated by dqbench publish from leaderboard/results/. Do not edit by hand — see how to submit.

Detect

Rank Tool Version T1 T2 T3 Score Submitter Source Date
1 GoldenCheck 1.2.0 94.1% 90.9% 83.0% 88.40 DQBench maintainers reproduced 2026-05-24
2 Pandera (best-effort) 0.31.1 36.4% 38.1% 25.0% 32.51 DQBench maintainers reproduced 2026-05-24
3 Pointblank (best-effort) 0.24.0 30.0% 47.6% 14.8% 30.97 DQBench maintainers reproduced 2026-05-24
4 cuallee (best-effort) 0.15.4 30.0% 47.6% 13.8% 30.56 DQBench maintainers reproduced 2026-05-24
5 Soda (best-effort) 4.0.0.b1 38.1% 23.5% 13.3% 22.36 DQBench maintainers reproduced 2026-05-24
6 GX (best-effort) 1.17.2 36.4% 23.5% 12.5% 21.68 DQBench maintainers reproduced 2026-05-24
7 GX (auto-profiled) 1.17.2 22.2% 42.1% 0.0% 21.29 DQBench maintainers reproduced 2026-05-24
8 Soda (auto-profiled) 4.0.0.b1 0.0% 11.1% 6.2% 6.94 DQBench maintainers reproduced 2026-05-24
9 ydata-profiling (auto-profiled) 4.18.4 0.0% 11.8% 0.0% 4.70 DQBench maintainers reproduced 2026-05-24
10 frictionless (schema-inferred) 5.19.0 11.1% 0.0% 0.0% 2.22 DQBench maintainers reproduced 2026-05-24
11 GX (zero-config) 1.17.2 0.0% 0.0% 0.0% 0.00 DQBench maintainers reproduced 2026-05-24
12 Pandera (auto-profiled) 0.31.1 0.0% 0.0% 0.0% 0.00 DQBench maintainers reproduced 2026-05-24
13 Pandera (zero-config) 0.31.1 0.0% 0.0% 0.0% 0.00 DQBench maintainers reproduced 2026-05-24
14 Soda (zero-config) 4.0.0.b1 0.0% 0.0% 0.0% 0.00 DQBench maintainers reproduced 2026-05-24

Transform

Rank Tool Version T1 T2 T3 Score Submitter Source Date
1 GoldenFlow 1.1.6 100.0% 100.0% 100.0% 100.00 DQBench maintainers reproduced 2026-05-24
2 pandas (cleaning baseline) 3.0.3 100.0% 100.0% 100.0% 100.00 DQBench maintainers reproduced 2026-05-24

ER

Rank Tool Version T1 T2 T3 T4 Score Submitter Source Date
1 GoldenMatch (auto-config) 1.18.1 89.3% 97.8% 88.4% 82.3% 92.36 DQBench maintainers auto-config 2026-05-24
2 Splink 4.0.16 66.7% 99.9% 84.6% 66.7% 87.14 DQBench maintainers reproduced 2026-05-24
3 recordlinkage 0.16 80.8% 83.8% 76.5% 33.3% 80.28 DQBench maintainers reproduced 2026-05-24
4 GoldenMatch 1.18.1 87.0% 81.0% 67.8% 67.8% 76.91 DQBench maintainers reproduced 2026-05-24

Pipeline

Rank Tool Version T1 T2 T3 Score Submitter Source Date
1 GoldenSuite (tuned) 1.2.0 80.0% 81.7% 67.3% 75.59 DQBench maintainers reproduced 2026-05-24
2 GoldenPipe 1.2.0 80.0% 81.7% 56.8% 71.38 DQBench maintainers reproduced 2026-05-24

Reference — not gate-verified

⚠️ These runs are not reproducible and are not enforced by CI — the tools are non-deterministic (auto-config that learns/samples, or active-learning matchers), so they produce different numbers across runs. Shown for reference only; see each entry’s notes for the observed range.

ER

Rank Tool Version T1 T2 T3 T4 Score Submitter Source Date
1 dedupe 3.0.3 91.6% 61.0% 53.4% 97.4% 64.08 DQBench maintainers third-party 2026-05-24

Pipeline

Rank Tool Version T1 T2 T3 Score Submitter Source Date
1 GoldenSuite (zero-config) 1.2.0 49.8% 28.9% 30.9% 33.85 DQBench maintainers auto-config 2026-05-24