huggingface / dataset-dedupe-estimator
parquet dedupe estimator
GitHub Repo 
1.4K
lines of main code
11 files
0
lines of test code
0 files
0.4K
lines of other code
1 files
<1y
age
159 days
100%
main code touched
1 year (1.4K LOC)
100%
new main code
1 year (1.4K LOC)
0.8K
py
0.4K
rs
JINJA2
0.2K
jinja2
0.05K
toml

26

1

2025

generated by sokrates.dev (configuration) on 2025-06-30