huggingface / dataset-dedupe-estimator
File Size

The distribution of size of files (measured in lines of code).

Intro
Learn more...
File Size Overall
0% | 0% | 67% | 19% | 13%
Legend:
1001+
501-1000
201-500
101-200
1-100


explore: grouped by folders | grouped by size | sunburst | 3D view
File Size per Extension
1001+
501-1000
201-500
101-200
1-100
py0% | 0% | 62% | 22% | 15%
rs0% | 0% | 68% | 26% | 5%
jinja20% | 0% | 100% | 0% | 0%
toml0% | 0% | 0% | 0% | 100%
File Size per Logical Decomposition
primary
1001+
501-1000
201-500
101-200
1-100
de0% | 0% | 62% | 22% | 15%
src0% | 0% | 68% | 26% | 5%
ROOT0% | 0% | 81% | 0% | 18%
Longest Files (Top 11)
File# lines# units
cli.py
in de
486 12
show.rs
in src
265 3
jinja2
201 -
177 15
store.rs
in src
102 -
83 5
35 2
30 -
lib.rs
in src
22 1
Cargo.toml
in root
16 -
1 -
Files With Most Units (Top 6)
File# lines# units
177 15
cli.py
in de
486 12
83 5
show.rs
in src
265 3
35 2
lib.rs
in src
22 1
Files With Long Lines (Top 2)

There are 2 files with lines longer than 120 characters. In total, there are 13 long lines.

File# lines# units# long lines
jinja2
201 - 7
cli.py
in de
486 12 6