mozilla / smart-tab-grouping
File Size

The distribution of size of files (measured in lines of code).

Intro
Learn more...
File Size Overall
51% | 9% | 13% | 14% | 9%
Legend:
1001+
501-1000
201-500
101-200
1-100


explore: grouped by folders | grouped by size | sunburst | 3D view
File Size per Extension
1001+
501-1000
201-500
101-200
1-100
ipynb78% | 14% | 4% | 2% | <1%
py0% | 0% | 33% | 39% | 27%
toml0% | 0% | 0% | 0% | 100%
File Size per Logical Decomposition
primary
1001+
501-1000
201-500
101-200
1-100
notebooks78% | 14% | 4% | 2% | <1%
src0% | 0% | 33% | 39% | 27%
ROOT0% | 0% | 0% | 0% | 100%
Longest Files (Top 49)
File# lines# units
Common Crawl.ipynb
in notebooks
2805 -
2639 -
1285 -
760 -
Benchmarking.ipynb
in notebooks
524 -
429 13
359 -
grouping_pipeline.py
in src/jobs/util
267 20
267 20
tune_t5.py
in src/jobs
256 6
228 6
196 8
195 -
tab_titles.py
in src/jobs/util
176 23
tab_titles.py
in src/util
176 23
167 8
tune_bart.py
in src/jobs
153 3
distill_t5.py
in src/jobs
147 4
tune_gpt2.py
in src/jobs
131 4
language_model.py
in src/jobs/util
116 14
116 14
labeled_data_utils.py
in src/jobs/util
113 4
113 4
111 6
shorten_topic_length.py
in src/jobs/util
97 6
97 6
tune_base.py
in src/jobs
87 5
key_document_finder.py
in src/jobs/util
76 9
76 9
70 5
secrets.py
in src/jobs/util
68 10
silhouette.py
in src/jobs/util
68 4
secrets.py
in src/util
68 10
silhouette.py
in src/util
68 4
Onnx.ipynb
in notebooks
57 -
evaluate.py
in src/jobs/util
50 3
evaluate.py
in src/util
50 3
38 -
topic_utils.py
in src/jobs/util
37 2
topic_utils.py
in src/util
37 2
34 5
utils.py
in src/jobs
32 2
30 2
storage.py
in src/jobs/util
29 3
storage.py
in src/util
29 3
21 1
16 -
16 -
16 -
Files With Most Units (Top 37)
File# lines# units
tab_titles.py
in src/jobs/util
176 23
tab_titles.py
in src/util
176 23
grouping_pipeline.py
in src/jobs/util
267 20
267 20
language_model.py
in src/jobs/util
116 14
116 14
429 13
secrets.py
in src/jobs/util
68 10
secrets.py
in src/util
68 10
key_document_finder.py
in src/jobs/util
76 9
76 9
196 8
167 8
111 6
228 6
shorten_topic_length.py
in src/jobs/util
97 6
tune_t5.py
in src/jobs
256 6
97 6
34 5
70 5
tune_base.py
in src/jobs
87 5
labeled_data_utils.py
in src/jobs/util
113 4
silhouette.py
in src/jobs/util
68 4
distill_t5.py
in src/jobs
147 4
tune_gpt2.py
in src/jobs
131 4
113 4
silhouette.py
in src/util
68 4
tune_bart.py
in src/jobs
153 3
evaluate.py
in src/jobs/util
50 3
storage.py
in src/jobs/util
29 3
evaluate.py
in src/util
50 3
storage.py
in src/util
29 3
utils.py
in src/jobs
32 2
topic_utils.py
in src/jobs/util
37 2
topic_utils.py
in src/util
37 2
30 2
21 1
Files With Long Lines (Top 24)

There are 24 files with lines longer than 120 characters. In total, there are 831 long lines.

File# lines# units# long lines
1285 - 744
2639 - 29
Benchmarking.ipynb
in notebooks
524 - 6
Common Crawl.ipynb
in notebooks
2805 - 6
429 13 4
utils.py
in src/jobs
32 2 4
167 8 4
Onnx.ipynb
in notebooks
57 - 3
topic_utils.py
in src/jobs/util
37 2 3
language_model.py
in src/jobs/util
116 14 3
grouping_pipeline.py
in src/jobs/util
267 20 3
topic_utils.py
in src/util
37 2 3
116 14 3
267 20 3
tab_titles.py
in src/jobs/util
176 23 2
tune_gpt2.py
in src/jobs
131 4 2
tab_titles.py
in src/util
176 23 2
760 - 1
359 - 1
16 - 1
16 - 1
distill_t5.py
in src/jobs
147 4 1
tune_base.py
in src/jobs
87 5 1
16 - 1