huggingface / docmatix
File Size

The distribution of size of files (measured in lines of code).

Intro
Learn more...
File Size Overall
0% | 0% | 0% | 67% | 32%
Legend:
1001+
501-1000
201-500
101-200
1-100


explore: grouped by folders | grouped by size | sunburst | 3D view
File Size per Extension
1001+
501-1000
201-500
101-200
1-100
py0% | 0% | 0% | 67% | 32%
File Size per Logical Decomposition
primary
1001+
501-1000
201-500
101-200
1-100
generation0% | 0% | 0% | 84% | 15%
clean_and_create0% | 0% | 0% | 100% | 0%
create_only_with_pdfs0% | 0% | 0% | 83% | 16%
florence_2_dataset0% | 0% | 0% | 0% | 100%
analysis0% | 0% | 0% | 0% | 100%
zero_shot_exp0% | 0% | 0% | 0% | 100%
Longest Files (Top 9)
File# lines# units
llm_swarm_script.py
in generation
197 9
load_data.py
in clean_and_create
173 8
load_data.py
in create_only_with_pdfs
111 4
create_florence_2_dataset.py
in florence_2_dataset
71 2
plot.py
in analysis
55 -
base_prompts.py
in generation
36 1
zero_shot.py
in zero_shot_exp
29 -
upload_data.py
in create_only_with_pdfs
22 1
14 1
Files With Most Units (Top 7)
File# lines# units
llm_swarm_script.py
in generation
197 9
load_data.py
in clean_and_create
173 8
load_data.py
in create_only_with_pdfs
111 4
create_florence_2_dataset.py
in florence_2_dataset
71 2
14 1
base_prompts.py
in generation
36 1
upload_data.py
in create_only_with_pdfs
22 1
Files With Long Lines (Top 6)

There are 6 files with lines longer than 120 characters. In total, there are 17 long lines.

File# lines# units# long lines
base_prompts.py
in generation
36 1 5
create_florence_2_dataset.py
in florence_2_dataset
71 2 3
load_data.py
in create_only_with_pdfs
111 4 3
load_data.py
in clean_and_create
173 8 3
llm_swarm_script.py
in generation
197 9 2
plot.py
in analysis
55 - 1