awslabs / mlmax
File Size

The distribution of size of files (measured in lines of code).

Intro
  • File size measurements show the distribution of size of files.
  • Files are classified in four categories based on their size (lines of code): 1-100 (very small files), 101-200 (small files), 201-500 (medium size files), 501-1000 (long files), 1001+(very long files).
  • It is a good practice to keep files small. Long files may become "bloaters", code that have increased to such gargantuan proportions that they are hard to work with.
Learn more...
File Size Overall
  • There are 42 files with 3,856 lines of code.
    • 0 very long files (0 lines of code)
    • 0 long files (0 lines of code)
    • 4 medium size files (972 lines of codeclsfd_ftr_w_mp_ins)
    • 13 small files (1,614 lines of code)
    • 25 very small files (1,270 lines of code)
0% | 0% | 25% | 41% | 32%
Legend:
1001+
501-1000
201-500
101-200
1-100


explore: zoomable circles | sunburst | 3D view
File Size per Extension
1001+
501-1000
201-500
101-200
1-100
yaml0% | 0% | 29% | 42% | 27%
py0% | 0% | 21% | 41% | 37%
toml0% | 0% | 0% | 0% | 100%
File Size per Logical Decomposition
primary
1001+
501-1000
201-500
101-200
1-100
modules/pipeline/templates0% | 0% | 44% | 49% | 5%
modules/environment0% | 0% | 43% | 16% | 39%
modules/pipeline0% | 0% | 27% | 59% | 13%
src/mlmax0% | 0% | 43% | 24% | 32%
modules/monitoring/templates0% | 0% | 0% | 89% | 10%
modules/data/templates0% | 0% | 0% | 41% | 58%
modules/monitoring0% | 0% | 0% | 46% | 53%
modules/environment/util0% | 0% | 0% | 83% | 16%
modules/data0% | 0% | 0% | 0% | 100%
modules/data/src0% | 0% | 0% | 0% | 100%
ROOT0% | 0% | 0% | 0% | 100%
Longest Files (Top 42)
File# lines# units
my_training_pipeline.yaml
in modules/pipeline/templates
265 -
vpc.yaml
in modules/environment
261 -
custom_steps.py
in modules/pipeline
232 4
monitoring.py
in src/mlmax
214 16
my_inference_pipeline.yaml
in modules/pipeline/templates
193 -
training_pipeline_define.py
in modules/pipeline
156 1
roles.yaml
in modules/data/templates
135 -
inference_pipeline_define.py
in modules/pipeline
125 1
preprocessing.py
in src/mlmax
121 7
smconfig.py
in modules/environment/util/screening
120 14
monitor_pipeline_define.py
in modules/monitoring
120 1
my_monitor_pipeline.yaml
in modules/monitoring/templates
113 -
inference_pipeline_run.py
in modules/pipeline
110 3
training_pipeline_run.py
in modules/pipeline
108 2
roles.yaml
in modules/pipeline/templates
106 -
roles.yaml
in modules/monitoring/templates
106 -
ec2.yaml
in modules/environment
101 -
my_data_pipeline.yaml
in modules/data/templates
94 -
data_pipeline_define.py
in modules/data
78 1
sagemaker_nb.yaml
in modules/environment
77 -
monitor_pipeline_create.py
in modules/monitoring
71 3
monitor_pipeline_run.py
in modules/monitoring
68 2
s3.yaml
in modules/environment
62 -
stacks.yaml
in modules/environment
61 -
data_pipeline_create.py
in modules/data
61 3
custom_steps.py
in modules/data
61 1
data_pipeline_run.py
in modules/data
58 2
train.py
in src/mlmax
56 7
evaluation.py
in src/mlmax
56 5
training_pipeline_create.py
in modules/pipeline
56 3
inference_pipeline_create.py
in modules/pipeline
56 3
preprocessing.py
in modules/data/src
55 1
master.yaml
in modules/data/templates
52 -
scheduler.yaml
in modules/data/templates
44 -
inference.py
in src/mlmax
39 5
setup.py
in root
35 1
master.yaml
in modules/pipeline/templates
35 -
keys.yaml
in modules/environment
34 -
master.yaml
in modules/monitoring/templates
25 -
screen.py
in modules/environment/util/screening/sourcedir_screen
24 -
__init__.py
in src/mlmax
8 -
pyproject.toml
in root
4 -
Files With Most Units (Top 20)
File# lines# units
monitoring.py
in src/mlmax
214 16
smconfig.py
in modules/environment/util/screening
120 14
train.py
in src/mlmax
56 7
preprocessing.py
in src/mlmax
121 7
inference.py
in src/mlmax
39 5
evaluation.py
in src/mlmax
56 5
custom_steps.py
in modules/pipeline
232 4
inference_pipeline_run.py
in modules/pipeline
110 3
training_pipeline_create.py
in modules/pipeline
56 3
inference_pipeline_create.py
in modules/pipeline
56 3
monitor_pipeline_create.py
in modules/monitoring
71 3
data_pipeline_create.py
in modules/data
61 3
training_pipeline_run.py
in modules/pipeline
108 2
monitor_pipeline_run.py
in modules/monitoring
68 2
data_pipeline_run.py
in modules/data
58 2
setup.py
in root
35 1
training_pipeline_define.py
in modules/pipeline
156 1
inference_pipeline_define.py
in modules/pipeline
125 1
monitor_pipeline_define.py
in modules/monitoring
120 1
data_pipeline_define.py
in modules/data
78 1
Files With Long Lines (Top 0)

There are 0 files with lines longer than 120 characters. In total, there are 0 long lines.

File# lines# units# long lines