aws-samples / amazon-sagemaker-build-train-deploy
File Size

The distribution of size of files (measured in lines of code).

Intro
  • File size measurements show the distribution of size of files.
  • Files are classified in four categories based on their size (lines of code): 1-100 (very small files), 101-200 (small files), 201-500 (medium size files), 501-1000 (long files), 1001+(very long files).
  • It is a good practice to keep files small. Long files may become "bloaters", code that have increased to such gargantuan proportions that they are hard to work with.
Learn more...
File Size Overall
  • There are 22 files with 1,601 lines of code.
    • 0 very long files (0 lines of code)
    • 0 long files (0 lines of code)
    • 3 medium size files (997 lines of codeclsfd_ftr_w_mp_ins)
    • 0 small files (0 lines of code)
    • 19 very small files (604 lines of code)
0% | 0% | 62% | 0% | 37%
Legend:
1001+
501-1000
201-500
101-200
1-100


explore: zoomable circles | sunburst | 3D view
File Size per Extension
1001+
501-1000
201-500
101-200
1-100
yaml0% | 0% | 100% | 0% | 0%
py0% | 0% | 27% | 0% | 72%
File Size per Logical Decomposition
primary
1001+
501-1000
201-500
101-200
1-100
setup0% | 0% | 100% | 0% | 0%
08_projects/modelbuild/pipelines/endtoendmlsm0% | 0% | 99% | 0% | <1%
03_train_model/source_dir0% | 0% | 0% | 0% | 100%
04_deploy_model0% | 0% | 0% | 0% | 100%
08_projects/modelbuild/pipelines/endtoendmlsm/deploy0% | 0% | 0% | 0% | 100%
08_projects/modelbuild/pipelines/endtoendmlsm/dataprep0% | 0% | 0% | 0% | 100%
02_data_exploration_and_feature_eng/source_dir0% | 0% | 0% | 0% | 100%
08_projects/modelbuild/pipelines/endtoendmlsm/train0% | 0% | 0% | 0% | 100%
04_deploy_model/sklearn_source_dir0% | 0% | 0% | 0% | 100%
04_deploy_model/xgboost_source_dir0% | 0% | 0% | 0% | 100%
02_data_exploration_and_feature_eng0% | 0% | 0% | 0% | 100%
Longest Files (Top 22)
File# lines# units
create_studio_domain_vpc.yaml
in setup
495 -
create_studio_domain.yaml
in setup
270 -
workflow.py
in 08_projects/modelbuild/pipelines/endtoendmlsm
232 5
monitoringjob_utils.py
in 04_deploy_model
83 3
training_debug.py
in 03_train_model/source_dir
70 2
preprocess.py
in 08_projects/modelbuild/pipelines/endtoendmlsm/dataprep
66 -
preprocessor.py
in 02_data_exploration_and_feature_eng/source_dir
66 -
train.py
in 08_projects/modelbuild/pipelines/endtoendmlsm/train
64 2
training.py
in 03_train_model/source_dir
64 2
inference.py
in 08_projects/modelbuild/pipelines/endtoendmlsm/deploy/sklearn
38 4
inference.py
in 04_deploy_model/sklearn_source_dir
38 4
inference.py
in 08_projects/modelbuild/pipelines/endtoendmlsm/deploy/xgboost
34 3
inference.py
in 04_deploy_model/xgboost_source_dir
34 3
notebook_utilities.py
in 02_data_exploration_and_feature_eng
25 1
notebook_utilities.py
in 04_deploy_model
14 2
postprocessor.py
in 04_deploy_model
2 1
__init__.py
in 08_projects/modelbuild/pipelines/endtoendmlsm/train
1 -
__init__.py
in 08_projects/modelbuild/pipelines/endtoendmlsm/dataprep
1 -
__init__.py
in 08_projects/modelbuild/pipelines/endtoendmlsm/deploy/sklearn
1 -
__init__.py
in 08_projects/modelbuild/pipelines/endtoendmlsm/deploy/xgboost
1 -
__init__.py
in 08_projects/modelbuild/pipelines/endtoendmlsm/deploy
1 -
__init__.py
in 08_projects/modelbuild/pipelines/endtoendmlsm
1 -
Files With Most Units (Top 12)
File# lines# units
workflow.py
in 08_projects/modelbuild/pipelines/endtoendmlsm
232 5
inference.py
in 08_projects/modelbuild/pipelines/endtoendmlsm/deploy/sklearn
38 4
inference.py
in 04_deploy_model/sklearn_source_dir
38 4
inference.py
in 08_projects/modelbuild/pipelines/endtoendmlsm/deploy/xgboost
34 3
inference.py
in 04_deploy_model/xgboost_source_dir
34 3
monitoringjob_utils.py
in 04_deploy_model
83 3
train.py
in 08_projects/modelbuild/pipelines/endtoendmlsm/train
64 2
training_debug.py
in 03_train_model/source_dir
70 2
training.py
in 03_train_model/source_dir
64 2
notebook_utilities.py
in 04_deploy_model
14 2
postprocessor.py
in 04_deploy_model
2 1
notebook_utilities.py
in 02_data_exploration_and_feature_eng
25 1
Files With Long Lines (Top 6)

There are 6 files with lines longer than 120 characters. In total, there are 10 long lines.

File# lines# units# long lines
monitoringjob_utils.py
in 04_deploy_model
83 3 3
preprocess.py
in 08_projects/modelbuild/pipelines/endtoendmlsm/dataprep
66 - 2
preprocessor.py
in 02_data_exploration_and_feature_eng/source_dir
66 - 2
inference.py
in 08_projects/modelbuild/pipelines/endtoendmlsm/deploy/sklearn
38 4 1
inference.py
in 04_deploy_model/sklearn_source_dir
38 4 1
create_studio_domain_vpc.yaml
in setup
495 - 1