aws-samples / amazon-sagemaker-mlops-with-featurestore-and-datawrangler
File Size

The distribution of size of files (measured in lines of code).

Intro
  • File size measurements show the distribution of size of files.
  • Files are classified in four categories based on their size (lines of code): 1-100 (very small files), 101-200 (small files), 201-500 (medium size files), 501-1000 (long files), 1001+(very long files).
  • It is a good practice to keep files small. Long files may become "bloaters", code that have increased to such gargantuan proportions that they are hard to work with.
Learn more...
File Size Overall
  • There are 61 files with 4,570 lines of code.
    • 0 very long files (0 lines of code)
    • 1 long files (572 lines of code)
    • 6 medium size files (1,652 lines of codeclsfd_ftr_w_mp_ins)
    • 5 small files (665 lines of code)
    • 49 very small files (1,681 lines of code)
0% | 12% | 36% | 14% | 36%
Legend:
1001+
501-1000
201-500
101-200
1-100


explore: zoomable circles | sunburst | 3D view
File Size per Extension
1001+
501-1000
201-500
101-200
1-100
py0% | 12% | 36% | 14% | 35%
toml0% | 0% | 0% | 0% | 100%
File Size per Logical Decomposition
primary
1001+
501-1000
201-500
101-200
1-100
repos/build_pipeline/pipelines0% | 100% | 0% | 0% | 0%
infra0% | 0% | 92% | 0% | 7%
repos/serving/infra0% | 0% | 58% | 13% | 28%
demo-workspace/tensorflow0% | 0% | 82% | 0% | 17%
repos/serving/pipelines0% | 0% | 0% | 100% | 0%
repos/build_pipeline/infra0% | 0% | 0% | 60% | 39%
repos/features_ingestion_pipeline/infra0% | 0% | 0% | 45% | 54%
repos/serving/lambdas0% | 0% | 0% | 27% | 72%
demo-workspace/utils0% | 0% | 0% | 0% | 100%
ROOT0% | 0% | 0% | 0% | 100%
repos/features_ingestion_pipeline/pipelines0% | 0% | 0% | 0% | 100%
repos/build_pipeline/scripts0% | 0% | 0% | 0% | 100%
demo-workspace/scripts0% | 0% | 0% | 0% | 100%
repos/serving/scripts0% | 0% | 0% | 0% | 100%
repos/serving0% | 0% | 0% | 0% | 100%
lambdas/functions/auto_approval0% | 0% | 0% | 0% | 100%
repos/build_pipeline0% | 0% | 0% | 0% | 100%
repos/features_ingestion_pipeline0% | 0% | 0% | 0% | 100%
repos/build_pipeline/lambdas0% | 0% | 0% | 0% | 100%
Longest Files (Top 50)
File# lines# units
xgboost_pipeline.py
in repos/build_pipeline/pipelines
572 9
mlops_featurestore_construct.py
in infra
376 3
model_endpoint_construct.py
in repos/serving/infra
275 4
cicd_construct.py
in infra
268 1
tf_pipeline.py
in demo-workspace/tensorflow
264 1
dynamodb_construct.py
in repos/serving/infra
238 1
service_catalog_stack.py
in infra
231 2
batch_transform_serving_pipeline.py
in repos/serving/pipelines
177 2
build_model_stack.py
in repos/build_pipeline/infra
130 1
features_ingestion_stack.py
in repos/features_ingestion_pipeline/infra
122 2
batch_transform_construct.py
in repos/serving/infra
121 1
lambda_function.py
in repos/serving/lambdas/functions/xgboost_inference
115 1
sm_pipeline_utils.py
in repos/features_ingestion_pipeline/infra
83 3
sm_pipeline_utils.py
in repos/build_pipeline/infra
83 3
sm_pipeline_utils.py
in repos/serving/infra
83 3
serving_stack.py
in repos/serving/infra
80 2
utils.py
in infra
70 6
xgboost_starter_script.py
in repos/build_pipeline/scripts
67 -
lambda_function.py
in repos/serving/lambdas/functions/processing-job-execution
66 3
feature_store_utils.py
in repos/features_ingestion_pipeline/infra
65 4
xgboost_starter_script.py
in demo-workspace/scripts
64 -
feature_ingestion_pipeline.py
in repos/features_ingestion_pipeline/pipelines
60 1
lambda_function.py
in repos/serving/lambdas/functions/processing-job-status-check
60 4
create_dataset.py
in demo-workspace/utils
60 -
redeploy_construct.py
in repos/serving/infra
58 1
direct_deploy.py
in root
58 1
tensorflow_starter_script.py
in demo-workspace/tensorflow
57 -
lambda_function.py
in repos/serving/lambdas/functions/read-ddb
54 2
lambda_function.py
in repos/serving/lambdas/functions/read-sm
54 2
lambda.py
in lambdas/functions/auto_approval
53 2
load-ddb-table.py
in repos/serving/scripts/glue
52 -
feature_store_utils.py
in demo-workspace/utils
52 4
lambda_function.py
in repos/serving/lambdas/functions/execute-state-machine
51 2
pyproject.toml
in repos/serving
32 -
serving_stack_utils.py
in repos/serving/infra
26 1
get_datasets.py
in demo-workspace/utils
24 1
pyproject.toml
in root
23 -
parse_flow.py
in repos/features_ingestion_pipeline/pipelines
22 4
parse_flow.py
in demo-workspace/utils
22 4
app.py
in repos/features_ingestion_pipeline
20 -
app.py
in repos/build_pipeline
20 -
extract_metrics.py
in repos/build_pipeline/lambdas/extract_metrics
20 2
app.py
in repos/serving
20 -
pyproject.toml
in repos/build_pipeline
16 -
create_dataset.py
in repos/build_pipeline/scripts
16 -
create_dataset.py
in demo-workspace/scripts
16 -
pyproject.toml
in repos/features_ingestion_pipeline
15 -
lambda_function.py
in repos/serving/lambdas/functions/datafreshness-check
15 1
app.py
in root
12 -
create_dataset.py
in repos/serving/scripts
10 -
Files With Most Units (Top 20)
File# lines# units
xgboost_pipeline.py
in repos/build_pipeline/pipelines
572 9
utils.py
in infra
70 6
feature_store_utils.py
in repos/features_ingestion_pipeline/infra
65 4
parse_flow.py
in repos/features_ingestion_pipeline/pipelines
22 4
model_endpoint_construct.py
in repos/serving/infra
275 4
lambda_function.py
in repos/serving/lambdas/functions/processing-job-status-check
60 4
feature_store_utils.py
in demo-workspace/utils
52 4
parse_flow.py
in demo-workspace/utils
22 4
sm_pipeline_utils.py
in repos/features_ingestion_pipeline/infra
83 3
sm_pipeline_utils.py
in repos/build_pipeline/infra
83 3
sm_pipeline_utils.py
in repos/serving/infra
83 3
lambda_function.py
in repos/serving/lambdas/functions/processing-job-execution
66 3
mlops_featurestore_construct.py
in infra
376 3
features_ingestion_stack.py
in repos/features_ingestion_pipeline/infra
122 2
extract_metrics.py
in repos/build_pipeline/lambdas/extract_metrics
20 2
serving_stack.py
in repos/serving/infra
80 2
batch_transform_serving_pipeline.py
in repos/serving/pipelines
177 2
lambda_function.py
in repos/serving/lambdas/functions/read-ddb
54 2
lambda_function.py
in repos/serving/lambdas/functions/execute-state-machine
51 2
lambda_function.py
in repos/serving/lambdas/functions/read-sm
54 2
Files With Long Lines (Top 3)

There are 3 files with lines longer than 120 characters. In total, there are 3 long lines.

File# lines# units# long lines
features_ingestion_stack.py
in repos/features_ingestion_pipeline/infra
122 2 1
build_model_stack.py
in repos/build_pipeline/infra
130 1 1
redeploy_construct.py
in repos/serving/infra
58 1 1