aws-samples / amazon-sagemaker-feature-store-end-to-end-workshop
File Size

The distribution of size of files (measured in lines of code).

Intro
  • File size measurements show the distribution of size of files.
  • Files are classified in four categories based on their size (lines of code): 1-100 (very small files), 101-200 (small files), 201-500 (medium size files), 501-1000 (long files), 1001+(very long files).
  • It is a good practice to keep files small. Long files may become "bloaters", code that have increased to such gargantuan proportions that they are hard to work with.
Learn more...
File Size Overall
  • There are 14 files with 2,660 lines of code.
    • 1 very long files (1,036 lines of code)
    • 1 long files (505 lines of code)
    • 2 medium size files (686 lines of codeclsfd_ftr_w_mp_ins)
    • 1 small files (116 lines of code)
    • 9 very small files (317 lines of code)
38% | 18% | 25% | 4% | 11%
Legend:
1001+
501-1000
201-500
101-200
1-100


explore: zoomable circles | sunburst | 3D view
File Size per Extension
1001+
501-1000
201-500
101-200
1-100
py38% | 18% | 25% | 4% | 11%
File Size per Logical Decomposition
primary
1001+
501-1000
201-500
101-200
1-100
utilities62% | 0% | 29% | 0% | 8%
07-module-feature-monitoring0% | 100% | 0% | 0% | 0%
06-module-automated-pipeline0% | 0% | 84% | 0% | 15%
04-module-working-with-online-store/custom_library0% | 0% | 0% | 100% | 0%
05-module-scalable-batch-ingestion/scripts0% | 0% | 0% | 0% | 100%
Longest Files (Top 14)
File# lines# units
feature_store_helper.py
in utilities
1036 50
feature_monitoring_utils.py
in 07-module-feature-monitoring
505 19
Utils.py
in utilities
485 31
pipeline_utils.py
in 06-module-automated-pipeline
201 12
helper.py
in 04-module-working-with-online-store/custom_library
116 5
batch_ingest_sm_pyspark.py
in 05-module-scalable-batch-ingestion/scripts
85 6
batch_ingest_sm_sklearn.py
in 05-module-scalable-batch-ingestion/scripts
60 5
batch_ingest_sm_pyspark_top.py
in utilities
45 3
lambda_function.py
in 06-module-automated-pipeline
38 1
batch_ingest_sm_sklearn_bottom.py
in utilities
38 3
batch_ingest_sm_pyspark_bottom.py
in utilities
25 2
batch_ingest_sm_sklearn_top.py
in utilities
18 -
batch_ingest_sm_pyspark_empty.py
in utilities
5 1
batch_ingest_sm_sklearn_empty.py
in utilities
3 1
Files With Most Units (Top 13)
File# lines# units
feature_store_helper.py
in utilities
1036 50
Utils.py
in utilities
485 31
feature_monitoring_utils.py
in 07-module-feature-monitoring
505 19
pipeline_utils.py
in 06-module-automated-pipeline
201 12
batch_ingest_sm_pyspark.py
in 05-module-scalable-batch-ingestion/scripts
85 6
helper.py
in 04-module-working-with-online-store/custom_library
116 5
batch_ingest_sm_sklearn.py
in 05-module-scalable-batch-ingestion/scripts
60 5
batch_ingest_sm_sklearn_bottom.py
in utilities
38 3
batch_ingest_sm_pyspark_top.py
in utilities
45 3
batch_ingest_sm_pyspark_bottom.py
in utilities
25 2
lambda_function.py
in 06-module-automated-pipeline
38 1
batch_ingest_sm_sklearn_empty.py
in utilities
3 1
batch_ingest_sm_pyspark_empty.py
in utilities
5 1
Files With Long Lines (Top 9)

There are 9 files with lines longer than 120 characters. In total, there are 29 long lines.

File# lines# units# long lines
feature_store_helper.py
in utilities
1036 50 11
feature_monitoring_utils.py
in 07-module-feature-monitoring
505 19 6
Utils.py
in utilities
485 31 4
pipeline_utils.py
in 06-module-automated-pipeline
201 12 2
batch_ingest_sm_pyspark.py
in 05-module-scalable-batch-ingestion/scripts
85 6 2
lambda_function.py
in 06-module-automated-pipeline
38 1 1
batch_ingest_sm_sklearn.py
in 05-module-scalable-batch-ingestion/scripts
60 5 1
batch_ingest_sm_sklearn_bottom.py
in utilities
38 3 1
batch_ingest_sm_pyspark_bottom.py
in utilities
25 2 1