aws-samples / aws-research-workshops
File Size

The distribution of size of files (measured in lines of code).

Intro
  • File size measurements show the distribution of size of files.
  • Files are classified in four categories based on their size (lines of code): 1-100 (very small files), 101-200 (small files), 201-500 (medium size files), 501-1000 (long files), 1001+(very long files).
  • It is a good practice to keep files small. Long files may become "bloaters", code that have increased to such gargantuan proportions that they are hard to work with.
Learn more...
File Size Overall
  • There are 60 files with 5,582 lines of code.
    • 0 very long files (0 lines of code)
    • 2 long files (1,336 lines of code)
    • 7 medium size files (2,031 lines of codeclsfd_ftr_w_mp_ins)
    • 11 small files (1,353 lines of code)
    • 40 very small files (862 lines of code)
0% | 23% | 36% | 24% | 15%
Legend:
1001+
501-1000
201-500
101-200
1-100


explore: zoomable circles | sunburst | 3D view
File Size per Extension
1001+
501-1000
201-500
101-200
1-100
py0% | 27% | 26% | 30% | 16%
yaml0% | 36% | 33% | 20% | 8%
js0% | 0% | 98% | 0% | 1%
yml0% | 0% | 48% | 42% | 8%
html0% | 0% | 0% | 0% | 100%
css0% | 0% | 0% | 0% | 100%
File Size per Logical Decomposition
primary
1001+
501-1000
201-500
101-200
1-100
lib0% | 94% | 0% | 0% | 5%
notebooks/building_data_lakes0% | 43% | 41% | 8% | 6%
notebooks/iot_greengress0% | 0% | 52% | 35% | 12%
notebooks/serverless_apps0% | 0% | 51% | 20% | 28%
notebooks/parallelcluster0% | 0% | 67% | 0% | 32%
notebooks/escience_series0% | 0% | 0% | 81% | 18%
ROOT0% | 0% | 0% | 100% | 0%
notebooks/ml_tensorflow0% | 0% | 0% | 52% | 47%
notebooks/hpc0% | 0% | 0% | 0% | 100%
notebooks/intro_to_aws0% | 0% | 0% | 0% | 100%
notebooks/container0% | 0% | 0% | 0% | 100%
notebooks/genomics_pipeline0% | 0% | 0% | 0% | 100%
notebooks/spot0% | 0% | 0% | 0% | 100%
notebooks0% | 0% | 0% | 0% | 100%
notebooks/account-factory0% | 0% | 0% | 0% | 100%
Longest Files (Top 50)
File# lines# units
workshop.py
in lib
675 36
kinesis-swiss-army.yaml
in notebooks/building_data_lakes/cfn
661 -
smoothie.js
in notebooks/iot_greengress/health_tracker/tracker/ggd/flask/static
489 27
sf_orch_glue.yaml
in notebooks/building_data_lakes/cfn
315 -
emr.yaml
in notebooks/building_data_lakes/cfn
303 -
code-pipeline.yml
in notebooks/serverless_apps
260 -
web.py
in notebooks/iot_greengress/health_tracker/tracker/ggd
228 15
pcluster_athena.py
in notebooks/parallelcluster
221 18
lambda_setup.py
in notebooks/iot_greengress/health_tracker
215 15
utils.py
in notebooks/iot_greengress/health_tracker/tracker/ggd
161 8
ec2_ssm.yaml
in notebooks/escience_series/cfn
141 -
group_setup.py
in notebooks/iot_greengress/health_tracker
137 6
redshift.yaml
in notebooks/building_data_lakes/cfn
133 -
research-env.yml
in root
126 -
heartbeat.py
in notebooks/iot_greengress/health_tracker/tracker/ggd
121 2
mnist.py
in notebooks/escience_series
114 9
mnist.py
in notebooks/ml_tensorflow
109 8
greengrass-device.yaml
in notebooks/iot_greengress/health_tracker
106 -
heartrate.py
in notebooks/iot_greengress/health_tracker/tracker/ggd
103 2
serverless.yml
in notebooks/serverless_apps
102 -
sqlserver-rds.yaml
in notebooks/building_data_lakes/cfn
87 -
fastq-batch-roles.yaml
in notebooks/hpc
73 -
pcluster_cost_estimator.py
in notebooks/parallelcluster
70 7
input.html
in notebooks/ml_tensorflow
58 -
input.html
in notebooks/escience_series
58 -
tracker_brain.py
in notebooks/iot_greengress/health_tracker/lambda/TrackerBrain
52 2
index.html
in notebooks/iot_greengress/health_tracker/tracker/ggd/flask/templates
46 -
style.css
in notebooks/iot_greengress/health_tracker/tracker/ggd/flask/static/css
42 -
tf_scripts.py
in lib
40 3
update.py
in notebooks/serverless_apps/todos
36 1
pipemode.py
in notebooks/ml_tensorflow
34 4
db-create.yml
in notebooks/parallelcluster
31 -
create.py
in notebooks/serverless_apps/todos
27 1
error_detector.py
in notebooks/iot_greengress/health_tracker/lambda/TrackerErrorDetector
22 2
frequency.html
in notebooks/iot_greengress/health_tracker/tracker/ggd/flask/templates
22 -
get.py
in notebooks/serverless_apps/todos
18 1
topic.html
in notebooks/iot_greengress/health_tracker/tracker/ggd/flask/templates
17 -
buildspec.yml
in notebooks/serverless_apps
17 -
delete.py
in notebooks/serverless_apps/todos
14 1
list.py
in notebooks/serverless_apps/todos
13 1
hello.html
in notebooks/iot_greengress/health_tracker/tracker/ggd/flask/templates
7 -
decimalencoder.py
in notebooks/serverless_apps/todos
7 1
get-sls-prefix.js
in notebooks/serverless_apps
6 -
project_path.py
in notebooks/ml_tensorflow
5 -
project_path.py
in notebooks/intro_to_aws
5 -
project_path.py
in notebooks/container
5 -
project_path.py
in notebooks/genomics_pipeline
5 -
project_path.py
in notebooks/iot_greengress
5 -
project_path.py
in notebooks/spot
5 -
project_path.py
in notebooks/building_data_lakes
5 -
Files With Most Units (Top 20)
File# lines# units
workshop.py
in lib
675 36
smoothie.js
in notebooks/iot_greengress/health_tracker/tracker/ggd/flask/static
489 27
pcluster_athena.py
in notebooks/parallelcluster
221 18
web.py
in notebooks/iot_greengress/health_tracker/tracker/ggd
228 15
lambda_setup.py
in notebooks/iot_greengress/health_tracker
215 15
mnist.py
in notebooks/escience_series
114 9
mnist.py
in notebooks/ml_tensorflow
109 8
utils.py
in notebooks/iot_greengress/health_tracker/tracker/ggd
161 8
pcluster_cost_estimator.py
in notebooks/parallelcluster
70 7
group_setup.py
in notebooks/iot_greengress/health_tracker
137 6
pipemode.py
in notebooks/ml_tensorflow
34 4
tf_scripts.py
in lib
40 3
tracker_brain.py
in notebooks/iot_greengress/health_tracker/lambda/TrackerBrain
52 2
error_detector.py
in notebooks/iot_greengress/health_tracker/lambda/TrackerErrorDetector
22 2
heartrate.py
in notebooks/iot_greengress/health_tracker/tracker/ggd
103 2
heartbeat.py
in notebooks/iot_greengress/health_tracker/tracker/ggd
121 2
update.py
in notebooks/serverless_apps/todos
36 1
decimalencoder.py
in notebooks/serverless_apps/todos
7 1
get.py
in notebooks/serverless_apps/todos
18 1
delete.py
in notebooks/serverless_apps/todos
14 1
Files With Long Lines (Top 13)

There are 13 files with lines longer than 120 characters. In total, there are 47 long lines.

File# lines# units# long lines
workshop.py
in lib
675 36 13
pcluster_athena.py
in notebooks/parallelcluster
221 18 6
index.html
in notebooks/iot_greengress/health_tracker/tracker/ggd/flask/templates
46 - 4
smoothie.js
in notebooks/iot_greengress/health_tracker/tracker/ggd/flask/static
489 27 4
kinesis-swiss-army.yaml
in notebooks/building_data_lakes/cfn
661 - 4
pcluster_cost_estimator.py
in notebooks/parallelcluster
70 7 4
ec2_ssm.yaml
in notebooks/escience_series/cfn
141 - 3
greengrass-device.yaml
in notebooks/iot_greengress/health_tracker
106 - 2
emr.yaml
in notebooks/building_data_lakes/cfn
303 - 2
sf_orch_glue.yaml
in notebooks/building_data_lakes/cfn
315 - 2
frequency.html
in notebooks/iot_greengress/health_tracker/tracker/ggd/flask/templates
22 - 1
redshift.yaml
in notebooks/building_data_lakes/cfn
133 - 1
serverless.yml
in notebooks/serverless_apps
102 - 1