aws-samples / sql-based-etl-on-amazon-eks
File Size

The distribution of size of files (measured in lines of code).

Intro
  • File size measurements show the distribution of size of files.
  • Files are classified in four categories based on their size (lines of code): 1-100 (very small files), 101-200 (small files), 201-500 (medium size files), 501-1000 (long files), 1001+(very long files).
  • It is a good practice to keep files small. Long files may become "bloaters", code that have increased to such gargantuan proportions that they are hard to work with.
Learn more...
File Size Overall
  • There are 35 files with 1,764 lines of code.
    • 0 very long files (0 lines of code)
    • 0 long files (0 lines of code)
    • 1 medium size files (310 lines of codeclsfd_ftr_w_mp_ins)
    • 2 small files (256 lines of code)
    • 32 very small files (1,198 lines of code)
0% | 0% | 17% | 14% | 67%
Legend:
1001+
501-1000
201-500
101-200
1-100


explore: zoomable circles | sunburst | 3D view
File Size per Extension
1001+
501-1000
201-500
101-200
1-100
yaml0% | 0% | 30% | 14% | 54%
py0% | 0% | 0% | 16% | 83%
js0% | 0% | 0% | 0% | 100%
File Size per Logical Decomposition
primary
1001+
501-1000
201-500
101-200
1-100
source/app_resources0% | 0% | 38% | 17% | 43%
source/lib0% | 0% | 0% | 71% | 28%
source/lib/cdk_infra0% | 0% | 0% | 0% | 100%
source/example0% | 0% | 0% | 0% | 100%
deployment/cdk-solution-helper0% | 0% | 0% | 0% | 100%
deployment/app_code/job0% | 0% | 0% | 0% | 100%
source/lib/util0% | 0% | 0% | 0% | 100%
source0% | 0% | 0% | 0% | 100%
Longest Files (Top 35)
File# lines# units
spark-template.yaml
in spark-on-eks/source/app_resources
310 -
alb-iam-role.yaml
in spark-on-eks/source/app_resources
145 -
spark_on_eks_stack.py
in spark-on-eks/source/lib
111 4
jupyter-values.yaml
in spark-on-eks/source/app_resources
100 -
index.js
in spark-on-eks/deployment/cdk-solution-helper
99 2
spark_permission.py
in spark-on-eks/source/lib/cdk_infra
80 2
native-spark-job-scheduler.yaml
in spark-on-eks/source/example
77 -
manifest_reader.py
in spark-on-eks/source/lib/util
69 4
eks_base_app.py
in spark-on-eks/source/lib/cdk_infra
67 2
scd2-job-scheduler.yaml
in spark-on-eks/source/example
61 -
jupyter-config.yaml
in spark-on-eks/source/app_resources
52 -
cloud_front_stack.py
in spark-on-eks/source/lib
45 4
msk_consumer.py
in spark-on-eks/deployment/app_code/job
45 1
iam_roles.py
in spark-on-eks/source/lib/cdk_infra
44 3
eks_cluster.py
in spark-on-eks/source/lib/cdk_infra
43 2
setup.py
in spark-on-eks/source
43 -
etl-rbac.yaml
in spark-on-eks/source/app_resources
38 -
etl-iam-role.yaml
in spark-on-eks/source/app_resources
37 -
eks_service_account.py
in spark-on-eks/source/lib/cdk_infra
36 1
nyctaxi-job-scheduler.yaml
in spark-on-eks/source/example
28 -
native-spark-iam-role.yaml
in spark-on-eks/source/app_resources
27 -
s3_app_code.py
in spark-on-eks/source/lib/cdk_infra
26 2
network_sg.py
in spark-on-eks/source/lib/cdk_infra
26 2
argo-values.yaml
in spark-on-eks/source/app_resources
22 -
autoscaler-values.yaml
in spark-on-eks/source/app_resources
17 -
executor-pod-template.yaml
in spark-on-eks/deployment/app_code/job
17 -
driver-pod-template.yaml
in spark-on-eks/deployment/app_code/job
17 -
app.py
in spark-on-eks/source
13 -
native-spark-rbac.yaml
in spark-on-eks/source/app_resources
13 -
ex-secret-iam-role.yaml
in spark-on-eks/source/app_resources
13 -
autoscaler-iam-role.yaml
in spark-on-eks/source/app_resources
11 -
spark-operator-values.yaml
in spark-on-eks/source/app_resources
10 -
ex-secret-values.yaml
in spark-on-eks/source/app_resources
10 -
alb-values.yaml
in spark-on-eks/source/app_resources
6 -
wordcount.py
in spark-on-eks/deployment/app_code/job
6 -
Files With Most Units (Top 12)
File# lines# units
spark_on_eks_stack.py
in spark-on-eks/source/lib
111 4
manifest_reader.py
in spark-on-eks/source/lib/util
69 4
cloud_front_stack.py
in spark-on-eks/source/lib
45 4
iam_roles.py
in spark-on-eks/source/lib/cdk_infra
44 3
eks_base_app.py
in spark-on-eks/source/lib/cdk_infra
67 2
spark_permission.py
in spark-on-eks/source/lib/cdk_infra
80 2
eks_cluster.py
in spark-on-eks/source/lib/cdk_infra
43 2
s3_app_code.py
in spark-on-eks/source/lib/cdk_infra
26 2
network_sg.py
in spark-on-eks/source/lib/cdk_infra
26 2
index.js
in spark-on-eks/deployment/cdk-solution-helper
99 2
eks_service_account.py
in spark-on-eks/source/lib/cdk_infra
36 1
msk_consumer.py
in spark-on-eks/deployment/app_code/job
45 1
Files With Long Lines (Top 10)

There are 10 files with lines longer than 120 characters. In total, there are 20 long lines.

File# lines# units# long lines
spark-template.yaml
in spark-on-eks/source/app_resources
310 - 4
index.js
in spark-on-eks/deployment/cdk-solution-helper
99 2 4
eks_cluster.py
in spark-on-eks/source/lib/cdk_infra
43 2 3
network_sg.py
in spark-on-eks/source/lib/cdk_infra
26 2 3
cloud_front_stack.py
in spark-on-eks/source/lib
45 4 1
eks_base_app.py
in spark-on-eks/source/lib/cdk_infra
67 2 1
spark_permission.py
in spark-on-eks/source/lib/cdk_infra
80 2 1
app.py
in spark-on-eks/source
13 - 1
nyctaxi-job-scheduler.yaml
in spark-on-eks/source/example
28 - 1
wordcount.py
in spark-on-eks/deployment/app_code/job
6 - 1