awslabs / amazon-redshift-utils
File Size

The distribution of size of files (measured in lines of code).

Intro
  • File size measurements show the distribution of size of files.
  • Files are classified in four categories based on their size (lines of code): 1-100 (very small files), 101-200 (small files), 201-500 (medium size files), 501-1000 (long files), 1001+(very long files).
  • It is a good practice to keep files small. Long files may become "bloaters", code that have increased to such gargantuan proportions that they are hard to work with.
Learn more...
File Size Overall
  • There are 59 files with 11,588 lines of code.
    • 3 very long files (3,600 lines of code)
    • 3 long files (2,392 lines of code)
    • 7 medium size files (1,836 lines of codeclsfd_ftr_w_mp_ins)
    • 21 small files (2,987 lines of code)
    • 25 very small files (773 lines of code)
31% | 20% | 15% | 25% | 6%
Legend:
1001+
501-1000
201-500
101-200
1-100


explore: zoomable circles | sunburst | 3D view
File Size per Extension
1001+
501-1000
201-500
101-200
1-100
py28% | 19% | 12% | 30% | 8%
yaml40% | 26% | 17% | 12% | 3%
js0% | 0% | 94% | 0% | 5%
html0% | 0% | 0% | 100% | 0%
File Size per Logical Decomposition
primary
1001+
501-1000
201-500
101-200
1-100
src/SimpleReplay71% | 16% | 4% | 5% | 2%
src/ColumnEncodingUtility0% | 100% | 0% | 0% | 0%
src/AnalyzeVacuumUtility0% | 84% | 0% | 15% | 0%
src/UnloadCopyUtility0% | 0% | 47% | 45% | 6%
src/RedshiftAutomation0% | 0% | 34% | 44% | 20%
src/SnapshotManager0% | 0% | 54% | 33% | 12%
src/QMRNotificationUtility0% | 0% | 70% | 29% | 0%
src/MetadataTransfer0% | 0% | 0% | 87% | 12%
src/SystemTablePersistence0% | 0% | 0% | 100% | 0%
src/UnloadAutoPartitions0% | 0% | 0% | 100% | 0%
src/MultipleTableRestoreUtility0% | 0% | 0% | 100% | 0%
src0% | 0% | 0% | 69% | 30%
src/WorkloadManagementScheduler0% | 0% | 0% | 100% | 0%
src/UserLastLogin0% | 0% | 0% | 0% | 100%
src/BlogContent0% | 0% | 0% | 0% | 100%
Longest Files (Top 50)
File# lines# units
replay.py
in src/SimpleReplay
1303 47
ra3-migration-replay.yaml
in src/SimpleReplay/cloudformation
1294 -
extract.py
in src/SimpleReplay
1003 35
analyze-schema-compression.py
in src/ColumnEncodingUtility
943 20
ra3-migration-extract.yaml
in src/SimpleReplay/cloudformation
857 -
analyze_vacuum.py
in src/AnalyzeVacuumUtility/lib
592 10
resources.py
in src/UnloadCopyUtility/util
325 56
deploy.yaml
in src/RedshiftAutomation
299 -
global_config.py
in src/UnloadCopyUtility
275 42
snapshotManager.js
in src/SnapshotManager
263 18
qmr-action-notification-utility.yaml
in src/QMRNotificationUtility/cloudformation
250 -
redshift_cluster.py
in src/UnloadCopyUtility/util
220 32
LambdaReplay.py
in src/SimpleReplay/cloudformation
204 11
ddl_generators.py
in src/UnloadCopyUtility/util/sql
189 27
snapshot_system_stats.py
in src/SystemTablePersistence
188 7
tasks.py
in src/UnloadCopyUtility/util
177 33
deploy.yaml
in src/SnapshotManager
161 -
ConfigurationForm.html
in src/RedshiftAutomation
160 -
userprivs.py
in src/MetadataTransfer
160 4
metadatacopy.py
in src/MetadataTransfer
159 4
sql_text_helpers.py
in src/UnloadCopyUtility/util/sql
158 15
genunload.py
in src/UnloadAutoPartitions
157 6
redshift_unload_copy.py
in src/UnloadCopyUtility
148 7
multitablerestore.py
in src/MultipleTableRestoreUtility
145 7
ParseUserActivityLog.py
in src/SimpleReplay/PreviousVersion
136 9
deploy-systable-standalone.yaml
in src/SystemTablePersistence
131 -
config_constants.py
in src
130 2
s3_utils.py
in src/UnloadCopyUtility/util
125 13
LambdaSimpleReplayExtract.py
in src/SimpleReplay/cloudformation
121 8
lambda_function.py
in src/RedshiftAutomation
120 2
analyze-vacuum-schema.py
in src/AnalyzeVacuumUtility
109 1
lambda_function.py
in src/QMRNotificationUtility/lambda
105 5
deploy-function-and-schedule.yaml
in src/RedshiftAutomation
104 -
wlm_scheduler.py
in src/WorkloadManagementScheduler
104 6
user_last_login.py
in src/UserLastLogin
84 1
util.py
in src/SimpleReplay
80 7
ParallelExecute.py
in src/BlogContent/SyntheticMixedBenchmark
79 1
pgpass.py
in src/UnloadCopyUtility/util
65 13
encrypt_password.py
in src/RedshiftAutomation
64 2
common.py
in src/RedshiftAutomation
58 2
redshift_utils_helper.py
in src
54 3
deploy-schedule.yaml
in src/RedshiftAutomation
52 -
schedule.py
in src/SnapshotManager
46 1
dbconstring.py
in src/MetadataTransfer
29 3
replay.yaml
in src/SimpleReplay
28 -
kms_helper.py
in src/UnloadCopyUtility/util
25 4
utils.py
in src/UserLastLogin/lib
18 1
queries.py
in src/MetadataTransfer
18 -
child_object.py
in src/UnloadCopyUtility/util
13 3
extract.yaml
in src/SimpleReplay
12 -
Files With Most Units (Top 20)
File# lines# units
resources.py
in src/UnloadCopyUtility/util
325 56
replay.py
in src/SimpleReplay
1303 47
global_config.py
in src/UnloadCopyUtility
275 42
extract.py
in src/SimpleReplay
1003 35
tasks.py
in src/UnloadCopyUtility/util
177 33
redshift_cluster.py
in src/UnloadCopyUtility/util
220 32
ddl_generators.py
in src/UnloadCopyUtility/util/sql
189 27
analyze-schema-compression.py
in src/ColumnEncodingUtility
943 20
snapshotManager.js
in src/SnapshotManager
263 18
sql_text_helpers.py
in src/UnloadCopyUtility/util/sql
158 15
pgpass.py
in src/UnloadCopyUtility/util
65 13
s3_utils.py
in src/UnloadCopyUtility/util
125 13
LambdaReplay.py
in src/SimpleReplay/cloudformation
204 11
analyze_vacuum.py
in src/AnalyzeVacuumUtility/lib
592 10
ParseUserActivityLog.py
in src/SimpleReplay/PreviousVersion
136 9
LambdaSimpleReplayExtract.py
in src/SimpleReplay/cloudformation
121 8
util.py
in src/SimpleReplay
80 7
multitablerestore.py
in src/MultipleTableRestoreUtility
145 7
redshift_unload_copy.py
in src/UnloadCopyUtility
148 7
snapshot_system_stats.py
in src/SystemTablePersistence
188 7
Files With Long Lines (Top 20)

There are 25 files with lines longer than 120 characters. In total, there are 233 long lines.

File# lines# units# long lines
replay.py
in src/SimpleReplay
1303 47 51
ra3-migration-replay.yaml
in src/SimpleReplay/cloudformation
1294 - 40
ra3-migration-extract.yaml
in src/SimpleReplay/cloudformation
857 - 26
extract.py
in src/SimpleReplay
1003 35 24
ConfigurationForm.html
in src/RedshiftAutomation
160 - 16
analyze_vacuum.py
in src/AnalyzeVacuumUtility/lib
592 10 14
analyze-schema-compression.py
in src/ColumnEncodingUtility
943 20 11
genunload.py
in src/UnloadAutoPartitions
157 6 11
deploy.yaml
in src/RedshiftAutomation
299 - 7
ParseUserActivityLog.py
in src/SimpleReplay/PreviousVersion
136 9 7
LambdaReplay.py
in src/SimpleReplay/cloudformation
204 11 4
deploy-function-and-schedule.yaml
in src/RedshiftAutomation
104 - 3
user_last_login.py
in src/UserLastLogin
84 1 3
deploy-schedule.yaml
in src/RedshiftAutomation
52 - 2
wlm_scheduler.py
in src/WorkloadManagementScheduler
104 6 2
LambdaSimpleReplayExtract.py
in src/SimpleReplay/cloudformation
121 8 2
deploy-systable-standalone.yaml
in src/SystemTablePersistence
131 - 2
common.py
in src/RedshiftAutomation
58 2 1
snapshotManager.js
in src/SnapshotManager
263 18 1
deploy.yaml
in src/SnapshotManager
161 - 1