aws-samples / aws-cdk-pipelines-datalake-etl
Duplication

Places in code with 6 or more lines that are exactly the same.

Intro
  • For duplication, we look at places in code where there are 6 or more lines of code that are exactly the same.
  • Before duplication is calculated, the code is cleaned to remove empty lines, comments, and frequently duplicated constructs such as imports.
  • You should aim at having as little as possible (<5%) of duplicated code as high-level of duplication can lead to maintenance difficulties, poor factoring, and logical contradictions.
Learn more...
Duplication Overall
  • 26% duplication:
    • 1,323 cleaned lines of cleaned code (without empty lines, comments, and frequently duplicated constructs such as imports)
    • 353 duplicated lines
  • 22 duplicates
system26% (353 lines)
Duplication per Extension
py26% (353 lines)
Duplication per Component (primary)
lib/glue_scripts59% (178 lines)
lib16% (127 lines)
lib/etl_job_auditor58% (41 lines)
lib/state_machine_trigger6% (7 lines)
ROOT0% (0 lines)
Longest Duplicates
The list of 20 longest duplicates.
See data for all 22 duplicates...
Size#FoldersFilesLinesCode
28 x 2 lib/glue_scripts
lib/glue_scripts
etl_conformed_to_purposebuilt.py
etl_raw_to_conformed.py
137:167 (19%)
133:163 (18%)
view
20 x 2 lib/glue_scripts
lib/glue_scripts
etl_conformed_to_purposebuilt.py
etl_raw_to_conformed.py
114:135 (13%)
110:131 (13%)
view
17 x 2 lib/glue_scripts
lib/glue_scripts
etl_conformed_to_purposebuilt.py
etl_raw_to_conformed.py
71:89 (11%)
68:86 (11%)
view
10 x 2 lib
lib
glue_stack.py
glue_stack.py
127:136 (3%)
158:167 (3%)
view
9 x 2 lib/etl_job_auditor
lib/etl_job_auditor
lambda_handler.py
lambda_handler.py
69:77 (12%)
99:107 (12%)
view
8 x 2 lib/etl_job_auditor
lib/etl_job_auditor
lambda_handler.py
lambda_handler.py
56:64 (11%)
84:93 (11%)
view
8 x 2 lib/glue_scripts
lib/glue_scripts
etl_conformed_to_purposebuilt.py
etl_raw_to_conformed.py
28:37 (5%)
27:35 (5%)
view
7 x 2 lib
lib
glue_stack.py
glue_stack.py
296:302 (2%)
307:314 (2%)
view
7 x 2 lib
lib
glue_stack.py
glue_stack.py
282:288 (2%)
307:314 (2%)
view
7 x 2 lib/etl_job_auditor
lib/state_machine_trigger
lambda_handler.py
lambda_handler.py
15:40 (10%)
50:63 (6%)
view
7 x 2 lib
lib
glue_stack.py
glue_stack.py
196:202 (2%)
238:244 (2%)
view
7 x 2 lib/glue_scripts
lib/glue_scripts
etl_conformed_to_purposebuilt.py
etl_raw_to_conformed.py
55:68 (4%)
51:65 (4%)
view
7 x 2 lib
lib
glue_stack.py
glue_stack.py
282:288 (2%)
296:302 (2%)
view
6 x 2 lib/glue_scripts
lib/glue_scripts
etl_conformed_to_purposebuilt.py
etl_raw_to_conformed.py
172:180 (4%)
168:176 (3%)
view
6 x 2 lib
lib
step_functions_stack.py
step_functions_stack.py
236:241 (2%)
243:248 (2%)
view
6 x 2 lib
lib
step_functions_stack.py
step_functions_stack.py
119:124 (2%)
138:143 (2%)
view
6 x 2 lib/glue_scripts
lib/glue_scripts
etl_conformed_to_purposebuilt.py
etl_raw_to_conformed.py
192:197 (4%)
155:160 (3%)
view
6 x 2 lib
lib
glue_stack.py
glue_stack.py
80:85 (2%)
87:92 (2%)
view
6 x 2 lib
lib
glue_stack.py
step_functions_stack.py
25:42 (2%)
32:51 (2%)
view
6 x 2 lib
lib
glue_stack.py
glue_stack.py
117:122 (2%)
146:151 (2%)
view