awslabs / aws-dataset-ingestion-metrics-collection-framework
Duplication

Places in code with 6 or more lines that are exactly the same.

Intro
  • For duplication, we look at places in code where there are 6 or more lines of code that are exactly the same.
  • Before duplication is calculated, the code is cleaned to remove empty lines, comments, and frequently duplicated constructs such as imports.
  • You should aim at having as little as possible (<5%) of duplicated code as high-level of duplication can lead to maintenance difficulties, poor factoring, and logical contradictions.
Learn more...
Duplication Overall
  • 12% duplication:
    • 2,513 cleaned lines of cleaned code (without empty lines, comments, and frequently duplicated constructs such as imports)
    • 320 duplicated lines
  • 21 duplicates
system12% (320 lines)
Duplication per Extension
py12% (320 lines)
Duplication per Component (primary)
cdk_constructs21% (267 lines)
stacks7% (39 lines)
lambda4% (14 lines)
definitions0% (0 lines)
definitions/account_1234123412340% (0 lines)
ROOT0% (0 lines)
dataquality0% (0 lines)
accounts0% (0 lines)
glue0% (0 lines)
Longest Duplicates
The list of 20 longest duplicates.
See data for all 21 duplicates...
Size#FoldersFilesLinesCode
25 x 2 cdk_constructs
cdk_constructs
glue_catalog_construct.py
glue_catalog_construct.py
48:72 (6%)
131:155 (6%)
view
16 x 2 cdk_constructs
cdk_constructs
metric_streamer_construct.py
sla_streamer_construct.py
64:79 (23%)
63:78 (23%)
view
12 x 2 cdk_constructs
cdk_constructs
metric_streamer_construct.py
sla_streamer_construct.py
31:44 (17%)
26:39 (17%)
view
12 x 2 cdk_constructs
cdk_constructs
sla_parse_construct.py
sla_streamer_construct.py
58:69 (20%)
67:78 (17%)
view
12 x 2 cdk_constructs
cdk_constructs
metric_streamer_construct.py
sla_parse_construct.py
31:44 (17%)
26:39 (20%)
view
12 x 2 cdk_constructs
cdk_constructs
sla_parse_construct.py
sla_streamer_construct.py
26:39 (20%)
26:39 (17%)
view
12 x 2 cdk_constructs
cdk_constructs
metric_streamer_construct.py
sla_parse_construct.py
68:79 (17%)
58:69 (20%)
view
11 x 2 cdk_constructs
cdk_constructs
glue_catalog_construct.py
glue_catalog_construct.py
112:122 (2%)
201:211 (2%)
view
10 x 2 cdk_constructs
cdk_constructs
glue_job_construct.py
lambda_construct.py
218:227 (4%)
136:145 (5%)
view
10 x 2 cdk_constructs
cdk_constructs
glue_job_construct.py
lambda_construct.py
203:212 (4%)
121:130 (5%)
view
9 x 2 cdk_constructs
cdk_constructs
glue_job_construct.py
lambda_construct.py
142:150 (3%)
137:145 (5%)
view
9 x 2 cdk_constructs
cdk_constructs
glue_job_construct.py
glue_job_construct.py
142:150 (3%)
219:227 (3%)
view
8 x 2 stacks
stacks
centralized_resources.py
centralized_resources.py
94:101 (4%)
109:116 (4%)
view
8 x 2 cdk_constructs
cdk_constructs
glue_catalog_construct.py
glue_catalog_construct.py
288:295 (2%)
375:382 (2%)
view
7 x 2 lambda
lambda
metric_stream_producer.py
sla_stream_producer.py
30:38 (5%)
23:31 (9%)
view
7 x 2 stacks
stacks
centralized_resources.py
centralized_resources.py
90:96 (4%)
159:165 (4%)
view
7 x 2 cdk_constructs
cdk_constructs
glue_catalog_construct.py
glue_catalog_construct.py
251:257 (1%)
351:357 (1%)
view
6 x 2 cdk_constructs
cdk_constructs
glue_job_construct.py
glue_job_construct.py
113:118 (2%)
135:140 (2%)
view
6 x 2 cdk_constructs
cdk_constructs
glue_catalog_construct.py
glue_catalog_construct.py
220:225 (1%)
304:309 (1%)
view
6 x 2 stacks
stacks
centralized_resources.py
metrics_streamer.py
26:31 (3%)
42:47 (1%)
view