awslabs / sagemaker-graph-entity-resolution
Duplication

Places in code with 6 or more lines that are exactly the same.

Intro
  • For duplication, we look at places in code where there are 6 or more lines of code that are exactly the same.
  • Before duplication is calculated, the code is cleaned to remove empty lines, comments, and frequently duplicated constructs such as imports.
  • You should aim at having as little as possible (<5%) of duplicated code as high-level of duplication can lead to maintenance difficulties, poor factoring, and logical contradictions.
Learn more...
Duplication Overall
  • 9% duplication:
    • 1,343 cleaned lines of cleaned code (without empty lines, comments, and frequently duplicated constructs such as imports)
    • 122 duplicated lines
  • 14 duplicates
system9% (122 lines)
Duplication per Extension
py11% (102 lines)
yaml4% (20 lines)
Duplication per Component (primary)
source/sagemaker/sagemaker_graph_entity_resolution/dgl_entity_resolution7% (38 lines)
source/sagemaker/baseline16% (22 lines)
deployment4% (20 lines)
source/sagemaker/data-preparation22% (16 lines)
deployment/solution-assistant/src34% (14 lines)
source/sagemaker/data-preprocessing13% (12 lines)
source/sagemaker/sagemaker_graph_entity_resolution0% (0 lines)
deployment/solution-assistant0% (0 lines)
Longest Duplicates
The list of 14 longest duplicates.
See data for all 14 duplicates...
Size#FoldersFilesLinesCode
10 x 2 source/sagemaker/data-preparation
source/sagemaker/data-preprocessing
data_prep.py
data_preprocessing.py
8:19 (25%)
20:31 (11%)
view
9 x 2 source/sagemaker/baseline
source/sagemaker/sagemak...n/dgl_entity_resolution
train_pytorch_mlp_entity_resolution.py
utils.py
66:74 (6%)
161:169 (6%)
view
8 x 2 source/sagemaker/sagemak...n/dgl_entity_resolution
source/sagemaker/sagemak...n/dgl_entity_resolution
utils.py
utils.py
87:94 (5%)
110:117 (5%)
view
8 x 2 source/sagemaker/baseline
source/sagemaker/data-preparation
train_pytorch_mlp_entity_resolution.py
data_prep.py
104:113 (6%)
8:17 (20%)
view
8 x 2 source/sagemaker/baseline
source/sagemaker/data-preprocessing
train_pytorch_mlp_entity_resolution.py
data_preprocessing.py
104:113 (6%)
20:29 (9%)
view
7 x 2 deployment/solution-assistant/src
deployment/solution-assistant/src
lambda_function.py
lambda_function.py
25:31 (17%)
39:45 (17%)
view
7 x 2 source/sagemaker/baseline
source/sagemaker/sagemak...n/dgl_entity_resolution
train_pytorch_mlp_entity_resolution.py
estimator_fns.py
112:119 (5%)
6:13 (21%)
view
7 x 2 deployment
deployment
sagemaker-notebook-instance-stack.yaml
sagemaker-permissions-stack.yaml
32:39 (8%)
15:22 (5%)
view
6 x 2 source/sagemaker/data-preprocessing
source/sagemaker/sagemak...n/dgl_entity_resolution
data_preprocessing.py
estimator_fns.py
20:25 (6%)
37:42 (18%)
view
6 x 2 source/sagemaker/data-preparation
source/sagemaker/sagemak...n/dgl_entity_resolution
data_prep.py
estimator_fns.py
8:13 (15%)
37:42 (18%)
view
6 x 2 source/sagemaker/data-preparation
source/sagemaker/data-preprocessing
data_sampling.py
data_preprocessing.py
9:14 (20%)
28:33 (6%)
view
6 x 2 deployment
deployment
sagemaker-graph-entity-resolution.yaml
sagemaker-notebook-instance-stack.yaml
99:104 (2%)
34:39 (6%)
view
6 x 2 source/sagemaker/baseline
source/sagemaker/sagemak...n/dgl_entity_resolution
train_pytorch_mlp_entity_resolution.py
estimator_fns.py
104:109 (4%)
37:42 (18%)
view
6 x 2 deployment
deployment
sagemaker-graph-entity-resolution.yaml
sagemaker-permissions-stack.yaml
99:104 (2%)
17:22 (4%)
view