awslabs / sagemaker-privacy-for-nlp
Duplication

Places in code with 6 or more lines that are exactly the same.

Intro
  • For duplication, we look at places in code where there are 6 or more lines of code that are exactly the same.
  • Before duplication is calculated, the code is cleaned to remove empty lines, comments, and frequently duplicated constructs such as imports.
  • You should aim at having as little as possible (<5%) of duplicated code as high-level of duplication can lead to maintenance difficulties, poor factoring, and logical contradictions.
Learn more...
Duplication Overall
  • 20% duplication:
    • 1,639 cleaned lines of cleaned code (without empty lines, comments, and frequently duplicated constructs such as imports)
    • 338 duplicated lines
  • 5 duplicates
system20% (338 lines)
Duplication per Extension
py34% (300 lines)
yaml6% (38 lines)
Duplication per Component (primary)
source/sagemaker/src/package/model60% (146 lines)
source/sagemaker/src/package/data_privatization54% (140 lines)
deployment6% (38 lines)
deployment/solution-assistant/src15% (14 lines)
ROOT0% (0 lines)
source0% (0 lines)
source/scripts0% (0 lines)
source/sagemaker/src/package0% (0 lines)
source/sagemaker/src/package/container_build0% (0 lines)
source/sagemaker0% (0 lines)
deployment/solution-assistant0% (0 lines)

Duplication Between Components (50+ lines)

G source/sagemaker/src/package/data_privatization source/sagemaker/src/package/data_privatization source/sagemaker/src/package/model source/sagemaker/src/package/model source/sagemaker/src/package/data_privatization--source/sagemaker/src/package/model 286

Download: SVG DOT (open online Graphviz editor)

Open 3D force graph...

Show more details on duplication between components...
Longest Duplicates
The list of 5 longest duplicates.
See data for all 5 duplicates...
Size#FoldersFilesLinesCode
140 x 2 source/sagemaker/src/pac...privatization/container
source/sagemaker/src/package/model
train.py
train.py
25:267 (100%)
25:267 (100%)
view
19 x 2 deployment
deployment
sagemaker-privacy-for-nlp.yaml
sagemaker-privacy-for-nlp.yaml
96:114 (3%)
141:159 (3%)
view
7 x 2 deployment/solution-assistant/src
deployment/solution-assistant/src
lambda_function.py
lambda_function.py
78:84 (7%)
93:99 (7%)
view
6 x 2 source/sagemaker/src/package/model
source/sagemaker/src/package/model
inference.py
train.py
64:72 (6%)
75:83 (4%)
view
6 x 2 source/sagemaker/src/pac...privatization/container
source/sagemaker/src/package/model
train.py
inference.py
75:83 (4%)
64:72 (6%)
view
Duplicated Units
The list of top 7 duplicated units.
See data for all 7 unit duplicates...
Size#FoldersFilesLinesCode
20 x 2 source/sagemaker/src/pac...privatization/container
source/sagemaker/src/package/model
train.py
train.py
0:0 
0:0 
view
13 x 2 source/sagemaker/src/pac...privatization/container
source/sagemaker/src/package/model
train.py
train.py
0:0 
0:0 
view
10 x 2 source/sagemaker/src/pac...privatization/container
source/sagemaker/src/package/model
train.py
train.py
0:0 
0:0 
view
9 x 2 source/sagemaker/src/pac...privatization/container
source/sagemaker/src/package/model
train.py
train.py
0:0 
0:0 
view
6 x 2 source/sagemaker/src/pac...privatization/container
source/sagemaker/src/package/model
train.py
train.py
0:0 
0:0 
view
8 x 2 source/sagemaker/src/pac...privatization/container
source/sagemaker/src/package/model
train.py
train.py
0:0 
0:0 
view
8 x 2 source/sagemaker/src/pac...privatization/container
source/sagemaker/src/package/model
train.py
train.py
0:0 
0:0 
view