aws-samples / aim317-uncover-insights-customer-conversations
Duplication

Places in code with 6 or more lines that are exactly the same.

Intro
  • For duplication, we look at places in code where there are 6 or more lines of code that are exactly the same.
  • Before duplication is calculated, the code is cleaned to remove empty lines, comments, and frequently duplicated constructs such as imports.
  • You should aim at having as little as possible (<5%) of duplicated code as high-level of duplication can lead to maintenance difficulties, poor factoring, and logical contradictions.
Learn more...
Duplication Overall
  • 13% duplication:
    • 871 cleaned lines of cleaned code (without empty lines, comments, and frequently duplicated constructs such as imports)
    • 116 duplicated lines
  • 6 duplicates
system13% (116 lines)
Duplication per Extension
yaml52% (76 lines)
py10% (40 lines)
Duplication per Component (primary)
cloudformation15% (76 lines)
src10% (40 lines)
Longest Duplicates
The list of 6 longest duplicates.
See data for all 6 duplicates...
Size#FoldersFilesLinesCode
14 x 2 cloudformation
cloudformation
sagemakerNotebookEventEngineTemplate....
sagemakerNotebookTemplate.yaml
53:66 (15%)
39:52 (24%)
view
12 x 2 cloudformation
cloudformation
sagemakerNotebookEventEngineTemplate....
sagemakerNotebookTemplate.yaml
21:33 (13%)
18:30 (21%)
view
12 x 2 cloudformation
cloudformation
sagemakerNotebookEventEngineTemplate....
sagemakerNotebookTemplate.yaml
68:81 (13%)
53:66 (21%)
view
7 x 2 src
src
buildTrainTest.py
createDocumentClassifier.py
52:58 (16%)
18:24 (31%)
view
7 x 2 src
src
buildTrainTest.py
paginateProcessDataTrainTestFiles.py
32:39 (16%)
18:25 (36%)
view
6 x 2 src
src
classifyDocument.py
detectEntities.py
21:27 (18%)
23:29 (17%)
view