aws-samples / amazon-textract-a2i-pdf
Duplication

Places in code with 6 or more lines that are exactly the same.

Intro
  • For duplication, we look at places in code where there are 6 or more lines of code that are exactly the same.
  • Before duplication is calculated, the code is cleaned to remove empty lines, comments, and frequently duplicated constructs such as imports.
  • You should aim at having as little as possible (<5%) of duplicated code as high-level of duplication can lead to maintenance difficulties, poor factoring, and logical contradictions.
Learn more...
Duplication Overall
  • 12% duplication:
    • 774 cleaned lines of cleaned code (without empty lines, comments, and frequently duplicated constructs such as imports)
    • 100 duplicated lines
  • 14 duplicates
system12% (100 lines)
Duplication per Extension
py13% (100 lines)
Duplication per Component (primary)
multipagepdfa2i15% (46 lines)
deploy_code/multipagepdfa2i_analyzepdf19% (27 lines)
deploy_code/multipagepdfa2i_humancomplete21% (27 lines)
ROOT0% (0 lines)
deploy_code/multipagepdfa2i_wrapup0% (0 lines)
deploy_code/multipagepdfa2i_kickoff0% (0 lines)
deploy_code/multipagepdfa2i_pngextract/src/main/java0% (0 lines)

Duplication Between Components (50+ lines)

G deploy_code/multipagepdfa2i_analyzepdf deploy_code/multipagepdfa2i_analyzepdf deploy_code/multipagepdfa2i_humancomplete deploy_code/multipagepdfa2i_humancomplete deploy_code/multipagepdfa2i_analyzepdf--deploy_code/multipagepdfa2i_humancomplete 54

Download: SVG DOT (open online Graphviz editor)

Open 3D force graph...

Show more details on duplication between components...
Longest Duplicates
The list of 14 longest duplicates.
See data for all 14 duplicates...
Size#FoldersFilesLinesCode
11 x 2 deploy_code/multipagepdfa2i_analyzepdf
deploy_code/multipagepdfa2i_humancomplete
clean_data.py
clean_data.py
24:35 (17%)
24:35 (17%)
view
8 x 2 deploy_code/multipagepdfa2i_analyzepdf
deploy_code/multipagepdfa2i_humancomplete
clean_data.py
clean_data.py
38:46 (12%)
38:46 (12%)
view
8 x 2 deploy_code/multipagepdfa2i_analyzepdf
deploy_code/multipagepdfa2i_humancomplete
clean_data.py
clean_data.py
55:63 (12%)
55:63 (12%)
view
7 x 2 multipagepdfa2i
multipagepdfa2i
multipagepdfa2i_stack.py
multipagepdfa2i_stack.py
167:173 (2%)
217:223 (2%)
view
7 x 2 multipagepdfa2i
multipagepdfa2i
multipagepdfa2i_stack.py
multipagepdfa2i_stack.py
167:173 (2%)
195:201 (2%)
view
7 x 2 multipagepdfa2i
multipagepdfa2i
multipagepdfa2i_stack.py
multipagepdfa2i_stack.py
195:201 (2%)
238:244 (2%)
view
7 x 2 multipagepdfa2i
multipagepdfa2i
multipagepdfa2i_stack.py
multipagepdfa2i_stack.py
167:173 (2%)
238:244 (2%)
view
7 x 2 multipagepdfa2i
multipagepdfa2i
multipagepdfa2i_stack.py
multipagepdfa2i_stack.py
217:223 (2%)
238:244 (2%)
view
7 x 2 multipagepdfa2i
multipagepdfa2i
multipagepdfa2i_stack.py
multipagepdfa2i_stack.py
195:201 (2%)
217:223 (2%)
view
6 x 2 multipagepdfa2i
multipagepdfa2i
multipagepdfa2i_stack.py
multipagepdfa2i_stack.py
152:157 (2%)
239:244 (2%)
view
6 x 2 multipagepdfa2i
multipagepdfa2i
multipagepdfa2i_stack.py
multipagepdfa2i_stack.py
182:187 (2%)
209:214 (2%)
view
6 x 2 multipagepdfa2i
multipagepdfa2i
multipagepdfa2i_stack.py
multipagepdfa2i_stack.py
152:157 (2%)
168:173 (2%)
view
6 x 2 multipagepdfa2i
multipagepdfa2i
multipagepdfa2i_stack.py
multipagepdfa2i_stack.py
152:157 (2%)
196:201 (2%)
view
6 x 2 multipagepdfa2i
multipagepdfa2i
multipagepdfa2i_stack.py
multipagepdfa2i_stack.py
152:157 (2%)
218:223 (2%)
view
Duplicated Units
The list of top 10 duplicated units.
See data for all 10 unit duplicates...
Size#FoldersFilesLinesCode
17 x 2 deploy_code/multipagepdfa2i_analyzepdf
deploy_code/multipagepdfa2i_humancomplete
clean_data.py
clean_data.py
0:0 
0:0 
view
17 x 2 deploy_code/multipagepdfa2i_humancomplete
deploy_code/multipagepdfa2i_analyzepdf
clean_data.py
clean_data.py
0:0 
0:0 
view
9 x 2 deploy_code/multipagepdfa2i_analyzepdf
deploy_code/multipagepdfa2i_humancomplete
clean_data.py
clean_data.py
0:0 
0:0 
view
9 x 2 deploy_code/multipagepdfa2i_analyzepdf
deploy_code/multipagepdfa2i_humancomplete
clean_data.py
clean_data.py
0:0 
0:0 
view
9 x 2 deploy_code/multipagepdfa2i_humancomplete
deploy_code/multipagepdfa2i_analyzepdf
clean_data.py
clean_data.py
0:0 
0:0 
view
9 x 2 deploy_code/multipagepdfa2i_humancomplete
deploy_code/multipagepdfa2i_analyzepdf
clean_data.py
clean_data.py
0:0 
0:0 
view
8 x 2 deploy_code/multipagepdfa2i_analyzepdf
deploy_code/multipagepdfa2i_humancomplete
clean_data.py
clean_data.py
0:0 
0:0 
view
8 x 2 deploy_code/multipagepdfa2i_analyzepdf
deploy_code/multipagepdfa2i_humancomplete
clean_data.py
clean_data.py
0:0 
0:0 
view
8 x 2 deploy_code/multipagepdfa2i_humancomplete
deploy_code/multipagepdfa2i_analyzepdf
clean_data.py
clean_data.py
0:0 
0:0 
view
8 x 2 deploy_code/multipagepdfa2i_humancomplete
deploy_code/multipagepdfa2i_analyzepdf
clean_data.py
clean_data.py
0:0 
0:0 
view