aws-samples / amazon-textract-textractor
Duplication

Places in code with 6 or more lines that are exactly the same.

Intro
  • For duplication, we look at places in code where there are 6 or more lines of code that are exactly the same.
  • Before duplication is calculated, the code is cleaned to remove empty lines, comments, and frequently duplicated constructs such as imports.
  • You should aim at having as little as possible (<5%) of duplicated code as high-level of duplication can lead to maintenance difficulties, poor factoring, and logical contradictions.
Learn more...
Duplication Overall
  • 14% duplication:
    • 3,704 cleaned lines of cleaned code (without empty lines, comments, and frequently duplicated constructs such as imports)
    • 522 duplicated lines
  • 104 duplicates
system14% (522 lines)
Duplication per Extension
py13% (480 lines)
cfg46% (42 lines)
Duplication per Component (primary)
src16% (208 lines)
tpipelinegeofinder/textractgeofinder7% (90 lines)
tpipelinepagedimensions47% (33 lines)
prettyprinter/textractprettyprinter12% (30 lines)
tpipelinegeofinder39% (24 lines)
caller39% (24 lines)
overlayer40% (24 lines)
prettyprinter41% (24 lines)
helper24% (23 lines)
overlayer/textractoverlayer7% (15 lines)
tpipelinepagedimensions/textractpagedimensions21% (15 lines)
caller/textractcaller3% (12 lines)
helper/textracthelper0% (0 lines)
Longest Duplicates
The list of 20 longest duplicates.
See data for all 104 duplicates...
Size#FoldersFilesLinesCode
22 x 2 src
src
trp.py
trp.py
194:222 (4%)
246:274 (4%)
view
15 x 2 src
src
trp.py
trp.py
256:274 (2%)
394:412 (2%)
view
15 x 2 src
src
trp.py
trp.py
204:222 (2%)
394:412 (2%)
view
14 x 2 src
src
trp.py
trp.py
177:192 (2%)
225:240 (2%)
view
12 x 2 src
src
trp.py
trp.py
79:94 (2%)
249:264 (2%)
view
12 x 2 src
src
trp.py
trp.py
79:94 (2%)
197:212 (2%)
view
11 x 2 caller
tpipelinegeofinder
setup.py
setup.py
42:52 (26%)
42:52 (26%)
view
11 x 2 prettyprinter
tpipelinepagedimensions
setup.py
setup.py
39:49 (28%)
50:60 (22%)
view
11 x 2 caller
prettyprinter
setup.py
setup.py
42:52 (26%)
39:49 (28%)
view
11 x 2 overlayer
tpipelinegeofinder
setup.py
setup.py
41:51 (26%)
42:52 (26%)
view
11 x 2 overlayer
tpipelinepagedimensions
setup.py
setup.py
41:51 (26%)
50:60 (22%)
view
11 x 2 overlayer
prettyprinter
setup.py
setup.py
41:51 (26%)
39:49 (28%)
view
11 x 2 caller
tpipelinepagedimensions
setup.py
setup.py
42:52 (26%)
50:60 (22%)
view
11 x 2 tpipelinegeofinder
tpipelinepagedimensions
setup.py
setup.py
42:52 (26%)
50:60 (22%)
view
11 x 2 tpipelinegeofinder/textractgeofinder
tpipelinegeofinder/textractgeofinder
tgeofinder.py
tgeofinder.py
195:205 (1%)
210:220 (1%)
view
11 x 2 src
src
trp.py
trp.py
127:141 (2%)
461:475 (2%)
view
11 x 2 prettyprinter
tpipelinegeofinder
setup.py
setup.py
39:49 (28%)
42:52 (26%)
view
11 x 2 caller
overlayer
setup.py
setup.py
42:52 (26%)
41:51 (26%)
view
10 x 2 src
src
trp.py
trp.py
129:141 (1%)
200:212 (1%)
view
10 x 2 src
src
trp.py
trp.py
200:212 (1%)
463:475 (1%)
view
Duplicated Units
The list of top 2 duplicated units.
See data for all 2 unit duplicates...
Size#FoldersFilesLinesCode
9 x 2 overlayer/textractoverlayer
tpipelinepagedimensions/textractpagedimensions
image_tools.py
t_pagedimensions.py
0:0 
0:0 
view
6 x 2 overlayer/textractoverlayer
tpipelinepagedimensions/textractpagedimensions
image_tools.py
t_pagedimensions.py
0:0 
0:0 
view