aws-samples / bioimage-search
Duplication

Places in code with 6 or more lines that are exactly the same.

Intro
  • For duplication, we look at places in code where there are 6 or more lines of code that are exactly the same.
  • Before duplication is calculated, the code is cleaned to remove empty lines, comments, and frequently duplicated constructs such as imports.
  • You should aim at having as little as possible (<5%) of duplicated code as high-level of duplication can lead to maintenance difficulties, poor factoring, and logical contradictions.
Learn more...
Duplication Overall
  • 24% duplication:
    • 8,497 cleaned lines of cleaned code (without empty lines, comments, and frequently duplicated constructs such as imports)
    • 2,059 duplicated lines
  • 2,046 duplicates
system24% (2,059 lines)
Duplication per Extension
ts24% (1,198 lines)
py26% (841 lines)
java4% (20 lines)
Duplication per Component (primary)
cli/bioims/src54% (618 lines)
main/cdk25% (511 lines)
main/src/image-management29% (228 lines)
datasets/bbbc-021/scripts20% (167 lines)
main/src/search19% (100 lines)
main/src/training-configuration25% (76 lines)
main/src/label24% (66 lines)
main/src/message29% (47 lines)
main/src/process-plate38% (42 lines)
main/src/configuration24% (34 lines)
main/src/training-compute23% (31 lines)
main/src/train21% (27 lines)
main/src/embedding-management58% (23 lines)
main/src/search-service4% (20 lines)
main/src/common-node20% (20 lines)
main/src/tag10% (15 lines)
main/scripts68% (11 lines)
main/src/artifact6% (9 lines)
main/src/image-inspector8% (7 lines)
main/src/training-build7% (7 lines)
datasets/bbbc-021/cdk0% (0 lines)
main/src/image-artifact0% (0 lines)
main/src/common0% (0 lines)
main/src/plate-preprocessing0% (0 lines)
main/src/embedding-compute0% (0 lines)
main/src/image-preprocessing0% (0 lines)

Duplication Between Components (50+ lines)

G main/src/image-management main/src/image-management main/src/search main/src/search main/src/image-management--main/src/search 151 main/src/process-plate main/src/process-plate main/src/image-management--main/src/process-plate 56 main/src/training-configuration main/src/training-configuration main/src/search--main/src/training-configuration 74 main/src/configuration main/src/configuration main/src/message main/src/message main/src/configuration--main/src/message 57 main/src/process-plate--main/src/search 54 main/src/embedding-management main/src/embedding-management main/src/embedding-management--main/src/process-plate 52

Download: SVG DOT (open online Graphviz editor)

Open 3D force graph...

Show more details on duplication between components...
Longest Duplicates
The list of 20 longest duplicates.
See data for all 2,046 duplicates...
Size#FoldersFilesLinesCode
52 x 2 main/cdk
main/cdk
process-plate-stack.ts
process-plate-stack.ts
531:599 (9%)
627:689 (9%)
view
40 x 2 main/cdk
main/cdk
embedding-stack.ts
train-stack.ts
168:219 (19%)
71:122 (18%)
view
21 x 2 datasets/bbbc-021/scripts
datasets/bbbc-021/scripts
embedding-compute-helper.py
run-training-helper.py
25:45 (67%)
28:48 (65%)
view
20 x 2 main/src/image-management
main/src/image-management
image-management.ts
image-management.ts
417:437 (2%)
528:548 (2%)
view
20 x 2 datasets/bbbc-021/scripts
datasets/bbbc-021/scripts
apply-tags.py
generate_train_filters.py
7:37 (24%)
55:85 (28%)
view
14 x 2 main/src/image-management
main/src/image-management
image-management.ts
image-management.ts
451:467 (1%)
562:578 (1%)
view
13 x 2 datasets/bbbc-021/scripts
datasets/bbbc-021/scripts
generate_tags.py
generate_train_filters.py
7:29 (20%)
55:77 (18%)
view
13 x 2 datasets/bbbc-021/scripts
datasets/bbbc-021/scripts
apply-tags.py
generate_tags.py
7:29 (15%)
7:29 (20%)
view
13 x 2 main/src/image-management
main/src/image-management
image-management.ts
image-management.ts
309:321 (1%)
345:357 (1%)
view
13 x 2 main/cdk
main/cdk
embedding-stack.ts
train-stack.ts
238:253 (6%)
142:157 (6%)
view
12 x 2 datasets/bbbc-021/scripts
datasets/bbbc-021/scripts
generate_tags.py
get-train-input-helper.py
7:25 (19%)
7:25 (46%)
view
12 x 2 datasets/bbbc-021/scripts
datasets/bbbc-021/scripts
embedding-compute-helper.py
embedding-upload-helper.py
25:36 (38%)
25:36 (40%)
view
12 x 2 datasets/bbbc-021/scripts
datasets/bbbc-021/scripts
generate_train_filters.py
get-train-input-helper.py
55:73 (17%)
7:25 (46%)
view
12 x 2 datasets/bbbc-021/scripts
datasets/bbbc-021/scripts
apply-tags.py
get-train-input-helper.py
7:25 (14%)
7:25 (46%)
view
12 x 2 datasets/bbbc-021/scripts
datasets/bbbc-021/scripts
embedding-upload-helper.py
run-training-helper.py
25:36 (40%)
28:39 (37%)
view
11 x 2 main/scripts
main/src/training-compute
fsxl-helper.py
training-compute.py
7:19 (68%)
35:45 (8%)
view
11 x 2 main/src/image-management
main/src/search
image-management.ts
search.ts
728:739 (1%)
435:446 (2%)
view
11 x 2 main/src/image-management
main/src/image-management
image-management.ts
image-management.ts
112:123 (1%)
728:739 (1%)
view
11 x 2 main/src/image-management
main/src/image-management
image-management.ts
image-management.ts
112:123 (1%)
567:578 (1%)
view
11 x 2 main/src/image-management
main/src/image-management
image-management.ts
image-management.ts
456:467 (1%)
728:739 (1%)
view
Duplicated Units
The list of top 5 duplicated units.
See data for all 5 unit duplicates...
Size#FoldersFilesLinesCode
13 x 2 main/src/configuration
main/src/message
configuration.ts
message.ts
47:60 
90:103 
view
12 x 2 main/src/configuration
main/src/tag
configuration.ts
tag.ts
62:74 
115:127 
view
10 x 2 main/src/search
main/src/search
search.ts
search.ts
86:99 
101:114 
view
9 x 2 main/src/image-management
main/src/image-management
image-management.ts
image-management.ts
88:97 
293:302 
view
7 x 2 main/src/embedding-management
main/src/process-plate
embedding-management.ts
process-plate.ts
8:16 
57:65 
view