microsoft / AzureML-BERT
Duplication

Places in code with 6 or more lines that are exactly the same.

Intro
  • For duplication, we look at places in code where there are 6 or more lines of code that are exactly the same.
  • Before duplication is calculated, the code is cleaned to remove empty lines, comments, and frequently duplicated constructs such as imports.
  • You should aim at having as little as possible (<5%) of duplicated code as high-level of duplication can lead to maintenance difficulties, poor factoring, and logical contradictions.
Learn more...
Duplication Overall
  • 20% duplication:
    • 4,140 cleaned lines of cleaned code (without empty lines, comments, and frequently duplicated constructs such as imports)
    • 859 duplicated lines
  • 109 duplicates
system20% (859 lines)
Duplication per Extension
py20% (859 lines)
Duplication per Component (primary)
finetune26% (418 lines)
finetune/PyTorch39% (208 lines)
finetune/TensorFlow17% (120 lines)
pretrain/PyTorch9% (113 lines)
pretrain/PyTorch/dataprep0% (0 lines)

Duplication Between Components (50+ lines)

G finetune/PyTorch finetune/PyTorch finetune finetune finetune/PyTorch--finetune 397 finetune/TensorFlow finetune/TensorFlow finetune/PyTorch--finetune/TensorFlow 194 finetune/TensorFlow--finetune 253

Download: SVG DOT (open online Graphviz editor)

Open 3D force graph...

Show more details on duplication between components...
Longest Duplicates
The list of 20 longest duplicates.
See data for all 109 duplicates...
Size#FoldersFilesLinesCode
35 x 2 finetune/PyTorch
finetune/TensorFlow
run_classifier_azureml.py
run_classifier.py
208:276 (8%)
393:460 (5%)
view
23 x 2 finetune/TensorFlow
finetune
run_classifier.py
run_classifier_azureml.py
130:181 (3%)
160:211 (2%)
view
18 x 2 finetune/TensorFlow
finetune
run_classifier.py
run_classifier_azureml.py
239:267 (2%)
262:290 (2%)
view
17 x 2 finetune/PyTorch
finetune/TensorFlow
run_classifier_azureml.py
run_classifier.py
47:87 (3%)
130:170 (2%)
view
17 x 2 finetune
finetune
run_classifier_azureml.py
run_classifier_azureml.py
479:500 (2%)
516:537 (2%)
view
17 x 2 finetune
finetune
run_classifier_azureml.py
run_classifier_azureml.py
428:453 (2%)
542:567 (2%)
view
17 x 2 finetune/PyTorch
finetune
run_classifier_azureml.py
run_classifier_azureml.py
47:87 (3%)
160:200 (2%)
view
16 x 2 finetune
finetune
run_classifier_azureml.py
run_classifier_azureml.py
355:379 (2%)
542:566 (2%)
view
16 x 2 finetune/TensorFlow
finetune
run_classifier.py
run_classifier_azureml.py
283:307 (2%)
428:452 (2%)
view
16 x 2 finetune
finetune
run_classifier_azureml.py
run_classifier_azureml.py
355:379 (2%)
428:452 (2%)
view
16 x 2 finetune
finetune
run_classifier_azureml.py
run_classifier_azureml.py
230:253 (2%)
357:380 (2%)
view
16 x 2 finetune/TensorFlow
finetune
run_classifier.py
run_classifier_azureml.py
283:307 (2%)
355:379 (2%)
view
16 x 2 finetune/TensorFlow
finetune
run_classifier.py
run_classifier_azureml.py
283:307 (2%)
542:566 (2%)
view
15 x 2 finetune/TensorFlow
finetune
run_classifier.py
run_classifier_azureml.py
320:345 (2%)
316:341 (1%)
view
15 x 2 finetune/PyTorch
finetune
run_classifier_azureml.py
run_classifier_azureml.py
287:309 (3%)
662:684 (1%)
view
15 x 2 finetune/TensorFlow
finetune
run_classifier.py
run_classifier_azureml.py
285:307 (2%)
230:252 (1%)
view
15 x 2 finetune
finetune
run_classifier_azureml.py
run_classifier_azureml.py
230:252 (1%)
544:566 (1%)
view
15 x 2 finetune
finetune
run_classifier_azureml.py
run_classifier_azureml.py
230:252 (1%)
430:452 (1%)
view
14 x 2 finetune/TensorFlow
finetune
run_classifier.py
run_classifier_azureml.py
283:305 (2%)
319:341 (1%)
view
14 x 2 finetune/TensorFlow
finetune/TensorFlow
run_classifier.py
run_classifier.py
283:305 (2%)
323:345 (2%)
view
Duplicated Units
The list of top 5 duplicated units.
See data for all 5 unit duplicates...
Size#FoldersFilesLinesCode
12 x 2 finetune
finetune
run_classifier_azureml.py
run_classifier_azureml.py
0:0 
0:0 
view
12 x 2 pretrain/PyTorch
pretrain/PyTorch
dataset.py
sources.py
0:0 
0:0 
view
13 x 3 finetune/PyTorch
finetune/TensorFlow
finetune
run_classifier_azureml.py
run_classifier.py
run_classifier_azureml.py
0:0 
0:0 
0:0 
view
8 x 2 finetune/PyTorch
finetune
run_classifier_azureml.py
run_squad_azureml.py
0:0 
0:0 
view
14 x 3 finetune/PyTorch
finetune/TensorFlow
finetune
run_classifier_azureml.py
run_classifier.py
run_classifier_azureml.py
0:0 
0:0 
0:0 
view