amazon-research / tanl
Duplication

Places in code with 6 or more lines that are exactly the same.

Intro
  • For duplication, we look at places in code where there are 6 or more lines of code that are exactly the same.
  • Before duplication is calculated, the code is cleaned to remove empty lines, comments, and frequently duplicated constructs such as imports.
  • You should aim at having as little as possible (<5%) of duplicated code as high-level of duplication can lead to maintenance difficulties, poor factoring, and logical contradictions.
Learn more...
Duplication Overall
  • 13% duplication:
    • 3,198 cleaned lines of cleaned code (without empty lines, comments, and frequently duplicated constructs such as imports)
    • 442 duplicated lines
  • 31 duplicates
system13% (442 lines)
Duplication per Extension
py13% (442 lines)
Duplication per Component (primary)
ROOT14% (442 lines)
preprocess_multiwoz0% (0 lines)
Longest Duplicates
The list of 20 longest duplicates.
See data for all 31 duplicates...
Size#FoldersFilesLinesCode
35 x 2 ROOT
ROOT
datasets.py
datasets.py
227:271 (1%)
889:933 (1%)
view
16 x 2 ROOT
ROOT
datasets.py
datasets.py
1092:1112 (<1%)
1144:1164 (<1%)
view
12 x 2 ROOT
ROOT
output_formats.py
output_formats.py
247:261 (3%)
444:458 (3%)
view
12 x 2 ROOT
ROOT
datasets.py
datasets.py
290:305 (<1%)
951:966 (<1%)
view
12 x 2 ROOT
ROOT
datasets.py
datasets.py
107:126 (<1%)
1388:1404 (<1%)
view
12 x 2 ROOT
ROOT
datasets.py
datasets.py
757:768 (<1%)
2168:2179 (<1%)
view
11 x 2 ROOT
ROOT
datasets.py
datasets.py
1272:1285 (<1%)
1417:1431 (<1%)
view
9 x 2 ROOT
ROOT
datasets.py
datasets.py
482:490 (<1%)
1063:1071 (<1%)
view
9 x 2 ROOT
ROOT
base_dataset.py
datasets.py
201:210 (5%)
2400:2409 (<1%)
view
9 x 2 ROOT
ROOT
datasets.py
datasets.py
135:146 (<1%)
459:470 (<1%)
view
8 x 2 ROOT
ROOT
datasets.py
datasets.py
209:216 (<1%)
880:887 (<1%)
view
8 x 2 ROOT
ROOT
datasets.py
datasets.py
131:140 (<1%)
1415:1424 (<1%)
view
8 x 2 ROOT
ROOT
datasets.py
datasets.py
1240:1252 (<1%)
1388:1400 (<1%)
view
8 x 2 ROOT
ROOT
output_formats.py
output_formats.py
290:300 (2%)
406:416 (2%)
view
8 x 2 ROOT
ROOT
datasets.py
datasets.py
765:772 (<1%)
1044:1051 (<1%)
view
8 x 2 ROOT
ROOT
datasets.py
datasets.py
107:122 (<1%)
1240:1252 (<1%)
view
7 x 2 ROOT
ROOT
datasets.py
datasets.py
229:239 (<1%)
1333:1343 (<1%)
view
7 x 2 ROOT
ROOT
datasets.py
datasets.py
635:642 (<1%)
1151:1158 (<1%)
view
7 x 2 ROOT
ROOT
datasets.py
datasets.py
635:642 (<1%)
1099:1106 (<1%)
view
7 x 2 ROOT
ROOT
datasets.py
datasets.py
891:901 (<1%)
1333:1343 (<1%)
view
Duplicated Units
The list of top 1 duplicated units.
See data for all 1 unit duplicate
Size#FoldersFilesLinesCode
6 x 2 ROOT
ROOT
datasets.py
datasets.py
0:0 
0:0 
view