amazon-research / sentence-representations
Duplication

Places in code with 6 or more lines that are exactly the same.

Intro
  • For duplication, we look at places in code where there are 6 or more lines of code that are exactly the same.
  • Before duplication is calculated, the code is cleaned to remove empty lines, comments, and frequently duplicated constructs such as imports.
  • You should aim at having as little as possible (<5%) of duplicated code as high-level of duplication can lead to maintenance difficulties, poor factoring, and logical contradictions.
Learn more...
Duplication Overall
  • 6% duplication:
    • 2,322 cleaned lines of cleaned code (without empty lines, comments, and frequently duplicated constructs such as imports)
    • 152 duplicated lines
  • 9 duplicates
system6% (152 lines)
Duplication per Extension
py6% (152 lines)
Duplication per Component (primary)
DownstreamEval/SentEval/senteval8% (138 lines)
DownstreamEval/configs12% (7 lines)
PairSupCon/utils8% (7 lines)
DownstreamEval/clustering0% (0 lines)
DownstreamEval/SentEval0% (0 lines)
DownstreamEval0% (0 lines)
DownstreamEval/stseval0% (0 lines)
PairSupCon/models0% (0 lines)
PairSupCon/dataloader0% (0 lines)
PairSupCon0% (0 lines)
Longest Duplicates
The list of 9 longest duplicates.
See data for all 9 duplicates...
Size#FoldersFilesLinesCode
20 x 2 DownstreamEval/SentEval/senteval
DownstreamEval/SentEval/senteval
sick.py
sick.py
55:79 (12%)
159:183 (12%)
view
13 x 2 DownstreamEval/SentEval/senteval
DownstreamEval/SentEval/senteval
sick.py
sts.py
41:54 (8%)
216:229 (7%)
view
10 x 2 DownstreamEval/SentEval/senteval
DownstreamEval/SentEval/senteval
sick.py
sts.py
147:156 (6%)
217:226 (5%)
view
10 x 2 DownstreamEval/SentEval/senteval
DownstreamEval/SentEval/senteval
sick.py
sick.py
42:51 (6%)
147:156 (6%)
view
10 x 2 DownstreamEval/SentEval/senteval/tools
DownstreamEval/SentEval/senteval/tools
ranking.py
ranking.py
279:290 (3%)
319:330 (3%)
view
9 x 2 DownstreamEval/SentEval/senteval
DownstreamEval/SentEval/senteval
sts.py
sts.py
171:180 (5%)
193:202 (5%)
view
7 x 2 DownstreamEval/configs
PairSupCon/utils
utils.py
utils.py
6:12 (100%)
7:13 (24%)
view
6 x 2 DownstreamEval/SentEval/senteval/tools
DownstreamEval/SentEval/senteval/tools
classifier.py
relatedness.py
144:149 (4%)
124:129 (6%)
view
6 x 2 DownstreamEval/SentEval/senteval/tools
DownstreamEval/SentEval/senteval/tools
ranking.py
relatedness.py
194:199 (2%)
92:97 (6%)
view
Duplicated Units
The list of top 1 duplicated units.
See data for all 1 unit duplicate
Size#FoldersFilesLinesCode
6 x 2 DownstreamEval/configs
PairSupCon/utils
utils.py
utils.py
0:0 
0:0 
view