facebookresearch / GDT
Duplication

Places in code with 6 or more lines that are exactly the same.

Intro
  • For duplication, we look at places in code where there are 6 or more lines of code that are exactly the same.
  • Before duplication is calculated, the code is cleaned to remove empty lines, comments, and frequently duplicated constructs such as imports.
  • You should aim at having as little as possible (<5%) of duplicated code as high-level of duplication can lead to maintenance difficulties, poor factoring, and logical contradictions.
Learn more...
Duplication Overall
  • 17% duplication:
    • 5,504 cleaned lines of cleaned code (without empty lines, comments, and frequently duplicated constructs such as imports)
    • 967 duplicated lines
  • 63 duplicates
system17% (967 lines)
Duplication per Extension
py18% (967 lines)
Duplication per Component (primary)
ROOT17% (527 lines)
datasets16% (280 lines)
src18% (160 lines)

Duplication Between Components (50+ lines)

G ROOT ROOT src src ROOT--src 232

Download: SVG DOT (open online Graphviz editor)

Open 3D force graph...

Show more details on duplication between components...
Longest Duplicates
The list of 20 longest duplicates.
See data for all 63 duplicates...
Size#FoldersFilesLinesCode
74 x 2 ROOT
src
log_utils.py
logger.py
10:105 (31%)
98:193 (41%)
view
33 x 2 ROOT
ROOT
log_utils.py
log_utils.py
117:149 (14%)
201:233 (14%)
view
32 x 2 ROOT
ROOT
eval_video.py
eval_video.py
57:92 (4%)
117:154 (4%)
view
28 x 2 ROOT
src
log_utils.py
logger.py
118:145 (11%)
196:223 (15%)
view
28 x 2 ROOT
src
log_utils.py
logger.py
202:229 (11%)
196:223 (15%)
view
24 x 2 ROOT
ROOT
log_utils.py
log_utils.py
162:185 (10%)
245:268 (10%)
view
17 x 2 datasets
datasets
AVideoDataset.py
GDTPretrainDataset.py
124:153 (5%)
95:124 (6%)
view
16 x 2 datasets
datasets
AVideoDataset.py
GDTPretrainDataset.py
279:294 (4%)
229:244 (5%)
view
14 x 2 datasets
datasets
AVideoDataset.py
GDTPretrainDataset.py
263:278 (4%)
213:227 (4%)
view
11 x 2 datasets
datasets
AVideoDataset.py
AVideoDataset.py
300:310 (3%)
317:327 (3%)
view
9 x 2 datasets
datasets
GDTPretrainDataset.py
GDTPretrainDataset.py
23:31 (3%)
34:42 (3%)
view
9 x 2 src
src
resnet.py
resnet.py
58:69 (4%)
103:114 (4%)
view
9 x 2 ROOT
ROOT
log_utils.py
log_utils.py
151:159 (3%)
235:243 (3%)
view
9 x 2 ROOT
ROOT
eval_video.py
main_gdt.py
619:627 (1%)
521:529 (1%)
view
8 x 2 ROOT
ROOT
eval_video.py
eval_video.py
264:271 (1%)
274:281 (1%)
view
8 x 2 src
src
vmz.py
vmz.py
104:113 (3%)
148:157 (3%)
view
8 x 2 ROOT
ROOT
model.py
model.py
217:224 (1%)
278:285 (1%)
view
8 x 2 datasets
datasets
AVideoDataset.py
GDTPretrainDataset.py
112:119 (2%)
83:90 (2%)
view
8 x 2 ROOT
ROOT
model.py
model.py
280:287 (1%)
548:555 (1%)
view
8 x 2 ROOT
ROOT
log_utils.py
log_utils.py
107:114 (3%)
192:199 (3%)
view
Duplicated Units
The list of top 6 duplicated units.
See data for all 6 unit duplicates...
Size#FoldersFilesLinesCode
25 x 2 datasets
datasets
GDTPretrainDataset.py
AVideoDataset.py
0:0 
0:0 
view
11 x 2 ROOT
src
log_utils.py
logger.py
0:0 
0:0 
view
6 x 2 ROOT
src
log_utils.py
logger.py
0:0 
0:0 
view
6 x 2 ROOT
src
log_utils.py
logger.py
0:0 
0:0 
view
6 x 2 ROOT
src
log_utils.py
logger.py
0:0 
0:0 
view
6 x 2 ROOT
src
log_utils.py
logger.py
0:0 
0:0 
view