huggingface / transformers-research-projects
Duplication

Places in code with 6 or more lines that are exactly the same.

Intro
Learn more...
Duplication Overall
system49% (18,522 lines)
dependency graphs: 2D graph | 3D graph | 3D graph (with duplicates)...
Duplication per Extension
py50% (18,404 lines)
ipynb18% (118 lines)
Duplication per Component (primary)
visual_bert100% (2,367 lines)
lxmert100% (2,367 lines)
movement-pruning56% (1,835 lines)
rag82% (1,472 lines)
rag-end2end-retriever82% (1,463 lines)
tapex75% (1,062 lines)
jax-projects31% (825 lines)
robust-speech-event73% (802 lines)
seq2seq-distillation31% (762 lines)
bert-loses-patience78% (759 lines)
deebert67% (630 lines)
distillation33% (586 lines)
wav2vec245% (474 lines)
quantization-qdqbert33% (453 lines)
bertology64% (387 lines)
xtreme-s51% (368 lines)
mm-imdb59% (319 lines)
performer26% (309 lines)
self-training-text-classification30% (261 lines)
layoutlmv348% (192 lines)
mlm_wwm41% (179 lines)
pplm14% (147 lines)
luke16% (103 lines)
adversarial27% (98 lines)
longform-qa10% (89 lines)
information-gain-filtration12% (59 lines)
codeparrot5% (54 lines)
onnx4% (36 lines)
zero-shot-distillation9% (24 lines)
bertabs2% (22 lines)
synthid_text2% (18 lines)
fsner0% (0 lines)
vqgan-clip0% (0 lines)
decision_transformer0% (0 lines)
token-healing0% (0 lines)

Duplication Between Components (50+ lines)

G lxmert lxmert visual_bert visual_bert lxmert--visual_bert 4734 rag-end2end-retriever rag-end2end-retriever rag rag rag-end2end-retriever--rag 2926 seq2seq-distillation seq2seq-distillation rag-end2end-retriever--seq2seq-distillation 1022 rag--seq2seq-distillation 1040 distillation distillation movement-pruning movement-pruning distillation--movement-pruning 1359 distillation--seq2seq-distillation 86 mm-imdb mm-imdb distillation--mm-imdb 460 quantization-qdqbert quantization-qdqbert distillation--quantization-qdqbert 102 movement-pruning--rag-end2end-retriever 90 movement-pruning--rag 135 movement-pruning--seq2seq-distillation 133 movement-pruning--quantization-qdqbert 108 bert-loses-patience bert-loses-patience bert-loses-patience--rag 95 bert-loses-patience--distillation 500 bert-loses-patience--movement-pruning 1213 deebert deebert bert-loses-patience--deebert 829 bert-loses-patience--mm-imdb 482 bertology bertology bert-loses-patience--bertology 83 deebert--rag 95 deebert--distillation 520 deebert--movement-pruning 1175 deebert--seq2seq-distillation 93 deebert--mm-imdb 521 robust-speech-event robust-speech-event xtreme-s xtreme-s robust-speech-event--xtreme-s 921 tapex tapex robust-speech-event--tapex 204 wav2vec2 wav2vec2 robust-speech-event--wav2vec2 302 mm-imdb--rag 90 mm-imdb--movement-pruning 620 mm-imdb--seq2seq-distillation 88 quantization-qdqbert--tapex 531 tapex--xtreme-s 203 tapex--wav2vec2 191 layoutlmv3 layoutlmv3 layoutlmv3--robust-speech-event 84 layoutlmv3--quantization-qdqbert 252 layoutlmv3--tapex 460 jax-projects jax-projects jax-projects--quantization-qdqbert 175 jax-projects--tapex 295 jax-projects--layoutlmv3 116 performer performer jax-projects--performer 427 jax-projects--wav2vec2 392 mlm_wwm mlm_wwm jax-projects--mlm_wwm 287 wav2vec2--xtreme-s 127 mlm_wwm--quantization-qdqbert 120 mlm_wwm--tapex 264 mlm_wwm--performer 212 bertology--movement-pruning 108 bertology--deebert 83 bertology--mm-imdb 87 adversarial adversarial adversarial--jax-projects 93 luke luke self-training-text-classification self-training-text-classification luke--self-training-text-classification 87

Download: SVG DOT (open online Graphviz editor)

Open 2D force graph... Open 3D force graph...

Show more details on duplication between components...
Longest Duplicates
The list of 50 longest duplicates.
See data for all 1,488 duplicates...
Size#FoldersFilesLinesCode
1324 x 2 lxmert
visual_bert
37:1920 (100%)
37:1920 (100%)
view
440 x 2 lxmert
visual_bert
33:500 (100%)
33:500 (100%)
view
399 x 2 lxmert
visual_bert
47:554 (100%)
47:554 (100%)
view
261 x 2 rag-end2end-retriever
rag
17:320 (100%)
17:320 (100%)
view
212 x 2 tapex
tapex
122:386 (44%)
120:384 (45%)
view
171 x 2 rag-end2end-retriever
rag
21:244 (100%)
21:244 (100%)
view
127 x 2 rag
seq2seq-distillation
11:168 (39%)
11:168 (39%)
view
109 x 2 lxmert
visual_bert
24:147 (100%)
24:147 (100%)
view
103 x 2 tapex
tapex
426:550 (21%)
402:526 (22%)
view
100 x 2 rag-end2end-retriever
seq2seq-distillation
11:134 (30%)
11:134 (31%)
view
100 x 2 rag-end2end-retriever
rag
11:134 (30%)
11:134 (30%)
view
95 x 2 lxmert
visual_bert
31:151 (100%)
31:151 (100%)
view
88 x 2 rag
seq2seq-distillation
172:266 (27%)
172:266 (27%)
view
79 x 2 robust-speech-event
robust-speech-event
59:155 (14%)
58:154 (16%)
view
77 x 2 movement-pruning
movement-pruning
652:735 (9%)
770:853 (8%)
view
75 x 2 tapex
tapex
554:649 (15%)
530:625 (16%)
view
75 x 2 distillation
movement-pruning
539:617 (11%)
682:760 (8%)
view
75 x 2 rag-end2end-retriever
rag
138:227 (22%)
136:225 (23%)
view
73 x 2 rag-end2end-retriever
rag
502:574 (11%)
378:450 (13%)
view
73 x 2 movement-pruning
movement-pruning
106:196 (9%)
113:203 (7%)
view
67 x 2 seq2seq-distillation
seq2seq-distillation
29:97 (18%)
16:84 (49%)
view
66 x 2 rag
seq2seq-distillation
278:354 (20%)
269:345 (20%)
view
61 x 2 rag-end2end-retriever
rag
414:487 (9%)
301:374 (11%)
view
61 x 2 bertology
bertology
304:376 (19%)
279:351 (21%)
view
61 x 2 tapex
tapex
40:120 (12%)
38:118 (13%)
view
59 x 2 rag-end2end-retriever
rag
43:117 (70%)
41:114 (71%)
view
58 x 2 wav2vec2
wav2vec2
223:343 (18%)
142:262 (17%)
view
57 x 2 jax-projects/wav2vec2
wav2vec2
283:363 (14%)
287:367 (22%)
view
57 x 2 mlm_wwm
performer
114:177 (18%)
124:187 (13%)
view
56 x 2 jax-projects/dataset-streaming
performer
109:170 (14%)
124:185 (13%)
view
56 x 2 jax-projects/dataset-streaming
mlm_wwm
109:170 (14%)
114:175 (18%)
view
53 x 2 robust-speech-event
xtreme-s
208:260 (9%)
264:316 (7%)
view
52 x 2 rag-end2end-retriever
rag
642:698 (7%)
487:544 (9%)
view
52 x 2 bert-loses-patience
deebert
587:652 (8%)
540:605 (8%)
view
50 x 2 jax-projects/dataset-streaming
jax-projects/model_parallel
67:128 (12%)
60:121 (11%)
view
50 x 2 robust-speech-event
xtreme-s
81:130 (10%)
94:143 (7%)
view
50 x 2 robust-speech-event
xtreme-s
82:131 (9%)
94:143 (7%)
view
50 x 2 deebert/src
deebert/src
337:397 (20%)
95:154 (52%)
view
49 x 2 rag-end2end-retriever
seq2seq-distillation
58:117 (58%)
22:81 (60%)
view
49 x 2 rag
seq2seq-distillation
55:114 (59%)
22:81 (60%)
view
49 x 2 rag-end2end-retriever
rag
16:102 (43%)
23:110 (37%)
view
49 x 2 rag-end2end-retriever
rag
198:254 (7%)
197:254 (9%)
view
48 x 2 bert-loses-patience
movement-pruning
458:508 (8%)
599:648 (6%)
view
47 x 2 rag-end2end-retriever
seq2seq-distillation
174:227 (14%)
172:225 (14%)
view
47 x 2 distillation
movement-pruning
459:516 (6%)
628:685 (5%)
view
46 x 2 rag-end2end-retriever
rag
229:278 (13%)
227:275 (14%)
view
45 x 2 bert-loses-patience
deebert
398:447 (7%)
385:434 (7%)
view
45 x 2 distillation
mm-imdb
707:766 (6%)
435:494 (10%)
view
44 x 2 self-training-text-classification
self-training-text-classification
280:331 (7%)
342:393 (7%)
view
42 x 2 deebert
mm-imdb
517:567 (7%)
410:460 (9%)
view
Duplicated Units
The list of top 10 duplicated units.
See data for all 10 unit duplicates...
Size#FoldersFilesLinesCode
144 x 2 visual_bert
lxmert
0:0 
0:0 
view
109 x 2 rag
rag-end2end-retriever
0:0 
0:0 
view
58 x 2 seq2seq-distillation
rag
0:0 
0:0 
view
32 x 2 visual_bert
lxmert
0:0 
0:0 
view
29 x 2 visual_bert
lxmert
0:0 
0:0 
view
29 x 2 visual_bert
lxmert
0:0 
0:0 
view
28 x 2 visual_bert
lxmert
0:0 
0:0 
view
23 x 2 rag
rag-end2end-retriever
0:0 
0:0 
view
9 x 2 deebert/src
deebert/src
0:0 
0:0 
view
7 x 2 rag
rag-end2end-retriever
0:0 
0:0 
view