huggingface / datasets
Duplication

Places in code with 6 or more lines that are exactly the same.

Intro
Learn more...
Duplication Overall
system16% (3,218 lines)
dependency graphs: 2D graph | 3D graph | 3D graph (with duplicates)...
Duplication per Extension
py16% (3,218 lines)
Duplication per Component (primary)
src17% (3,192 lines)
benchmarks7% (26 lines)
utils0% (0 lines)
ROOT0% (0 lines)
Longest Duplicates
The list of 50 longest duplicates.
See data for all 324 duplicates...
Size#FoldersFilesLinesCode
137 x 2 src/datasets
src/datasets
1774:1920 (13%)
2590:2736 (13%)
view
56 x 2 src/datasets
src/datasets
1440:1506 (4%)
1695:1761 (4%)
view
42 x 2 src/datasets
src/datasets
1619:1756 (4%)
2445:2572 (4%)
view
38 x 2 src/datasets/features
src/datasets/features
240:290 (26%)
207:257 (30%)
view
38 x 2 src/datasets
src/datasets
1882:1920 (3%)
4225:4263 (1%)
view
38 x 2 src/datasets
src/datasets
2698:2736 (3%)
4225:4263 (1%)
view
36 x 2 src/datasets/features
src/datasets/features
196:253 (15%)
224:281 (21%)
view
28 x 2 src/datasets/features
src/datasets/features
250:290 (19%)
253:293 (12%)
view
28 x 2 src/datasets/features
src/datasets/features
253:293 (12%)
217:257 (22%)
view
25 x 2 src/datasets
src/datasets
1409:1437 (2%)
1666:1694 (2%)
view
25 x 2 src/datasets/features
src/datasets/features
173:216 (19%)
228:271 (14%)
view
25 x 2 src/datasets/download
src/datasets
15:44 (25%)
7:35 (33%)
view
25 x 2 src/datasets/features
src/datasets/features
200:243 (10%)
173:216 (19%)
view
24 x 2 src/datasets/io
src/datasets/io
39:66 (21%)
33:60 (50%)
view
23 x 2 src/datasets/packaged_modules/text
src/datasets/packaged_modules/xml
31:62 (30%)
27:58 (58%)
view
22 x 2 src/datasets/io
src/datasets/io
16:37 (19%)
10:31 (45%)
view
22 x 2 src/datasets/io
src/datasets/io
18:39 (24%)
10:31 (45%)
view
22 x 2 src/datasets
src/datasets
124:145 (27%)
190:211 (27%)
view
22 x 2 src/datasets/io
src/datasets/io
16:37 (19%)
18:39 (24%)
view
21 x 2 src/datasets/io
src/datasets/io
45:69 (15%)
36:60 (43%)
view
21 x 2 src/datasets/io
src/datasets/io
42:66 (18%)
45:69 (15%)
view
21 x 2 src/datasets/io
src/datasets/io
42:66 (18%)
46:70 (23%)
view
21 x 2 src/datasets/utils
src/datasets/utils
273:293 (6%)
442:462 (6%)
view
21 x 2 src/datasets/io
src/datasets/io
45:69 (15%)
46:70 (23%)
view
21 x 2 src/datasets/io
src/datasets/io
46:70 (23%)
36:60 (43%)
view
20 x 2 src/datasets
src/datasets
1589:1612 (<1%)
1713:1736 (<1%)
view
18 x 2 src/datasets/packaged_modules/folder_based_builder
src/datasets/packaged_modules/json
314:333 (5%)
135:154 (13%)
view
18 x 2 src/datasets
src/datasets
323:341 (<1%)
395:413 (<1%)
view
18 x 2 src/datasets/packaged_modules/csv
src/datasets/packaged_modules/text
148:168 (11%)
31:55 (24%)
view
18 x 2 src/datasets/utils
src/datasets/utils
279:296 (5%)
430:447 (5%)
view
18 x 2 src/datasets/packaged_modules/csv
src/datasets/packaged_modules/xml
148:168 (11%)
27:51 (46%)
view
17 x 2 src/datasets/utils
src/datasets/utils
361:377 (5%)
385:401 (5%)
view
15 x 2 src/datasets
src/datasets
2659:2674 (1%)
4192:4207 (<1%)
view
15 x 2 src/datasets/packaged_modules/json
src/datasets/packaged_modules/text
71:88 (11%)
32:53 (20%)
view
15 x 2 src/datasets/packaged_modules/json
src/datasets/packaged_modules/xml
71:88 (11%)
28:49 (38%)
view
15 x 2 src/datasets
src/datasets
1843:1858 (1%)
4192:4207 (<1%)
view
15 x 2 src/datasets/utils
src/datasets/utils
430:444 (4%)
448:462 (4%)
view
15 x 2 src/datasets
src/datasets
1588:1603 (1%)
1840:1855 (1%)
view
15 x 2 src/datasets/packaged_modules/csv
src/datasets/packaged_modules/json
149:166 (9%)
71:88 (11%)
view
14 x 2 src/datasets/utils
src/datasets/utils
361:374 (4%)
407:420 (4%)
view
14 x 2 src/datasets
src/datasets
2912:2925 (<1%)
2992:3005 (<1%)
view
14 x 2 src/datasets/packaged_modules/arrow
src/datasets/packaged_modules/parquet
28:45 (28%)
42:59 (17%)
view
14 x 2 src/datasets
src/datasets
2679:2693 (1%)
4209:4223 (<1%)
view
14 x 2 src/datasets/utils
src/datasets/utils
297:310 (4%)
431:444 (4%)
view
14 x 2 src/datasets/utils
src/datasets/utils
297:310 (4%)
449:462 (4%)
view
14 x 2 src/datasets/utils
src/datasets/utils
258:271 (4%)
427:440 (4%)
view
14 x 2 src/datasets
src/datasets
1863:1877 (1%)
4209:4223 (<1%)
view
14 x 2 src/datasets/utils
src/datasets/utils
385:398 (4%)
407:420 (4%)
view
14 x 2 src/datasets/utils
src/datasets/utils
280:293 (4%)
297:310 (4%)
view
13 x 2 src/datasets/utils
src/datasets/utils
259:271 (4%)
409:421 (4%)
view
Duplicated Units
The list of top 1 duplicated units.
See data for all 1 unit duplicate
Size#FoldersFilesLinesCode
19 x 2 src/datasets/io
src/datasets/io
0:0 
0:0 
view