awslabs / open-data-registry
Duplication

Places in code with 6 or more lines that are exactly the same.

Intro
  • For duplication, we look at places in code where there are 6 or more lines of code that are exactly the same.
  • Before duplication is calculated, the code is cleaned to remove empty lines, comments, and frequently duplicated constructs such as imports.
  • You should aim at having as little as possible (<5%) of duplicated code as high-level of duplication can lead to maintenance difficulties, poor factoring, and logical contradictions.
Learn more...
Duplication Overall
  • 18% duplication:
    • 13,948 cleaned lines of cleaned code (without empty lines, comments, and frequently duplicated constructs such as imports)
    • 2,578 duplicated lines
  • 1,082 duplicates
system18% (2,578 lines)
Duplication per Extension
yaml18% (2,578 lines)
Duplication per Component (primary)
datasets18% (2,524 lines)
ROOT8% (54 lines)
Longest Duplicates
The list of 20 longest duplicates.
See data for all 1,082 duplicates...
Size#FoldersFilesLinesCode
52 x 2 datasets
datasets
landsat-8.yaml
usgs-landsat.yaml
64:115 (45%)
73:124 (41%)
view
23 x 2 datasets
datasets
ncar-cesm-lens.yaml
ncar-cesm2-lens.yaml
7:29 (48%)
7:29 (53%)
view
20 x 2 datasets
datasets
deafrica-geomad.yaml
deafrica-sentinel-2.yaml
85:104 (19%)
61:80 (25%)
view
20 x 2 datasets
datasets
deafrica-sentinel-2.yaml
deafrica-wofs.yaml
61:80 (25%)
75:94 (21%)
view
20 x 2 datasets
datasets
deafrica-alos-jers.yaml
deafrica-geomad.yaml
55:74 (27%)
85:104 (19%)
view
20 x 2 datasets
datasets
deafrica-geomad.yaml
deafrica-wofs.yaml
85:104 (19%)
75:94 (21%)
view
20 x 2 datasets
datasets
noaa-cdr-atmospheric.yaml
noaa-cdr-fundamental.yaml
6:25 (15%)
6:25 (16%)
view
20 x 2 datasets
datasets
noaa-cdr-atmospheric.yaml
noaa-cdr-terrestrial.yaml
6:25 (15%)
6:25 (46%)
view
20 x 2 datasets
datasets
noaa-cdr-fundamental.yaml
noaa-cdr-terrestrial.yaml
6:25 (16%)
6:25 (46%)
view
20 x 2 datasets
datasets
deafrica-alos-jers.yaml
deafrica-sentinel-2.yaml
55:74 (27%)
61:80 (25%)
view
20 x 2 datasets
datasets
deafrica-alos-jers.yaml
deafrica-wofs.yaml
55:74 (27%)
75:94 (21%)
view
19 x 2 datasets
datasets
deafrica-geomad.yaml
deafrica-wofs.yaml
13:31 (18%)
9:27 (20%)
view
19 x 2 datasets
datasets
deafrica-sentinel-1.yaml
deafrica-sentinel-2.yaml
11:29 (24%)
12:30 (23%)
view
18 x 2 datasets
datasets
wrf-alaska-snap.yaml
wrf-se-alaska-snap.yaml
5:22 (56%)
5:22 (56%)
view
18 x 2 ROOT
ROOT
schema.yaml
schema.yaml
104:121 (10%)
148:165 (10%)
view
18 x 2 ROOT
ROOT
schema.yaml
schema.yaml
104:121 (10%)
129:146 (10%)
view
18 x 2 ROOT
ROOT
schema.yaml
schema.yaml
129:146 (10%)
148:165 (10%)
view
17 x 2 datasets
datasets
deafrica-alos-jers.yaml
deafrica-sentinel-2.yaml
10:26 (22%)
12:28 (21%)
view
17 x 2 datasets
datasets
deafrica-alos-jers.yaml
deafrica-wofs.yaml
10:26 (22%)
9:25 (18%)
view
17 x 2 datasets
datasets
deafrica-geomad.yaml
deafrica-sentinel-1.yaml
13:29 (16%)
11:27 (21%)
view