awslabs / dgl-lifesci
Duplication

Places in code with 6 or more lines that are exactly the same.

Intro
  • For duplication, we look at places in code where there are 6 or more lines of code that are exactly the same.
  • Before duplication is calculated, the code is cleaned to remove empty lines, comments, and frequently duplicated constructs such as imports.
  • You should aim at having as little as possible (<5%) of duplicated code as high-level of duplication can lead to maintenance difficulties, poor factoring, and logical contradictions.
Learn more...
Duplication Overall
  • 13% duplication:
    • 9,408 cleaned lines of cleaned code (without empty lines, comments, and frequently duplicated constructs such as imports)
    • 1,273 duplicated lines
  • 1,073 duplicates
system13% (1,273 lines)
Duplication per Extension
py13% (1,227 lines)
yml86% (46 lines)
Duplication per Component (primary)
python/dgllife/model/pretrain31% (725 lines)
python/dgllife/data12% (248 lines)
python/dgllife/utils7% (122 lines)
python/dgllife/model/model_zoo4% (74 lines)
python/dgllife/model/gnn5% (58 lines)
docker/install/conda_env100% (46 lines)
conda/dgllife0% (0 lines)
python0% (0 lines)
python/dgllife0% (0 lines)
python/dgllife/model0% (0 lines)
python/dgllife/model/readout0% (0 lines)
python/dgllife/utils/jtvae0% (0 lines)
ROOT0% (0 lines)
Longest Duplicates
The list of 20 longest duplicates.
See data for all 1,073 duplicates...
Size#FoldersFilesLinesCode
23 x 2 docker/install/conda_env
docker/install/conda_env
torch_cpu.yml
torch_gpu.yml
1:23 (100%)
1:23 (100%)
view
19 x 2 python/dgllife/utils
python/dgllife/utils
complex_to_graph.py
complex_to_graph.py
76:141 (9%)
207:255 (9%)
view
16 x 2 python/dgllife/utils
python/dgllife/utils
mol_to_graph.py
mol_to_graph.py
95:113 (7%)
876:894 (7%)
view
16 x 2 python/dgllife/data
python/dgllife/data
muv.py
pcba.py
120:157 (43%)
117:154 (43%)
view
14 x 2 python/dgllife/model/pretrain/moleculenet
python/dgllife/model/pretrain/moleculenet
muv.py
pcba.py
202:216 (7%)
208:222 (7%)
view
14 x 2 python/dgllife/model/pretrain/moleculenet
python/dgllife/model/pretrain/moleculenet
freesolv.py
lipophilicity.py
212:226 (7%)
213:227 (7%)
view
13 x 2 python/dgllife/model/pretrain/moleculenet
python/dgllife/model/pretrain/moleculenet
freesolv.py
hiv.py
166:178 (7%)
191:203 (6%)
view
13 x 2 python/dgllife/model/pretrain/moleculenet
python/dgllife/model/pretrain/moleculenet
bbbp.py
lipophilicity.py
208:220 (6%)
167:179 (6%)
view
13 x 2 python/dgllife/model/pretrain/moleculenet
python/dgllife/model/pretrain/moleculenet
bbbp.py
lipophilicity.py
208:220 (6%)
197:209 (6%)
view
13 x 2 python/dgllife/model/pretrain/moleculenet
python/dgllife/model/pretrain/moleculenet
bace.py
toxcast.py
176:188 (6%)
195:207 (6%)
view
13 x 2 python/dgllife/model/pretrain/moleculenet
python/dgllife/model/pretrain/moleculenet
sider.py
tox21.py
197:209 (6%)
194:206 (5%)
view
13 x 2 python/dgllife/model/pretrain/moleculenet
python/dgllife/model/pretrain/moleculenet
freesolv.py
hiv.py
196:208 (7%)
161:173 (6%)
view
13 x 2 python/dgllife/model/pretrain/moleculenet
python/dgllife/model/pretrain/moleculenet
esol.py
lipophilicity.py
158:170 (7%)
212:224 (6%)
view
13 x 2 python/dgllife/model/pretrain/moleculenet
python/dgllife/model/pretrain/moleculenet
bace.py
sider.py
161:173 (6%)
167:179 (6%)
view
13 x 2 python/dgllife/model/pretrain/moleculenet
python/dgllife/model/pretrain/moleculenet
pcba.py
toxcast.py
207:219 (7%)
195:207 (6%)
view
13 x 2 python/dgllife/model/pretrain/moleculenet
python/dgllife/model/pretrain/moleculenet
bbbp.py
bbbp.py
163:175 (6%)
178:190 (6%)
view
13 x 2 python/dgllife/model/pretrain/moleculenet
python/dgllife/model/pretrain/moleculenet
bace.py
toxcast.py
191:203 (6%)
165:177 (6%)
view
13 x 2 python/dgllife/model/pretrain/moleculenet
python/dgllife/model/pretrain/moleculenet
bace.py
toxcast.py
206:218 (6%)
195:207 (6%)
view
13 x 2 python/dgllife/model/pretrain/moleculenet
python/dgllife/model/pretrain/moleculenet
bace.py
tox21.py
206:218 (6%)
224:236 (5%)
view
13 x 2 python/dgllife/model/pretrain/moleculenet
python/dgllife/model/pretrain/moleculenet
tox21.py
toxcast.py
224:236 (5%)
195:207 (6%)
view
Duplicated Units
The list of top 8 duplicated units.
See data for all 8 unit duplicates...
Size#FoldersFilesLinesCode
23 x 3 python/dgllife/data
python/dgllife/data
python/dgllife/data
pcba.py
muv.py
bace.py
0:0 
0:0 
0:0 
view
9 x 2 python/dgllife/model/model_zoo
python/dgllife/model/model_zoo
dgmg.py
dgmg.py
0:0 
0:0 
view
18 x 2 python/dgllife/model/model_zoo
python/dgllife/model/model_zoo
gcn_predictor.py
gat_predictor.py
0:0 
0:0 
view
18 x 2 python/dgllife/model/gnn
python/dgllife/model/gnn
gcn.py
graphsage.py
0:0 
0:0 
view
16 x 2 python/dgllife/model/model_zoo
python/dgllife/model/model_zoo
schnet_predictor.py
mgcn_predictor.py
0:0 
0:0 
view
10 x 2 python/dgllife/data
python/dgllife/data
uspto.py
uspto.py
0:0 
0:0 
view
7 x 2 python/dgllife/data
python/dgllife/data
smiles_inference.py
csv_dataset.py
0:0 
0:0 
view
17 x 3 python/dgllife/data
python/dgllife/data
python/dgllife/data
clintox.py
sider.py
toxcast.py
0:0 
0:0 
0:0 
view