aws-samples / amazon-sagemaker-local-mode
Duplication

Places in code with 6 or more lines that are exactly the same.

Intro
  • For duplication, we look at places in code where there are 6 or more lines of code that are exactly the same.
  • Before duplication is calculated, the code is cleaned to remove empty lines, comments, and frequently duplicated constructs such as imports.
  • You should aim at having as little as possible (<5%) of duplicated code as high-level of duplication can lead to maintenance difficulties, poor factoring, and logical contradictions.
Learn more...
Duplication Overall
  • 53% duplication:
    • 2,345 cleaned lines of cleaned code (without empty lines, comments, and frequently duplicated constructs such as imports)
    • 1,248 duplicated lines
  • 88 duplicates
system53% (1,248 lines)
Duplication per Extension
py53% (1,248 lines)
Duplication per Component (primary)
pytorch_script_mode_local_training_and_serving82% (137 lines)
blog89% (116 lines)
tensorflow_script_mode_california_housing_local_training_and_batch_transform93% (101 lines)
tensorflow_script_mode_california_housing_local_training_and_serving89% (93 lines)
tensorflow_script_mode_local_training_and_serving65% (85 lines)
tensorflow_script_mode_debug_local_training88% (85 lines)
tensorflow_bring_your_own_california_housing_local_training_and_batch_transform70% (73 lines)
scikit_learn_local_processing100% (58 lines)
tensorflow_bring_your_own_california_housing_local_training_and_serving82% (51 lines)
scikit_learn_bring_your_own_container_local_processing85% (49 lines)
pytorch_script_mode_local_model_inference76% (48 lines)
scikit_learn_bring_your_own_container_and_own_model_local_serving61% (44 lines)
catboost_bring_your_own_container_local_training_and_serving59% (42 lines)
lightgbm_bring_your_own_container_local_training_and_serving48% (36 lines)
delta_lake_bring_your_own_container_local_training_and_serving63% (35 lines)
scikit_learn_nltk_local_processing42% (27 lines)
prophet_bring_your_own_container_local_training_and_serving45% (25 lines)
catboost_scikit_learn_script_mode_local_training_and_serving22% (24 lines)
delta_sharing_scikit_learn_local_training_and_serving25% (18 lines)
scikit_learn_bring_your_own_model_local_serving47% (18 lines)
pytorch_nlp_script_mode_local_model_inference37% (18 lines)
scikit_learn_script_mode_local_training_and_serving22% (17 lines)
delta_sharing_bring_your_own_container_local_processing20% (13 lines)
scikit_learn_script_mode_local_serving_no_model_artifact28% (13 lines)
tensorflow_script_mode_local_model_inference13% (8 lines)
huggingface_hebert_sentiment_analysis_local_serving15% (8 lines)
tensorflow_script_mode_local_gpu_training_resnet506% (6 lines)
xgboost_script_mode_local_training_and_serving0% (0 lines)
hdbscan_bring_your_own_container_local_training0% (0 lines)

Duplication Between Components (50+ lines)

G blog blog pytorch_script_mode_local_training_and_serving pytorch_script_mode_local_training_and_serving blog--pytorch_script_mode_local_training_and_serving 232 tensorflow_script_mode_debug_local_training tensorflow_script_mode_debug_local_training tensorflow_script_mode_local_training_and_serving tensorflow_script_mode_local_training_and_serving tensorflow_script_mode_debug_local_training--tensorflow_script_mode_local_training_and_serving 170 tensorflow_script_mode_california_housing_local_training_and_batch_transform tensorflow_script_mode_california_housing_local_training_and_batch_transform tensorflow_script_mode_california_housing_local_training_and_serving tensorflow_script_mode_california_housing_local_training_and_serving tensorflow_script_mode_california_housing_local_training_and_batch_transform--tensorflow_script_mode_california_housing_local_training_and_serving 142 tensorflow_bring_your_own_california_housing_local_training_and_batch_transform tensorflow_bring_your_own_california_housing_local_training_and_batch_transform tensorflow_bring_your_own_california_housing_local_training_and_batch_transform--tensorflow_script_mode_california_housing_local_training_and_batch_transform 100 tensorflow_bring_your_own_california_housing_local_training_and_serving tensorflow_bring_your_own_california_housing_local_training_and_serving tensorflow_bring_your_own_california_housing_local_training_and_batch_transform--tensorflow_bring_your_own_california_housing_local_training_and_serving 58 scikit_learn_bring_your_own_container_local_processing scikit_learn_bring_your_own_container_local_processing scikit_learn_local_processing scikit_learn_local_processing scikit_learn_bring_your_own_container_local_processing--scikit_learn_local_processing 98 scikit_learn_nltk_local_processing scikit_learn_nltk_local_processing scikit_learn_local_processing--scikit_learn_nltk_local_processing 54 tensorflow_bring_your_own_california_housing_local_training_and_serving--tensorflow_script_mode_california_housing_local_training_and_serving 80 pytorch_script_mode_local_model_inference pytorch_script_mode_local_model_inference pytorch_script_mode_local_model_inference--pytorch_script_mode_local_training_and_serving 76 catboost_bring_your_own_container_local_training_and_serving catboost_bring_your_own_container_local_training_and_serving delta_lake_bring_your_own_container_local_training_and_serving delta_lake_bring_your_own_container_local_training_and_serving catboost_bring_your_own_container_local_training_and_serving--delta_lake_bring_your_own_container_local_training_and_serving 70 scikit_learn_bring_your_own_container_and_own_model_local_serving scikit_learn_bring_your_own_container_and_own_model_local_serving catboost_bring_your_own_container_local_training_and_serving--scikit_learn_bring_your_own_container_and_own_model_local_serving 66 lightgbm_bring_your_own_container_local_training_and_serving lightgbm_bring_your_own_container_local_training_and_serving catboost_bring_your_own_container_local_training_and_serving--lightgbm_bring_your_own_container_local_training_and_serving 60 prophet_bring_your_own_container_local_training_and_serving prophet_bring_your_own_container_local_training_and_serving catboost_bring_your_own_container_local_training_and_serving--prophet_bring_your_own_container_local_training_and_serving 50 delta_lake_bring_your_own_container_local_training_and_serving--scikit_learn_bring_your_own_container_and_own_model_local_serving 66 delta_lake_bring_your_own_container_local_training_and_serving--prophet_bring_your_own_container_local_training_and_serving 50 prophet_bring_your_own_container_local_training_and_serving--scikit_learn_bring_your_own_container_and_own_model_local_serving 50

Download: SVG DOT (open online Graphviz editor)

Open 3D force graph...

Show more details on duplication between components...
Longest Duplicates
The list of 20 longest duplicates.
See data for all 88 duplicates...
Size#FoldersFilesLinesCode
99 x 2 blog/pytorch_cnn_cifar10/source
pytorch_script_mode_loca...aining_and_serving/code
cifar10.py
cifar10_pytorch.py
23:155 (91%)
18:150 (94%)
view
36 x 2 tensorflow_script_mode_d...cal_training/source_dir
tensorflow_script_mode_l...aining_and_serving/code
mnist_tf2.py
mnist_tf2.py
25:81 (97%)
21:77 (100%)
view
35 x 2 catboost_bring_your_own_...iner/catboost_regressor
delta_lake_bring_your_ow...iner/catboost_regressor
predictor.py
predictor.py
18:83 (100%)
18:83 (100%)
view
35 x 2 tensorflow_bring_your_ow...ing_and_batch_transform
tensorflow_script_mode_c...ing_and_batch_transform
tensorflow_bring_your_own_california_...
tensorflow_script_mode_california_hou...
21:70 (53%)
23:72 (56%)
view
33 x 2 scikit_learn_bring_your_...tainer_local_processing
scikit_learn_local_processing
processing_script.py
processing_script.py
5:50 (100%)
5:50 (100%)
view
32 x 2 tensorflow_bring_your_ow...al_training_and_serving
tensorflow_script_mode_c...al_training_and_serving
tensorflow_bring_your_own_california_...
tensorflow_script_mode_california_hou...
24:69 (52%)
25:70 (55%)
view
29 x 2 tensorflow_script_mode_c...nd_batch_transform/code
tensorflow_script_mode_c...aining_and_serving/code
california_housing_tf2.py
california_housing_tf2.py
40:83 (63%)
38:81 (63%)
view
26 x 2 catboost_bring_your_own_...iner/catboost_regressor
scikit_learn_bring_your_...er/sklearn_rf_regressor
predictor.py
predictor.py
33:83 (74%)
32:82 (76%)
view
26 x 2 delta_lake_bring_your_ow...iner/catboost_regressor
scikit_learn_bring_your_...er/sklearn_rf_regressor
predictor.py
predictor.py
33:83 (74%)
32:82 (76%)
view
21 x 2 pytorch_script_mode_local_model_inference
pytorch_script_mode_local_training_and_serving
utils_cifar.py
utils_cifar.py
7:36 (100%)
7:36 (100%)
view
18 x 2 prophet_bring_your_own_c...rving/container/prophet
scikit_learn_bring_your_...er/sklearn_rf_regressor
predictor.py
predictor.py
54:89 (46%)
45:78 (52%)
view
18 x 2 delta_lake_bring_your_ow...iner/catboost_regressor
prophet_bring_your_own_c...rving/container/prophet
predictor.py
predictor.py
46:79 (51%)
54:89 (46%)
view
18 x 2 catboost_bring_your_own_...iner/catboost_regressor
prophet_bring_your_own_c...rving/container/prophet
predictor.py
predictor.py
46:79 (51%)
54:89 (46%)
view
17 x 2 pytorch_script_mode_local_model_inference
pytorch_script_mode_local_training_and_serving
pytorch_script_mode_local_model_infer...
pytorch_script_mode_local_training_an...
25:47 (40%)
23:45 (41%)
view
16 x 2 catboost_bring_your_own_...iner/catboost_regressor
lightgbm_bring_your_own_...ner/lightgbm_regression
predictor.py
predictor.py
33:67 (45%)
33:70 (47%)
view
16 x 2 scikit_learn_bring_your_...tainer_local_processing
scikit_learn_local_processing
scikit_learn_bring_your_own_container...
SKLearnProcessor_local_processing.py
28:47 (66%)
31:50 (64%)
view
16 x 2 delta_lake_bring_your_ow...iner/catboost_regressor
lightgbm_bring_your_own_...ner/lightgbm_regression
predictor.py
predictor.py
33:67 (45%)
33:70 (47%)
view
16 x 2 lightgbm_bring_your_own_...ner/lightgbm_regression
scikit_learn_bring_your_...er/sklearn_rf_regressor
predictor.py
predictor.py
33:70 (47%)
32:66 (47%)
view
15 x 2 tensorflow_script_mode_c...ing_and_batch_transform
tensorflow_script_mode_c...al_training_and_serving
tensorflow_script_mode_california_hou...
tensorflow_script_mode_california_hou...
71:87 (24%)
78:94 (25%)
view
14 x 2 tensorflow_script_mode_debug_local_training
tensorflow_script_mode_local_training_and_serving
tensorflow_script_mode_debug_local_tr...
tensorflow_script_mode_local_training...
22:37 (23%)
26:41 (15%)
view
Duplicated Units
The list of top 17 duplicated units.
See data for all 17 unit duplicates...
Size#FoldersFilesLinesCode
57 x 2 blog/pytorch_cnn_cifar10/source
pytorch_script_mode_loca...aining_and_serving/code
cifar10.py
cifar10_pytorch.py
0:0 
0:0 
view
32 x 2 tensorflow_bring_your_ow...ing_and_batch_transform
tensorflow_script_mode_c...ing_and_batch_transform
tensorflow_bring_your_own_california_...
tensorflow_script_mode_california_hou...
0:0 
0:0 
view
29 x 2 scikit_learn_local_processing
scikit_learn_bring_your_...tainer_local_processing
processing_script.py
processing_script.py
0:0 
0:0 
view
27 x 2 tensorflow_bring_your_ow...al_training_and_serving
tensorflow_script_mode_c...al_training_and_serving
tensorflow_bring_your_own_california_...
tensorflow_script_mode_california_hou...
0:0 
0:0 
view
20 x 3 delta_lake_bring_your_ow...iner/catboost_regressor
scikit_learn_bring_your_...er/sklearn_rf_regressor
catboost_bring_your_own_...iner/catboost_regressor
predictor.py
predictor.py
predictor.py
0:0 
0:0 
0:0 
view
13 x 2 tensorflow_script_mode_l...aining_and_serving/code
tensorflow_script_mode_d...cal_training/source_dir
mnist_tf2.py
mnist_tf2.py
0:0 
0:0 
view
11 x 2 tensorflow_script_mode_local_training_and_serving
tensorflow_script_mode_debug_local_training
tensorflow_script_mode_local_training...
tensorflow_script_mode_debug_local_tr...
0:0 
0:0 
view
10 x 2 tensorflow_script_mode_local_training_and_serving
tensorflow_script_mode_debug_local_training
tensorflow_script_mode_local_training...
tensorflow_script_mode_debug_local_tr...
0:0 
0:0 
view
9 x 2 blog/pytorch_cnn_cifar10/source
pytorch_script_mode_loca...aining_and_serving/code
cifar10.py
cifar10_pytorch.py
0:0 
0:0 
view
8 x 2 pytorch_script_mode_local_training_and_serving
pytorch_script_mode_local_model_inference
pytorch_script_mode_local_training_an...
pytorch_script_mode_local_model_infer...
0:0 
0:0 
view
11 x 2 tensorflow_script_mode_c...nd_batch_transform/code
tensorflow_script_mode_c...aining_and_serving/code
california_housing_tf2.py
california_housing_tf2.py
0:0 
0:0 
view
9 x 2 tensorflow_script_mode_l...aining_and_serving/code
tensorflow_script_mode_d...cal_training/source_dir
mnist_tf2.py
mnist_tf2.py
0:0 
0:0 
view
7 x 2 blog/pytorch_cnn_cifar10/source
pytorch_script_mode_loca...aining_and_serving/code
cifar10.py
cifar10_pytorch.py
0:0 
0:0 
view
7 x 2 blog/pytorch_cnn_cifar10/source
pytorch_script_mode_loca...aining_and_serving/code
cifar10.py
cifar10_pytorch.py
0:0 
0:0 
view
7 x 2 pytorch_script_mode_local_training_and_serving
pytorch_script_mode_local_model_inference
pytorch_script_mode_local_training_an...
pytorch_script_mode_local_model_infer...
0:0 
0:0 
view
7 x 2 huggingface_hebert_senti...ysis_local_serving/code
pytorch_nlp_script_mode_local_model_inference/code
inference.py
inference.py
0:0 
0:0 
view
6 x 4 delta_lake_bring_your_ow...iner/catboost_regressor
lightgbm_bring_your_own_...ner/lightgbm_regression
scikit_learn_bring_your_...er/sklearn_rf_regressor
catboost_bring_your_own_...iner/catboost_regressor
predictor.py
predictor.py
predictor.py
predictor.py
0:0 
0:0 
0:0 
0:0 
view