awslabs / sagemaker-graph-entity-resolution

Places in code with 6 or more lines that are exactly the same.

Intro

For duplication, we look at places in code where there are 6 or more lines of code that are exactly the same.
Before duplication is calculated, the code is cleaned to remove empty lines, comments, and frequently duplicated constructs such as imports.
You should aim at having as little as possible (<5%) of duplicated code as high-level of duplication can lead to maintenance difficulties, poor factoring, and logical contradictions.

Duplication Overall

9% duplication:

1,343 cleaned lines of cleaned code (without empty lines, comments, and frequently duplicated constructs such as imports)
122 duplicated lines

14 duplicates

Duplication per Extension

Duplication per Component (primary)

Longest Duplicates

The list of 14 longest duplicates.

See data for all 14 duplicates...

Size	#	Folders	Files	Lines	Code
10	x 2	source/sagemaker/data-preparation source/sagemaker/data-preprocessing	data_prep.py data_preprocessing.py	8:19 (25%) 20:31 (11%)	view
9	x 2	source/sagemaker/baseline source/sagemaker/sagemak...n/dgl_entity_resolution	train_pytorch_mlp_entity_resolution.py utils.py	66:74 (6%) 161:169 (6%)	view
8	x 2	source/sagemaker/sagemak...n/dgl_entity_resolution source/sagemaker/sagemak...n/dgl_entity_resolution	utils.py utils.py	87:94 (5%) 110:117 (5%)	view
8	x 2	source/sagemaker/baseline source/sagemaker/data-preparation	train_pytorch_mlp_entity_resolution.py data_prep.py	104:113 (6%) 8:17 (20%)	view
8	x 2	source/sagemaker/baseline source/sagemaker/data-preprocessing	train_pytorch_mlp_entity_resolution.py data_preprocessing.py	104:113 (6%) 20:29 (9%)	view
7	x 2	deployment/solution-assistant/src deployment/solution-assistant/src	lambda_function.py lambda_function.py	25:31 (17%) 39:45 (17%)	view
7	x 2	source/sagemaker/baseline source/sagemaker/sagemak...n/dgl_entity_resolution	train_pytorch_mlp_entity_resolution.py estimator_fns.py	112:119 (5%) 6:13 (21%)	view
7	x 2	deployment deployment	sagemaker-notebook-instance-stack.yaml sagemaker-permissions-stack.yaml	32:39 (8%) 15:22 (5%)	view
6	x 2	source/sagemaker/data-preprocessing source/sagemaker/sagemak...n/dgl_entity_resolution	data_preprocessing.py estimator_fns.py	20:25 (6%) 37:42 (18%)	view
6	x 2	source/sagemaker/data-preparation source/sagemaker/sagemak...n/dgl_entity_resolution	data_prep.py estimator_fns.py	8:13 (15%) 37:42 (18%)	view
6	x 2	source/sagemaker/data-preparation source/sagemaker/data-preprocessing	data_sampling.py data_preprocessing.py	9:14 (20%) 28:33 (6%)	view
6	x 2	deployment deployment	sagemaker-graph-entity-resolution.yaml sagemaker-notebook-instance-stack.yaml	99:104 (2%) 34:39 (6%)	view
6	x 2	source/sagemaker/baseline source/sagemaker/sagemak...n/dgl_entity_resolution	train_pytorch_mlp_entity_resolution.py estimator_fns.py	104:109 (4%) 37:42 (18%)	view
6	x 2	deployment deployment	sagemaker-graph-entity-resolution.yaml sagemaker-permissions-stack.yaml	99:104 (2%) 17:22 (4%)	view