aws / sagemaker-scikit-learn-extension
File Size

The distribution of size of files (measured in lines of code).

Intro
  • File size measurements show the distribution of size of files.
  • Files are classified in four categories based on their size (lines of code): 1-100 (very small files), 101-200 (small files), 201-500 (medium size files), 501-1000 (long files), 1001+(very long files).
  • It is a good practice to keep files small. Long files may become "bloaters", code that have increased to such gargantuan proportions that they are hard to work with.
Learn more...
File Size Overall
  • There are 28 files with 2,001 lines of code.
    • 0 very long files (0 lines of code)
    • 0 long files (0 lines of code)
    • 4 medium size files (1,080 lines of codeclsfd_ftr_w_mp_ins)
    • 2 small files (232 lines of code)
    • 22 very small files (689 lines of code)
0% | 0% | 53% | 11% | 34%
Legend:
1001+
501-1000
201-500
101-200
1-100


explore: zoomable circles | sunburst | 3D view
File Size per Extension
1001+
501-1000
201-500
101-200
1-100
py0% | 0% | 55% | 11% | 32%
yml0% | 0% | 0% | 0% | 100%
in0% | 0% | 0% | 0% | 100%
File Size per Logical Decomposition
primary
1001+
501-1000
201-500
101-200
1-100
src/sagemaker_sklearn_extension/preprocessing0% | 0% | 59% | 0% | 40%
src/sagemaker_sklearn_extension/contrib/taei0% | 0% | 64% | 0% | 35%
src/sagemaker_sklearn_extension/feature_extraction0% | 0% | 51% | 47% | <1%
src/sagemaker_sklearn_extension/externals0% | 0% | 67% | 0% | 32%
src/sagemaker_sklearn_extension/impute0% | 0% | 0% | 0% | 100%
ci0% | 0% | 0% | 0% | 100%
src/sagemaker_sklearn_extension/decomposition0% | 0% | 0% | 0% | 100%
ROOT0% | 0% | 0% | 0% | 100%
src/sagemaker_sklearn_extension/contrib0% | 0% | 0% | 0% | 100%
src/sagemaker_sklearn_extension0% | 0% | 0% | 0% | 100%
Longest Files (Top 28)
File# lines# units
encoders.py
in src/sagemaker_sklearn_extension/preprocessing
303 33
models.py
in src/sagemaker_sklearn_extension/contrib/taei
274 15
sequences.py
in src/sagemaker_sklearn_extension/feature_extraction
252 21
read_data.py
in src/sagemaker_sklearn_extension/externals
251 41
date_time.py
in src/sagemaker_sklearn_extension/feature_extraction
124 19
text.py
in src/sagemaker_sklearn_extension/feature_extraction
108 6
base.py
in src/sagemaker_sklearn_extension/preprocessing
96 16
base.py
in src/sagemaker_sklearn_extension/impute
85 14
data.py
in src/sagemaker_sklearn_extension/preprocessing
79 8
header.py
in src/sagemaker_sklearn_extension/externals
67 6
nn_utils.py
in src/sagemaker_sklearn_extension/contrib/taei
61 7
latent_space_oversampler.py
in src/sagemaker_sklearn_extension/contrib/taei
57 6
automl_transformer.py
in src/sagemaker_sklearn_extension/externals
47 6
robust_pca.py
in src/sagemaker_sklearn_extension/decomposition
41 3
__init__.py
in src/sagemaker_sklearn_extension/preprocessing
30 -
star_oversampler.py
in src/sagemaker_sklearn_extension/contrib/taei
27 4
setup.py
in root
24 2
buildspec-release.yml
in ci
21 -
buildspec-pr.yml
in ci
16 -
buildspec-deploy.yml
in ci
10 -
__init__.py
in src/sagemaker_sklearn_extension/externals
8 -
__init__.py
in src/sagemaker_sklearn_extension/contrib/taei
4 -
__init__.py
in src/sagemaker_sklearn_extension/decomposition
4 -
__init__.py
in src/sagemaker_sklearn_extension/feature_extraction
4 -
in
MANIFEST.in
in root
4 -
__init__.py
in src/sagemaker_sklearn_extension/impute
2 -
__init__.py
in src/sagemaker_sklearn_extension/contrib
1 -
__init__.py
in src/sagemaker_sklearn_extension
1 -
Files With Most Units (Top 16)
File# lines# units
read_data.py
in src/sagemaker_sklearn_extension/externals
251 41
encoders.py
in src/sagemaker_sklearn_extension/preprocessing
303 33
sequences.py
in src/sagemaker_sklearn_extension/feature_extraction
252 21
date_time.py
in src/sagemaker_sklearn_extension/feature_extraction
124 19
base.py
in src/sagemaker_sklearn_extension/preprocessing
96 16
models.py
in src/sagemaker_sklearn_extension/contrib/taei
274 15
base.py
in src/sagemaker_sklearn_extension/impute
85 14
data.py
in src/sagemaker_sklearn_extension/preprocessing
79 8
nn_utils.py
in src/sagemaker_sklearn_extension/contrib/taei
61 7
latent_space_oversampler.py
in src/sagemaker_sklearn_extension/contrib/taei
57 6
text.py
in src/sagemaker_sklearn_extension/feature_extraction
108 6
automl_transformer.py
in src/sagemaker_sklearn_extension/externals
47 6
header.py
in src/sagemaker_sklearn_extension/externals
67 6
star_oversampler.py
in src/sagemaker_sklearn_extension/contrib/taei
27 4
robust_pca.py
in src/sagemaker_sklearn_extension/decomposition
41 3
setup.py
in root
24 2
Files With Long Lines (Top 1)

There is only one file with lines longer than 120 characters. In total, there are 2 long lines.

File# lines# units# long lines
buildspec-deploy.yml
in ci
10 - 2