awslabs / predictive-maintenance-using-machine-learning
File Size

The distribution of size of files (measured in lines of code).

Intro
  • File size measurements show the distribution of size of files.
  • Files are classified in four categories based on their size (lines of code): 1-100 (very small files), 101-200 (small files), 201-500 (medium size files), 501-1000 (long files), 1001+(very long files).
  • It is a good practice to keep files small. Long files may become "bloaters", code that have increased to such gargantuan proportions that they are hard to work with.
Learn more...
File Size Overall
  • There are 451 files with 125,427 lines of code.
    • 28 very long files (46,414 lines of code)
    • 46 long files (32,609 lines of code)
    • 93 medium size files (31,016 lines of codeclsfd_ftr_w_mp_ins)
    • 56 small files (8,111 lines of code)
    • 228 very small files (7,277 lines of code)
37% | 25% | 24% | 6% | 5%
Legend:
1001+
501-1000
201-500
101-200
1-100


explore: zoomable circles | sunburst | 3D view
File Size per Extension
1001+
501-1000
201-500
101-200
1-100
py38% | 25% | 25% | 5% | 5%
h24% | 31% | 19% | 15% | 8%
c0% | 99% | 0% | 0% | <1%
yaml0% | 0% | 52% | 26% | 20%
tpl0% | 0% | 0% | 0% | 100%
in0% | 0% | 0% | 0% | 100%
File Size per Logical Decomposition
primary
1001+
501-1000
201-500
101-200
1-100
source/predictive_maintenance/pandas/core38% | 32% | 22% | 3% | 2%
source/predictive_maintenance/pandas/io56% | 18% | 13% | 5% | 5%
source/predictive_maintenance/numpy/f2py47% | 27% | 18% | 4% | 2%
source/predictive_maintenance/numpy/distutils31% | 17% | 24% | 11% | 15%
source/predictive_maintenance/numpy/ma62% | 13% | 17% | 3% | 2%
source/predictive_maintenance/numpy/lib30% | 14% | 40% | 8% | 6%
source/predictive_maintenance/pandas/plotting50% | 23% | 21% | 3% | <1%
source/predictive_maintenance/numpy/core11% | 42% | 27% | 13% | 5%
source/predictive_maintenance/pandas/util66% | 0% | 0% | 29% | 4%
source/predictive_maintenance/pandas/tseries71% | 0% | 27% | 0% | <1%
source/predictive_maintenance/pytz63% | 0% | 19% | 6% | 10%
source/predictive_maintenance/numpy/testing0% | 51% | 36% | 0% | 12%
source/predictive_maintenance/numpy/linalg0% | 93% | 0% | 0% | 6%
source/predictive_maintenance/numpy/polynomial0% | 16% | 80% | 0% | 2%
source/predictive_maintenance/pandas/compat0% | 0% | 63% | 17% | 19%
deployment0% | 0% | 58% | 29% | 12%
source/predictive_maintenance/pandas0% | 0% | 82% | 0% | 17%
source/predictive_maintenance/numpy0% | 0% | 42% | 0% | 57%
source/predictive_maintenance/numpy/matrixlib0% | 0% | 94% | 0% | 5%
source/notebooks/sagemaker_predictive_maintenance/sagemaker_predictive_maintenance_entry_point0% | 0% | 100% | 0% | 0%
source/predictive_maintenance/numpy/fft0% | 0% | 0% | 59% | 40%
source/predictive_maintenance/numpy/compat0% | 0% | 0% | 58% | 41%
source/predictive_maintenance/numpy/random0% | 0% | 0% | 0% | 100%
source/notebooks/sagemaker_predictive_maintenance0% | 0% | 0% | 0% | 100%
source/predictive_maintenance0% | 0% | 0% | 0% | 100%
source/predictive_maintenance/numpy/doc0% | 0% | 0% | 0% | 100%
deployment/solution-assistant/src0% | 0% | 0% | 0% | 100%
deployment/solution-assistant0% | 0% | 0% | 0% | 100%
source/predictive_maintenance/pandas/errors0% | 0% | 0% | 0% | 100%
source/predictive_maintenance/pandas/api0% | 0% | 0% | 0% | 100%
source/predictive_maintenance/pandas/arrays0% | 0% | 0% | 0% | 100%
source/predictive_maintenance/pandas/_libs0% | 0% | 0% | 0% | 100%
Longest Files (Top 50)
File# lines# units
pytables.py
in source/predictive_maintenance/pandas/io
3002 269
generic.py
in source/predictive_maintenance/pandas/core
2902 209
crackfortran.py
in source/predictive_maintenance/numpy/f2py
2899 62
core.py
in source/predictive_maintenance/numpy/ma
2802 260
frame.py
in source/predictive_maintenance/pandas/core
2335 133
base.py
in source/predictive_maintenance/pandas/core/indexes
2186 209
parsers.py
in source/predictive_maintenance/pandas/io
2121 89
system_info.py
in source/predictive_maintenance/numpy/distutils
1875 73
blocks.py
in source/predictive_maintenance/pandas/core/internals
1847 175
stata.py
in source/predictive_maintenance/pandas/io
1802 109
_core.py
in source/predictive_maintenance/pandas/plotting
1723 136
multi.py
in source/predictive_maintenance/pandas/core/indexes
1502 110
__multiarray_api.h
in source/predictive_maintenance/numpy/core/include/numpy
1484 -
function_base.py
in source/predictive_maintenance/numpy/lib
1466 84
testing.py
in source/predictive_maintenance/pandas/util
1451 122
offsets.py
in source/predictive_maintenance/pandas/tseries
1442 134
misc_util.py
in source/predictive_maintenance/numpy/distutils
1359 104
__init__.py
in source/predictive_maintenance/pytz
1291 31
series.py
in source/predictive_maintenance/pandas/core
1254 123
indexing.py
in source/predictive_maintenance/pandas/core
1234 83
managers.py
in source/predictive_maintenance/pandas/core/internals
1213 116
window.py
in source/predictive_maintenance/pandas/core
1059 106
rules.py
in source/predictive_maintenance/numpy/f2py
1058 2
npyio.py
in source/predictive_maintenance/numpy/lib
1046 33
format.py
in source/predictive_maintenance/pandas/io/formats
1038 66
ops.py
in source/predictive_maintenance/pandas/core
1019 55
sparse.py
in source/predictive_maintenance/pandas/core/arrays
1002 91
excel.py
in source/predictive_maintenance/pandas/io
1002 59
generic.py
in source/predictive_maintenance/pandas/core/groupby
997 57
categorical.py
in source/predictive_maintenance/pandas/core/arrays
992 102
ndarraytypes.h
in source/predictive_maintenance/numpy/core/include/numpy
972 1
merge.py
in source/predictive_maintenance/pandas/core/reshape
970 38
utils.py
in source/predictive_maintenance/numpy/testing/_private
957 58
npy_common.h
in source/predictive_maintenance/numpy/core/include/numpy
934 -
fortranobject.c
in source/predictive_maintenance/numpy/f2py/src
919 16
strings.py
in source/predictive_maintenance/pandas/core
898 75
algorithms.py
in source/predictive_maintenance/pandas/core
857 33
datetimes.py
in source/predictive_maintenance/pandas/core/arrays
842 53
panel.py
in source/predictive_maintenance/pandas/core
822 65
groupby.py
in source/predictive_maintenance/pandas/core/groupby
820 69
datetimes.py
in source/predictive_maintenance/pandas/core/indexes
808 53
sql.py
in source/predictive_maintenance/pandas/io
796 66
_converter.py
in source/predictive_maintenance/pandas/plotting
790 46
linalg.py
in source/predictive_maintenance/numpy/linalg
745 61
capi_maps.py
in source/predictive_maintenance/numpy/f2py
728 12
resample.py
in source/predictive_maintenance/pandas/core
726 72
__init__.py
in source/predictive_maintenance/numpy/distutils/fcompiler
725 47
arrayprint.py
in source/predictive_maintenance/numpy/core
717 55
datetimelike.py
in source/predictive_maintenance/pandas/core/arrays
703 84
interval.py
in source/predictive_maintenance/pandas/core/indexes
700 79
Files With Most Units (Top 20)
File# lines# units
pytables.py
in source/predictive_maintenance/pandas/io
3002 269
core.py
in source/predictive_maintenance/numpy/ma
2802 260
generic.py
in source/predictive_maintenance/pandas/core
2902 209
base.py
in source/predictive_maintenance/pandas/core/indexes
2186 209
blocks.py
in source/predictive_maintenance/pandas/core/internals
1847 175
cpuinfo.py
in source/predictive_maintenance/numpy/distutils
508 174
_core.py
in source/predictive_maintenance/pandas/plotting
1723 136
offsets.py
in source/predictive_maintenance/pandas/tseries
1442 134
frame.py
in source/predictive_maintenance/pandas/core
2335 133
defchararray.py
in source/predictive_maintenance/numpy/core
511 130
series.py
in source/predictive_maintenance/pandas/core
1254 123
testing.py
in source/predictive_maintenance/pandas/util
1451 122
managers.py
in source/predictive_maintenance/pandas/core/internals
1213 116
multi.py
in source/predictive_maintenance/pandas/core/indexes
1502 110
stata.py
in source/predictive_maintenance/pandas/io
1802 109
window.py
in source/predictive_maintenance/pandas/core
1059 106
misc_util.py
in source/predictive_maintenance/numpy/distutils
1359 104
auxfuncs.py
in source/predictive_maintenance/numpy/f2py
613 103
categorical.py
in source/predictive_maintenance/pandas/core/arrays
992 102
sparse.py
in source/predictive_maintenance/pandas/core/arrays
1002 91
Files With Long Lines (Top 19)

There are 19 files with lines longer than 120 characters. In total, there are 94 long lines.

File# lines# units# long lines
rules.py
in source/predictive_maintenance/numpy/f2py
1058 2 25
crackfortran.py
in source/predictive_maintenance/numpy/f2py
2899 62 14
cfuncs.py
in source/predictive_maintenance/numpy/f2py
290 3 12
__multiarray_api.h
in source/predictive_maintenance/numpy/core/include/numpy
1484 - 7
predictive-maintenance-using-machine-learning.yaml
in deployment
293 - 5
__config__.py
in source/predictive_maintenance/numpy/distutils
28 2 4
__config__.py
in source/predictive_maintenance/numpy
28 2 4
__ufunc_api.h
in source/predictive_maintenance/numpy/core/include/numpy
309 - 4
capi_maps.py
in source/predictive_maintenance/numpy/f2py
728 12 4
cb_rules.py
in source/predictive_maintenance/numpy/f2py
349 2 4
npy_common.h
in source/predictive_maintenance/numpy/core/include/numpy
934 - 2
predictive-maintenance-sagemaker-notebook-instance.yaml
in deployment
63 - 2
ibm.py
in source/predictive_maintenance/numpy/distutils/fcompiler
81 5 1
absoft.py
in source/predictive_maintenance/numpy/distutils/fcompiler
116 11 1
build_src.py
in source/predictive_maintenance/numpy/distutils/command
639 24 1
oldnumeric.h
in source/predictive_maintenance/numpy/core/include/numpy
21 - 1
f90mod_rules.py
in source/predictive_maintenance/numpy/f2py
201 2 1
tpl
html.tpl
in source/predictive_maintenance/pandas/io/formats/templates
70 - 1
preprocess.py
in source/notebooks/sagemaker_predictive_maintenance
39 1 1