aws-samples / machine-learning-samples
File Size

The distribution of size of files (measured in lines of code).

Intro
  • File size measurements show the distribution of size of files.
  • Files are classified in four categories based on their size (lines of code): 1-100 (very small files), 101-200 (small files), 201-500 (medium size files), 501-1000 (long files), 1001+(very long files).
  • It is a good practice to keep files small. Long files may become "bloaters", code that have increased to such gargantuan proportions that they are hard to work with.
Learn more...
File Size Overall
  • There are 25 files with 1,601 lines of code.
    • 0 very long files (0 lines of code)
    • 0 long files (0 lines of code)
    • 0 medium size files (0 lines of codeclsfd_ftr_w_mp_ins)
    • 6 small files (854 lines of code)
    • 19 very small files (747 lines of code)
0% | 0% | 0% | 53% | 46%
Legend:
1001+
501-1000
201-500
101-200
1-100


explore: zoomable circles | sunburst | 3D view
File Size per Extension
1001+
501-1000
201-500
101-200
1-100
py0% | 0% | 0% | 57% | 42%
m0% | 0% | 0% | 0% | 100%
java0% | 0% | 0% | 0% | 100%
h0% | 0% | 0% | 0% | 100%
sbt0% | 0% | 0% | 0% | 100%
File Size per Logical Decomposition
primary
1001+
501-1000
201-500
101-200
1-100
social-media0% | 0% | 0% | 63% | 36%
ml-tools-python0% | 0% | 0% | 64% | 35%
cost-based-ml0% | 0% | 0% | 100% | 0%
k-fold-cross-validation0% | 0% | 0% | 46% | 53%
targeted-marketing-python0% | 0% | 0% | 0% | 100%
mobile-ios0% | 0% | 0% | 0% | 100%
mobile-android0% | 0% | 0% | 0% | 100%
targeted-marketing-scala0% | 0% | 0% | 0% | 100%
Longest Files (Top 25)
File# lines# units
awspyml.py
in ml-tools-python
195 29
gather-data.py
in social-media
148 7
cost_based_ml.py
in cost-based-ml
145 11
create-aml-model.py
in social-media
129 6
build-aml-training-dataset.py
in social-media
123 1
fold.py
in k-fold-cross-validation
114 8
create-lambda-function.py
in social-media
98 9
build_model.py
in targeted-marketing-python
87 4
collect_perf.py
in k-fold-cross-validation
68 1
use_model.py
in targeted-marketing-python
65 3
build_folds.py
in k-fold-cross-validation
59 1
realtime.py
in ml-tools-python
55 3
ViewController.m
in mobile-ios
50 -
scanner.py
in social-media
45 2
push-json-to-kinesis.py
in social-media
40 2
AndroidRealtimePrediction.java
in mobile-android
40 3
wait_for_entity.py
in ml-tools-python
39 1
build-mturk-csv.py
in social-media
28 -
AppDelegate.m
in mobile-ios
26 -
config.py
in social-media
16 -
guess_schema.py
in ml-tools-python
13 -
config.py
in k-fold-cross-validation
6 -
build.sbt
in targeted-marketing-scala
5 -
AppDelegate.h
in mobile-ios
4 -
ViewController.h
in mobile-ios
3 -
Files With Most Units (Top 16)
File# lines# units
awspyml.py
in ml-tools-python
195 29
cost_based_ml.py
in cost-based-ml
145 11
create-lambda-function.py
in social-media
98 9
fold.py
in k-fold-cross-validation
114 8
gather-data.py
in social-media
148 7
create-aml-model.py
in social-media
129 6
build_model.py
in targeted-marketing-python
87 4
use_model.py
in targeted-marketing-python
65 3
AndroidRealtimePrediction.java
in mobile-android
40 3
realtime.py
in ml-tools-python
55 3
scanner.py
in social-media
45 2
push-json-to-kinesis.py
in social-media
40 2
build-aml-training-dataset.py
in social-media
123 1
wait_for_entity.py
in ml-tools-python
39 1
collect_perf.py
in k-fold-cross-validation
68 1
build_folds.py
in k-fold-cross-validation
59 1
Files With Long Lines (Top 6)

There are 6 files with lines longer than 120 characters. In total, there are 18 long lines.

File# lines# units# long lines
cost_based_ml.py
in cost-based-ml
145 11 12
build-aml-training-dataset.py
in social-media
123 1 2
create-aml-model.py
in social-media
129 6 1
gather-data.py
in social-media
148 7 1
create-lambda-function.py
in social-media
98 9 1
AndroidRealtimePrediction.java
in mobile-android
40 3 1