awslabs / aws-cv-unique-information
File Size

The distribution of size of files (measured in lines of code).

Intro
  • File size measurements show the distribution of size of files.
  • Files are classified in four categories based on their size (lines of code): 1-100 (very small files), 101-200 (small files), 201-500 (medium size files), 501-1000 (long files), 1001+(very long files).
  • It is a good practice to keep files small. Long files may become "bloaters", code that have increased to such gargantuan proportions that they are hard to work with.
Learn more...
File Size Overall
  • There are 25 files with 3,247 lines of code.
    • 0 very long files (0 lines of code)
    • 1 long files (604 lines of code)
    • 3 medium size files (871 lines of codeclsfd_ftr_w_mp_ins)
    • 8 small files (1,019 lines of code)
    • 13 very small files (753 lines of code)
0% | 18% | 26% | 31% | 23%
Legend:
1001+
501-1000
201-500
101-200
1-100


explore: zoomable circles | sunburst | 3D view
File Size per Extension
1001+
501-1000
201-500
101-200
1-100
py0% | 18% | 26% | 31% | 23%
File Size per Logical Decomposition
primary
1001+
501-1000
201-500
101-200
1-100
scripts0% | 35% | 0% | 42% | 21%
modules0% | 0% | 52% | 24% | 22%
methods0% | 0% | 90% | 0% | 9%
archive0% | 0% | 0% | 0% | 100%
Longest Files (Top 25)
File# lines# units
generate_commands.py
in sample_info/scripts
604 34
data_utils.py
in sample_info/modules
388 47
ntk.py
in sample_info/modules
265 12
classifiers.py
in sample_info/methods
218 14
stability.py
in sample_info/modules
179 4
data_summarization.py
in sample_info/scripts
129 1
aggregate_ground_truth_results.py
in sample_info/scripts
127 5
visualizations.py
in sample_info/modules
123 4
compute_influence_functions_brute_force.py
in sample_info/scripts
121 1
ground_truth_effects.py
in sample_info/scripts
118 1
compute_informativeness.py
in sample_info/scripts
112 1
compute_influence_functions.py
in sample_info/scripts
110 1
prepare_informativeness_orders_for_data_summarization.py
in sample_info/scripts
99 1
aggregate_data_summarization_results.py
in sample_info/scripts
95 3
total_gradient.py
in sample_info/archive
94 1
misc.py
in sample_info/modules
93 8
synthetic_example_make_informativeness_video.py
in sample_info/scripts
82 2
train_classifier.py
in sample_info/scripts
80 1
sgd.py
in sample_info/modules
76 2
influence_functions.py
in sample_info/modules
55 4
nn_utils.py
in sample_info/modules
53 3
base.py
in sample_info/methods
21 4
__init__.py
in sample_info/methods
3 -
__init__.py
in sample_info/modules
1 -
__init__.py
in sample_info/scripts
1 -
Files With Most Units (Top 20)
File# lines# units
data_utils.py
in sample_info/modules
388 47
generate_commands.py
in sample_info/scripts
604 34
classifiers.py
in sample_info/methods
218 14
ntk.py
in sample_info/modules
265 12
misc.py
in sample_info/modules
93 8
aggregate_ground_truth_results.py
in sample_info/scripts
127 5
base.py
in sample_info/methods
21 4
stability.py
in sample_info/modules
179 4
influence_functions.py
in sample_info/modules
55 4
visualizations.py
in sample_info/modules
123 4
nn_utils.py
in sample_info/modules
53 3
aggregate_data_summarization_results.py
in sample_info/scripts
95 3
sgd.py
in sample_info/modules
76 2
synthetic_example_make_informativeness_video.py
in sample_info/scripts
82 2
total_gradient.py
in sample_info/archive
94 1
compute_influence_functions.py
in sample_info/scripts
110 1
prepare_informativeness_orders_for_data_summarization.py
in sample_info/scripts
99 1
data_summarization.py
in sample_info/scripts
129 1
ground_truth_effects.py
in sample_info/scripts
118 1
compute_informativeness.py
in sample_info/scripts
112 1
Files With Long Lines (Top 1)

There is only one file with lines longer than 120 characters. In total, there are 6 long lines.

File# lines# units# long lines
generate_commands.py
in sample_info/scripts
604 34 6