amazon-research / read-up
File Size

The distribution of size of files (measured in lines of code).

Intro
  • File size measurements show the distribution of size of files.
  • Files are classified in four categories based on their size (lines of code): 1-100 (very small files), 101-200 (small files), 201-500 (medium size files), 501-1000 (long files), 1001+(very long files).
  • It is a good practice to keep files small. Long files may become "bloaters", code that have increased to such gargantuan proportions that they are hard to work with.
Learn more...
File Size Overall
  • There are 75 files with 12,210 lines of code.
    • 1 very long files (1,274 lines of code)
    • 3 long files (2,129 lines of code)
    • 11 medium size files (2,535 lines of codeclsfd_ftr_w_mp_ins)
    • 30 small files (4,471 lines of code)
    • 30 very small files (1,801 lines of code)
10% | 17% | 20% | 36% | 14%
Legend:
1001+
501-1000
201-500
101-200
1-100


explore: zoomable circles | sunburst | 3D view
File Size per Extension
1001+
501-1000
201-500
101-200
1-100
py11% | 18% | 22% | 37% | 10%
yaml0% | 0% | 0% | 30% | 69%
File Size per Logical Decomposition
primary
1001+
501-1000
201-500
101-200
1-100
src/model/vilbert36% | 60% | 0% | 3% | <1%
solver0% | 0% | 50% | 48% | 1%
src/model0% | 0% | 46% | 29% | 24%
src/data0% | 0% | 0% | 73% | 26%
src/tools0% | 0% | 0% | 86% | 13%
config_files0% | 0% | 0% | 30% | 69%
ROOT0% | 0% | 0% | 100% | 0%
Longest Files (Top 50)
File# lines# units
vilbert.py
in src/model/vilbert
1274 84
task_utils.py
in src/model/vilbert
749 7
utils.py
in src/model/vilbert
723 40
basebert.py
in src/model/vilbert
657 53
qgen_vilbert.py
in src/model
356 12
self_play_all_vilbert.py
in src/model
229 5
qgen.py
in solver
223 10
oracle_vilbert.py
in solver
221 9
qgen_vdst.py
in src/model
221 7
self_play_qgen_vdst_oracle_vilbert_guesser_vilbert.py
in src/model
219 5
self_play_qgen_vdst_oracle_vilbert_guesser_vilbert.py
in solver
217 8
self_play_all_vilbert.py
in solver
216 8
qgen_vilbert.py
in solver
216 8
self_play_qgen_vdst_oracle_vilbert.py
in solver
213 8
self_play_qgen_vdst_guesser_vilbert.py
in solver
204 8
self_play_qgen_vdst_guesser_vilbert.py
in src/model
199 5
self_play.py
in solver
195 8
self_play_qgen_vdst_oracle_vilbert.py
in src/model
187 5
self_play_qgen_vdst.py
in solver
184 8
image_features_reader.py
in src/data
184 16
guesser_vilbert.py
in solver
172 9
qgen_vdst.py
in solver
171 8
utils.py
in src/tools
171 14
oracle_rcnn.py
in solver
170 9
main.py
in root
168 2
guesser_vilbert.py
in src/model
163 6
oracle.py
in solver
160 9
self_play_qgen_vdst_oracle_vilbert_guesser_vilbert.py
in src/data
160 5
self_play_all_vilbert.py
in src/data
158 5
self_play_qgen_vilbert.py
in solver
154 8
self_play_qgen_vdst_oracle_vilbert.py
in src/data
151 5
self_play_all_vilbert.yaml
in config_files
150 -
self_play_qgen_vdst_guesser_vilbert.py
in src/data
147 5
guesser.py
in solver
136 9
guesser_vilbert.py
in src/data
135 5
optimization.py
in src/model/vilbert
132 6
tokenizer.py
in src/tools
130 21
qgen_vilbert.py
in src/data
128 5
self_play_qgen_vdst.py
in src/data
114 5
solver.py
in solver
112 11
qgen_vdst.py
in src/data
112 5
self_play_qgen_vdst_oracle_vilbert_guesser_vilbert.yaml
in config_files
111 -
self_play_qgen_vilbert.py
in src/data
109 5
self_play.py
in src/data
105 5
oracle_rcnn.py
in src/model
103 2
self_play_qgen_vilbert.py
in src/model
100 4
oracle_vilbert.py
in src/data
100 5
self_play.py
in src/model
99 5
qgen.py
in src/model
98 5
qgen.py
in src/data
89 5
Files With Most Units (Top 20)
File# lines# units
vilbert.py
in src/model/vilbert
1274 84
basebert.py
in src/model/vilbert
657 53
utils.py
in src/model/vilbert
723 40
tokenizer.py
in src/tools
130 21
image_features_reader.py
in src/data
184 16
utils.py
in src/tools
171 14
qgen_vilbert.py
in src/model
356 12
solver.py
in solver
112 11
qgen.py
in solver
223 10
guesser_vilbert.py
in solver
172 9
oracle_rcnn.py
in solver
170 9
oracle.py
in solver
160 9
oracle_vilbert.py
in solver
221 9
guesser.py
in solver
136 9
self_play_all_vilbert.py
in solver
216 8
self_play_qgen_vilbert.py
in solver
154 8
self_play.py
in solver
195 8
self_play_qgen_vdst_oracle_vilbert_guesser_vilbert.py
in solver
217 8
qgen_vdst.py
in solver
171 8
self_play_qgen_vdst_guesser_vilbert.py
in solver
204 8
Files With Long Lines (Top 19)

There are 19 files with lines longer than 120 characters. In total, there are 48 long lines.

File# lines# units# long lines
vilbert.py
in src/model/vilbert
1274 84 8
task_utils.py
in src/model/vilbert
749 7 6
self_play_qgen_vdst_oracle_vilbert_guesser_vilbert.py
in solver
217 8 4
self_play_qgen_vdst_oracle_vilbert.py
in solver
213 8 4
self_play_all_vilbert.py
in solver
216 8 3
guesser_vilbert.py
in solver
172 9 3
self_play_qgen_vdst_guesser_vilbert.py
in solver
204 8 3
self_play.py
in solver
195 8 2
oracle_vilbert.py
in solver
221 9 2
self_play_qgen_vdst.py
in solver
184 8 2
qgen_vdst.py
in src/model
221 7 2
basebert.py
in src/model/vilbert
657 53 2
qgen.py
in solver
223 10 1
self_play_qgen_vilbert.py
in solver
154 8 1
utils.py
in solver
43 7 1
qgen_vilbert.py
in solver
216 8 1
optimizer.py
in src/tools
48 6 1
qgen_vilbert.py
in src/model
356 12 1
guesser_vilbert.py
in src/data
135 5 1