amazon-research / nlu-slot-constraints
File Size

The distribution of size of files (measured in lines of code).

Intro
  • File size measurements show the distribution of size of files.
  • Files are classified in four categories based on their size (lines of code): 1-100 (very small files), 101-200 (small files), 201-500 (medium size files), 501-1000 (long files), 1001+(very long files).
  • It is a good practice to keep files small. Long files may become "bloaters", code that have increased to such gargantuan proportions that they are hard to work with.
Learn more...
File Size Overall
  • There are 9 files with 32,324 lines of code.
    • 4 very long files (31,437 lines of code)
    • 0 long files (0 lines of code)
    • 2 medium size files (732 lines of codeclsfd_ftr_w_mp_ins)
    • 0 small files (0 lines of code)
    • 3 very small files (155 lines of code)
97% | 0% | 2% | 0% | <1%
Legend:
1001+
501-1000
201-500
101-200
1-100


explore: zoomable circles | sunburst | 3D view
File Size per Extension
1001+
501-1000
201-500
101-200
1-100
in100% | 0% | 0% | 0% | 0%
py0% | 0% | 82% | 0% | 17%
File Size per Logical Decomposition
primary
1001+
501-1000
201-500
101-200
1-100
data/insurance/train100% | 0% | 0% | 0% | 0%
data/fastfood/train100% | 0% | 0% | 0% | 0%
data/insurance/dev100% | 0% | 0% | 0% | 0%
data/fastfood/dev100% | 0% | 0% | 0% | 0%
ROOT0% | 0% | 85% | 0% | 14%
data0% | 0% | 0% | 0% | 100%
Longest Files (Top 9)
File# lines# units
in
seq.in
in data/insurance/train
14492 -
in
seq.in
in data/fastfood/train
13038 -
in
seq.in
in data/insurance/dev
2039 -
in
seq.in
in data/fastfood/dev
1868 -
main.py
in root
478 11
entity_linking.py
in root
254 21
violation_detection.py
in root
69 5
generate_result_table.py
in root
55 1
vocab_process.py
in data
31 1
Files With Most Units (Top 5)
File# lines# units
entity_linking.py
in root
254 21
main.py
in root
478 11
violation_detection.py
in root
69 5
generate_result_table.py
in root
55 1
vocab_process.py
in data
31 1
Files With Long Lines (Top 7)

There are 7 files with lines longer than 120 characters. In total, there are 52 long lines.

File# lines# units# long lines
in
seq.in
in data/fastfood/train
13038 - 30
main.py
in root
478 11 9
in
seq.in
in data/fastfood/dev
1868 - 5
generate_result_table.py
in root
55 1 2
entity_linking.py
in root
254 21 2
in
seq.in
in data/insurance/train
14492 - 2
vocab_process.py
in data
31 1 2