facebookresearch / Clinical-Trial-Parser
File Size

The distribution of size of files (measured in lines of code).

Intro
  • File size measurements show the distribution of size of files.
  • Files are classified in four categories based on their size (lines of code): 1-100 (very small files), 101-200 (small files), 201-500 (medium size files), 501-1000 (long files), 1001+(very long files).
  • It is a good practice to keep files small. Long files may become "bloaters", code that have increased to such gargantuan proportions that they are hard to work with.
Learn more...
File Size Overall
  • There are 63 files with 5,742 lines of code.
    • 0 very long files (0 lines of code)
    • 0 long files (0 lines of code)
    • 7 medium size files (2,022 lines of codeclsfd_ftr_w_mp_ins)
    • 16 small files (2,262 lines of code)
    • 40 very small files (1,458 lines of code)
0% | 0% | 35% | 39% | 25%
Legend:
1001+
501-1000
201-500
101-200
1-100


explore: zoomable circles | sunburst | 3D view
File Size per Extension
1001+
501-1000
201-500
101-200
1-100
go0% | 0% | 37% | 37% | 24%
py0% | 0% | 0% | 61% | 38%
File Size per Logical Decomposition
primary
1001+
501-1000
201-500
101-200
1-100
ct/parser0% | 0% | 76% | 9% | 13%
ct/relation0% | 0% | 100% | 0% | 0%
cmd/nel0% | 0% | 100% | 0% | 0%
vocabularies/mesh0% | 0% | 51% | 0% | 48%
vocabularies/taxonomy0% | 0% | 0% | 94% | 5%
ie0% | 0% | 0% | 99% | <1%
common/util0% | 0% | 0% | 57% | 42%
ct/variables0% | 0% | 0% | 70% | 29%
ct/studies0% | 0% | 0% | 91% | 8%
ct/units0% | 0% | 0% | 92% | 7%
cmd/cfg0% | 0% | 0% | 100% | 0%
common/conf0% | 0% | 0% | 100% | 0%
common/trie0% | 0% | 0% | 85% | 14%
cmd/search0% | 0% | 0% | 100% | 0%
cmd/extract0% | 0% | 0% | 100% | 0%
common/col0% | 0% | 0% | 0% | 100%
ct/criteria0% | 0% | 0% | 0% | 100%
embedding0% | 0% | 0% | 0% | 100%
common/lsh0% | 0% | 0% | 0% | 100%
text0% | 0% | 0% | 0% | 100%
vocabularies/umls0% | 0% | 0% | 0% | 100%
common/resource0% | 0% | 0% | 0% | 100%
vocabularies0% | 0% | 0% | 0% | 100%
ct/eligibility0% | 0% | 0% | 0% | 100%
common/param0% | 0% | 0% | 0% | 100%
Longest Files (Top 50)
File# lines# units
relation.go
in src/ct/relation
402 32
tree.go
in src/ct/parser
317 21
parser.go
in src/ct/parser
306 15
item.go
in src/ct/parser
263 26
lexer.go
in src/ct/parser
258 24
main.go
in src/cmd/nel
255 14
normalize.go
in src/vocabularies/mesh
221 1
node.go
in src/vocabularies/taxonomy
195 19
variables.go
in src/ct/variables
185 13
study.go
in src/ct/studies
171 12
term.go
in src/vocabularies/taxonomy
163 19
units.go
in src/ct/units
163 13
taxonomy.go
in src/vocabularies/taxonomy
160 15
cfg.go
in src/ct/parser
148 6
ner.py
in src/ie
133 8
main.go
in src/cmd/cfg
129 7
file.go
in src/common/util/fio
122 7
conf.go
in src/common/conf
121 12
process_ner_data.py
in src/ie
121 3
main.go
in src/cmd/search
119 7
trie.go
in src/common/trie
119 7
main.go
in src/cmd/extract
109 6
text.go
in src/common/util/text
104 13
mesh.go
in src/vocabularies/mesh
99 3
minhash.go
in src/common/lsh
99 9
rules.go
in src/ct/parser
88 1
set.go
in src/common/col/set
87 14
categories.go
in src/vocabularies/mesh
83 6
parse_criteria.go
in src/ct/criteria
81 8
train_embeddings.py
in src/embedding
80 7
criteria.go
in src/ct/criteria
68 10
slice.go
in src/common/util/slice
67 6
tuple.go
in src/common/col/tuple
66 11
variable_type.go
in src/ct/variables
54 5
transformer.py
in src/text
54 8
umls.go
in src/vocabularies/umls
52 1
resource.go
in src/common/resource
43 2
token.go
in src/ct/parser
42 3
interpreter.go
in src/ct/parser
34 5
priority_queue.go
in src/vocabularies/taxonomy
30 4
source.go
in src/vocabularies
30 2
type.go
in src/ct/eligibility
30 2
animal.go
in src/vocabularies/mesh
25 1
criterion.go
in src/ct/parser/production
25 -
timer.go
in src/common/util/timer
24 3
exponential_multiplier.go
in src/common/util/text
24 1
param.py
in src/embedding
22 -
intmath.go
in src/common/util/intmath
22 3
value.go
in src/common/trie
20 3
variable.go
in src/ct/variables
19 2
Files With Most Units (Top 20)
File# lines# units
relation.go
in src/ct/relation
402 32
item.go
in src/ct/parser
263 26
lexer.go
in src/ct/parser
258 24
tree.go
in src/ct/parser
317 21
term.go
in src/vocabularies/taxonomy
163 19
node.go
in src/vocabularies/taxonomy
195 19
taxonomy.go
in src/vocabularies/taxonomy
160 15
parser.go
in src/ct/parser
306 15
main.go
in src/cmd/nel
255 14
set.go
in src/common/col/set
87 14
units.go
in src/ct/units
163 13
variables.go
in src/ct/variables
185 13
text.go
in src/common/util/text
104 13
study.go
in src/ct/studies
171 12
conf.go
in src/common/conf
121 12
tuple.go
in src/common/col/tuple
66 11
criteria.go
in src/ct/criteria
68 10
minhash.go
in src/common/lsh
99 9
parse_criteria.go
in src/ct/criteria
81 8
transformer.py
in src/text
54 8
Files With Long Lines (Top 7)

There are 7 files with lines longer than 120 characters. In total, there are 12 long lines.

File# lines# units# long lines
parse_criteria.go
in src/ct/criteria
81 8 3
main.go
in src/cmd/nel
255 14 2
variables.go
in src/ct/variables
185 13 2
variable.go
in src/ct/variables
19 2 2
taxonomy.go
in src/vocabularies/taxonomy
160 15 1
main.go
in src/cmd/cfg
129 7 1
text.go
in src/common/util/text
104 13 1