microsoft / checkedc-llvm-test-suite
File Size

The distribution of size of files (measured in lines of code).

Intro
  • File size measurements show the distribution of size of files.
  • Files are classified in four categories based on their size (lines of code): 1-100 (very small files), 101-200 (small files), 201-500 (medium size files), 501-1000 (long files), 1001+(very long files).
  • It is a good practice to keep files small. Long files may become "bloaters", code that have increased to such gargantuan proportions that they are hard to work with.
Learn more...
File Size Overall
  • There are 4,761 files with 1,262,823 lines of code.
    • 131 very long files (579,125 lines of code)
    • 335 long files (230,666 lines of code)
    • 754 medium size files (235,599 lines of codeclsfd_ftr_w_mp_ins)
    • 806 small files (114,741 lines of code)
    • 2,735 very small files (102,692 lines of code)
45% | 18% | 18% | 9% | 8%
Legend:
1001+
501-1000
201-500
101-200
1-100


explore: zoomable circles | sunburst | 3D view
File Size per Extension
1001+
501-1000
201-500
101-200
1-100
awk99% | <1% | <1% | <1% | <1%
c31% | 25% | 25% | 11% | 6%
in96% | 2% | <1% | <1% | <1%
cpp17% | 34% | 27% | 12% | 7%
h20% | 7% | 22% | 16% | 32%
p98% | 0% | 1% | 0% | 0%
cc36% | 19% | 24% | 14% | 4%
ld45% | 38% | 11% | 4% | 0%
inc76% | 0% | 5% | 5% | 12%
cu74% | 0% | 20% | 0% | 5%
el100% | 0% | 0% | 0% | 0%
f45% | 0% | 22% | 15% | 17%
cxx36% | 25% | 30% | 0% | 7%
y100% | 0% | 0% | 0% | 0%
perl70% | 0% | 16% | 10% | 3%
hpp0% | 11% | 28% | 29% | 30%
pro0% | 0% | 100% | 0% | 0%
hxx0% | 0% | 74% | 24% | 1%
py0% | 0% | 25% | 44% | 30%
TXT0% | 0% | 65% | 0% | 34%
lua0% | 0% | 100% | 0% | 0%
proto0% | 0% | 100% | 0% | 0%
cmake0% | 0% | 24% | 19% | 56%
pl0% | 0% | 66% | 33% | 0%
spec0% | 0% | 100% | 0% | 0%
b0% | 0% | 71% | 0% | 28%
hh0% | 0% | 11% | 41% | 47%
ll0% | 0% | 0% | 0% | 100%
cfg0% | 0% | 0% | 0% | 100%
gs0% | 0% | 0% | 0% | 100%
yml0% | 0% | 0% | 0% | 100%
File Size per Logical Decomposition
primary
1001+
501-1000
201-500
101-200
1-100
MultiSource52% | 15% | 16% | 7% | 6%
CTMark22% | 30% | 24% | 11% | 10%
External63% | 0% | 28% | 0% | 7%
MicroBenchmarks13% | 20% | 26% | 18% | 21%
SingleSource0% | 11% | 34% | 30% | 24%
tools0% | 0% | 99% | 0% | <1%
ROOT0% | 0% | 42% | 48% | 8%
utils0% | 0% | 65% | 34% | 0%
cmake0% | 0% | 0% | 46% | 53%
litsupport0% | 0% | 0% | 33% | 66%
LNTBased0% | 0% | 0% | 100% | 0%
LLVMSource0% | 0% | 0% | 0% | 100%
autoconf0% | 0% | 0% | 0% | 100%
include0% | 0% | 0% | 0% | 100%
Longest Files (Top 50)
File# lines# units
in
ref.in
in MultiSource/Benchmarks/FreeBench/analyzer
108612 -
awk
words-large.awk
in MultiSource/Benchmarks/MallocBench/gawk/INPUT
69964 -
awk
words-large.awk
in MultiSource/Benchmarks/MallocBench/perl/INPUT
69964 -
awk
words-small.awk
in MultiSource/Benchmarks/MallocBench/gawk/INPUT
25144 -
awk
words-small.awk
in MultiSource/Benchmarks/MallocBench/perl/INPUT
25144 -
p
mf.p
in MultiSource/Benchmarks/MallocBench/p2c/INPUT
18881 -
packet_lengths.h
in MultiSource/Benchmarks/Trimaran/netbench-crc
10002 -
packet_lengths.h
in MultiSource/Benchmarks/Trimaran/netbench-url
10002 -
p
ptc.p
in MultiSource/Benchmarks/MallocBench/p2c/INPUT
8509 -
mesh.cpp
in MultiSource/Benchmarks/DOE-ProxyApps-C++/CLAMR
8277 68
in
test.in
in MultiSource/Benchmarks/FreeBench/analyzer
8065 -
mltaln9.c
in CTMark/mafft
5879 104
mltaln9.c
in MultiSource/Benchmarks/mafft
5879 104
decl.c
in MultiSource/Benchmarks/MallocBench/p2c
4799 67
expr.c
in MultiSource/Benchmarks/MallocBench/p2c
4724 59
funcs.c
in MultiSource/Benchmarks/MallocBench/p2c
4554 5
tsc.inc
in MultiSource/Benchmarks/TSVC
4046 166
parse.c
in MultiSource/Benchmarks/MallocBench/p2c
3861 54
io.c
in CTMark/mafft
3205 82
io.c
in MultiSource/Benchmarks/mafft
3205 82
pexpr.c
in MultiSource/Benchmarks/MallocBench/p2c
3196 17
awk.tab.c
in MultiSource/Benchmarks/MallocBench/gawk
3106 -
state.cpp
in MultiSource/Benchmarks/DOE-ProxyApps-C++/CLAMR
3024 47
lex.c
in MultiSource/Benchmarks/MallocBench/p2c
3018 67
perly.c
in MultiSource/Benchmarks/MallocBench/perl
3000 1
paq8p.cpp
in MultiSource/Benchmarks/PAQ8p
2945 105
eval.c
in MultiSource/Benchmarks/MallocBench/perl
2846 -
pbmsrch.c
in MultiSource/Benchmarks/MiBench/office-stringsearch
2724 1
Falign.c
in CTMark/mafft
2692 15
Falign.c
in MultiSource/Benchmarks/mafft
2692 15
btSoftBody.cpp
in CTMark/Bullet
2514 106
btSoftBody.cpp
in MultiSource/Benchmarks/Bullet
2514 106
doio.c
in MultiSource/Benchmarks/MallocBench/perl
2451 1
toke.c
in MultiSource/Benchmarks/MallocBench/perl
2420 -
Cmd.cc
in MultiSource/Benchmarks/DOE-ProxyApps-C++/CLAMR
2328 82
nbench1.c
in MultiSource/Benchmarks/nbench
2252 65
fftsg.c
in MultiSource/Benchmarks/FreeBench/pifft
2245 37
ld
standard.ld
in CTMark/consumer-typeset/data/data
2174 -
ld
standard.ld
in MultiSource/Benchmarks/MiBench/consumer-typeset/data/data
2174 -
el
ispell.el
in MultiSource/Benchmarks/MiBench/office-ispell
2156 -
PowerParser.cc
in MultiSource/Benchmarks/DOE-ProxyApps-C++/CLAMR
2145 97
Lalignmm.c
in CTMark/mafft
2105 8
Lalignmm.c
in MultiSource/Benchmarks/mafft
2105 8
z48.c
in CTMark/consumer-typeset
2085 53
z48.c
in MultiSource/Benchmarks/MiBench/consumer-typeset
2085 53
in
file1.in
in MultiSource/Benchmarks/PAQ8p
2052 -
tperly.c
in MultiSource/Benchmarks/MallocBench/perl
2031 -
f
mpi_stubs.f
in MultiSource/Benchmarks/ASC_Sequoia/sphot
2012 -
externs.h
in CTMark/consumer-typeset
2002 -
externs.h
in MultiSource/Benchmarks/MiBench/consumer-typeset
2002 -
Files With Most Units (Top 20)
File# lines# units
tsc.inc
in MultiSource/Benchmarks/TSVC
4046 166
btSoftBody.cpp
in CTMark/Bullet
2514 106
btSoftBody.cpp
in MultiSource/Benchmarks/Bullet
2514 106
paq8p.cpp
in MultiSource/Benchmarks/PAQ8p
2945 105
mltaln9.c
in CTMark/mafft
5879 104
mltaln9.c
in MultiSource/Benchmarks/mafft
5879 104
PowerParser.cc
in MultiSource/Benchmarks/DOE-ProxyApps-C++/CLAMR
2145 97
io.c
in CTMark/mafft
3205 82
Cmd.cc
in MultiSource/Benchmarks/DOE-ProxyApps-C++/CLAMR
2328 82
io.c
in MultiSource/Benchmarks/mafft
3205 82
btGImpactShape.h
in CTMark/Bullet/include/BulletCollision/Gimpact
751 81
btGImpactShape.h
in MultiSource/Benchmarks/Bullet/include/BulletCollision/Gimpact
751 81
LzmaEnc.c
in CTMark/7zip/C
1965 68
LzmaEnc.c
in MultiSource/Benchmarks/7zip/C
1965 68
mesh.cpp
in MultiSource/Benchmarks/DOE-ProxyApps-C++/CLAMR
8277 68
decl.c
in MultiSource/Benchmarks/MallocBench/p2c
4799 67
lex.c
in MultiSource/Benchmarks/MallocBench/p2c
3018 67
btSoftBodyInternals.h
in CTMark/Bullet/include/BulletSoftBody
762 66
btSoftBodyInternals.h
in MultiSource/Benchmarks/Bullet/include/BulletSoftBody
762 66
nbench1.c
in MultiSource/Benchmarks/nbench
2252 65
Files With Long Lines (Top 20)

There are 726 files with lines longer than 120 characters. In total, there are 5345 long lines.

File# lines# units# long lines
in
file1.in
in MultiSource/Benchmarks/PAQ8p
2052 - 752
mesh.cpp
in MultiSource/Benchmarks/DOE-ProxyApps-C++/CLAMR
8277 68 425
awk
prog-small-data.awk
in MultiSource/Benchmarks/MallocBench/gawk/INPUT
629 - 151
simple_types_loop_invariant.cpp
in SingleSource/Benchmarks/Adobe-C++
275 2 132
state.cpp
in MultiSource/Benchmarks/DOE-ProxyApps-C++/CLAMR
3024 47 129
headers.h
in MultiSource/Benchmarks/Trimaran/netbench-crc
102 - 100
headers.h
in MultiSource/Benchmarks/Trimaran/netbench-url
102 - 100
btSequentialImpulseConstraintSolver.cpp
in CTMark/Bullet
883 18 92
btSequentialImpulseConstraintSolver.cpp
in MultiSource/Benchmarks/Bullet
883 18 92
p
mf.p
in MultiSource/Benchmarks/MallocBench/p2c/INPUT
18881 - 77
struct_ls.h
in MultiSource/Benchmarks/ASCI_Purple/SMG2000
331 - 72
mg.c
in MultiSource/Benchmarks/DOE-ProxyApps-C/miniGMG
628 10 71
hash.c
in MultiSource/Benchmarks/DOE-ProxyApps-C++/CLAMR
1028 50 69
btQuantizedBvh.cpp
in CTMark/Bullet
769 20 67
btQuantizedBvh.cpp
in MultiSource/Benchmarks/Bullet
769 20 67
simple_types_constant_folding.cpp
in SingleSource/Benchmarks/Adobe-C++
350 1 61
functions.h
in CTMark/mafft
286 - 47
functions.h
in MultiSource/Benchmarks/mafft
286 - 47
BenchmarkDemo.cpp
in CTMark/Bullet
919 22 43
BenchmarkDemo.cpp
in MultiSource/Benchmarks/Bullet
919 22 43