google / riegeli
File Size

The distribution of size of files (measured in lines of code).

Intro
  • File size measurements show the distribution of size of files.
  • Files are classified in four categories based on their size (lines of code): 1-100 (very small files), 101-200 (small files), 201-500 (medium size files), 501-1000 (long files), 1001+(very long files).
  • It is a good practice to keep files small. Long files may become "bloaters", code that have increased to such gargantuan proportions that they are hard to work with.
Learn more...
File Size Overall
  • There are 272 files with 55,000 lines of code.
    • 5 very long files (7,385 lines of code)
    • 9 long files (6,304 lines of code)
    • 90 medium size files (26,835 lines of codeclsfd_ftr_w_mp_ins)
    • 66 small files (9,940 lines of code)
    • 102 very small files (4,536 lines of code)
13% | 11% | 48% | 18% | 8%
Legend:
1001+
501-1000
201-500
101-200
1-100


explore: zoomable circles | sunburst | 3D view
File Size per Extension
1001+
501-1000
201-500
101-200
1-100
cc21% | 11% | 43% | 18% | 5%
h5% | 11% | 55% | 17% | 10%
bzl0% | 0% | 60% | 32% | 7%
py0% | 0% | 0% | 0% | 100%
proto0% | 0% | 0% | 0% | 100%
tpl0% | 0% | 0% | 0% | 100%
in0% | 0% | 0% | 0% | 100%
File Size per Logical Decomposition
primary
1001+
501-1000
201-500
101-200
1-100
riegeli/base54% | 0% | 23% | 12% | 8%
riegeli/chunk_encoding52% | 0% | 0% | 29% | 18%
python/riegeli27% | 16% | 44% | 4% | 5%
riegeli/endian0% | 99% | 0% | 0% | <1%
riegeli/records0% | 30% | 47% | 13% | 8%
riegeli/csv0% | 49% | 42% | 4% | 2%
riegeli/bytes0% | 5% | 66% | 21% | 6%
riegeli/snappy0% | 0% | 67% | 24% | 7%
riegeli/tensorflow0% | 0% | 85% | 12% | 1%
riegeli/zlib0% | 0% | 95% | 0% | 4%
riegeli/zstd0% | 0% | 85% | 9% | 4%
riegeli/brotli0% | 0% | 62% | 28% | 9%
riegeli/varint0% | 0% | 53% | 23% | 22%
riegeli/messages0% | 0% | 29% | 70% | 0%
riegeli/lines0% | 0% | 0% | 55% | 44%
tf_dependency0% | 0% | 0% | 92% | 7%
riegeli/ordered_varint0% | 0% | 0% | 35% | 64%
python0% | 0% | 0% | 0% | 100%
ROOT0% | 0% | 0% | 0% | 100%
Longest Files (Top 50)
File# lines# units
chain.cc
in riegeli/base
2389 128
chain.h
in riegeli/base
1449 116
transpose_decoder.cc
in riegeli/chunk_encoding
1251 21
record_reader.cc
in python/riegeli/records
1226 22
transpose_encoder.cc
in riegeli/chunk_encoding
1070 25
endian_writing.h
in riegeli/endian
917 130
record_writer.cc
in riegeli/records
893 72
endian_reading.h
in riegeli/endian
833 118
csv_record.h
in riegeli/csv
778 49
record_writer.cc
in python/riegeli/records
741 14
record_reader.cc
in riegeli/records
573 33
fd_reader.h
in riegeli/bytes
538 46
pullable_reader.cc
in riegeli/bytes
517 23
csv_reader.cc
in riegeli/csv
514 13
reader_factory.cc
in riegeli/bytes
497 19
chunk_reader.cc
in riegeli/records
496 18
records_benchmark.cc
in riegeli/records/tools
494 18
file_reader.cc
in riegeli/tensorflow/io
480 20
base.h
in riegeli/base
451 33
reader.cc
in riegeli/bytes
450 27
pushable_writer.cc
in riegeli/bytes
405 28
record_writer.h
in riegeli/records
403 22
limiting_reader.h
in riegeli/bytes
396 33
pushable_backward_writer.cc
in riegeli/bytes
395 24
csv_record.cc
in riegeli/csv
381 38
zstd_writer.h
in riegeli/zstd
379 27
fd_reader.cc
in riegeli/bytes
376 24
snappy_writer.h
in riegeli/snappy
373 30
csv_reader.h
in riegeli/csv
372 26
csv_writer.h
in riegeli/csv
369 26
record_position.cc
in python/riegeli/records
367 5
writer.h
in riegeli/bytes
354 42
zlib_writer.h
in riegeli/zlib
354 24
brotli_writer.h
in riegeli/brotli
350 21
python_configure.bzl
in python/riegeli
349 -
zlib_reader.h
in riegeli/zlib
347 24
writer_ostream.cc
in riegeli/bytes
344 16
utils.cc
in python/riegeli/base
337 21
fd_writer.cc
in riegeli/bytes
334 16
describe_riegeli_file.cc
in riegeli/records/tools
332 6
limiting_writer.h
in riegeli/bytes
326 28
limiting_backward_writer.h
in riegeli/bytes
325 28
any_dependency.h
in riegeli/base
321 44
buffered_reader.cc
in riegeli/bytes
321 15
fd_writer.h
in riegeli/bytes
321 25
snappy_reader.h
in riegeli/snappy
320 28
file_writer.h
in riegeli/tensorflow/io
316 24
cfile_writer.h
in riegeli/bytes
315 25
string_writer.h
in riegeli/bytes
315 28
reader.h
in riegeli/bytes
313 36
Files With Most Units (Top 20)
File# lines# units
endian_writing.h
in riegeli/endian
917 130
chain.cc
in riegeli/base
2389 128
endian_reading.h
in riegeli/endian
833 118
chain.h
in riegeli/base
1449 116
record_writer.cc
in riegeli/records
893 72
csv_record.h
in riegeli/csv
778 49
fd_reader.h
in riegeli/bytes
538 46
any_dependency.h
in riegeli/base
321 44
writer.h
in riegeli/bytes
354 42
csv_record.cc
in riegeli/csv
381 38
reader.h
in riegeli/bytes
313 36
base.h
in riegeli/base
451 33
limiting_reader.h
in riegeli/bytes
396 33
record_reader.cc
in riegeli/records
573 33
backward_writer.h
in riegeli/bytes
280 32
snappy_writer.h
in riegeli/snappy
373 30
reader_istream.h
in riegeli/bytes
267 29
limiting_backward_writer.h
in riegeli/bytes
325 28
limiting_writer.h
in riegeli/bytes
326 28
pushable_writer.cc
in riegeli/bytes
405 28
Files With Long Lines (Top 0)

There are 0 files with lines longer than 120 characters. In total, there are 0 long lines.

File# lines# units# long lines