tensorflow / tfx-bsl
File Size

The distribution of size of files (measured in lines of code).

Intro
  • File size measurements show the distribution of size of files.
  • Files are classified in four categories based on their size (lines of code): 1-100 (very small files), 101-200 (small files), 201-500 (medium size files), 501-1000 (long files), 1001+(very long files).
  • It is a good practice to keep files small. Long files may become "bloaters", code that have increased to such gargantuan proportions that they are hard to work with.
Learn more...
File Size Overall
  • There are 94 files with 11,270 lines of code.
    • 1 very long files (1,091 lines of code)
    • 3 long files (2,171 lines of code)
    • 14 medium size files (4,423 lines of codeclsfd_ftr_w_mp_ins)
    • 12 small files (1,797 lines of code)
    • 64 very small files (1,788 lines of code)
9% | 19% | 39% | 15% | 15%
Legend:
1001+
501-1000
201-500
101-200
1-100


explore: zoomable circles | sunburst | 3D view
File Size per Extension
1001+
501-1000
201-500
101-200
1-100
cc21% | 13% | 52% | 6% | 6%
py0% | 32% | 33% | 19% | 14%
h0% | 0% | 17% | 34% | 47%
bzl0% | 0% | 0% | 100% | 0%
proto0% | 0% | 0% | 0% | 100%
toml0% | 0% | 0% | 0% | 100%
File Size per Logical Decomposition
primary
1001+
501-1000
201-500
101-200
1-100
tfx_bsl/cc16% | 10% | 44% | 11% | 15%
tfx_bsl/beam0% | 99% | 0% | 0% | <1%
tfx_bsl/tfxio0% | 22% | 43% | 27% | 6%
tfx_bsl/coders0% | 0% | 70% | 0% | 29%
tfx_bsl0% | 0% | 0% | 93% | 6%
ROOT0% | 0% | 0% | 84% | 15%
tfx_bsl/arrow0% | 0% | 0% | 0% | 100%
tfx_bsl/public0% | 0% | 0% | 0% | 100%
tfx_bsl/tools0% | 0% | 0% | 0% | 100%
tfx_bsl/statistics0% | 0% | 0% | 0% | 100%
tfx_bsl/types0% | 0% | 0% | 0% | 100%
tfx_bsl/sketches0% | 0% | 0% | 0% | 100%
tfx_bsl/telemetry0% | 0% | 0% | 0% | 100%
Longest Files (Top 50)
File# lines# units
example_decoder.cc
in tfx_bsl/cc/coders
1091 51
run_inference.py
in tfx_bsl/beam
861 61
array_util.cc
in tfx_bsl/cc/arrow
697 65
tensor_adapter.py
in tfx_bsl/tfxio
613 43
misragries_sketch.cc
in tfx_bsl/cc/sketches
490 37
sql_util.cc
in tfx_bsl/cc/arrow
390 32
tensor_representation_util.py
in tfx_bsl/tfxio
367 17
example_encoder.cc
in tfx_bsl/cc/coders
340 22
tensor_to_arrow.py
in tfx_bsl/tfxio
337 34
sketches_submodule.cc
in tfx_bsl/cc/sketches
337 4
quantiles_sketch.cc
in tfx_bsl/cc/sketches
331 17
csv_decoder.py
in tfx_bsl/coders
312 24
table_util.cc
in tfx_bsl/cc/arrow
310 15
arrow_submodule.cc
in tfx_bsl/cc/arrow
269 3
tf_example_record.py
in tfx_bsl/tfxio
261 24
merge_util.cc
in tfx_bsl/cc/statistics
235 14
record_to_tensor_tfxio.py
in tfx_bsl/tfxio
222 21
weighted_quantiles_summary.h
in tfx_bsl/cc/sketches
222 14
kmv_sketch.cc
in tfx_bsl/cc/sketches
198 24
tf_sequence_example_record.py
in tfx_bsl/tfxio
177 20
weighted_quantiles_stream.h
in tfx_bsl/cc/sketches
167 12
telemetry.py
in tfx_bsl/tfxio
162 12
csv_tfxio.py
in tfx_bsl/tfxio
159 17
record_based_tfxio.py
in tfx_bsl/tfxio
153 17
build_macros.bzl
in tfx_bsl
148 -
coders_submodule.cc
in tfx_bsl/cc/coders
143 1
example_coder.h
in tfx_bsl/cc/coders
133 -
arrow_casters_c_abi.h
in tfx_bsl/cc/pybind11
129 6
setup.py
in root
124 8
raw_tf_record.py
in tfx_bsl/tfxio
104 14
proto
sketches.proto
in tfx_bsl/cc/sketches
92 -
proto
model_spec.proto
in tfx_bsl/public/proto
90 -
weighted_quantiles_buffer.h
in tfx_bsl/cc/sketches
84 6
c_abi_bridge.cc
in tfx_bsl/cc/pybind11
77 7
example_numpy_decoder.cc
in tfx_bsl/cc/coders
75 1
table_util.py
in tfx_bsl/arrow
74 5
tf_graph_record_decoder.py
in tfx_bsl/coders
74 11
tfxio.py
in tfx_bsl/tfxio
65 19
arrow_casters_py_c_api.h
in tfx_bsl/cc/pybind11
63 6
statistics_submodule.cc
in tfx_bsl/cc/statistics
63 2
misragries_sketch.h
in tfx_bsl/cc/sketches
58 1
build_docs.py
in tfx_bsl/tools
52 2
merge_util.h
in tfx_bsl/cc/statistics
50 1
dataset_options.py
in tfx_bsl/tfxio
49 2
utf8.cc
in tfx_bsl/cc/util
47 2
c_abi_bridge.h
in tfx_bsl/cc/pybind11
45 -
path.py
in tfx_bsl/arrow
43 16
sql_util.h
in tfx_bsl/cc/arrow
43 -
example_coder.py
in tfx_bsl/coders
42 5
dataset_util.py
in tfx_bsl/tfxio
39 2
Files With Most Units (Top 20)
File# lines# units
array_util.cc
in tfx_bsl/cc/arrow
697 65
run_inference.py
in tfx_bsl/beam
861 61
example_decoder.cc
in tfx_bsl/cc/coders
1091 51
tensor_adapter.py
in tfx_bsl/tfxio
613 43
misragries_sketch.cc
in tfx_bsl/cc/sketches
490 37
tensor_to_arrow.py
in tfx_bsl/tfxio
337 34
sql_util.cc
in tfx_bsl/cc/arrow
390 32
tf_example_record.py
in tfx_bsl/tfxio
261 24
kmv_sketch.cc
in tfx_bsl/cc/sketches
198 24
csv_decoder.py
in tfx_bsl/coders
312 24
example_encoder.cc
in tfx_bsl/cc/coders
340 22
record_to_tensor_tfxio.py
in tfx_bsl/tfxio
222 21
tf_sequence_example_record.py
in tfx_bsl/tfxio
177 20
tfxio.py
in tfx_bsl/tfxio
65 19
csv_tfxio.py
in tfx_bsl/tfxio
159 17
record_based_tfxio.py
in tfx_bsl/tfxio
153 17
tensor_representation_util.py
in tfx_bsl/tfxio
367 17
quantiles_sketch.cc
in tfx_bsl/cc/sketches
331 17
path.py
in tfx_bsl/arrow
43 16
table_util.cc
in tfx_bsl/cc/arrow
310 15
Files With Long Lines (Top 2)

There are 2 files with lines longer than 120 characters. In total, there are 2 long lines.

File# lines# units# long lines
array_util.py
in tfx_bsl/arrow
38 2 1
types_compat.py
in tfx_bsl
8 - 1