tensorflow / tfx
File Size

The distribution of size of files (measured in lines of code).

Intro
  • File size measurements show the distribution of size of files.
  • Files are classified in four categories based on their size (lines of code): 1-100 (very small files), 101-200 (small files), 201-500 (medium size files), 501-1000 (long files), 1001+(very long files).
  • It is a good practice to keep files small. Long files may become "bloaters", code that have increased to such gargantuan proportions that they are hard to work with.
Learn more...
File Size Overall
  • There are 537 files with 40,351 lines of code.
    • 2 very long files (6,994 lines of code)
    • 4 long files (2,326 lines of code)
    • 30 medium size files (9,570 lines of codeclsfd_ftr_w_mp_ins)
    • 70 small files (10,326 lines of code)
    • 431 very small files (11,135 lines of code)
17% | 5% | 23% | 25% | 27%
Legend:
1001+
501-1000
201-500
101-200
1-100


explore: zoomable circles | sunburst | 3D view
File Size per Extension
1001+
501-1000
201-500
101-200
1-100
pb100% | 0% | 0% | 0% | 0%
py3% | 7% | 26% | 31% | 31%
proto0% | 0% | 48% | 9% | 41%
yaml0% | 0% | 0% | 0% | 100%
bzl0% | 0% | 0% | 0% | 100%
in0% | 0% | 0% | 0% | 100%
cfg0% | 0% | 0% | 0% | 100%
File Size per Logical Decomposition
primary
1001+
501-1000
201-500
101-200
1-100
tfx/benchmarks82% | 0% | 9% | 5% | 2%
tfx/components16% | 0% | 21% | 34% | 28%
tfx/orchestration0% | 20% | 22% | 25% | 32%
tfx/dsl0% | 0% | 32% | 31% | 35%
tfx/types0% | 0% | 64% | 26% | 9%
tfx/proto0% | 0% | 52% | 10% | 37%
tfx/tools0% | 0% | 29% | 38% | 31%
tfx/experimental0% | 0% | 22% | 46% | 30%
tfx/extensions0% | 0% | 18% | 38% | 43%
ROOT0% | 0% | 87% | 0% | 12%
tfx/utils0% | 0% | 0% | 35% | 64%
tfx0% | 0% | 0% | 52% | 47%
tfx/scripts0% | 0% | 0% | 51% | 48%
tfx/v10% | 0% | 0% | 0% | 100%
package_build/ml-pipelines-sdk0% | 0% | 0% | 0% | 100%
package_build/tfx0% | 0% | 0% | 0% | 100%
Longest Files (Top 50)
File# lines# units
pb
saved_model.pb
in tfx/benchmarks/datasets/chicago_taxi/model/tft_saved_model
5944 -
executor.py
in tfx/components/transform
1050 56
metadata.py
in tfx/orchestration
715 36
pipeline_state.py
in tfx/orchestration/experimental/core
557 64
pipeline_ops.py
in tfx/orchestration/experimental/core
527 22
metadata_test_utils.py
in tfx/orchestration
527 19
proto
pipeline.proto
in tfx/proto/orchestration
478 -
placeholder.py
in tfx/dsl/placeholder
475 71
artifact.py
in tfx/types
474 46
step_builder.py
in tfx/orchestration/kubeflow/v2
463 12
utils.py
in tfx/components/example_gen
436 17
tfma_v2_benchmark_base.py
in tfx/benchmarks
424 29
compiler.py
in tfx/dsl/compiler
419 22
launcher.py
in tfx/orchestration/portable
414 15
placeholder_utils.py
in tfx/dsl/compiler
408 20
partial_run_utils.py
in tfx/orchestration/portable
396 29
standard_component_specs.py
in tfx/types
382 -
prediction_clients.py
in tfx/extensions/google_cloud_ai_platform
330 14
sync_pipeline_task_gen.py
in tfx/orchestration/experimental/core
321 14
tft_benchmark_base.py
in tfx/benchmarks
284 17
pipeline.py
in tfx/tools/cli/commands
280 8
compiler_utils.py
in tfx/orchestration/kubeflow/v2
276 14
container_entrypoint.py
in tfx/orchestration/kubeflow
275 12
executor.py
in tfx/components/infra_validator
266 14
executor.py
in tfx/components/evaluator
255 2
run.py
in tfx/tools/cli/commands
253 7
component_spec.py
in tfx/types
249 21
task_manager.py
in tfx/orchestration/experimental/core
244 19
kubernetes_runner.py
in tfx/components/infra_validator/model_server_runners
243 14
container_based_test_case.py
in tfx/experimental/templates
241 26
proto
infra_validator.proto
in tfx/proto
233 -
proto
placeholder.proto
in tfx/proto/orchestration
222 -
async_pipeline_task_gen.py
in tfx/orchestration/experimental/core
214 8
udf_utils.py
in tfx/components/util
208 14
graph_partition.py
in tfx/experimental/distributed_inference/graphdef_experiments/subgraph_partitioning
205 22
setup.py
in root
202 8
kubeflow_dag_runner.py
in tfx/orchestration/kubeflow
195 15
training_clients.py
in tfx/extensions/google_cloud_ai_platform
194 20
request_builder.py
in tfx/components/infra_validator
194 13
tfma_benchmark_base.py
in tfx/benchmarks
194 8
executor.py
in tfx/extensions/google_cloud_ai_platform/tuner
193 8
data_types_utils.py
in tfx/orchestration
192 12
outputs_utils.py
in tfx/orchestration/portable
191 15
dataset.py
in tfx/benchmarks/datasets/chicago_taxi
190 14
executor.py
in tfx/components/bulk_inferrer
189 6
base_driver.py
in tfx/dsl/components/base
187 9
base_handler.py
in tfx/tools/cli/handler
187 22
tfxio_utils.py
in tfx/components/util
187 10
beam_dag_runner.py
in tfx/orchestration/beam
187 12
kubeflow_handler.py
in tfx/tools/cli/handler
182 20
Files With Most Units (Top 20)
File# lines# units
placeholder.py
in tfx/dsl/placeholder
475 71
pipeline_state.py
in tfx/orchestration/experimental/core
557 64
executor.py
in tfx/components/transform
1050 56
artifact.py
in tfx/types
474 46
metadata.py
in tfx/orchestration
715 36
partial_run_utils.py
in tfx/orchestration/portable
396 29
tfma_v2_benchmark_base.py
in tfx/benchmarks
424 29
container_based_test_case.py
in tfx/experimental/templates
241 26
context_manager.py
in tfx/dsl/context_managers
95 25
resolver_op.py
in tfx/dsl/input_resolution
170 25
graph_partition.py
in tfx/experimental/distributed_inference/graphdef_experiments/subgraph_partitioning
205 22
compiler.py
in tfx/dsl/compiler
419 22
base_handler.py
in tfx/tools/cli/handler
187 22
pipeline_ops.py
in tfx/orchestration/experimental/core
527 22
base_node.py
in tfx/dsl/components/base
123 21
component_spec.py
in tfx/types
249 21
training_clients.py
in tfx/extensions/google_cloud_ai_platform
194 20
placeholder_utils.py
in tfx/dsl/compiler
408 20
kubeflow_handler.py
in tfx/tools/cli/handler
182 20
task_manager.py
in tfx/orchestration/experimental/core
244 19
Files With Long Lines (Top 4)

There are 4 files with lines longer than 120 characters. In total, there are 227 long lines.

File# lines# units# long lines
pb
saved_model.pb
in tfx/benchmarks/datasets/chicago_taxi/model/tft_saved_model
5944 - 223
base_handler.py
in tfx/tools/cli/handler
187 22 2
proto
infra_validator.proto
in tfx/proto
233 - 1
__init__.py
in tfx/v1/extensions/google_cloud_ai_platform/experimental
5 - 1