tensorflow / tfx

File Size

The distribution of size of files (measured in lines of code).

Intro

File size measurements show the distribution of size of files.
Files are classified in four categories based on their size (lines of code): 1-100 (very small files), 101-200 (small files), 201-500 (medium size files), 501-1000 (long files), 1001+(very long files).
It is a good practice to keep files small. Long files may become "bloaters", code that have increased to such gargantuan proportions that they are hard to work with.

Learn more...

File Size Overall

There are 537 files with 40,351 lines of code.

2 very long files (6,994 lines of code)
4 long files (2,326 lines of code)
30 medium size files (9,570 lines of codeclsfd_ftr_w_mp_ins)
70 small files (10,326 lines of code)
431 very small files (11,135 lines of code)

Legend:

1001+

501-1000

201-500

101-200

1-100

explore: zoomable circles | sunburst | 3D view

File Size per Extension

1001+

501-1000

201-500

101-200

1-100

File Size per Logical Decomposition

primary

1001+

501-1000

201-500

101-200

1-100

Longest Files (Top 50)

File	# lines	# units
pb saved_model.pb in tfx/benchmarks/datasets/chicago_taxi/model/tft_saved_model	5944	-
executor.py in tfx/components/transform	1050	56
metadata.py in tfx/orchestration	715	36
pipeline_state.py in tfx/orchestration/experimental/core	557	64
pipeline_ops.py in tfx/orchestration/experimental/core	527	22
metadata_test_utils.py in tfx/orchestration	527	19
proto pipeline.proto in tfx/proto/orchestration	478	-
placeholder.py in tfx/dsl/placeholder	475	71
artifact.py in tfx/types	474	46
step_builder.py in tfx/orchestration/kubeflow/v2	463	12
utils.py in tfx/components/example_gen	436	17
tfma_v2_benchmark_base.py in tfx/benchmarks	424	29
compiler.py in tfx/dsl/compiler	419	22
launcher.py in tfx/orchestration/portable	414	15
placeholder_utils.py in tfx/dsl/compiler	408	20
partial_run_utils.py in tfx/orchestration/portable	396	29
standard_component_specs.py in tfx/types	382	-
prediction_clients.py in tfx/extensions/google_cloud_ai_platform	330	14
sync_pipeline_task_gen.py in tfx/orchestration/experimental/core	321	14
tft_benchmark_base.py in tfx/benchmarks	284	17
pipeline.py in tfx/tools/cli/commands	280	8
compiler_utils.py in tfx/orchestration/kubeflow/v2	276	14
container_entrypoint.py in tfx/orchestration/kubeflow	275	12
executor.py in tfx/components/infra_validator	266	14
executor.py in tfx/components/evaluator	255	2
run.py in tfx/tools/cli/commands	253	7
component_spec.py in tfx/types	249	21
task_manager.py in tfx/orchestration/experimental/core	244	19
kubernetes_runner.py in tfx/components/infra_validator/model_server_runners	243	14
container_based_test_case.py in tfx/experimental/templates	241	26
proto infra_validator.proto in tfx/proto	233	-
proto placeholder.proto in tfx/proto/orchestration	222	-
async_pipeline_task_gen.py in tfx/orchestration/experimental/core	214	8
udf_utils.py in tfx/components/util	208	14
graph_partition.py in tfx/experimental/distributed_inference/graphdef_experiments/subgraph_partitioning	205	22
setup.py in root	202	8
kubeflow_dag_runner.py in tfx/orchestration/kubeflow	195	15
training_clients.py in tfx/extensions/google_cloud_ai_platform	194	20
request_builder.py in tfx/components/infra_validator	194	13
tfma_benchmark_base.py in tfx/benchmarks	194	8
executor.py in tfx/extensions/google_cloud_ai_platform/tuner	193	8
data_types_utils.py in tfx/orchestration	192	12
outputs_utils.py in tfx/orchestration/portable	191	15
dataset.py in tfx/benchmarks/datasets/chicago_taxi	190	14
executor.py in tfx/components/bulk_inferrer	189	6
base_driver.py in tfx/dsl/components/base	187	9
base_handler.py in tfx/tools/cli/handler	187	22
tfxio_utils.py in tfx/components/util	187	10
beam_dag_runner.py in tfx/orchestration/beam	187	12
kubeflow_handler.py in tfx/tools/cli/handler	182	20

Files With Most Units (Top 20)

File	# lines	# units
placeholder.py in tfx/dsl/placeholder	475	71
pipeline_state.py in tfx/orchestration/experimental/core	557	64
executor.py in tfx/components/transform	1050	56
artifact.py in tfx/types	474	46
metadata.py in tfx/orchestration	715	36
partial_run_utils.py in tfx/orchestration/portable	396	29
tfma_v2_benchmark_base.py in tfx/benchmarks	424	29
container_based_test_case.py in tfx/experimental/templates	241	26
context_manager.py in tfx/dsl/context_managers	95	25
resolver_op.py in tfx/dsl/input_resolution	170	25
graph_partition.py in tfx/experimental/distributed_inference/graphdef_experiments/subgraph_partitioning	205	22
compiler.py in tfx/dsl/compiler	419	22
base_handler.py in tfx/tools/cli/handler	187	22
pipeline_ops.py in tfx/orchestration/experimental/core	527	22
base_node.py in tfx/dsl/components/base	123	21
component_spec.py in tfx/types	249	21
training_clients.py in tfx/extensions/google_cloud_ai_platform	194	20
placeholder_utils.py in tfx/dsl/compiler	408	20
kubeflow_handler.py in tfx/tools/cli/handler	182	20
task_manager.py in tfx/orchestration/experimental/core	244	19

Files With Long Lines (Top 4)

There are 4 files with lines longer than 120 characters. In total, there are 227 long lines.

File	# lines	# units	# long lines
pb saved_model.pb in tfx/benchmarks/datasets/chicago_taxi/model/tft_saved_model	5944	-	223
base_handler.py in tfx/tools/cli/handler	187	22	2
proto infra_validator.proto in tfx/proto	233	-	1
__init__.py in tfx/v1/extensions/google_cloud_ai_platform/experimental	5	-	1