tensorflow / ecosystem
File Size

The distribution of size of files (measured in lines of code).

Intro
  • File size measurements show the distribution of size of files.
  • Files are classified in four categories based on their size (lines of code): 1-100 (very small files), 101-200 (small files), 201-500 (medium size files), 501-1000 (long files), 1001+(very long files).
  • It is a good practice to keep files small. Long files may become "bloaters", code that have increased to such gargantuan proportions that they are hard to work with.
Learn more...
File Size Overall
  • There are 35 files with 2,247 lines of code.
    • 0 very long files (0 lines of code)
    • 0 long files (0 lines of code)
    • 1 medium size files (243 lines of codeclsfd_ftr_w_mp_ins)
    • 5 small files (768 lines of code)
    • 29 very small files (1,236 lines of code)
0% | 0% | 10% | 34% | 55%
Legend:
1001+
501-1000
201-500
101-200
1-100


explore: zoomable circles | sunburst | 3D view
File Size per Extension
1001+
501-1000
201-500
101-200
1-100
py0% | 0% | 44% | 0% | 55%
scala0% | 0% | 0% | 72% | 27%
jinja0% | 0% | 0% | 28% | 71%
java0% | 0% | 0% | 0% | 100%
sbt0% | 0% | 0% | 0% | 100%
File Size per Logical Decomposition
primary
1001+
501-1000
201-500
101-200
1-100
spark/spark-tensorflow-distributor0% | 0% | 89% | 0% | 10%
spark/spark-tensorflow-connector0% | 0% | 0% | 72% | 27%
distribution_strategy0% | 0% | 0% | 44% | 55%
hadoop/src0% | 0% | 0% | 0% | 100%
data_service0% | 0% | 0% | 0% | 100%
docker0% | 0% | 0% | 0% | 100%
kubernetes0% | 0% | 0% | 0% | 100%
marathon0% | 0% | 0% | 0% | 100%
swarm0% | 0% | 0% | 0% | 100%
ROOT0% | 0% | 0% | 0% | 100%
Longest Files (Top 35)
File# lines# units
mirrored_strategy_runner.py
in spark/spark-tensorflow-distributor/spark_tensorflow_distributor
243 9
FeatureDecoder.scala
in spark/spark-tensorflow-connector/src/main/scala/org/tensorflow/spark/datasources/tfrecords/serde
195 15
TensorFlowInferSchema.scala
in spark/spark-tensorflow-connector/src/main/scala/org/tensorflow/spark/datasources/tfrecords
167 12
DefaultSource.scala
in spark/spark-tensorflow-connector/src/main/scala/org/tensorflow/spark/datasources/tfrecords
157 8
DefaultTfRecordRowEncoder.scala
in spark/spark-tensorflow-connector/src/main/scala/org/tensorflow/spark/datasources/tfrecords/serde
133 5
template.yaml.jinja
in distribution_strategy
116 -
mnist.py
in docker
96 4
template.yaml.jinja
in kubernetes
90 -
TFRecordFileMRExample.java
in hadoop/src/main/java/org/tensorflow/hadoop/example
87 2
keras_model_to_estimator_client.py
in distribution_strategy
82 4
DefaultTfRecordRowDecoder.scala
in spark/spark-tensorflow-connector/src/main/scala/org/tensorflow/spark/datasources/tfrecords/serde
79 5
TFRecordReader.java
in hadoop/src/main/java/org/tensorflow/hadoop/util
73 4
TFRecordFileInputFormat.java
in hadoop/src/main/java/org/tensorflow/hadoop/io
69 1
template.json.jinja
in marathon
60 -
data_service.yaml.jinja
in data_service
54 -
template.yaml.jinja
in swarm
47 -
TFRecordFileOutputFormat.java
in hadoop/src/main/java/org/tensorflow/hadoop/io
47 -
Crc32C.java
in hadoop/src/main/java/org/tensorflow/hadoop/util
43 7
keras_model_to_estimator.py
in distribution_strategy
43 2
FeatureListDecoder.scala
in spark/spark-tensorflow-connector/src/main/scala/org/tensorflow/spark/datasources/tfrecords/serde
42 8
TFRecordFileOutputFormatV1.java
in hadoop/src/main/java/org/tensorflow/hadoop/io
38 1
TensorflowRelation.scala
in spark/spark-tensorflow-connector/src/main/scala/org/tensorflow/spark/datasources/tfrecords
37 -
FeatureEncoder.scala
in spark/spark-tensorflow-connector/src/main/scala/org/tensorflow/spark/datasources/tfrecords/serde
36 4
FeatureListEncoder.scala
in spark/spark-tensorflow-connector/src/main/scala/org/tensorflow/spark/datasources/tfrecords/serde
35 4
TFRecordWriter.java
in hadoop/src/main/java/org/tensorflow/hadoop/util
34 3
data_service_interfaces.yaml.jinja
in data_service
34 -
setup.py
in spark/spark-tensorflow-distributor
28 -
tf_std_data_server.py
in data_service
28 1
DataFrameTfrConverter.scala
in spark/spark-tensorflow-connector/src/main/scala/org/tensorflow/spark/datasources/tfrecords/udf
15 2
TFRecordIOConf.java
in hadoop/src/main/java/org/tensorflow/hadoop/io
10 2
tf_std_server.py
in distribution_strategy
9 1
render_template.py
in distribution_strategy
8 -
render_template.py
in root
8 -
plugins.sbt
in spark/spark-tensorflow-connector/project
3 -
__init__.py
in spark/spark-tensorflow-distributor/spark_tensorflow_distributor
1 -
Files With Most Units (Top 20)
File# lines# units
FeatureDecoder.scala
in spark/spark-tensorflow-connector/src/main/scala/org/tensorflow/spark/datasources/tfrecords/serde
195 15
TensorFlowInferSchema.scala
in spark/spark-tensorflow-connector/src/main/scala/org/tensorflow/spark/datasources/tfrecords
167 12
mirrored_strategy_runner.py
in spark/spark-tensorflow-distributor/spark_tensorflow_distributor
243 9
DefaultSource.scala
in spark/spark-tensorflow-connector/src/main/scala/org/tensorflow/spark/datasources/tfrecords
157 8
FeatureListDecoder.scala
in spark/spark-tensorflow-connector/src/main/scala/org/tensorflow/spark/datasources/tfrecords/serde
42 8
Crc32C.java
in hadoop/src/main/java/org/tensorflow/hadoop/util
43 7
DefaultTfRecordRowEncoder.scala
in spark/spark-tensorflow-connector/src/main/scala/org/tensorflow/spark/datasources/tfrecords/serde
133 5
DefaultTfRecordRowDecoder.scala
in spark/spark-tensorflow-connector/src/main/scala/org/tensorflow/spark/datasources/tfrecords/serde
79 5
FeatureListEncoder.scala
in spark/spark-tensorflow-connector/src/main/scala/org/tensorflow/spark/datasources/tfrecords/serde
35 4
FeatureEncoder.scala
in spark/spark-tensorflow-connector/src/main/scala/org/tensorflow/spark/datasources/tfrecords/serde
36 4
mnist.py
in docker
96 4
TFRecordReader.java
in hadoop/src/main/java/org/tensorflow/hadoop/util
73 4
keras_model_to_estimator_client.py
in distribution_strategy
82 4
TFRecordWriter.java
in hadoop/src/main/java/org/tensorflow/hadoop/util
34 3
DataFrameTfrConverter.scala
in spark/spark-tensorflow-connector/src/main/scala/org/tensorflow/spark/datasources/tfrecords/udf
15 2
TFRecordFileMRExample.java
in hadoop/src/main/java/org/tensorflow/hadoop/example
87 2
TFRecordIOConf.java
in hadoop/src/main/java/org/tensorflow/hadoop/io
10 2
keras_model_to_estimator.py
in distribution_strategy
43 2
TFRecordFileOutputFormatV1.java
in hadoop/src/main/java/org/tensorflow/hadoop/io
38 1
TFRecordFileInputFormat.java
in hadoop/src/main/java/org/tensorflow/hadoop/io
69 1
Files With Long Lines (Top 8)

There are 8 files with lines longer than 120 characters. In total, there are 26 long lines.

File# lines# units# long lines
FeatureDecoder.scala
in spark/spark-tensorflow-connector/src/main/scala/org/tensorflow/spark/datasources/tfrecords/serde
195 15 14
TensorflowRelation.scala
in spark/spark-tensorflow-connector/src/main/scala/org/tensorflow/spark/datasources/tfrecords
37 - 3
DefaultSource.scala
in spark/spark-tensorflow-connector/src/main/scala/org/tensorflow/spark/datasources/tfrecords
157 8 2
DefaultTfRecordRowDecoder.scala
in spark/spark-tensorflow-connector/src/main/scala/org/tensorflow/spark/datasources/tfrecords/serde
79 5 2
tf_std_data_server.py
in data_service
28 1 2
DefaultTfRecordRowEncoder.scala
in spark/spark-tensorflow-connector/src/main/scala/org/tensorflow/spark/datasources/tfrecords/serde
133 5 1
TensorFlowInferSchema.scala
in spark/spark-tensorflow-connector/src/main/scala/org/tensorflow/spark/datasources/tfrecords
167 12 1
template.json.jinja
in marathon
60 - 1