apache / spark-connect-go
File Size

The distribution of size of files (measured in lines of code).

Intro
Learn more...
File Size Overall
38% | 0% | 34% | 12% | 14%
Legend:
1001+
501-1000
201-500
101-200
1-100


explore: grouped by folders | grouped by size | sunburst | 3D view
File Size per Extension
1001+
501-1000
201-500
101-200
1-100
go42% | 0% | 30% | 14% | 13%
py0% | 0% | 81% | 0% | 18%
scala0% | 0% | 0% | 0% | 100%
sbt0% | 0% | 0% | 0% | 100%
yaml0% | 0% | 0% | 0% | 100%
File Size per Logical Decomposition
primary
1001+
501-1000
201-500
101-200
1-100
spark43% | 0% | 31% | 11% | 13%
ROOT0% | 0% | 97% | 0% | 2%
cmd0% | 0% | 0% | 80% | 19%
dev0% | 0% | 0% | 0% | 100%
java0% | 0% | 0% | 0% | 100%
Longest Files (Top 39)
File# lines# units
dataframe.go
in spark/sql
1388 96
generated.go
in spark/sql/functions
1181 356
437 15
client.go
in spark/client
362 21
arrow.go
in spark/sql/types
341 7
retry.go
in spark/client
340 18
builtin.go
in spark/sql/types
300 52
expressions.go
in spark/sql/column
285 31
sparksession.go
in spark/sql
212 11
datatype.go
in spark/sql/types
177 43
main.go
in cmd/spark-connect-example-spark-session
156 1
channel.go
in spark/client/channel
154 9
group.go
in spark/sql
148 9
errors.go
in spark/sparkerrors
117 10
consts.go
in spark/sql/utils
106 3
column.go
in spark/sql/column
100 18
gen.py
in dev
98 1
conf.go
in spark/client
95 7
dataframewriter.go
in spark/sql
80 5
conversion.go
in spark/sql/types
64 4
utils.go
in spark/client/testutils
59 9
plan.go
in spark/sql
58 5
57 11
buiitins.go
in spark/sql/functions
44 13
structtype.go
in spark/sql/types
41 6
row.go
in spark/sql/types
41 5
dataframereader.go
in spark/sql
41 5
39 6
main.go
in cmd/spark-connect-example-raw-grpc-client
38 1
base.go
in spark/client/base
27 -
Runner.scala
in java/src/main/scala/org/apache/spark/golang
21 1
options.go
in spark/client/options
14 1
build.sbt
in java
11 -
6 -
check.go
in spark/sql/utils
6 1
version.go
in spark
4 1
3 -
compat.go
in spark/client/channel
2 -
1 -
Files With Most Units (Top 33)
File# lines# units
generated.go
in spark/sql/functions
1181 356
dataframe.go
in spark/sql
1388 96
builtin.go
in spark/sql/types
300 52
datatype.go
in spark/sql/types
177 43
expressions.go
in spark/sql/column
285 31
client.go
in spark/client
362 21
column.go
in spark/sql/column
100 18
retry.go
in spark/client
340 18
437 15
buiitins.go
in spark/sql/functions
44 13
57 11
sparksession.go
in spark/sql
212 11
errors.go
in spark/sparkerrors
117 10
group.go
in spark/sql
148 9
utils.go
in spark/client/testutils
59 9
channel.go
in spark/client/channel
154 9
arrow.go
in spark/sql/types
341 7
conf.go
in spark/client
95 7
39 6
structtype.go
in spark/sql/types
41 6
plan.go
in spark/sql
58 5
row.go
in spark/sql/types
41 5
dataframereader.go
in spark/sql
41 5
dataframewriter.go
in spark/sql
80 5
conversion.go
in spark/sql/types
64 4
consts.go
in spark/sql/utils
106 3
Runner.scala
in java/src/main/scala/org/apache/spark/golang
21 1
check.go
in spark/sql/utils
6 1
version.go
in spark
4 1
options.go
in spark/client/options
14 1
gen.py
in dev
98 1
main.go
in cmd/spark-connect-example-raw-grpc-client
38 1
main.go
in cmd/spark-connect-example-spark-session
156 1
Files With Long Lines (Top 13)

There are 13 files with lines longer than 120 characters. In total, there are 44 long lines.

File# lines# units# long lines
dataframe.go
in spark/sql
1388 96 11
client.go
in spark/client
362 21 6
57 11 5
generated.go
in spark/sql/functions
1181 356 5
retry.go
in spark/client
340 18 4
39 6 2
sparksession.go
in spark/sql
212 11 2
utils.go
in spark/client/testutils
59 9 2
gen.py
in dev
98 1 2
main.go
in cmd/spark-connect-example-spark-session
156 1 2
group.go
in spark/sql
148 9 1
base.go
in spark/client/base
27 - 1
channel.go
in spark/client/channel
154 9 1
Correlations

File Size vs. Commits (all time): 39 points

spark/sql/types/arrow.go x: 7 commits (all time) y: 341 lines of code spark/client/channel/channel.go x: 7 commits (all time) y: 154 lines of code spark/client/client.go x: 11 commits (all time) y: 362 lines of code spark/client/options/options.go x: 2 commits (all time) y: 14 lines of code spark/sql/sparksession.go x: 9 commits (all time) y: 212 lines of code spark/version.go x: 1 commits (all time) y: 4 lines of code spark/sql/types/row.go x: 4 commits (all time) y: 41 lines of code spark/sql/dataframe.go x: 24 commits (all time) y: 1388 lines of code spark/sql/dataframestatfunctions.go x: 2 commits (all time) y: 57 lines of code spark/sql/group.go x: 4 commits (all time) y: 148 lines of code spark/sql/dataframenafunctions.go x: 1 commits (all time) y: 39 lines of code spark/sql/column/column.go x: 8 commits (all time) y: 100 lines of code spark/sql/column/expressions.go x: 7 commits (all time) y: 285 lines of code java/build.sbt x: 1 commits (all time) y: 11 lines of code java/src/main/scala/org/apache/spark/golang/Runner.scala x: 1 commits (all time) y: 21 lines of code spark/client/conf.go x: 2 commits (all time) y: 95 lines of code cmd/spark-connect-example-spark-session/main.go x: 24 commits (all time) y: 156 lines of code spark/sql/functions/buiitins.go x: 3 commits (all time) y: 44 lines of code spark/sql/functions/generated.go x: 4 commits (all time) y: 1181 lines of code spark/sql/types/builtin.go x: 1 commits (all time) y: 300 lines of code spark/client/base/base.go x: 4 commits (all time) y: 27 lines of code spark/sql/dataframereader.go x: 5 commits (all time) y: 41 lines of code spark/sql/plan.go x: 3 commits (all time) y: 58 lines of code spark/sql/types/datatype.go x: 4 commits (all time) y: 177 lines of code spark/sql/utils/consts.go x: 2 commits (all time) y: 106 lines of code spark/sparkerrors/errors.go x: 10 commits (all time) y: 117 lines of code spark/sql/types/conversion.go x: 2 commits (all time) y: 64 lines of code spark/client/retry.go x: 1 commits (all time) y: 340 lines of code spark/sql/dataframewriter.go x: 4 commits (all time) y: 80 lines of code merge_connect_go_pr.py x: 3 commits (all time) y: 437 lines of code spark/sql/executeplanclient.go x: 3 commits (all time) y: 1 lines of code cmd/spark-connect-example-raw-grpc-client/main.go x: 12 commits (all time) y: 38 lines of code spark/sql/utils/check.go x: 1 commits (all time) y: 6 lines of code buf.gen.yaml x: 2 commits (all time) y: 6 lines of code
1388.0
lines of code
  min: 1.0
  average: 170.62
  25th percentile: 27.0
  median: 64.0
  75th percentile: 177.0
  max: 1388.0
0 24.0
commits (all time)
min: 1.0 | average: 4.97 | 25th percentile: 2.0 | median: 3.0 | 75th percentile: 7.0 | max: 24.0

File Size vs. Contributors (all time): 39 points

spark/sql/types/arrow.go x: 4 contributors (all time) y: 341 lines of code spark/client/channel/channel.go x: 1 contributors (all time) y: 154 lines of code spark/client/client.go x: 4 contributors (all time) y: 362 lines of code spark/client/options/options.go x: 1 contributors (all time) y: 14 lines of code spark/sql/sparksession.go x: 3 contributors (all time) y: 212 lines of code spark/version.go x: 1 contributors (all time) y: 4 lines of code spark/sql/types/row.go x: 3 contributors (all time) y: 41 lines of code spark/sql/dataframe.go x: 4 contributors (all time) y: 1388 lines of code spark/sql/dataframestatfunctions.go x: 2 contributors (all time) y: 57 lines of code spark/sql/group.go x: 1 contributors (all time) y: 148 lines of code spark/sql/dataframenafunctions.go x: 1 contributors (all time) y: 39 lines of code spark/sql/column/column.go x: 2 contributors (all time) y: 100 lines of code spark/sql/column/expressions.go x: 2 contributors (all time) y: 285 lines of code java/src/main/scala/org/apache/spark/golang/Runner.scala x: 1 contributors (all time) y: 21 lines of code spark/client/conf.go x: 2 contributors (all time) y: 95 lines of code cmd/spark-connect-example-spark-session/main.go x: 7 contributors (all time) y: 156 lines of code dev/gen.py x: 1 contributors (all time) y: 98 lines of code spark/sql/functions/buiitins.go x: 1 contributors (all time) y: 44 lines of code spark/sql/functions/generated.go x: 1 contributors (all time) y: 1181 lines of code spark/sql/types/builtin.go x: 1 contributors (all time) y: 300 lines of code spark/client/base/base.go x: 3 contributors (all time) y: 27 lines of code spark/sql/dataframereader.go x: 2 contributors (all time) y: 41 lines of code spark/sql/types/datatype.go x: 2 contributors (all time) y: 177 lines of code spark/sql/utils/consts.go x: 1 contributors (all time) y: 106 lines of code spark/sparkerrors/errors.go x: 2 contributors (all time) y: 117 lines of code spark/sql/types/conversion.go x: 1 contributors (all time) y: 64 lines of code spark/client/retry.go x: 1 contributors (all time) y: 340 lines of code spark/client/testutils/utils.go x: 1 contributors (all time) y: 59 lines of code spark/sql/dataframewriter.go x: 1 contributors (all time) y: 80 lines of code merge_connect_go_pr.py x: 2 contributors (all time) y: 437 lines of code cmd/spark-connect-example-raw-grpc-client/main.go x: 5 contributors (all time) y: 38 lines of code spark/sql/utils/check.go x: 1 contributors (all time) y: 6 lines of code buf.work.yaml x: 2 contributors (all time) y: 3 lines of code buf.gen.yaml x: 2 contributors (all time) y: 6 lines of code
1388.0
lines of code
  min: 1.0
  average: 170.62
  25th percentile: 27.0
  median: 64.0
  75th percentile: 177.0
  max: 1388.0
0 7.0
contributors (all time)
min: 1.0 | average: 1.95 | 25th percentile: 1.0 | median: 2.0 | 75th percentile: 2.0 | max: 7.0

File Size vs. Commits (30 days): 0 points

No data for "commits (30d)" vs. "lines of code".

File Size vs. Contributors (30 days): 0 points

No data for "contributors (30d)" vs. "lines of code".


File Size vs. Commits (90 days): 0 points

No data for "commits (90d)" vs. "lines of code".

File Size vs. Contributors (90 days): 0 points

No data for "contributors (90d)" vs. "lines of code".