apache / datafusion-ballista
File Size

The distribution of size of files (measured in lines of code).

Intro
Learn more...
File Size Overall
13% | 30% | 31% | 11% | 12%
Legend:
1001+
501-1000
201-500
101-200
1-100


explore: grouped by folders | grouped by size | sunburst | 3D view
File Size per Extension
1001+
501-1000
201-500
101-200
1-100
rs11% | 35% | 35% | 11% | 6%
proto48% | 27% | 22% | 0% | 2%
scala0% | 0% | 100% | 0% | 0%
py0% | 0% | 0% | 38% | 61%
toml0% | 0% | 0% | 32% | 67%
sql0% | 0% | 0% | 0% | 100%
File Size per Logical Decomposition
primary
1001+
501-1000
201-500
101-200
1-100
ballista15% | 35% | 34% | 8% | 6%
benchmarks0% | 0% | 18% | 23% | 58%
python0% | 0% | 35% | 27% | 36%
ballista-cli0% | 0% | 0% | 92% | 7%
dev0% | 0% | 0% | 0% | 100%
ROOT0% | 0% | 0% | 0% | 100%
Longest Files (Top 50)
File# lines# units
execution_graph.rs
in ballista/scheduler/src/state
2094 10
proto
datafusion.proto
in ballista/core/proto
1069 -
mod.rs
in ballista/scheduler/src/cluster
809 13
grpc.rs
in ballista/scheduler/src/scheduler_server
775 1
execution_stage.rs
in ballista/scheduler/src/state
721 5
executor_server.rs
in ballista/executor/src
688 8
execution_graph_dot.rs
in ballista/scheduler/src/state
623 9
shuffle_reader.rs
in ballista/core/src/execution_plans
613 22
proto
ballista.proto
in ballista/core/proto
603 -
mod.rs
in ballista/scheduler/src/scheduler_server
584 3
executor_process.rs
in ballista/executor/src
578 1
task_manager.rs
in ballista/scheduler/src/state
568 2
memory.rs
in ballista/scheduler/src/cluster
523 6
shuffle_writer.rs
in ballista/core/src/execution_plans
491 14
proto
datafusion_common.proto
in ballista/core/proto
489 -
planner.rs
in ballista/scheduler/src
403 7
mod.rs
in ballista/scheduler/src/state
388 -
from_proto.rs
in ballista/core/src/serde/scheduler
382 11
query_stage_scheduler.rs
in ballista/scheduler/src/scheduler_server
380 4
handlers.rs
in ballista/scheduler/src/api
363 2
executor_manager.rs
in ballista/scheduler/src/state
347 -
executor.rs
in ballista/executor/src
338 13
distributed_query.rs
in ballista/core/src/execution_plans
333 11
extension.rs
in ballista/core/src
331 28
execution_loop.rs
in ballista/executor/src
267 -
config.rs
in ballista/core/src
256 16
cpu_bound_executor.rs
in ballista/executor/src
250 5
mod.rs
in ballista/scheduler/src/cluster/event
242 3
mod.rs
in ballista/core/src/consistent_hash
241 8
object_store.rs
in ballista/core/src
238 9
config.rs
in ballista/scheduler/src
237 4
mod.rs
in ballista/core/src/serde
231 12
mod.rs
in ballista/core/src/serde/scheduler
226 1
SparkTpch.scala
in benchmarks/spark/src/main/scala/org/apache/arrow
222 3
to_proto.rs
in ballista/core/src/serde/scheduler
218 9
cluster.rs
in python/src
211 -
planner.rs
in ballista/core/src
206 3
command.rs
in ballista-cli/src
200 4
flight_service.rs
in ballista/executor/src
198 4
error.rs
in ballista/core/src
195 14
extension.rs
in ballista/client/src
189 1
client.rs
in ballista/core/src
176 -
groupby-datafusion.py
in benchmarks/db-benchmark
166 1
main.rs
in ballista-cli/src
165 4
codec.rs
in python/src
162 14
exec.rs
in ballista-cli/src
155 -
scheduler_config_spec.toml
in ballista/scheduler
144 -
prometheus.rs
in ballista/scheduler/src/metrics
128 6
standalone.rs
in ballista/executor/src
128 -
display.rs
in ballista/scheduler/src
121 4
Files With Most Units (Top 50)
File# lines# units
extension.rs
in ballista/core/src
331 28
shuffle_reader.rs
in ballista/core/src/execution_plans
613 22
config.rs
in ballista/core/src
256 16
codec.rs
in python/src
162 14
shuffle_writer.rs
in ballista/core/src/execution_plans
491 14
error.rs
in ballista/core/src
195 14
mod.rs
in ballista/scheduler/src/cluster
809 13
executor.rs
in ballista/executor/src
338 13
mod.rs
in ballista/core/src/serde
231 12
from_proto.rs
in ballista/core/src/serde/scheduler
382 11
distributed_query.rs
in ballista/core/src/execution_plans
333 11
execution_graph.rs
in ballista/scheduler/src/state
2094 10
execution_graph_dot.rs
in ballista/scheduler/src/state
623 9
object_store.rs
in ballista/core/src
238 9
to_proto.rs
in ballista/core/src/serde/scheduler
218 9
unresolved_shuffle.rs
in ballista/core/src/execution_plans
87 9
collect.rs
in ballista/executor/src
97 9
mod.rs
in ballista/core/src/consistent_hash
241 8
executor_server.rs
in ballista/executor/src
688 8
mod.rs
in ballista/scheduler/src/metrics
34 7
planner.rs
in ballista/scheduler/src
403 7
registry.rs
in ballista/core/src
80 7
prometheus.rs
in ballista/scheduler/src/metrics
128 6
memory.rs
in ballista/scheduler/src/cluster
523 6
56 5
execution_stage.rs
in ballista/scheduler/src/state
721 5
cpu_bound_executor.rs
in ballista/executor/src
250 5
command.rs
in ballista-cli/src
200 4
main.rs
in ballista-cli/src
165 4
74 4
config.rs
in ballista/scheduler/src
237 4
query_stage_scheduler.rs
in ballista/scheduler/src/scheduler_server
380 4
display.rs
in ballista/scheduler/src
121 4
flight_service.rs
in ballista/executor/src
198 4
SparkTpch.scala
in benchmarks/spark/src/main/scala/org/apache/arrow
222 3
generate-changelog.py
in dev/release
73 3
65 3
mod.rs
in ballista/scheduler/src/cluster/event
242 3
mod.rs
in ballista/scheduler/src/scheduler_server
584 3
planner.rs
in ballista/core/src
206 3
execution_engine.rs
in ballista/executor/src
67 3
join-datafusion.py
in benchmarks/db-benchmark
118 2
handlers.rs
in ballista/scheduler/src/api
363 2
task_manager.rs
in ballista/scheduler/src/state
568 2
mod.rs
in ballista/executor/src/metrics
28 2
groupby-datafusion.py
in benchmarks/db-benchmark
166 1
lib.rs
in python/src
60 1
80 1
grpc.rs
in ballista/scheduler/src/scheduler_server
775 1
build.rs
in ballista/scheduler
14 1
Files With Long Lines (Top 18)

There are 18 files with lines longer than 120 characters. In total, there are 63 long lines.

File# lines# units# long lines
execution_graph.rs
in ballista/scheduler/src/state
2094 10 14
join-datafusion.py
in benchmarks/db-benchmark
118 2 7
execution_stage.rs
in ballista/scheduler/src/state
721 5 7
scheduler_config_spec.toml
in ballista/scheduler
144 - 5
executor_config_spec.toml
in ballista/executor
118 - 5
planner.rs
in ballista/scheduler/src
403 7 4
config.rs
in ballista/core/src
256 16 4
groupby-datafusion.py
in benchmarks/db-benchmark
166 1 3
task_manager.rs
in ballista/scheduler/src/state
568 2 3
proto
datafusion.proto
in ballista/core/proto
1069 - 2
executor_process.rs
in ballista/executor/src
578 1 2
cluster.rs
in python/src
211 - 1
Cargo.toml
in root
74 - 1
mod.rs
in ballista/scheduler/src/scheduler_server
584 3 1
Cargo.toml
in ballista/scheduler
72 - 1
object_store.rs
in ballista/core/src
238 9 1
to_proto.rs
in ballista/core/src/serde/scheduler
218 9 1
execution_loop.rs
in ballista/executor/src
267 - 1
Correlations

File Size vs. Commits (all time): 126 points

ballista/scheduler/src/state/execution_graph.rs x: 24 commits (all time) y: 2094 lines of code ballista/core/src/object_store.rs x: 2 commits (all time) y: 238 lines of code ballista/core/src/execution_plans/distributed_query.rs x: 19 commits (all time) y: 333 lines of code ballista/scheduler/src/cluster/mod.rs x: 22 commits (all time) y: 809 lines of code ballista/executor/src/cpu_bound_executor.rs x: 2 commits (all time) y: 250 lines of code ballista/scheduler/Cargo.toml x: 46 commits (all time) y: 72 lines of code ballista/scheduler/src/lib.rs x: 9 commits (all time) y: 15 lines of code ballista/scheduler/src/scheduler_process.rs x: 17 commits (all time) y: 76 lines of code ballista/scheduler/src/state/mod.rs x: 29 commits (all time) y: 388 lines of code ballista/scheduler/src/cluster/memory.rs x: 16 commits (all time) y: 523 lines of code ballista/scheduler/src/config.rs x: 22 commits (all time) y: 237 lines of code ballista/scheduler/src/scheduler_server/grpc.rs x: 32 commits (all time) y: 775 lines of code ballista/scheduler/src/state/execution_stage.rs x: 1 commits (all time) y: 721 lines of code ballista/scheduler/src/state/executor_manager.rs x: 18 commits (all time) y: 347 lines of code ballista/core/Cargo.toml x: 47 commits (all time) y: 66 lines of code ballista/core/src/extension.rs x: 9 commits (all time) y: 331 lines of code ballista/core/src/lib.rs x: 13 commits (all time) y: 25 lines of code ballista/executor/src/flight_service.rs x: 11 commits (all time) y: 198 lines of code python/Cargo.toml x: 57 commits (all time) y: 43 lines of code python/pyproject.toml x: 16 commits (all time) y: 59 lines of code python/src/codec.rs x: 2 commits (all time) y: 162 lines of code ballista/scheduler/src/state/task_manager.rs x: 26 commits (all time) y: 568 lines of code Cargo.toml x: 72 commits (all time) y: 74 lines of code ballista-cli/Cargo.toml x: 65 commits (all time) y: 38 lines of code ballista-cli/src/command.rs x: 8 commits (all time) y: 200 lines of code ballista-cli/src/exec.rs x: 11 commits (all time) y: 155 lines of code ballista/client/Cargo.toml x: 39 commits (all time) y: 47 lines of code ballista/core/proto/datafusion.proto x: 13 commits (all time) y: 1069 lines of code ballista/core/proto/datafusion_common.proto x: 6 commits (all time) y: 489 lines of code ballista/core/src/diagram.rs x: 3 commits (all time) y: 119 lines of code ballista/core/src/execution_plans/shuffle_reader.rs x: 22 commits (all time) y: 613 lines of code ballista/core/src/execution_plans/shuffle_writer.rs x: 17 commits (all time) y: 491 lines of code ballista/executor/Cargo.toml x: 38 commits (all time) y: 62 lines of code ballista/scheduler/src/planner.rs x: 20 commits (all time) y: 403 lines of code ballista/scheduler/src/state/execution_graph_dot.rs x: 19 commits (all time) y: 623 lines of code benchmarks/Cargo.toml x: 74 commits (all time) y: 50 lines of code ballista/executor/executor_config_spec.toml x: 9 commits (all time) y: 118 lines of code ballista/core/proto/ballista.proto x: 19 commits (all time) y: 603 lines of code dev/release/download-python-wheels.py x: 3 commits (all time) y: 80 lines of code dev/release/generate-changelog.py x: 2 commits (all time) y: 73 lines of code rustfmt.toml x: 3 commits (all time) y: 18 lines of code ballista/executor/src/executor.rs x: 22 commits (all time) y: 338 lines of code ballista/executor/src/executor_server.rs x: 25 commits (all time) y: 688 lines of code ballista/scheduler/src/scheduler_server/mod.rs x: 33 commits (all time) y: 584 lines of code benchmarks/queries/q10.sql x: 2 commits (all time) y: 32 lines of code benchmarks/queries/q18.sql x: 2 commits (all time) y: 33 lines of code benchmarks/queries/q2.sql x: 2 commits (all time) y: 44 lines of code benchmarks/queries/q3.sql x: 2 commits (all time) y: 23 lines of code ballista/scheduler/src/api/handlers.rs x: 9 commits (all time) y: 363 lines of code ballista/scheduler/src/api/mod.rs x: 5 commits (all time) y: 38 lines of code ballista/executor/src/execution_engine.rs x: 6 commits (all time) y: 67 lines of code ballista/executor/src/metrics/mod.rs x: 3 commits (all time) y: 28 lines of code ballista/core/src/client.rs x: 8 commits (all time) y: 176 lines of code ballista/core/src/config.rs x: 19 commits (all time) y: 256 lines of code ballista/executor/src/executor_process.rs x: 27 commits (all time) y: 578 lines of code ballista/executor/src/standalone.rs x: 16 commits (all time) y: 128 lines of code ballista/scheduler/src/standalone.rs x: 16 commits (all time) y: 85 lines of code ballista/executor/src/execution_loop.rs x: 21 commits (all time) y: 267 lines of code ballista/executor/src/lib.rs x: 8 commits (all time) y: 87 lines of code ballista/client/src/lib.rs x: 5 commits (all time) y: 4 lines of code ballista/core/src/execution_plans/unresolved_shuffle.rs x: 10 commits (all time) y: 87 lines of code ballista/core/src/planner.rs x: 2 commits (all time) y: 206 lines of code ballista/core/src/utils.rs x: 30 commits (all time) y: 121 lines of code ballista/executor/src/collect.rs x: 9 commits (all time) y: 97 lines of code ballista/scheduler/build.rs x: 5 commits (all time) y: 14 lines of code ballista/core/src/consistent_hash/mod.rs x: 2 commits (all time) y: 241 lines of code ballista/core/src/serde/mod.rs x: 19 commits (all time) y: 231 lines of code ballista/core/src/error.rs x: 8 commits (all time) y: 195 lines of code ballista/executor/build.rs x: 3 commits (all time) y: 8 lines of code ballista/scheduler/src/scheduler_server/query_stage_scheduler.rs x: 16 commits (all time) y: 380 lines of code python/src/cluster.rs x: 2 commits (all time) y: 211 lines of code python/ballista/__init__.py x: 3 commits (all time) y: 14 lines of code python/src/lib.rs x: 12 commits (all time) y: 60 lines of code ballista/scheduler/src/display.rs x: 5 commits (all time) y: 121 lines of code ballista/scheduler/src/state/session_manager.rs x: 10 commits (all time) y: 54 lines of code ballista/client/src/extension.rs x: 6 commits (all time) y: 189 lines of code ballista/core/src/serde/scheduler/from_proto.rs x: 12 commits (all time) y: 382 lines of code ballista/core/src/registry.rs x: 1 commits (all time) y: 80 lines of code ballista/core/src/serde/scheduler/mod.rs x: 9 commits (all time) y: 226 lines of code ballista/scheduler/scheduler_config_spec.toml x: 21 commits (all time) y: 144 lines of code ballista-cli/src/main.rs x: 13 commits (all time) y: 165 lines of code ballista/client/src/prelude.rs x: 9 commits (all time) y: 1 lines of code ballista/core/src/serde/scheduler/to_proto.rs x: 8 commits (all time) y: 218 lines of code ballista/scheduler/src/scheduler_server/event.rs x: 9 commits (all time) y: 47 lines of code ballista/core/src/consistent_hash/node.rs x: 1 commits (all time) y: 4 lines of code ballista/scheduler/src/scheduler_server/external_scaler.rs x: 4 commits (all time) y: 49 lines of code dev/update_ballista_versions.py x: 13 commits (all time) y: 65 lines of code ballista/scheduler/src/metrics/prometheus.rs x: 2 commits (all time) y: 128 lines of code benchmarks/spark/src/main/scala/org/apache/arrow/SparkTpch.scala x: 1 commits (all time) y: 222 lines of code ballista/executor/src/shutdown.rs x: 1 commits (all time) y: 48 lines of code ballista/executor/src/terminate.rs x: 1 commits (all time) y: 14 lines of code ballista/scheduler/proto/keda.proto x: 1 commits (all time) y: 50 lines of code benchmarks/db-benchmark/groupby-datafusion.py x: 1 commits (all time) y: 166 lines of code benchmarks/db-benchmark/join-datafusion.py x: 1 commits (all time) y: 118 lines of code dev/update_arrow_deps.py x: 4 commits (all time) y: 56 lines of code benchmarks/queries/q15.sql x: 1 commits (all time) y: 31 lines of code benchmarks/queries/q1.sql x: 1 commits (all time) y: 21 lines of code benchmarks/queries/q19.sql x: 1 commits (all time) y: 35 lines of code
2094.0
lines of code
  min: 1.0
  average: 182.46
  25th percentile: 32.75
  median: 73.5
  75th percentile: 232.5
  max: 2094.0
0 74.0
commits (all time)
min: 1.0 | average: 11.33 | 25th percentile: 2.0 | median: 5.5 | 75th percentile: 16.25 | max: 74.0

File Size vs. Contributors (all time): 126 points

ballista/scheduler/src/state/execution_graph.rs x: 9 contributors (all time) y: 2094 lines of code ballista/core/src/object_store.rs x: 1 contributors (all time) y: 238 lines of code ballista/core/src/execution_plans/distributed_query.rs x: 10 contributors (all time) y: 333 lines of code ballista/scheduler/src/cluster/mod.rs x: 8 contributors (all time) y: 809 lines of code ballista/executor/src/cpu_bound_executor.rs x: 2 contributors (all time) y: 250 lines of code ballista/scheduler/Cargo.toml x: 13 contributors (all time) y: 72 lines of code ballista/scheduler/src/lib.rs x: 4 contributors (all time) y: 15 lines of code ballista/scheduler/src/scheduler_process.rs x: 6 contributors (all time) y: 76 lines of code ballista/scheduler/src/state/mod.rs x: 11 contributors (all time) y: 388 lines of code ballista/scheduler/src/cluster/memory.rs x: 4 contributors (all time) y: 523 lines of code ballista/scheduler/src/config.rs x: 7 contributors (all time) y: 237 lines of code ballista/scheduler/src/scheduler_server/grpc.rs x: 9 contributors (all time) y: 775 lines of code ballista/scheduler/src/state/execution_stage.rs x: 1 contributors (all time) y: 721 lines of code ballista/scheduler/src/state/executor_manager.rs x: 7 contributors (all time) y: 347 lines of code ballista/core/Cargo.toml x: 12 contributors (all time) y: 66 lines of code ballista/core/src/extension.rs x: 2 contributors (all time) y: 331 lines of code ballista/core/src/lib.rs x: 4 contributors (all time) y: 25 lines of code ballista/executor/src/flight_service.rs x: 7 contributors (all time) y: 198 lines of code python/Cargo.toml x: 18 contributors (all time) y: 43 lines of code python/pyproject.toml x: 9 contributors (all time) y: 59 lines of code python/src/codec.rs x: 1 contributors (all time) y: 162 lines of code ballista/scheduler/src/state/task_manager.rs x: 9 contributors (all time) y: 568 lines of code Cargo.toml x: 23 contributors (all time) y: 74 lines of code ballista-cli/Cargo.toml x: 17 contributors (all time) y: 38 lines of code ballista-cli/src/command.rs x: 4 contributors (all time) y: 200 lines of code ballista-cli/src/exec.rs x: 7 contributors (all time) y: 155 lines of code ballista/client/Cargo.toml x: 11 contributors (all time) y: 47 lines of code ballista/core/proto/datafusion.proto x: 7 contributors (all time) y: 1069 lines of code ballista/core/proto/datafusion_common.proto x: 3 contributors (all time) y: 489 lines of code ballista/core/src/diagram.rs x: 1 contributors (all time) y: 119 lines of code ballista/core/src/execution_plans/shuffle_reader.rs x: 12 contributors (all time) y: 613 lines of code ballista/core/src/execution_plans/shuffle_writer.rs x: 12 contributors (all time) y: 491 lines of code ballista/executor/Cargo.toml x: 10 contributors (all time) y: 62 lines of code ballista/scheduler/src/planner.rs x: 10 contributors (all time) y: 403 lines of code ballista/scheduler/src/state/execution_graph_dot.rs x: 9 contributors (all time) y: 623 lines of code benchmarks/Cargo.toml x: 25 contributors (all time) y: 50 lines of code ballista/executor/executor_config_spec.toml x: 3 contributors (all time) y: 118 lines of code ballista/core/proto/ballista.proto x: 10 contributors (all time) y: 603 lines of code dev/release/download-python-wheels.py x: 3 contributors (all time) y: 80 lines of code dev/release/generate-changelog.py x: 2 contributors (all time) y: 73 lines of code rustfmt.toml x: 3 contributors (all time) y: 18 lines of code ballista/executor/src/executor.rs x: 9 contributors (all time) y: 338 lines of code ballista/executor/src/executor_server.rs x: 8 contributors (all time) y: 688 lines of code ballista/scheduler/src/scheduler_server/mod.rs x: 10 contributors (all time) y: 584 lines of code benchmarks/queries/q10.sql x: 2 contributors (all time) y: 32 lines of code benchmarks/queries/q18.sql x: 2 contributors (all time) y: 33 lines of code benchmarks/queries/q2.sql x: 2 contributors (all time) y: 44 lines of code benchmarks/queries/q3.sql x: 2 contributors (all time) y: 23 lines of code ballista/scheduler/src/api/handlers.rs x: 7 contributors (all time) y: 363 lines of code ballista/scheduler/src/api/mod.rs x: 5 contributors (all time) y: 38 lines of code ballista/executor/src/execution_engine.rs x: 4 contributors (all time) y: 67 lines of code ballista/core/src/client.rs x: 6 contributors (all time) y: 176 lines of code ballista/core/src/config.rs x: 7 contributors (all time) y: 256 lines of code ballista/executor/src/executor_process.rs x: 7 contributors (all time) y: 578 lines of code ballista/executor/src/standalone.rs x: 5 contributors (all time) y: 128 lines of code ballista/scheduler/src/standalone.rs x: 6 contributors (all time) y: 85 lines of code ballista/executor/src/execution_loop.rs x: 8 contributors (all time) y: 267 lines of code ballista/executor/src/config.rs x: 1 contributors (all time) y: 36 lines of code ballista/executor/src/lib.rs x: 2 contributors (all time) y: 87 lines of code ballista/client/src/lib.rs x: 2 contributors (all time) y: 4 lines of code ballista/core/src/execution_plans/unresolved_shuffle.rs x: 10 contributors (all time) y: 87 lines of code ballista/core/src/planner.rs x: 1 contributors (all time) y: 206 lines of code ballista/core/src/utils.rs x: 11 contributors (all time) y: 121 lines of code ballista/executor/src/collect.rs x: 8 contributors (all time) y: 97 lines of code ballista/scheduler/build.rs x: 3 contributors (all time) y: 14 lines of code ballista/core/src/consistent_hash/mod.rs x: 2 contributors (all time) y: 241 lines of code ballista/core/src/serde/mod.rs x: 10 contributors (all time) y: 231 lines of code ballista/core/src/error.rs x: 4 contributors (all time) y: 195 lines of code ballista/executor/build.rs x: 3 contributors (all time) y: 8 lines of code ballista/scheduler/src/scheduler_server/query_stage_scheduler.rs x: 7 contributors (all time) y: 380 lines of code python/src/cluster.rs x: 1 contributors (all time) y: 211 lines of code python/src/utils.rs x: 4 contributors (all time) y: 38 lines of code ballista/scheduler/src/display.rs x: 4 contributors (all time) y: 121 lines of code ballista/scheduler/src/state/session_manager.rs x: 7 contributors (all time) y: 54 lines of code ballista/client/src/extension.rs x: 1 contributors (all time) y: 189 lines of code ballista/core/src/serde/scheduler/from_proto.rs x: 8 contributors (all time) y: 382 lines of code ballista/core/src/registry.rs x: 1 contributors (all time) y: 80 lines of code ballista/core/src/serde/scheduler/mod.rs x: 6 contributors (all time) y: 226 lines of code ballista/scheduler/scheduler_config_spec.toml x: 6 contributors (all time) y: 144 lines of code ballista-cli/src/main.rs x: 9 contributors (all time) y: 165 lines of code ballista/client/src/prelude.rs x: 4 contributors (all time) y: 1 lines of code ballista/core/src/serde/scheduler/to_proto.rs x: 7 contributors (all time) y: 218 lines of code ballista/scheduler/src/scheduler_server/event.rs x: 5 contributors (all time) y: 47 lines of code ballista/core/src/consistent_hash/node.rs x: 1 contributors (all time) y: 4 lines of code ballista/scheduler/src/scheduler_server/external_scaler.rs x: 4 contributors (all time) y: 49 lines of code dev/update_ballista_versions.py x: 4 contributors (all time) y: 65 lines of code ballista/scheduler/src/metrics/prometheus.rs x: 2 contributors (all time) y: 128 lines of code benchmarks/spark/src/main/scala/org/apache/arrow/SparkTpch.scala x: 1 contributors (all time) y: 222 lines of code ballista/executor/src/shutdown.rs x: 1 contributors (all time) y: 48 lines of code ballista/executor/src/terminate.rs x: 1 contributors (all time) y: 14 lines of code ballista/scheduler/proto/keda.proto x: 1 contributors (all time) y: 50 lines of code dev/update_datafusion_versions.py x: 3 contributors (all time) y: 74 lines of code benchmarks/db-benchmark/groupby-datafusion.py x: 1 contributors (all time) y: 166 lines of code dev/update_arrow_deps.py x: 3 contributors (all time) y: 56 lines of code benchmarks/queries/q15.sql x: 1 contributors (all time) y: 31 lines of code benchmarks/queries/q1.sql x: 1 contributors (all time) y: 21 lines of code
2094.0
lines of code
  min: 1.0
  average: 182.46
  25th percentile: 32.75
  median: 73.5
  75th percentile: 232.5
  max: 2094.0
0 25.0
contributors (all time)
min: 1.0 | average: 4.94 | 25th percentile: 1.0 | median: 3.0 | 75th percentile: 7.25 | max: 25.0

File Size vs. Commits (30 days): 41 points

ballista/scheduler/src/state/execution_graph.rs x: 5 commits (30d) y: 2094 lines of code ballista/core/src/object_store.rs x: 2 commits (30d) y: 238 lines of code ballista/core/src/execution_plans/distributed_query.rs x: 1 commits (30d) y: 333 lines of code ballista/scheduler/src/cluster/mod.rs x: 3 commits (30d) y: 809 lines of code ballista/executor/src/cpu_bound_executor.rs x: 1 commits (30d) y: 250 lines of code ballista/scheduler/Cargo.toml x: 2 commits (30d) y: 72 lines of code ballista/scheduler/src/lib.rs x: 1 commits (30d) y: 15 lines of code ballista/scheduler/src/scheduler_process.rs x: 2 commits (30d) y: 76 lines of code ballista/scheduler/src/state/mod.rs x: 2 commits (30d) y: 388 lines of code ballista/scheduler/src/cluster/memory.rs x: 1 commits (30d) y: 523 lines of code ballista/scheduler/src/scheduler_server/grpc.rs x: 1 commits (30d) y: 775 lines of code ballista/scheduler/src/state/execution_stage.rs x: 1 commits (30d) y: 721 lines of code ballista/scheduler/src/state/executor_manager.rs x: 1 commits (30d) y: 347 lines of code ballista/core/src/lib.rs x: 1 commits (30d) y: 25 lines of code ballista/executor/src/flight_service.rs x: 1 commits (30d) y: 198 lines of code python/Cargo.toml x: 1 commits (30d) y: 43 lines of code python/pyproject.toml x: 1 commits (30d) y: 59 lines of code python/src/codec.rs x: 1 commits (30d) y: 162 lines of code ballista/scheduler/src/state/task_manager.rs x: 2 commits (30d) y: 568 lines of code Cargo.toml x: 1 commits (30d) y: 74 lines of code ballista-cli/Cargo.toml x: 1 commits (30d) y: 38 lines of code ballista-cli/src/exec.rs x: 1 commits (30d) y: 155 lines of code ballista/core/proto/datafusion.proto x: 2 commits (30d) y: 1069 lines of code ballista/core/proto/datafusion_common.proto x: 1 commits (30d) y: 489 lines of code ballista/core/src/diagram.rs x: 1 commits (30d) y: 119 lines of code ballista/core/src/execution_plans/shuffle_reader.rs x: 1 commits (30d) y: 613 lines of code ballista/scheduler/src/planner.rs x: 2 commits (30d) y: 403 lines of code ballista/scheduler/src/state/execution_graph_dot.rs x: 2 commits (30d) y: 623 lines of code benchmarks/Cargo.toml x: 1 commits (30d) y: 50 lines of code ballista/core/proto/ballista.proto x: 1 commits (30d) y: 603 lines of code dev/release/download-python-wheels.py x: 1 commits (30d) y: 80 lines of code rustfmt.toml x: 1 commits (30d) y: 18 lines of code
2094.0
lines of code
  min: 15.0
  average: 333.05
  25th percentile: 69.0
  median: 200.0
  75th percentile: 507.0
  max: 2094.0
0 5.0
commits (30d)
min: 1.0 | average: 1.39 | 25th percentile: 1.0 | median: 1.0 | 75th percentile: 2.0 | max: 5.0

File Size vs. Contributors (30 days): 41 points

ballista/scheduler/src/state/execution_graph.rs x: 1 contributors (30d) y: 2094 lines of code ballista/core/src/object_store.rs x: 1 contributors (30d) y: 238 lines of code ballista/core/src/execution_plans/distributed_query.rs x: 1 contributors (30d) y: 333 lines of code ballista/scheduler/src/cluster/mod.rs x: 1 contributors (30d) y: 809 lines of code ballista/executor/src/cpu_bound_executor.rs x: 1 contributors (30d) y: 250 lines of code ballista/scheduler/Cargo.toml x: 1 contributors (30d) y: 72 lines of code ballista/scheduler/src/lib.rs x: 1 contributors (30d) y: 15 lines of code ballista/scheduler/src/scheduler_process.rs x: 1 contributors (30d) y: 76 lines of code ballista/scheduler/src/state/mod.rs x: 1 contributors (30d) y: 388 lines of code ballista/scheduler/src/cluster/memory.rs x: 1 contributors (30d) y: 523 lines of code ballista/scheduler/src/scheduler_server/grpc.rs x: 1 contributors (30d) y: 775 lines of code ballista/scheduler/src/state/execution_stage.rs x: 1 contributors (30d) y: 721 lines of code ballista/scheduler/src/state/executor_manager.rs x: 1 contributors (30d) y: 347 lines of code ballista/core/src/lib.rs x: 1 contributors (30d) y: 25 lines of code ballista/executor/src/flight_service.rs x: 1 contributors (30d) y: 198 lines of code python/Cargo.toml x: 1 contributors (30d) y: 43 lines of code python/pyproject.toml x: 1 contributors (30d) y: 59 lines of code python/src/codec.rs x: 1 contributors (30d) y: 162 lines of code ballista/scheduler/src/state/task_manager.rs x: 2 contributors (30d) y: 568 lines of code ballista-cli/Cargo.toml x: 1 contributors (30d) y: 38 lines of code ballista-cli/src/exec.rs x: 1 contributors (30d) y: 155 lines of code ballista/core/proto/datafusion.proto x: 2 contributors (30d) y: 1069 lines of code ballista/core/proto/datafusion_common.proto x: 1 contributors (30d) y: 489 lines of code ballista/core/src/diagram.rs x: 1 contributors (30d) y: 119 lines of code ballista/core/src/execution_plans/shuffle_reader.rs x: 1 contributors (30d) y: 613 lines of code ballista/scheduler/src/planner.rs x: 1 contributors (30d) y: 403 lines of code ballista/scheduler/src/state/execution_graph_dot.rs x: 1 contributors (30d) y: 623 lines of code benchmarks/Cargo.toml x: 1 contributors (30d) y: 50 lines of code ballista/core/proto/ballista.proto x: 1 contributors (30d) y: 603 lines of code rustfmt.toml x: 1 contributors (30d) y: 18 lines of code
2094.0
lines of code
  min: 15.0
  average: 333.05
  25th percentile: 69.0
  median: 200.0
  75th percentile: 507.0
  max: 2094.0
0 2.0
contributors (30d)
min: 1.0 | average: 1.05 | 25th percentile: 1.0 | median: 1.0 | 75th percentile: 1.0 | max: 2.0

File Size vs. Commits (90 days): 60 points

ballista/scheduler/src/state/execution_graph.rs x: 5 commits (90d) y: 2094 lines of code ballista/core/src/object_store.rs x: 2 commits (90d) y: 238 lines of code ballista/core/src/execution_plans/distributed_query.rs x: 2 commits (90d) y: 333 lines of code ballista/scheduler/src/cluster/mod.rs x: 4 commits (90d) y: 809 lines of code ballista/executor/src/cpu_bound_executor.rs x: 1 commits (90d) y: 250 lines of code ballista/scheduler/Cargo.toml x: 5 commits (90d) y: 72 lines of code ballista/scheduler/src/lib.rs x: 1 commits (90d) y: 15 lines of code ballista/scheduler/src/scheduler_process.rs x: 3 commits (90d) y: 76 lines of code ballista/scheduler/src/state/mod.rs x: 2 commits (90d) y: 388 lines of code ballista/scheduler/src/cluster/memory.rs x: 2 commits (90d) y: 523 lines of code ballista/scheduler/src/scheduler_server/grpc.rs x: 1 commits (90d) y: 775 lines of code ballista/scheduler/src/state/execution_stage.rs x: 1 commits (90d) y: 721 lines of code ballista/scheduler/src/state/executor_manager.rs x: 3 commits (90d) y: 347 lines of code ballista/core/Cargo.toml x: 4 commits (90d) y: 66 lines of code ballista/core/src/extension.rs x: 3 commits (90d) y: 331 lines of code ballista/core/src/lib.rs x: 1 commits (90d) y: 25 lines of code ballista/executor/src/flight_service.rs x: 2 commits (90d) y: 198 lines of code python/Cargo.toml x: 3 commits (90d) y: 43 lines of code python/pyproject.toml x: 3 commits (90d) y: 59 lines of code python/src/codec.rs x: 1 commits (90d) y: 162 lines of code ballista/scheduler/src/state/task_manager.rs x: 3 commits (90d) y: 568 lines of code Cargo.toml x: 2 commits (90d) y: 74 lines of code ballista-cli/Cargo.toml x: 3 commits (90d) y: 38 lines of code ballista-cli/src/command.rs x: 1 commits (90d) y: 200 lines of code ballista-cli/src/exec.rs x: 1 commits (90d) y: 155 lines of code ballista/core/proto/datafusion.proto x: 3 commits (90d) y: 1069 lines of code ballista/core/proto/datafusion_common.proto x: 2 commits (90d) y: 489 lines of code ballista/core/src/diagram.rs x: 1 commits (90d) y: 119 lines of code ballista/core/src/execution_plans/shuffle_reader.rs x: 4 commits (90d) y: 613 lines of code ballista/scheduler/src/planner.rs x: 3 commits (90d) y: 403 lines of code ballista/scheduler/src/state/execution_graph_dot.rs x: 2 commits (90d) y: 623 lines of code benchmarks/Cargo.toml x: 3 commits (90d) y: 50 lines of code ballista/core/proto/ballista.proto x: 1 commits (90d) y: 603 lines of code dev/release/download-python-wheels.py x: 1 commits (90d) y: 80 lines of code dev/release/generate-changelog.py x: 1 commits (90d) y: 73 lines of code rustfmt.toml x: 1 commits (90d) y: 18 lines of code ballista/executor/src/executor.rs x: 1 commits (90d) y: 338 lines of code ballista/executor/src/executor_server.rs x: 1 commits (90d) y: 688 lines of code ballista/scheduler/src/scheduler_server/mod.rs x: 1 commits (90d) y: 584 lines of code benchmarks/queries/q18.sql x: 1 commits (90d) y: 33 lines of code benchmarks/queries/q2.sql x: 1 commits (90d) y: 44 lines of code ballista/scheduler/src/api/handlers.rs x: 2 commits (90d) y: 363 lines of code ballista/core/src/client.rs x: 1 commits (90d) y: 176 lines of code ballista/core/src/config.rs x: 2 commits (90d) y: 256 lines of code ballista/executor/src/standalone.rs x: 1 commits (90d) y: 128 lines of code ballista/scheduler/src/standalone.rs x: 1 commits (90d) y: 85 lines of code ballista/executor/src/execution_loop.rs x: 1 commits (90d) y: 267 lines of code
2094.0
lines of code
  min: 15.0
  average: 290.87
  25th percentile: 52.25
  median: 169.0
  75th percentile: 467.5
  max: 2094.0
0 5.0
commits (90d)
min: 1.0 | average: 1.88 | 25th percentile: 1.0 | median: 1.0 | 75th percentile: 3.0 | max: 5.0

File Size vs. Contributors (90 days): 60 points

ballista/scheduler/src/state/execution_graph.rs x: 1 contributors (90d) y: 2094 lines of code ballista/core/src/object_store.rs x: 1 contributors (90d) y: 238 lines of code ballista/core/src/execution_plans/distributed_query.rs x: 1 contributors (90d) y: 333 lines of code ballista/scheduler/src/cluster/mod.rs x: 1 contributors (90d) y: 809 lines of code ballista/executor/src/cpu_bound_executor.rs x: 1 contributors (90d) y: 250 lines of code ballista/scheduler/Cargo.toml x: 1 contributors (90d) y: 72 lines of code ballista/scheduler/src/lib.rs x: 1 contributors (90d) y: 15 lines of code ballista/scheduler/src/scheduler_process.rs x: 1 contributors (90d) y: 76 lines of code ballista/scheduler/src/state/mod.rs x: 1 contributors (90d) y: 388 lines of code ballista/scheduler/src/cluster/memory.rs x: 1 contributors (90d) y: 523 lines of code ballista/scheduler/src/scheduler_server/grpc.rs x: 1 contributors (90d) y: 775 lines of code ballista/scheduler/src/state/execution_stage.rs x: 1 contributors (90d) y: 721 lines of code ballista/scheduler/src/state/executor_manager.rs x: 1 contributors (90d) y: 347 lines of code ballista/core/src/lib.rs x: 1 contributors (90d) y: 25 lines of code ballista/executor/src/flight_service.rs x: 1 contributors (90d) y: 198 lines of code python/Cargo.toml x: 1 contributors (90d) y: 43 lines of code python/pyproject.toml x: 1 contributors (90d) y: 59 lines of code python/src/codec.rs x: 1 contributors (90d) y: 162 lines of code ballista/scheduler/src/state/task_manager.rs x: 2 contributors (90d) y: 568 lines of code ballista-cli/Cargo.toml x: 1 contributors (90d) y: 38 lines of code ballista-cli/src/exec.rs x: 1 contributors (90d) y: 155 lines of code ballista/core/proto/datafusion.proto x: 2 contributors (90d) y: 1069 lines of code ballista/core/proto/datafusion_common.proto x: 1 contributors (90d) y: 489 lines of code ballista/core/src/diagram.rs x: 1 contributors (90d) y: 119 lines of code ballista/core/src/execution_plans/shuffle_reader.rs x: 1 contributors (90d) y: 613 lines of code ballista/scheduler/src/planner.rs x: 1 contributors (90d) y: 403 lines of code ballista/scheduler/src/state/execution_graph_dot.rs x: 1 contributors (90d) y: 623 lines of code benchmarks/Cargo.toml x: 1 contributors (90d) y: 50 lines of code ballista/core/proto/ballista.proto x: 1 contributors (90d) y: 603 lines of code rustfmt.toml x: 1 contributors (90d) y: 18 lines of code ballista/executor/src/executor_server.rs x: 1 contributors (90d) y: 688 lines of code ballista/scheduler/src/scheduler_server/mod.rs x: 1 contributors (90d) y: 584 lines of code ballista/scheduler/src/api/handlers.rs x: 1 contributors (90d) y: 363 lines of code ballista/core/src/client.rs x: 1 contributors (90d) y: 176 lines of code ballista/core/src/config.rs x: 1 contributors (90d) y: 256 lines of code ballista/executor/src/standalone.rs x: 1 contributors (90d) y: 128 lines of code ballista/scheduler/src/standalone.rs x: 1 contributors (90d) y: 85 lines of code ballista/executor/src/execution_loop.rs x: 1 contributors (90d) y: 267 lines of code
2094.0
lines of code
  min: 15.0
  average: 290.87
  25th percentile: 52.25
  median: 169.0
  75th percentile: 467.5
  max: 2094.0
0 2.0
contributors (90d)
min: 1.0 | average: 1.03 | 25th percentile: 1.0 | median: 1.0 | 75th percentile: 1.0 | max: 2.0