apache / arrow-ballista-python
File Size

The distribution of size of files (measured in lines of code).

Intro
Learn more...
File Size Overall
0% | 0% | 27% | 15% | 56%
Legend:
1001+
501-1000
201-500
101-200
1-100


explore: grouped by folders | grouped by size | sunburst | 3D view
File Size per Extension
1001+
501-1000
201-500
101-200
1-100
rs0% | 0% | 30% | 32% | 37%
py0% | 0% | 39% | 0% | 60%
cmake0% | 0% | 0% | 0% | 100%
toml0% | 0% | 0% | 0% | 100%
yml0% | 0% | 0% | 0% | 100%
xml0% | 0% | 0% | 0% | 100%
in0% | 0% | 0% | 0% | 100%
File Size per Logical Decomposition
primary
1001+
501-1000
201-500
101-200
1-100
src0% | 0% | 30% | 32% | 37%
ci0% | 0% | 48% | 0% | 51%
dev0% | 0% | 0% | 0% | 100%
ROOT0% | 0% | 0% | 0% | 100%
ballista0% | 0% | 0% | 0% | 100%
Longest Files (Top 41)
File# lines# units
350 11
286 18
135 14
127 8
udaf.rs
in src
104 8
95 13
93 1
errors.rs
in src
81 9
80 4
__init__.py
in ballista
71 8
70 3
65 5
udf.rs
in src
61 3
Cargo.toml
in root
55 -
52 1
51 -
46 1
lib.rs
in src
42 1
hdfs-site.xml
in ci/etc
36 -
check-rat-report.py
in dev/release
36 -
27 -
utils.rs
in src
27 -
26 -
26 -
26 -
25 -
22 -
22 -
21 -
21 -
20 -
20 -
19 -
17 2
14 -
13 -
5 -
4 -
functions.py
in ballista
3 1
2 -
2 -
Files With Most Units (Top 18)
File# lines# units
286 18
135 14
95 13
350 11
errors.rs
in src
81 9
__init__.py
in ballista
71 8
udaf.rs
in src
104 8
127 8
65 5
80 4
70 3
udf.rs
in src
61 3
17 2
functions.py
in ballista
3 1
93 1
lib.rs
in src
42 1
52 1
46 1
Files With Long Lines (Top 2)

There are 2 files with lines longer than 120 characters. In total, there are 15 long lines.

File# lines# units# long lines
350 11 13
Cargo.toml
in root
55 - 2
Correlations

File Size vs. Commits (all time): 41 points

Cargo.toml x: 6 commits (all time) y: 55 lines of code src/executor.rs x: 2 commits (all time) y: 52 lines of code src/scheduler.rs x: 2 commits (all time) y: 46 lines of code src/udaf.rs x: 3 commits (all time) y: 104 lines of code src/udf.rs x: 3 commits (all time) y: 61 lines of code ballista/__init__.py x: 3 commits (all time) y: 71 lines of code src/functions.rs x: 4 commits (all time) y: 350 lines of code src/lib.rs x: 3 commits (all time) y: 42 lines of code src/context.rs x: 3 commits (all time) y: 127 lines of code ci/detect-changes.py x: 2 commits (all time) y: 286 lines of code dev/release/check-rat-report.py x: 2 commits (all time) y: 36 lines of code dev/release/download-python-wheels.py x: 2 commits (all time) y: 93 lines of code dev/update_arrow_deps.py x: 2 commits (all time) y: 65 lines of code dev/update_ballista_versions.py x: 2 commits (all time) y: 70 lines of code dev/update_datafusion_versions.py x: 2 commits (all time) y: 80 lines of code src/dataframe.rs x: 2 commits (all time) y: 135 lines of code src/expression.rs x: 2 commits (all time) y: 95 lines of code src/datatype.rs x: 2 commits (all time) y: 17 lines of code ballista/functions.py x: 1 commits (all time) y: 3 lines of code ci/conda_env_archery.yml x: 1 commits (all time) y: 13 lines of code ci/conda_env_cpp.yml x: 1 commits (all time) y: 25 lines of code ci/conda_env_gandiva.yml x: 1 commits (all time) y: 2 lines of code ci/conda_env_python.yml x: 1 commits (all time) y: 14 lines of code ci/conda_env_r.yml x: 1 commits (all time) y: 19 lines of code ci/conda_env_sphinx.yml x: 1 commits (all time) y: 5 lines of code ci/etc/hdfs-site.xml x: 1 commits (all time) y: 36 lines of code ci/vcpkg/x64-linux-static-debug.cmake x: 1 commits (all time) y: 21 lines of code ci/vcpkg/x64-windows-static-md-debug.cmake x: 1 commits (all time) y: 20 lines of code dev/create_license.py x: 1 commits (all time) y: 27 lines of code pyproject.toml x: 1 commits (all time) y: 51 lines of code src/errors.rs x: 1 commits (all time) y: 81 lines of code
350.0
lines of code
  min: 2.0
  average: 56.05
  25th percentile: 20.0
  median: 27.0
  75th percentile: 70.5
  max: 350.0
0 6.0
commits (all time)
min: 1.0 | average: 1.71 | 25th percentile: 1.0 | median: 1.0 | 75th percentile: 2.0 | max: 6.0

File Size vs. Contributors (all time): 41 points

Cargo.toml x: 2 contributors (all time) y: 55 lines of code src/executor.rs x: 1 contributors (all time) y: 52 lines of code src/scheduler.rs x: 1 contributors (all time) y: 46 lines of code src/udaf.rs x: 2 contributors (all time) y: 104 lines of code src/udf.rs x: 2 contributors (all time) y: 61 lines of code ballista/__init__.py x: 2 contributors (all time) y: 71 lines of code src/functions.rs x: 2 contributors (all time) y: 350 lines of code src/lib.rs x: 2 contributors (all time) y: 42 lines of code src/context.rs x: 2 contributors (all time) y: 127 lines of code ci/detect-changes.py x: 2 contributors (all time) y: 286 lines of code dev/release/check-rat-report.py x: 2 contributors (all time) y: 36 lines of code dev/release/download-python-wheels.py x: 2 contributors (all time) y: 93 lines of code dev/update_arrow_deps.py x: 2 contributors (all time) y: 65 lines of code dev/update_ballista_versions.py x: 2 contributors (all time) y: 70 lines of code dev/update_datafusion_versions.py x: 2 contributors (all time) y: 80 lines of code src/dataframe.rs x: 2 contributors (all time) y: 135 lines of code src/expression.rs x: 2 contributors (all time) y: 95 lines of code src/datatype.rs x: 1 contributors (all time) y: 17 lines of code ballista/functions.py x: 1 contributors (all time) y: 3 lines of code ci/conda_env_archery.yml x: 1 contributors (all time) y: 13 lines of code ci/conda_env_cpp.yml x: 1 contributors (all time) y: 25 lines of code ci/conda_env_gandiva.yml x: 1 contributors (all time) y: 2 lines of code ci/conda_env_python.yml x: 1 contributors (all time) y: 14 lines of code ci/conda_env_r.yml x: 1 contributors (all time) y: 19 lines of code ci/conda_env_sphinx.yml x: 1 contributors (all time) y: 5 lines of code ci/etc/hdfs-site.xml x: 1 contributors (all time) y: 36 lines of code ci/vcpkg/x64-linux-static-debug.cmake x: 1 contributors (all time) y: 21 lines of code ci/vcpkg/x64-windows-static-md-debug.cmake x: 1 contributors (all time) y: 20 lines of code dev/create_license.py x: 1 contributors (all time) y: 27 lines of code src/errors.rs x: 1 contributors (all time) y: 81 lines of code
350.0
lines of code
  min: 2.0
  average: 56.05
  25th percentile: 20.0
  median: 27.0
  75th percentile: 70.5
  max: 350.0
0 2.0
contributors (all time)
min: 1.0 | average: 1.37 | 25th percentile: 1.0 | median: 1.0 | 75th percentile: 2.0 | max: 2.0

File Size vs. Commits (30 days): 0 points

No data for "commits (30d)" vs. "lines of code".

File Size vs. Contributors (30 days): 0 points

No data for "contributors (30d)" vs. "lines of code".


File Size vs. Commits (90 days): 5 points

Cargo.toml x: 1 commits (90d) y: 55 lines of code src/executor.rs x: 1 commits (90d) y: 52 lines of code src/scheduler.rs x: 1 commits (90d) y: 46 lines of code src/udaf.rs x: 1 commits (90d) y: 104 lines of code src/udf.rs x: 1 commits (90d) y: 61 lines of code
104.0
lines of code
  min: 46.0
  average: 63.6
  25th percentile: 49.0
  median: 55.0
  75th percentile: 82.5
  max: 104.0
0 1.0
commits (90d)
min: 1.0 | average: 1.0 | 25th percentile: 1.0 | median: 1.0 | 75th percentile: 1.0 | max: 1.0

File Size vs. Contributors (90 days): 5 points

Cargo.toml x: 1 contributors (90d) y: 55 lines of code src/executor.rs x: 1 contributors (90d) y: 52 lines of code src/scheduler.rs x: 1 contributors (90d) y: 46 lines of code src/udaf.rs x: 1 contributors (90d) y: 104 lines of code src/udf.rs x: 1 contributors (90d) y: 61 lines of code
104.0
lines of code
  min: 46.0
  average: 63.6
  25th percentile: 49.0
  median: 55.0
  75th percentile: 82.5
  max: 104.0
0 1.0
contributors (90d)
min: 1.0 | average: 1.0 | 25th percentile: 1.0 | median: 1.0 | 75th percentile: 1.0 | max: 1.0