apache / datafusion-python
File Size

The distribution of size of files (measured in lines of code).

Intro
Learn more...
File Size Overall
7% | 28% | 20% | 15% | 28%
Legend:
1001+
501-1000
201-500
101-200
1-100


explore: grouped by folders | grouped by size | sunburst | 3D view
File Size per Extension
1001+
501-1000
201-500
101-200
1-100
py24% | 14% | 45% | 2% | 12%
rs0% | 38% | 11% | 19% | 30%
toml0% | 0% | 0% | 70% | 29%
sql0% | 0% | 0% | 17% | 82%
File Size per Logical Decomposition
primary
1001+
501-1000
201-500
101-200
1-100
python31% | 18% | 36% | 0% | 13%
src0% | 38% | 11% | 19% | 30%
benchmarks0% | 0% | 47% | 8% | 44%
ROOT0% | 0% | 0% | 69% | 30%
dev0% | 0% | 0% | 63% | 36%
Longest Files (Top 50)
File# lines# units
functions.py
in python/datafusion
1099 243
929 28
expr.rs
in src
756 19
data_type.rs
in src/common
731 7
719 25
expr.py
in python/datafusion
653 143
619 52
groupby-datafusion.py
in benchmarks/db-benchmark
474 2
context.py
in python/datafusion
423 69
statement.rs
in src/expr
370 30
udf.py
in python/datafusion
294 34
dataframe.py
in python/datafusion
289 58
html_formatter.py
in python/datafusion
278 32
bool_expr.rs
in src/expr
264 20
join-datafusion.py
in benchmarks/db-benchmark
251 1
schema.rs
in src/common
246 11
window.rs
in src/expr
239 10
store.rs
in src
200 5
logical.rs
in src/sql
190 8
173 6
like.rs
in src/expr
151 24
udaf.rs
in src
148 11
join.rs
in src/expr
144 21
143 2
138 6
138 6
138 -
create_tables.sql
in benchmarks/tpch
133 -
aggregate.rs
in src/expr
115 11
112 3
table_scan.rs
in src/expr
106 11
generate-changelog.py
in dev/release
106 3
literal.rs
in src/expr
105 3
102 9
lib.rs
in src
94 2
create_index.rs
in src/expr
90 6
repartition.rs
in src/expr
89 12
projection.rs
in src/expr
81 9
80 11
io.py
in python/datafusion
80 4
udf.rs
in src
78 5
__init__.py
in python/datafusion
76 4
75 11
74 7
explain.rs
in src/expr
74 10
drop_view.rs
in src/expr
67 10
errors.rs
in src
66 5
sort.rs
in src/expr
66 9
65 6
65 6
Files With Most Units (Top 50)
File# lines# units
functions.py
in python/datafusion
1099 243
expr.py
in python/datafusion
653 143
context.py
in python/datafusion
423 69
dataframe.py
in python/datafusion
289 58
619 52
udf.py
in python/datafusion
294 34
html_formatter.py
in python/datafusion
278 32
statement.rs
in src/expr
370 30
929 28
719 25
like.rs
in src/expr
151 24
join.rs
in src/expr
144 21
bool_expr.rs
in src/expr
264 20
expr.rs
in src
756 19
plan.py
in python/datafusion
50 18
repartition.rs
in src/expr
89 12
schema.rs
in src/common
246 11
udaf.rs
in src
148 11
table_scan.rs
in src/expr
106 11
aggregate.rs
in src/expr
115 11
80 11
75 11
explain.rs
in src/expr
74 10
window.rs
in src/expr
239 10
drop_view.rs
in src/expr
67 10
create_view.rs
in src/expr
59 10
drop_function.rs
in src/expr
62 10
62 10
102 9
projection.rs
in src/expr
81 9
drop_table.rs
in src/expr
55 9
55 9
sort.rs
in src/expr
66 9
catalog.py
in python/datafusion
28 9
unparser.py
in python/datafusion
32 9
substrait.py
in python/datafusion
60 9
logical.rs
in src/sql
190 8
unnest.rs
in src/expr
52 8
51 8
union.rs
in src/expr
56 8
analyze.rs
in src/expr
51 8
filter.rs
in src/expr
53 8
between.rs
in src/expr
50 8
sort_expr.rs
in src/expr
62 8
cast.rs
in src/expr
51 8
74 7
data_type.rs
in src/common
731 7
51 7
58 7
distinct.rs
in src/expr
62 7
Files With Long Lines (Top 9)

There are 9 files with lines longer than 120 characters. In total, there are 28 long lines.

File# lines# units# long lines
719 25 13
join-datafusion.py
in benchmarks/db-benchmark
251 1 5
store.rs
in src
200 5 2
expr.rs
in src
756 19 2
lib.rs
in src
94 2 2
groupby-datafusion.py
in benchmarks/db-benchmark
474 2 1
929 28 1
143 2 1
138 6 1
Correlations

File Size vs. Commits (all time): 139 points

python/datafusion/html_formatter.py x: 2 commits (all time) y: 278 lines of code src/dataframe.rs x: 62 commits (all time) y: 619 lines of code python/datafusion/common.py x: 7 commits (all time) y: 36 lines of code python/datafusion/expr.py x: 28 commits (all time) y: 653 lines of code src/common.rs x: 8 commits (all time) y: 6 lines of code src/common/schema.rs x: 7 commits (all time) y: 246 lines of code src/expr.rs x: 64 commits (all time) y: 756 lines of code src/expr/copy_to.rs x: 1 commits (all time) y: 34 lines of code src/expr/create_catalog.rs x: 1 commits (all time) y: 65 lines of code src/expr/create_external_table.rs x: 1 commits (all time) y: 138 lines of code src/expr/create_index.rs x: 1 commits (all time) y: 90 lines of code src/expr/describe_table.rs x: 1 commits (all time) y: 58 lines of code src/expr/dml.rs x: 1 commits (all time) y: 26 lines of code src/expr/drop_catalog_schema.rs x: 1 commits (all time) y: 80 lines of code src/expr/drop_function.rs x: 1 commits (all time) y: 62 lines of code src/expr/recursive_query.rs x: 1 commits (all time) y: 75 lines of code src/expr/statement.rs x: 1 commits (all time) y: 370 lines of code src/sql/logical.rs x: 22 commits (all time) y: 190 lines of code python/datafusion/__init__.py x: 17 commits (all time) y: 76 lines of code python/datafusion/col.py x: 1 commits (all time) y: 11 lines of code python/datafusion/functions.py x: 33 commits (all time) y: 1099 lines of code src/functions.rs x: 62 commits (all time) y: 719 lines of code Cargo.toml x: 85 commits (all time) y: 57 lines of code src/dataset_exec.rs x: 20 commits (all time) y: 173 lines of code src/lib.rs x: 30 commits (all time) y: 94 lines of code src/unparser/dialect.rs x: 1 commits (all time) y: 43 lines of code src/unparser/mod.rs x: 1 commits (all time) y: 37 lines of code src/expr/aggregate.rs x: 9 commits (all time) y: 115 lines of code src/expr/aggregate_expr.rs x: 5 commits (all time) y: 51 lines of code src/expr/window.rs x: 10 commits (all time) y: 239 lines of code src/utils.rs x: 12 commits (all time) y: 63 lines of code benchmarks/db-benchmark/groupby-datafusion.py x: 7 commits (all time) y: 474 lines of code benchmarks/db-benchmark/join-datafusion.py x: 7 commits (all time) y: 251 lines of code benchmarks/tpch/tpch.py x: 6 commits (all time) y: 52 lines of code dev/release/generate-changelog.py x: 9 commits (all time) y: 106 lines of code pyproject.toml x: 28 commits (all time) y: 138 lines of code python/datafusion/catalog.py x: 5 commits (all time) y: 28 lines of code python/datafusion/context.py x: 24 commits (all time) y: 423 lines of code python/datafusion/dataframe.py x: 25 commits (all time) y: 289 lines of code python/datafusion/udf.py x: 13 commits (all time) y: 294 lines of code python/datafusion/io.py x: 4 commits (all time) y: 80 lines of code src/context.rs x: 69 commits (all time) y: 929 lines of code dev/release/check-rat-report.py x: 3 commits (all time) y: 34 lines of code python/datafusion/input/__init__.py x: 3 commits (all time) y: 4 lines of code python/datafusion/input/base.py x: 3 commits (all time) y: 8 lines of code python/datafusion/input/location.py x: 6 commits (all time) y: 44 lines of code python/datafusion/object_store.py x: 5 commits (all time) y: 7 lines of code python/datafusion/plan.py x: 4 commits (all time) y: 50 lines of code python/datafusion/substrait.py x: 8 commits (all time) y: 60 lines of code src/config.rs x: 7 commits (all time) y: 25 lines of code src/dataset.rs x: 11 commits (all time) y: 22 lines of code src/errors.rs x: 8 commits (all time) y: 66 lines of code src/expr/create_memory_table.rs x: 3 commits (all time) y: 62 lines of code src/expr/create_view.rs x: 4 commits (all time) y: 59 lines of code src/expr/distinct.rs x: 4 commits (all time) y: 62 lines of code src/expr/drop_table.rs x: 3 commits (all time) y: 55 lines of code src/expr/explain.rs x: 3 commits (all time) y: 74 lines of code src/expr/extension.rs x: 3 commits (all time) y: 25 lines of code src/expr/filter.rs x: 5 commits (all time) y: 53 lines of code src/expr/join.rs x: 5 commits (all time) y: 144 lines of code src/expr/limit.rs x: 6 commits (all time) y: 50 lines of code src/expr/logical_node.rs x: 4 commits (all time) y: 5 lines of code src/expr/projection.rs x: 9 commits (all time) y: 81 lines of code src/expr/repartition.rs x: 3 commits (all time) y: 89 lines of code src/expr/sort.rs x: 7 commits (all time) y: 66 lines of code src/expr/subquery_alias.rs x: 4 commits (all time) y: 55 lines of code src/expr/table_scan.rs x: 10 commits (all time) y: 106 lines of code src/physical_plan.rs x: 11 commits (all time) y: 46 lines of code src/pyarrow_filter_expression.rs x: 11 commits (all time) y: 143 lines of code src/pyarrow_util.rs x: 2 commits (all time) y: 33 lines of code src/substrait.rs x: 14 commits (all time) y: 112 lines of code src/udaf.rs x: 23 commits (all time) y: 148 lines of code src/udf.rs x: 18 commits (all time) y: 78 lines of code src/udwf.rs x: 6 commits (all time) y: 43 lines of code src/catalog.rs x: 10 commits (all time) y: 102 lines of code src/common/data_type.rs x: 27 commits (all time) y: 731 lines of code src/record_batch.rs x: 5 commits (all time) y: 74 lines of code src/sql/exceptions.rs x: 2 commits (all time) y: 11 lines of code src/store.rs x: 5 commits (all time) y: 200 lines of code benchmarks/tpch/create_tables.sql x: 2 commits (all time) y: 133 lines of code src/expr/alias.rs x: 3 commits (all time) y: 42 lines of code src/expr/between.rs x: 2 commits (all time) y: 50 lines of code src/expr/bool_expr.rs x: 2 commits (all time) y: 264 lines of code src/expr/grouping_set.rs x: 2 commits (all time) y: 17 lines of code src/expr/indexed_field.rs x: 3 commits (all time) y: 46 lines of code src/expr/like.rs x: 2 commits (all time) y: 151 lines of code src/expr/scalar_subquery.rs x: 2 commits (all time) y: 24 lines of code src/expr/signature.rs x: 5 commits (all time) y: 19 lines of code src/expr/unnest_expr.rs x: 2 commits (all time) y: 41 lines of code benchmarks/tpch/queries/q1.sql x: 1 commits (all time) y: 21 lines of code benchmarks/tpch/queries/q14.sql x: 1 commits (all time) y: 13 lines of code benchmarks/tpch/queries/q3.sql x: 1 commits (all time) y: 22 lines of code build.rs x: 1 commits (all time) y: 3 lines of code
1099.0
lines of code
  min: 2.0
  average: 109.7
  25th percentile: 29.0
  median: 50.0
  75th percentile: 94.0
  max: 1099.0
0 85.0
commits (all time)
min: 1.0 | average: 7.77 | 25th percentile: 1.0 | median: 3.0 | 75th percentile: 7.0 | max: 85.0

File Size vs. Contributors (all time): 139 points

python/datafusion/html_formatter.py x: 1 contributors (all time) y: 278 lines of code src/dataframe.rs x: 18 contributors (all time) y: 619 lines of code python/datafusion/common.py x: 3 contributors (all time) y: 36 lines of code python/datafusion/expr.py x: 9 contributors (all time) y: 653 lines of code src/common.rs x: 4 contributors (all time) y: 6 lines of code src/common/schema.rs x: 4 contributors (all time) y: 246 lines of code src/expr.rs x: 10 contributors (all time) y: 756 lines of code src/expr/copy_to.rs x: 1 contributors (all time) y: 34 lines of code src/expr/create_catalog.rs x: 1 contributors (all time) y: 65 lines of code src/expr/create_external_table.rs x: 1 contributors (all time) y: 138 lines of code src/expr/create_index.rs x: 1 contributors (all time) y: 90 lines of code src/expr/describe_table.rs x: 1 contributors (all time) y: 58 lines of code src/expr/dml.rs x: 1 contributors (all time) y: 26 lines of code src/expr/drop_catalog_schema.rs x: 1 contributors (all time) y: 80 lines of code src/expr/drop_function.rs x: 1 contributors (all time) y: 62 lines of code src/expr/recursive_query.rs x: 1 contributors (all time) y: 75 lines of code src/expr/statement.rs x: 1 contributors (all time) y: 370 lines of code src/sql/logical.rs x: 7 contributors (all time) y: 190 lines of code python/datafusion/__init__.py x: 7 contributors (all time) y: 76 lines of code python/datafusion/col.py x: 1 contributors (all time) y: 11 lines of code python/datafusion/functions.py x: 9 contributors (all time) y: 1099 lines of code src/functions.rs x: 20 contributors (all time) y: 719 lines of code Cargo.toml x: 15 contributors (all time) y: 57 lines of code src/dataset_exec.rs x: 7 contributors (all time) y: 173 lines of code src/lib.rs x: 13 contributors (all time) y: 94 lines of code src/unparser/dialect.rs x: 1 contributors (all time) y: 43 lines of code src/unparser/mod.rs x: 1 contributors (all time) y: 37 lines of code src/expr/aggregate.rs x: 6 contributors (all time) y: 115 lines of code src/expr/aggregate_expr.rs x: 4 contributors (all time) y: 51 lines of code src/expr/window.rs x: 5 contributors (all time) y: 239 lines of code src/utils.rs x: 8 contributors (all time) y: 63 lines of code benchmarks/db-benchmark/groupby-datafusion.py x: 3 contributors (all time) y: 474 lines of code benchmarks/db-benchmark/join-datafusion.py x: 3 contributors (all time) y: 251 lines of code benchmarks/tpch/tpch.py x: 3 contributors (all time) y: 52 lines of code dev/release/generate-changelog.py x: 4 contributors (all time) y: 106 lines of code pyproject.toml x: 14 contributors (all time) y: 138 lines of code python/datafusion/catalog.py x: 3 contributors (all time) y: 28 lines of code python/datafusion/context.py x: 9 contributors (all time) y: 423 lines of code python/datafusion/dataframe.py x: 6 contributors (all time) y: 289 lines of code python/datafusion/udf.py x: 6 contributors (all time) y: 294 lines of code python/datafusion/io.py x: 3 contributors (all time) y: 80 lines of code src/context.rs x: 23 contributors (all time) y: 929 lines of code dev/release/check-rat-report.py x: 3 contributors (all time) y: 34 lines of code python/datafusion/input/__init__.py x: 2 contributors (all time) y: 4 lines of code python/datafusion/input/base.py x: 2 contributors (all time) y: 8 lines of code python/datafusion/input/location.py x: 4 contributors (all time) y: 44 lines of code python/datafusion/object_store.py x: 3 contributors (all time) y: 7 lines of code python/datafusion/plan.py x: 1 contributors (all time) y: 50 lines of code python/datafusion/record_batch.py x: 2 contributors (all time) y: 26 lines of code python/datafusion/substrait.py x: 2 contributors (all time) y: 60 lines of code src/config.rs x: 7 contributors (all time) y: 25 lines of code src/dataset.rs x: 5 contributors (all time) y: 22 lines of code src/errors.rs x: 5 contributors (all time) y: 66 lines of code src/expr/analyze.rs x: 3 contributors (all time) y: 51 lines of code src/expr/create_memory_table.rs x: 3 contributors (all time) y: 62 lines of code src/expr/create_view.rs x: 3 contributors (all time) y: 59 lines of code src/expr/distinct.rs x: 4 contributors (all time) y: 62 lines of code src/expr/explain.rs x: 3 contributors (all time) y: 74 lines of code src/expr/extension.rs x: 3 contributors (all time) y: 25 lines of code src/expr/filter.rs x: 4 contributors (all time) y: 53 lines of code src/expr/join.rs x: 4 contributors (all time) y: 144 lines of code src/expr/limit.rs x: 5 contributors (all time) y: 50 lines of code src/expr/literal.rs x: 7 contributors (all time) y: 105 lines of code src/expr/projection.rs x: 4 contributors (all time) y: 81 lines of code src/expr/repartition.rs x: 3 contributors (all time) y: 89 lines of code src/expr/sort.rs x: 4 contributors (all time) y: 66 lines of code src/expr/table_scan.rs x: 5 contributors (all time) y: 106 lines of code src/physical_plan.rs x: 5 contributors (all time) y: 46 lines of code src/pyarrow_filter_expression.rs x: 6 contributors (all time) y: 143 lines of code src/pyarrow_util.rs x: 2 contributors (all time) y: 33 lines of code src/substrait.rs x: 6 contributors (all time) y: 112 lines of code src/udaf.rs x: 9 contributors (all time) y: 148 lines of code src/udf.rs x: 7 contributors (all time) y: 78 lines of code src/udwf.rs x: 3 contributors (all time) y: 43 lines of code src/catalog.rs x: 6 contributors (all time) y: 102 lines of code src/common/data_type.rs x: 6 contributors (all time) y: 731 lines of code src/record_batch.rs x: 4 contributors (all time) y: 74 lines of code src/sql/exceptions.rs x: 2 contributors (all time) y: 11 lines of code src/store.rs x: 5 contributors (all time) y: 200 lines of code benchmarks/tpch/create_tables.sql x: 2 contributors (all time) y: 133 lines of code src/expr/sort_expr.rs x: 2 contributors (all time) y: 62 lines of code src/expr/between.rs x: 2 contributors (all time) y: 50 lines of code src/expr/bool_expr.rs x: 2 contributors (all time) y: 264 lines of code src/expr/grouping_set.rs x: 2 contributors (all time) y: 17 lines of code src/expr/indexed_field.rs x: 3 contributors (all time) y: 46 lines of code src/expr/like.rs x: 2 contributors (all time) y: 151 lines of code src/expr/scalar_subquery.rs x: 2 contributors (all time) y: 24 lines of code src/expr/signature.rs x: 4 contributors (all time) y: 19 lines of code src/expr/unnest_expr.rs x: 2 contributors (all time) y: 41 lines of code benchmarks/tpch/queries/q1.sql x: 1 contributors (all time) y: 21 lines of code benchmarks/tpch/queries/q14.sql x: 1 contributors (all time) y: 13 lines of code benchmarks/tpch/queries/q3.sql x: 1 contributors (all time) y: 22 lines of code build.rs x: 1 contributors (all time) y: 3 lines of code
1099.0
lines of code
  min: 2.0
  average: 109.7
  25th percentile: 29.0
  median: 50.0
  75th percentile: 94.0
  max: 1099.0
0 23.0
contributors (all time)
min: 1.0 | average: 3.58 | 25th percentile: 1.0 | median: 3.0 | 75th percentile: 4.0 | max: 23.0

File Size vs. Commits (30 days): 28 points

python/datafusion/html_formatter.py x: 2 commits (30d) y: 278 lines of code src/dataframe.rs x: 3 commits (30d) y: 619 lines of code python/datafusion/common.py x: 1 commits (30d) y: 36 lines of code python/datafusion/expr.py x: 4 commits (30d) y: 653 lines of code src/common.rs x: 1 commits (30d) y: 6 lines of code src/common/schema.rs x: 1 commits (30d) y: 246 lines of code src/expr.rs x: 3 commits (30d) y: 756 lines of code src/expr/copy_to.rs x: 1 commits (30d) y: 34 lines of code src/expr/create_catalog.rs x: 1 commits (30d) y: 65 lines of code src/expr/create_external_table.rs x: 1 commits (30d) y: 138 lines of code src/expr/create_index.rs x: 1 commits (30d) y: 90 lines of code src/expr/describe_table.rs x: 1 commits (30d) y: 58 lines of code src/expr/dml.rs x: 1 commits (30d) y: 26 lines of code src/expr/drop_catalog_schema.rs x: 1 commits (30d) y: 80 lines of code src/expr/drop_function.rs x: 1 commits (30d) y: 62 lines of code src/expr/recursive_query.rs x: 1 commits (30d) y: 75 lines of code src/expr/statement.rs x: 1 commits (30d) y: 370 lines of code src/sql/logical.rs x: 1 commits (30d) y: 190 lines of code python/datafusion/__init__.py x: 2 commits (30d) y: 76 lines of code python/datafusion/col.py x: 1 commits (30d) y: 11 lines of code python/datafusion/functions.py x: 1 commits (30d) y: 1099 lines of code src/functions.rs x: 2 commits (30d) y: 719 lines of code src/dataset_exec.rs x: 1 commits (30d) y: 173 lines of code
1099.0
lines of code
  min: 6.0
  average: 221.93
  25th percentile: 57.25
  median: 78.0
  75th percentile: 270.0
  max: 1099.0
0 4.0
commits (30d)
min: 1.0 | average: 1.36 | 25th percentile: 1.0 | median: 1.0 | 75th percentile: 1.0 | max: 4.0

File Size vs. Contributors (30 days): 28 points

python/datafusion/html_formatter.py x: 1 contributors (30d) y: 278 lines of code src/dataframe.rs x: 2 contributors (30d) y: 619 lines of code python/datafusion/common.py x: 1 contributors (30d) y: 36 lines of code python/datafusion/expr.py x: 2 contributors (30d) y: 653 lines of code src/common.rs x: 1 contributors (30d) y: 6 lines of code src/common/schema.rs x: 1 contributors (30d) y: 246 lines of code src/expr.rs x: 2 contributors (30d) y: 756 lines of code src/expr/copy_to.rs x: 1 contributors (30d) y: 34 lines of code src/expr/create_catalog.rs x: 1 contributors (30d) y: 65 lines of code src/expr/create_external_table.rs x: 1 contributors (30d) y: 138 lines of code src/expr/create_index.rs x: 1 contributors (30d) y: 90 lines of code src/expr/describe_table.rs x: 1 contributors (30d) y: 58 lines of code src/expr/dml.rs x: 1 contributors (30d) y: 26 lines of code src/expr/drop_catalog_schema.rs x: 1 contributors (30d) y: 80 lines of code src/expr/drop_function.rs x: 1 contributors (30d) y: 62 lines of code src/expr/recursive_query.rs x: 1 contributors (30d) y: 75 lines of code src/expr/statement.rs x: 1 contributors (30d) y: 370 lines of code src/sql/logical.rs x: 1 contributors (30d) y: 190 lines of code python/datafusion/__init__.py x: 2 contributors (30d) y: 76 lines of code python/datafusion/col.py x: 1 contributors (30d) y: 11 lines of code python/datafusion/functions.py x: 1 contributors (30d) y: 1099 lines of code src/functions.rs x: 2 contributors (30d) y: 719 lines of code src/dataset_exec.rs x: 1 contributors (30d) y: 173 lines of code
1099.0
lines of code
  min: 6.0
  average: 221.93
  25th percentile: 57.25
  median: 78.0
  75th percentile: 270.0
  max: 1099.0
0 2.0
contributors (30d)
min: 1.0 | average: 1.18 | 25th percentile: 1.0 | median: 1.0 | 75th percentile: 1.0 | max: 2.0

File Size vs. Commits (90 days): 86 points

python/datafusion/html_formatter.py x: 2 commits (90d) y: 278 lines of code src/dataframe.rs x: 7 commits (90d) y: 619 lines of code python/datafusion/common.py x: 2 commits (90d) y: 36 lines of code python/datafusion/expr.py x: 9 commits (90d) y: 653 lines of code src/common.rs x: 1 commits (90d) y: 6 lines of code src/common/schema.rs x: 1 commits (90d) y: 246 lines of code src/expr.rs x: 6 commits (90d) y: 756 lines of code src/expr/copy_to.rs x: 1 commits (90d) y: 34 lines of code src/expr/create_catalog.rs x: 1 commits (90d) y: 65 lines of code src/expr/create_external_table.rs x: 1 commits (90d) y: 138 lines of code src/expr/create_index.rs x: 1 commits (90d) y: 90 lines of code src/expr/describe_table.rs x: 1 commits (90d) y: 58 lines of code src/expr/dml.rs x: 1 commits (90d) y: 26 lines of code src/expr/drop_catalog_schema.rs x: 1 commits (90d) y: 80 lines of code src/expr/drop_function.rs x: 1 commits (90d) y: 62 lines of code src/expr/recursive_query.rs x: 1 commits (90d) y: 75 lines of code src/expr/statement.rs x: 1 commits (90d) y: 370 lines of code src/sql/logical.rs x: 2 commits (90d) y: 190 lines of code python/datafusion/__init__.py x: 6 commits (90d) y: 76 lines of code python/datafusion/col.py x: 1 commits (90d) y: 11 lines of code python/datafusion/functions.py x: 6 commits (90d) y: 1099 lines of code src/functions.rs x: 5 commits (90d) y: 719 lines of code Cargo.toml x: 7 commits (90d) y: 57 lines of code src/dataset_exec.rs x: 2 commits (90d) y: 173 lines of code src/lib.rs x: 2 commits (90d) y: 94 lines of code src/unparser/dialect.rs x: 1 commits (90d) y: 43 lines of code src/unparser/mod.rs x: 1 commits (90d) y: 37 lines of code src/expr/aggregate.rs x: 2 commits (90d) y: 115 lines of code src/expr/aggregate_expr.rs x: 1 commits (90d) y: 51 lines of code src/expr/window.rs x: 2 commits (90d) y: 239 lines of code src/utils.rs x: 2 commits (90d) y: 63 lines of code benchmarks/db-benchmark/groupby-datafusion.py x: 1 commits (90d) y: 474 lines of code benchmarks/db-benchmark/join-datafusion.py x: 1 commits (90d) y: 251 lines of code benchmarks/tpch/tpch.py x: 2 commits (90d) y: 52 lines of code dev/release/generate-changelog.py x: 2 commits (90d) y: 106 lines of code pyproject.toml x: 6 commits (90d) y: 138 lines of code python/datafusion/catalog.py x: 2 commits (90d) y: 28 lines of code python/datafusion/context.py x: 6 commits (90d) y: 423 lines of code python/datafusion/dataframe.py x: 5 commits (90d) y: 289 lines of code python/datafusion/udf.py x: 4 commits (90d) y: 294 lines of code python/datafusion/io.py x: 4 commits (90d) y: 80 lines of code src/context.rs x: 4 commits (90d) y: 929 lines of code python/datafusion/input/__init__.py x: 1 commits (90d) y: 4 lines of code python/datafusion/input/location.py x: 2 commits (90d) y: 44 lines of code python/datafusion/substrait.py x: 2 commits (90d) y: 60 lines of code src/config.rs x: 1 commits (90d) y: 25 lines of code src/expr/drop_table.rs x: 1 commits (90d) y: 55 lines of code src/expr/join.rs x: 1 commits (90d) y: 144 lines of code src/expr/literal.rs x: 1 commits (90d) y: 105 lines of code src/physical_plan.rs x: 1 commits (90d) y: 46 lines of code src/substrait.rs x: 1 commits (90d) y: 112 lines of code src/udaf.rs x: 1 commits (90d) y: 148 lines of code
1099.0
lines of code
  min: 4.0
  average: 142.44
  25th percentile: 43.75
  median: 64.0
  75th percentile: 138.0
  max: 1099.0
0 9.0
commits (90d)
min: 1.0 | average: 1.87 | 25th percentile: 1.0 | median: 1.0 | 75th percentile: 2.0 | max: 9.0

File Size vs. Contributors (90 days): 86 points

python/datafusion/html_formatter.py x: 1 contributors (90d) y: 278 lines of code src/dataframe.rs x: 3 contributors (90d) y: 619 lines of code python/datafusion/common.py x: 2 contributors (90d) y: 36 lines of code python/datafusion/expr.py x: 4 contributors (90d) y: 653 lines of code src/common.rs x: 1 contributors (90d) y: 6 lines of code src/common/schema.rs x: 1 contributors (90d) y: 246 lines of code src/expr.rs x: 4 contributors (90d) y: 756 lines of code src/expr/copy_to.rs x: 1 contributors (90d) y: 34 lines of code src/expr/create_catalog.rs x: 1 contributors (90d) y: 65 lines of code src/expr/create_external_table.rs x: 1 contributors (90d) y: 138 lines of code src/expr/create_index.rs x: 1 contributors (90d) y: 90 lines of code src/expr/describe_table.rs x: 1 contributors (90d) y: 58 lines of code src/expr/dml.rs x: 1 contributors (90d) y: 26 lines of code src/expr/drop_catalog_schema.rs x: 1 contributors (90d) y: 80 lines of code src/expr/drop_function.rs x: 1 contributors (90d) y: 62 lines of code src/expr/recursive_query.rs x: 1 contributors (90d) y: 75 lines of code src/expr/statement.rs x: 1 contributors (90d) y: 370 lines of code src/sql/logical.rs x: 2 contributors (90d) y: 190 lines of code python/datafusion/__init__.py x: 6 contributors (90d) y: 76 lines of code python/datafusion/col.py x: 1 contributors (90d) y: 11 lines of code python/datafusion/functions.py x: 4 contributors (90d) y: 1099 lines of code src/functions.rs x: 3 contributors (90d) y: 719 lines of code Cargo.toml x: 2 contributors (90d) y: 57 lines of code src/dataset_exec.rs x: 2 contributors (90d) y: 173 lines of code src/lib.rs x: 2 contributors (90d) y: 94 lines of code src/unparser/dialect.rs x: 1 contributors (90d) y: 43 lines of code src/unparser/mod.rs x: 1 contributors (90d) y: 37 lines of code src/expr/aggregate.rs x: 2 contributors (90d) y: 115 lines of code src/expr/aggregate_expr.rs x: 1 contributors (90d) y: 51 lines of code src/expr/window.rs x: 2 contributors (90d) y: 239 lines of code src/utils.rs x: 2 contributors (90d) y: 63 lines of code benchmarks/db-benchmark/groupby-datafusion.py x: 1 contributors (90d) y: 474 lines of code benchmarks/db-benchmark/join-datafusion.py x: 1 contributors (90d) y: 251 lines of code benchmarks/tpch/tpch.py x: 2 contributors (90d) y: 52 lines of code dev/release/generate-changelog.py x: 2 contributors (90d) y: 106 lines of code pyproject.toml x: 4 contributors (90d) y: 138 lines of code python/datafusion/catalog.py x: 2 contributors (90d) y: 28 lines of code python/datafusion/context.py x: 5 contributors (90d) y: 423 lines of code python/datafusion/dataframe.py x: 4 contributors (90d) y: 289 lines of code python/datafusion/udf.py x: 4 contributors (90d) y: 294 lines of code python/datafusion/io.py x: 3 contributors (90d) y: 80 lines of code src/context.rs x: 3 contributors (90d) y: 929 lines of code python/datafusion/input/__init__.py x: 1 contributors (90d) y: 4 lines of code python/datafusion/input/location.py x: 2 contributors (90d) y: 44 lines of code src/config.rs x: 1 contributors (90d) y: 25 lines of code src/expr/drop_table.rs x: 1 contributors (90d) y: 55 lines of code src/expr/join.rs x: 1 contributors (90d) y: 144 lines of code src/expr/literal.rs x: 1 contributors (90d) y: 105 lines of code src/physical_plan.rs x: 1 contributors (90d) y: 46 lines of code src/substrait.rs x: 1 contributors (90d) y: 112 lines of code src/udaf.rs x: 1 contributors (90d) y: 148 lines of code
1099.0
lines of code
  min: 4.0
  average: 142.44
  25th percentile: 43.75
  median: 64.0
  75th percentile: 138.0
  max: 1099.0
0 6.0
contributors (90d)
min: 1.0 | average: 1.55 | 25th percentile: 1.0 | median: 1.0 | 75th percentile: 2.0 | max: 6.0