apache / arrow-datafusion-python
File Change Frequency

File change frequency (churn) shows the distribution of file updates (days with at least one commit).

Overview
File Change Frequency Overall
  • There are 116 files with 9,107 lines of code.
    • 0 files changed more than 100 times (0 lines of code)
    • 0 files changed 51-100 times (0 lines of code)
    • 4 files changed 21-50 times (1,530 lines of code)
    • 14 files changed 6-20 times (2,302 lines of code)
    • 98 files changed 1-5 times (5,275 lines of code)
0% | 0% | 16% | 25% | 57%
Legend:
101+
51-100
21-50
6-20
1-5

explore: grouped by folders | grouped by update frequency | data
Contributors Count Frequency Overall
  • There are 116 files with 9,107 lines of code.
    • 0 files changed by more than 25 contributors (0 lines of code)
    • 1 file changed by 11-25 contributors (624 lines of code)
    • 5 files changed by 6-10 contributors (890 lines of code)
    • 35 files changed by 2-5 contributors (3,832 lines of code)
    • 75 files changed by 1 contributor (3,761 lines of code)
0% | 6% | 9% | 42% | 41%
Legend:
26+
11-25
6-10
2-5
1

explore: grouped by folders | grouped by contributors count | data
File Change Frequency per File Extension
rs, py, sql, md, sh, txt, rst, yaml, svg, toml, yml, gitignore, json, html, css, in, dockerfile, dockerignore, bat, gitmodules
File Change Frequency per Extension
The number of recorded file updates
101+
51-100
21-50
6-20
1-5
rs0% | 0% | 21% | 30% | 47%
toml0% | 0% | 50% | 49% | 0%
py0% | 0% | 0% | 11% | 88%
sql0% | 0% | 0% | 0% | 100%
yaml0% | 0% | 0% | 0% | 100%
in0% | 0% | 0% | 0% | 100%
File Change Frequency per Logical Decomposition
primary
primary (file change frequency)
The number of recorded file updates
101+
51-100
21-50
6-20
1-5
src0% | 0% | 21% | 30% | 47%
ROOT0% | 0% | 40% | 39% | 19%
datafusion0% | 0% | 0% | 34% | 65%
benchmarks0% | 0% | 0% | 0% | 100%
dev0% | 0% | 0% | 0% | 100%
conda0% | 0% | 0% | 0% | 100%
Most Frequently Changed Files (Top 50)

See data for all files...

File# lines# unitscreatedlast modified# changes
(days)
# contributorsfirst
contributor
latest
contributor
Cargo.toml
in root
60 - 2022-07-22 2023-07-25 38 8 agrove@apache.org andygrove73@gmail.com
624 55 2022-07-22 2023-07-03 30 12 agrove@apache.org andygrove73@gmail.com
289 38 2022-07-22 2023-07-24 22 9 agrove@apache.org 417981+cpcloud@users.norepl...
expr.rs
in src
557 18 2023-02-15 2023-08-05 21 4 jdye64@gmail.com mesejoleon@gmail.com
lib.rs
in src
76 1 2022-07-22 2023-04-29 14 6 agrove@apache.org jdye64@gmail.com
__init__.py
in datafusion
172 8 2022-07-22 2023-03-15 13 5 agrove@apache.org jdye64@gmail.com
udaf.rs
in src
105 9 2022-07-22 2023-07-03 12 4 agrove@apache.org andygrove73@gmail.com
59 - 2022-07-22 2023-05-10 11 7 agrove@apache.org jimexist@users.noreply.gith...
406 12 2022-07-22 2023-08-05 11 6 agrove@apache.org mesejoleon@gmail.com
udf.rs
in src
64 4 2022-07-22 2023-07-03 9 4 agrove@apache.org andygrove73@gmail.com
logical.rs
in src/sql
91 8 2023-02-13 2023-07-03 9 2 jdye64@gmail.com jdye64@gmail.com
data_type.rs
in src/common
641 3 2023-02-13 2023-07-24 9 2 jdye64@gmail.com jdye64@gmail.com
222 10 2022-07-26 2023-07-22 8 3 84413234+kylebrooks-8451@us... andygrove73@gmail.com
projection.rs
in src/expr
84 10 2023-02-16 2023-07-22 7 2 jdye64@gmail.com andygrove73@gmail.com
table_scan.rs
in src/expr
109 12 2023-02-15 2023-07-22 7 2 jdye64@gmail.com jdye64@gmail.com
errors.rs
in src
63 5 2022-07-26 2023-04-26 6 3 84413234+kylebrooks-8451@us... 84413234+kylebrooks-8451@us...
100 9 2022-07-22 2023-07-03 6 4 agrove@apache.org andygrove73@gmail.com
110 3 2023-01-20 2023-06-11 6 3 jdye64@gmail.com andygrove73@gmail.com
48 3 2023-01-27 2023-07-03 5 2 andygrove73@gmail.com andygrove73@gmail.com
78 4 2022-07-26 2023-07-03 5 2 84413234+kylebrooks-8451@us... andygrove73@gmail.com
90 2 2022-07-26 2023-05-23 5 2 84413234+kylebrooks-8451@us... andygrove73@gmail.com
cudf.py
in datafusion
53 10 2023-02-22 2023-05-30 4 1 jdye64@gmail.com jdye64@gmail.com
pandas.py
in datafusion
54 10 2023-02-20 2023-05-30 4 2 andygrove73@gmail.com jdye64@gmail.com
df_field.rs
in src/common
58 9 2023-02-13 2023-06-11 4 2 jdye64@gmail.com andygrove73@gmail.com
polars.py
in datafusion
71 10 2023-02-20 2023-05-30 4 2 andygrove73@gmail.com jdye64@gmail.com
context.py
in datafusion
74 11 2023-05-27 2023-07-22 4 1 jdye64@gmail.com jdye64@gmail.com
literal.rs
in src/expr
119 6 2023-02-20 2023-04-25 4 3 andygrove73@gmail.com andygrove73@gmail.com
logical_node.rs
in src/expr
6 - 2023-02-16 2023-03-10 3 1 jdye64@gmail.com jdye64@gmail.com
common.rs
in src
20 - 2023-02-15 2023-05-30 3 1 jdye64@gmail.com jdye64@gmail.com
signature.rs
in src/expr
20 - 2023-02-24 2023-03-01 3 2 jdye64@gmail.com 14581281+iajoiner@users.nor...
datafusion-dev.yaml
in conda/environments
24 - 2023-01-18 2023-05-10 3 2 jdye64@gmail.com jimexist@users.noreply.gith...
26 - 2022-07-22 2023-05-10 3 3 agrove@apache.org jimexist@users.noreply.gith...
utils.rs
in src
31 - 2022-07-22 2023-04-25 3 3 agrove@apache.org 84413234+kylebrooks-8451@us...
meta.yaml
in conda/recipes
35 - 2023-01-18 2023-05-10 3 2 jdye64@gmail.com jimexist@users.noreply.gith...
analyze.rs
in src/expr
54 9 2023-02-22 2023-03-10 3 1 jdye64@gmail.com jdye64@gmail.com
filter.rs
in src/expr
56 9 2023-02-20 2023-03-10 3 2 andygrove73@gmail.com jdye64@gmail.com
limit.rs
in src/expr
59 10 2023-02-19 2023-03-10 3 2 andygrove73@gmail.com jdye64@gmail.com
sort.rs
in src/expr
66 9 2023-02-19 2023-03-10 3 2 andygrove73@gmail.com jdye64@gmail.com
config.rs
in src
69 3 2022-09-28 2023-02-04 3 3 me@francis.run 10134699+simicd@users.norep...
aggregate.rs
in src/expr
76 10 2023-02-19 2023-03-10 3 2 andygrove73@gmail.com jdye64@gmail.com
store.rs
in src
179 4 2022-11-08 2023-04-27 3 3 wseaton@users.noreply.githu... 84413234+kylebrooks-8451@us...
join-datafusion.py
in benchmarks/db-benchmark
266 1 2023-05-04 2023-05-08 3 1 andygrove73@gmail.com andygrove73@gmail.com
groupby-datafusion.py
in benchmarks/db-benchmark
478 2 2023-05-04 2023-05-08 3 1 andygrove73@gmail.com andygrove73@gmail.com
column.rs
in src/expr
32 5 2023-02-20 2023-03-30 2 1 andygrove73@gmail.com andygrove73@gmail.com
df_schema.rs
in src/common
34 4 2023-02-15 2023-02-23 2 1 jdye64@gmail.com jdye64@gmail.com
41 4 2023-02-24 2023-06-11 2 2 jdye64@gmail.com andygrove73@gmail.com
41 3 2023-02-16 2023-07-03 2 1 andygrove73@gmail.com andygrove73@gmail.com
location.py
in datafusion/input
45 2 2023-07-22 2023-07-24 2 1 jdye64@gmail.com jdye64@gmail.com
subquery.rs
in src/expr
51 8 2023-02-24 2023-07-03 2 1 jdye64@gmail.com jdye64@gmail.com
54 9 2023-02-23 2023-03-10 2 1 jdye64@gmail.com jdye64@gmail.com
Files With Most Contributors (Top 50)
Based on the number of unique email addresses found in commits.

See data for all files...

File# lines# unitscreatedlast modified# changes
(days)
# contributorsfirst
contributor
latest
contributor
624 55 2022-07-22 2023-07-03 30 12 agrove@apache.org andygrove73@gmail.com
289 38 2022-07-22 2023-07-24 22 9 agrove@apache.org 417981+cpcloud@users.norepl...
Cargo.toml
in root
60 - 2022-07-22 2023-07-25 38 8 agrove@apache.org andygrove73@gmail.com
59 - 2022-07-22 2023-05-10 11 7 agrove@apache.org jimexist@users.noreply.gith...
lib.rs
in src
76 1 2022-07-22 2023-04-29 14 6 agrove@apache.org jdye64@gmail.com
406 12 2022-07-22 2023-08-05 11 6 agrove@apache.org mesejoleon@gmail.com
__init__.py
in datafusion
172 8 2022-07-22 2023-03-15 13 5 agrove@apache.org jdye64@gmail.com
expr.rs
in src
557 18 2023-02-15 2023-08-05 21 4 jdye64@gmail.com mesejoleon@gmail.com
udaf.rs
in src
105 9 2022-07-22 2023-07-03 12 4 agrove@apache.org andygrove73@gmail.com
udf.rs
in src
64 4 2022-07-22 2023-07-03 9 4 agrove@apache.org andygrove73@gmail.com
100 9 2022-07-22 2023-07-03 6 4 agrove@apache.org andygrove73@gmail.com
222 10 2022-07-26 2023-07-22 8 3 84413234+kylebrooks-8451@us... andygrove73@gmail.com
110 3 2023-01-20 2023-06-11 6 3 jdye64@gmail.com andygrove73@gmail.com
errors.rs
in src
63 5 2022-07-26 2023-04-26 6 3 84413234+kylebrooks-8451@us... 84413234+kylebrooks-8451@us...
literal.rs
in src/expr
119 6 2023-02-20 2023-04-25 4 3 andygrove73@gmail.com andygrove73@gmail.com
26 - 2022-07-22 2023-05-10 3 3 agrove@apache.org jimexist@users.noreply.gith...
config.rs
in src
69 3 2022-09-28 2023-02-04 3 3 me@francis.run 10134699+simicd@users.norep...
store.rs
in src
179 4 2022-11-08 2023-04-27 3 3 wseaton@users.noreply.githu... 84413234+kylebrooks-8451@us...
utils.rs
in src
31 - 2022-07-22 2023-04-25 3 3 agrove@apache.org 84413234+kylebrooks-8451@us...
data_type.rs
in src/common
641 3 2023-02-13 2023-07-24 9 2 jdye64@gmail.com jdye64@gmail.com
logical.rs
in src/sql
91 8 2023-02-13 2023-07-03 9 2 jdye64@gmail.com jdye64@gmail.com
table_scan.rs
in src/expr
109 12 2023-02-15 2023-07-22 7 2 jdye64@gmail.com jdye64@gmail.com
projection.rs
in src/expr
84 10 2023-02-16 2023-07-22 7 2 jdye64@gmail.com andygrove73@gmail.com
90 2 2022-07-26 2023-05-23 5 2 84413234+kylebrooks-8451@us... andygrove73@gmail.com
78 4 2022-07-26 2023-07-03 5 2 84413234+kylebrooks-8451@us... andygrove73@gmail.com
48 3 2023-01-27 2023-07-03 5 2 andygrove73@gmail.com andygrove73@gmail.com
pandas.py
in datafusion
54 10 2023-02-20 2023-05-30 4 2 andygrove73@gmail.com jdye64@gmail.com
polars.py
in datafusion
71 10 2023-02-20 2023-05-30 4 2 andygrove73@gmail.com jdye64@gmail.com
df_field.rs
in src/common
58 9 2023-02-13 2023-06-11 4 2 jdye64@gmail.com andygrove73@gmail.com
datafusion-dev.yaml
in conda/environments
24 - 2023-01-18 2023-05-10 3 2 jdye64@gmail.com jimexist@users.noreply.gith...
meta.yaml
in conda/recipes
35 - 2023-01-18 2023-05-10 3 2 jdye64@gmail.com jimexist@users.noreply.gith...
limit.rs
in src/expr
59 10 2023-02-19 2023-03-10 3 2 andygrove73@gmail.com jdye64@gmail.com
sort.rs
in src/expr
66 9 2023-02-19 2023-03-10 3 2 andygrove73@gmail.com jdye64@gmail.com
aggregate.rs
in src/expr
76 10 2023-02-19 2023-03-10 3 2 andygrove73@gmail.com jdye64@gmail.com
signature.rs
in src/expr
20 - 2023-02-24 2023-03-01 3 2 jdye64@gmail.com 14581281+iajoiner@users.nor...
filter.rs
in src/expr
56 9 2023-02-20 2023-03-10 3 2 andygrove73@gmail.com jdye64@gmail.com
join.rs
in src/expr
135 20 2023-03-01 2023-03-10 2 2 14581281+iajoiner@users.nor... jdye64@gmail.com
union.rs
in src/expr
59 9 2023-02-28 2023-03-10 2 2 14581281+iajoiner@users.nor... jdye64@gmail.com
cross_join.rs
in src/expr
62 10 2023-03-01 2023-03-10 2 2 14581281+iajoiner@users.nor... jdye64@gmail.com
41 4 2023-02-24 2023-06-11 2 2 jdye64@gmail.com andygrove73@gmail.com
58 10 2023-03-13 2023-04-25 2 2 jdye64@gmail.com andygrove73@gmail.com
cudf.py
in datafusion
53 10 2023-02-22 2023-05-30 4 1 jdye64@gmail.com jdye64@gmail.com
context.py
in datafusion
74 11 2023-05-27 2023-07-22 4 1 jdye64@gmail.com jdye64@gmail.com
join-datafusion.py
in benchmarks/db-benchmark
266 1 2023-05-04 2023-05-08 3 1 andygrove73@gmail.com andygrove73@gmail.com
groupby-datafusion.py
in benchmarks/db-benchmark
478 2 2023-05-04 2023-05-08 3 1 andygrove73@gmail.com andygrove73@gmail.com
analyze.rs
in src/expr
54 9 2023-02-22 2023-03-10 3 1 jdye64@gmail.com jdye64@gmail.com
logical_node.rs
in src/expr
6 - 2023-02-16 2023-03-10 3 1 jdye64@gmail.com jdye64@gmail.com
common.rs
in src
20 - 2023-02-15 2023-05-30 3 1 jdye64@gmail.com jdye64@gmail.com
location.py
in datafusion/input
45 2 2023-07-22 2023-07-24 2 1 jdye64@gmail.com jdye64@gmail.com
generate-changelog.py
in dev/release
77 3 2023-05-23 2023-06-11 2 1 andygrove73@gmail.com andygrove73@gmail.com
Files With Least Contributors (Top 50)
Based on the number of unique email addresses found in commits.

See data for all files...

File# lines# unitscreatedlast modified# changes
(days)
# contributorsfirst
contributor
latest
contributor
functions.py
in datafusion
3 1
groupby-datafusion.py
in benchmarks/db-benchmark
478 2 2023-05-04 2023-05-08 3 1 andygrove73@gmail.com andygrove73@gmail.com
join-datafusion.py
in benchmarks/db-benchmark
266 1 2023-05-04 2023-05-08 3 1 andygrove73@gmail.com andygrove73@gmail.com
bool_expr.rs
in src/expr
264 20 2023-02-23 2023-02-23 1 1 jdye64@gmail.com jdye64@gmail.com
schema.rs
in src/common
174 7 2023-05-30 2023-05-31 2 1 jdye64@gmail.com jdye64@gmail.com
like.rs
in src/expr
151 24 2023-02-23 2023-02-23 1 1 jdye64@gmail.com jdye64@gmail.com
create_tables.sql
in benchmarks/tpch
109 - 2023-05-03 2023-05-03 1 1 andygrove73@gmail.com andygrove73@gmail.com
repartition.rs
in src/expr
92 13 2023-03-15 2023-03-15 1 1 jdye64@gmail.com jdye64@gmail.com
generate-changelog.py
in dev/release
77 3 2023-05-23 2023-06-11 2 1 andygrove73@gmail.com andygrove73@gmail.com
explain.rs
in src/expr
77 11 2023-03-10 2023-03-10 1 1 jdye64@gmail.com jdye64@gmail.com
context.py
in datafusion
74 11 2023-05-27 2023-07-22 4 1 jdye64@gmail.com jdye64@gmail.com
create_view.rs
in src/expr
71 12 2023-03-13 2023-07-03 2 1 jdye64@gmail.com jdye64@gmail.com
65 11 2023-03-13 2023-03-13 1 1 jdye64@gmail.com jdye64@gmail.com
drop_table.rs
in src/expr
58 10 2023-03-15 2023-03-15 1 1 jdye64@gmail.com jdye64@gmail.com
tpch.py
in benchmarks/tpch
57 1 2023-05-03 2023-05-03 1 1 andygrove73@gmail.com andygrove73@gmail.com
analyze.rs
in src/expr
54 9 2023-02-22 2023-03-10 3 1 jdye64@gmail.com jdye64@gmail.com
54 9 2023-02-23 2023-03-10 2 1 jdye64@gmail.com jdye64@gmail.com
cudf.py
in datafusion
53 10 2023-02-22 2023-05-30 4 1 jdye64@gmail.com jdye64@gmail.com
cast.rs
in src/expr
51 8 2023-02-23 2023-02-23 1 1 jdye64@gmail.com jdye64@gmail.com
subquery.rs
in src/expr
51 8 2023-02-24 2023-07-03 2 1 jdye64@gmail.com jdye64@gmail.com
distinct.rs
in src/expr
50 8 2023-03-14 2023-03-14 1 1 jdye64@gmail.com jdye64@gmail.com
between.rs
in src/expr
50 8 2023-02-23 2023-02-23 1 1 jdye64@gmail.com jdye64@gmail.com
location.py
in datafusion/input
45 2 2023-07-22 2023-07-24 2 1 jdye64@gmail.com jdye64@gmail.com
44 7 2023-02-20 2023-02-20 1 1 andygrove73@gmail.com andygrove73@gmail.com
q2.sql
in benchmarks/tpch/queries
43 - 2023-05-03 2023-05-03 1 1 andygrove73@gmail.com andygrove73@gmail.com
indexed_field.rs
in src/expr
43 6 2023-02-23 2023-02-23 1 1 jdye64@gmail.com jdye64@gmail.com
41 3 2023-02-16 2023-07-03 2 1 andygrove73@gmail.com andygrove73@gmail.com
alias.rs
in src/expr
41 4 2023-02-23 2023-02-23 1 1 jdye64@gmail.com jdye64@gmail.com
q7.sql
in benchmarks/tpch/queries
39 - 2023-05-03 2023-05-03 1 1 andygrove73@gmail.com andygrove73@gmail.com
q21.sql
in benchmarks/tpch/queries
39 - 2023-05-03 2023-05-03 1 1 andygrove73@gmail.com andygrove73@gmail.com
q22.sql
in benchmarks/tpch/queries
37 - 2023-05-03 2023-05-03 1 1 andygrove73@gmail.com andygrove73@gmail.com
q20.sql
in benchmarks/tpch/queries
37 - 2023-05-03 2023-05-03 1 1 andygrove73@gmail.com andygrove73@gmail.com
q8.sql
in benchmarks/tpch/queries
37 - 2023-05-03 2023-05-03 1 1 andygrove73@gmail.com andygrove73@gmail.com
check-rat-report.py
in dev/release
36 - 2022-10-25 2022-10-25 1 1 me@francis.run me@francis.run
q19.sql
in benchmarks/tpch/queries
35 - 2023-05-03 2023-05-03 1 1 andygrove73@gmail.com andygrove73@gmail.com
case.rs
in src/expr
34 5 2023-02-23 2023-02-23 1 1 jdye64@gmail.com jdye64@gmail.com
df_schema.rs
in src/common
34 4 2023-02-15 2023-02-23 2 1 jdye64@gmail.com jdye64@gmail.com
binary_expr.rs
in src/expr
33 6 2023-02-20 2023-02-20 1 1 andygrove73@gmail.com andygrove73@gmail.com
function.rs
in src/common
33 - 2023-05-30 2023-05-30 1 1 jdye64@gmail.com jdye64@gmail.com
q18.sql
in benchmarks/tpch/queries
32 - 2023-05-03 2023-05-03 1 1 andygrove73@gmail.com andygrove73@gmail.com
q9.sql
in benchmarks/tpch/queries
32 - 2023-05-03 2023-05-03 1 1 andygrove73@gmail.com andygrove73@gmail.com
column.rs
in src/expr
32 5 2023-02-20 2023-03-30 2 1 andygrove73@gmail.com andygrove73@gmail.com
q10.sql
in benchmarks/tpch/queries
31 - 2023-05-03 2023-05-03 1 1 andygrove73@gmail.com andygrove73@gmail.com
q15.sql
in benchmarks/tpch/queries
31 - 2023-05-03 2023-05-03 1 1 andygrove73@gmail.com andygrove73@gmail.com
in_subquery.rs
in src/expr
31 3 2023-02-24 2023-02-24 1 1 jdye64@gmail.com jdye64@gmail.com
in_list.rs
in src/expr
31 3 2023-02-24 2023-02-24 1 1 jdye64@gmail.com jdye64@gmail.com
31 5 2023-08-05 2023-08-05 1 1 mesejoleon@gmail.com mesejoleon@gmail.com
q16.sql
in benchmarks/tpch/queries
30 - 2023-05-03 2023-05-03 1 1 andygrove73@gmail.com andygrove73@gmail.com
29 3 2023-02-23 2023-02-23 1 1 jdye64@gmail.com jdye64@gmail.com
q12.sql
in benchmarks/tpch/queries
28 - 2023-05-03 2023-05-03 1 1 andygrove73@gmail.com andygrove73@gmail.com
Correlations

File Size vs. Number of Changes: 116 points

src/expr.rs x: 557 lines of code y: 21 # changes src/expr/conditional_expr.rs x: 31 lines of code y: 1 # changes src/functions.rs x: 406 lines of code y: 11 # changes Cargo.toml x: 60 lines of code y: 38 # changes src/dataframe.rs x: 289 lines of code y: 22 # changes datafusion/input/location.py x: 45 lines of code y: 2 # changes src/common/data_type.rs x: 641 lines of code y: 9 # changes src/dataset_exec.rs x: 222 lines of code y: 8 # changes src/expr/projection.rs x: 84 lines of code y: 7 # changes datafusion/context.py x: 74 lines of code y: 4 # changes datafusion/input/__init__.py x: 4 lines of code y: 1 # changes datafusion/input/base.py x: 14 lines of code y: 1 # changes src/expr/table_scan.rs x: 109 lines of code y: 7 # changes src/catalog.rs x: 100 lines of code y: 6 # changes src/context.rs x: 624 lines of code y: 30 # changes src/dataset.rs x: 78 lines of code y: 5 # changes src/physical_plan.rs x: 48 lines of code y: 5 # changes src/record_batch.rs x: 41 lines of code y: 2 # changes src/udaf.rs x: 105 lines of code y: 12 # changes src/udf.rs x: 64 lines of code y: 9 # changes src/expr/create_view.rs x: 71 lines of code y: 2 # changes src/expr/subquery.rs x: 51 lines of code y: 2 # changes src/sql/logical.rs x: 91 lines of code y: 9 # changes dev/release/generate-changelog.py x: 77 lines of code y: 2 # changes src/common/df_field.rs x: 58 lines of code y: 4 # changes src/substrait.rs x: 110 lines of code y: 6 # changes src/common/schema.rs x: 174 lines of code y: 2 # changes datafusion/cudf.py x: 53 lines of code y: 4 # changes datafusion/pandas.py x: 54 lines of code y: 4 # changes datafusion/polars.py x: 71 lines of code y: 4 # changes src/common.rs x: 20 lines of code y: 3 # changes src/common/function.rs x: 33 lines of code y: 1 # changes src/pyarrow_filter_expression.rs x: 90 lines of code y: 5 # changes conda/environments/datafusion-dev.yaml x: 24 lines of code y: 3 # changes conda/recipes/meta.yaml x: 35 lines of code y: 3 # changes pyproject.toml x: 59 lines of code y: 11 # changes requirements.in x: 26 lines of code y: 3 # changes benchmarks/db-benchmark/groupby-datafusion.py x: 478 lines of code y: 3 # changes benchmarks/db-benchmark/join-datafusion.py x: 266 lines of code y: 3 # changes benchmarks/tpch/create_tables.sql x: 109 lines of code y: 1 # changes benchmarks/tpch/queries/q1.sql x: 21 lines of code y: 1 # changes benchmarks/tpch/queries/q11.sql x: 27 lines of code y: 1 # changes benchmarks/tpch/queries/q12.sql x: 28 lines of code y: 1 # changes benchmarks/tpch/queries/q13.sql x: 20 lines of code y: 1 # changes benchmarks/tpch/queries/q14.sql x: 13 lines of code y: 1 # changes benchmarks/tpch/queries/q16.sql x: 30 lines of code y: 1 # changes benchmarks/tpch/queries/q17.sql x: 17 lines of code y: 1 # changes benchmarks/tpch/queries/q18.sql x: 32 lines of code y: 1 # changes benchmarks/tpch/queries/q19.sql x: 35 lines of code y: 1 # changes benchmarks/tpch/queries/q2.sql x: 43 lines of code y: 1 # changes benchmarks/tpch/queries/q20.sql x: 37 lines of code y: 1 # changes benchmarks/tpch/queries/q21.sql x: 39 lines of code y: 1 # changes benchmarks/tpch/queries/q3.sql x: 22 lines of code y: 1 # changes benchmarks/tpch/queries/q5.sql x: 24 lines of code y: 1 # changes benchmarks/tpch/queries/q6.sql x: 9 lines of code y: 1 # changes benchmarks/tpch/tpch.py x: 57 lines of code y: 1 # changes src/lib.rs x: 76 lines of code y: 14 # changes src/store.rs x: 179 lines of code y: 3 # changes src/errors.rs x: 63 lines of code y: 6 # changes src/utils.rs x: 31 lines of code y: 3 # changes src/expr/literal.rs x: 119 lines of code y: 4 # changes src/expr/subquery_alias.rs x: 58 lines of code y: 2 # changes build.rs x: 3 lines of code y: 1 # changes src/expr/column.rs x: 32 lines of code y: 2 # changes datafusion/__init__.py x: 172 lines of code y: 13 # changes src/expr/repartition.rs x: 92 lines of code y: 1 # changes src/expr/drop_table.rs x: 58 lines of code y: 1 # changes src/expr/distinct.rs x: 50 lines of code y: 1 # changes src/expr/create_memory_table.rs x: 65 lines of code y: 1 # changes src/expr/aggregate.rs x: 76 lines of code y: 3 # changes src/expr/analyze.rs x: 54 lines of code y: 3 # changes src/expr/cross_join.rs x: 62 lines of code y: 2 # changes src/expr/empty_relation.rs x: 54 lines of code y: 2 # changes src/expr/explain.rs x: 77 lines of code y: 1 # changes src/expr/filter.rs x: 56 lines of code y: 3 # changes src/expr/join.rs x: 135 lines of code y: 2 # changes src/expr/limit.rs x: 59 lines of code y: 3 # changes src/expr/logical_node.rs x: 6 lines of code y: 3 # changes src/expr/sort.rs x: 66 lines of code y: 3 # changes src/expr/placeholder.rs x: 26 lines of code y: 1 # changes src/expr/case.rs x: 34 lines of code y: 1 # changes src/expr/cast.rs x: 51 lines of code y: 1 # changes src/expr/bool_expr.rs x: 264 lines of code y: 1 # changes src/expr/like.rs x: 151 lines of code y: 1 # changes src/expr/alias.rs x: 41 lines of code y: 1 # changes src/common/df_schema.rs x: 34 lines of code y: 2 # changes src/expr/aggregate_expr.rs x: 44 lines of code y: 1 # changes src/sql.rs x: 2 lines of code y: 1 # changes src/sql/exceptions.rs x: 16 lines of code y: 1 # changes src/config.rs x: 69 lines of code y: 3 # changes
38.0
# changes
  min: 1.0
  average: 3.56
  25th percentile: 1.0
  median: 1.0
  75th percentile: 3.0
  max: 38.0
0 641.0
lines of code
min: 2.0 | average: 78.51 | 25th percentile: 28.0 | median: 43.5 | 75th percentile: 75.5 | max: 641.0

Number of Contributors vs. Number of Changes: 116 points

src/expr.rs x: 4 # contributors y: 21 # changes src/expr/conditional_expr.rs x: 1 # contributors y: 1 # changes src/functions.rs x: 6 # contributors y: 11 # changes Cargo.toml x: 8 # contributors y: 38 # changes src/dataframe.rs x: 9 # contributors y: 22 # changes datafusion/input/location.py x: 1 # contributors y: 2 # changes src/common/data_type.rs x: 2 # contributors y: 9 # changes src/dataset_exec.rs x: 3 # contributors y: 8 # changes src/expr/projection.rs x: 2 # contributors y: 7 # changes datafusion/context.py x: 1 # contributors y: 4 # changes src/catalog.rs x: 4 # contributors y: 6 # changes src/context.rs x: 12 # contributors y: 30 # changes src/dataset.rs x: 2 # contributors y: 5 # changes src/udaf.rs x: 4 # contributors y: 12 # changes src/udf.rs x: 4 # contributors y: 9 # changes src/common/df_field.rs x: 2 # contributors y: 4 # changes src/expr/scalar_function.rs x: 2 # contributors y: 2 # changes src/substrait.rs x: 3 # contributors y: 6 # changes src/common.rs x: 1 # contributors y: 3 # changes conda/environments/datafusion-dev.yaml x: 2 # contributors y: 3 # changes pyproject.toml x: 7 # contributors y: 11 # changes requirements.in x: 3 # contributors y: 3 # changes src/lib.rs x: 6 # contributors y: 14 # changes src/expr/literal.rs x: 3 # contributors y: 4 # changes datafusion/__init__.py x: 5 # contributors y: 13 # changes
38.0
# changes
  min: 1.0
  average: 3.56
  25th percentile: 1.0
  median: 1.0
  75th percentile: 3.0
  max: 38.0
0 12.0
# contributors
min: 1.0 | average: 1.83 | 25th percentile: 1.0 | median: 1.0 | 75th percentile: 2.0 | max: 12.0

Number of Contributors vs. File Size: 116 points

src/expr.rs x: 4 # contributors y: 557 lines of code src/expr/conditional_expr.rs x: 1 # contributors y: 31 lines of code src/functions.rs x: 6 # contributors y: 406 lines of code Cargo.toml x: 8 # contributors y: 60 lines of code src/dataframe.rs x: 9 # contributors y: 289 lines of code datafusion/input/location.py x: 1 # contributors y: 45 lines of code src/common/data_type.rs x: 2 # contributors y: 641 lines of code src/dataset_exec.rs x: 3 # contributors y: 222 lines of code src/expr/projection.rs x: 2 # contributors y: 84 lines of code datafusion/context.py x: 1 # contributors y: 74 lines of code datafusion/input/__init__.py x: 1 # contributors y: 4 lines of code datafusion/input/base.py x: 1 # contributors y: 14 lines of code src/expr/table_scan.rs x: 2 # contributors y: 109 lines of code src/catalog.rs x: 4 # contributors y: 100 lines of code src/context.rs x: 12 # contributors y: 624 lines of code src/dataset.rs x: 2 # contributors y: 78 lines of code src/physical_plan.rs x: 2 # contributors y: 48 lines of code src/record_batch.rs x: 1 # contributors y: 41 lines of code src/udaf.rs x: 4 # contributors y: 105 lines of code src/udf.rs x: 4 # contributors y: 64 lines of code src/expr/create_view.rs x: 1 # contributors y: 71 lines of code src/expr/subquery.rs x: 1 # contributors y: 51 lines of code src/sql/logical.rs x: 2 # contributors y: 91 lines of code dev/release/generate-changelog.py x: 1 # contributors y: 77 lines of code src/common/df_field.rs x: 2 # contributors y: 58 lines of code src/expr/scalar_function.rs x: 2 # contributors y: 41 lines of code src/substrait.rs x: 3 # contributors y: 110 lines of code src/common/schema.rs x: 1 # contributors y: 174 lines of code datafusion/cudf.py x: 1 # contributors y: 53 lines of code datafusion/pandas.py x: 2 # contributors y: 54 lines of code datafusion/polars.py x: 2 # contributors y: 71 lines of code src/common.rs x: 1 # contributors y: 20 lines of code src/common/function.rs x: 1 # contributors y: 33 lines of code src/pyarrow_filter_expression.rs x: 2 # contributors y: 90 lines of code conda/environments/datafusion-dev.yaml x: 2 # contributors y: 24 lines of code conda/recipes/meta.yaml x: 2 # contributors y: 35 lines of code pyproject.toml x: 7 # contributors y: 59 lines of code requirements.in x: 3 # contributors y: 26 lines of code benchmarks/db-benchmark/groupby-datafusion.py x: 1 # contributors y: 478 lines of code benchmarks/db-benchmark/join-datafusion.py x: 1 # contributors y: 266 lines of code benchmarks/tpch/create_tables.sql x: 1 # contributors y: 109 lines of code benchmarks/tpch/queries/q1.sql x: 1 # contributors y: 21 lines of code benchmarks/tpch/queries/q11.sql x: 1 # contributors y: 27 lines of code benchmarks/tpch/queries/q12.sql x: 1 # contributors y: 28 lines of code benchmarks/tpch/queries/q17.sql x: 1 # contributors y: 17 lines of code benchmarks/tpch/queries/q20.sql x: 1 # contributors y: 37 lines of code benchmarks/tpch/queries/q21.sql x: 1 # contributors y: 39 lines of code benchmarks/tpch/queries/q5.sql x: 1 # contributors y: 24 lines of code benchmarks/tpch/queries/q6.sql x: 1 # contributors y: 9 lines of code benchmarks/tpch/tpch.py x: 1 # contributors y: 57 lines of code src/lib.rs x: 6 # contributors y: 76 lines of code src/store.rs x: 3 # contributors y: 179 lines of code src/errors.rs x: 3 # contributors y: 63 lines of code src/utils.rs x: 3 # contributors y: 31 lines of code src/expr/literal.rs x: 3 # contributors y: 119 lines of code datafusion/__init__.py x: 5 # contributors y: 172 lines of code src/expr/repartition.rs x: 1 # contributors y: 92 lines of code src/expr/distinct.rs x: 1 # contributors y: 50 lines of code src/expr/create_memory_table.rs x: 1 # contributors y: 65 lines of code src/expr/cross_join.rs x: 2 # contributors y: 62 lines of code src/expr/join.rs x: 2 # contributors y: 135 lines of code src/expr/limit.rs x: 2 # contributors y: 59 lines of code src/expr/logical_node.rs x: 1 # contributors y: 6 lines of code src/expr/sort.rs x: 2 # contributors y: 66 lines of code src/expr/signature.rs x: 2 # contributors y: 20 lines of code src/expr/bool_expr.rs x: 1 # contributors y: 264 lines of code src/expr/like.rs x: 1 # contributors y: 151 lines of code src/sql.rs x: 1 # contributors y: 2 lines of code src/config.rs x: 3 # contributors y: 69 lines of code
641.0
lines of code
  min: 2.0
  average: 78.51
  25th percentile: 28.0
  median: 43.5
  75th percentile: 75.5
  max: 641.0
0 12.0
# contributors
min: 1.0 | average: 1.83 | 25th percentile: 1.0 | median: 1.0 | 75th percentile: 2.0 | max: 12.0