apache / gravitino-playground
File Change Frequency

File change frequency (churn) shows the distribution of file updates (days with at least one commit).

Overview
File Change Frequency Overall
  • There are 12 files with 2,624 lines of code.
    • 0 files changed more than 100 times (0 lines of code)
    • 0 files changed 51-100 times (0 lines of code)
    • 0 files changed 21-50 times (0 lines of code)
    • 2 files changed 6-20 times (539 lines of code)
    • 10 files changed 1-5 times (2,085 lines of code)
0% | 0% | 0% | 20% | 79%
Legend:
101+
51-100
21-50
6-20
1-5

explore: grouped by folders | grouped by update frequency | data
Contributors Count Frequency Overall
  • There are 12 files with 2,624 lines of code.
    • 0 files changed by more than 25 contributors (0 lines of code)
    • 0 files changed by 11-25 contributors (0 lines of code)
    • 1 file changed by 6-10 contributors (340 lines of code)
    • 6 files changed by 2-5 contributors (2,196 lines of code)
    • 5 files changed by 1 contributor (88 lines of code)
0% | 0% | 12% | 83% | 3%
Legend:
26+
11-25
6-10
2-5
1

explore: grouped by folders | grouped by contributors count | data
File Change Frequency per File Extension
sh, yaml, ipynb, sql, json, xml, ini, gitignore, txt, md
File Change Frequency per Extension
The number of recorded file updates
101+
51-100
21-50
6-20
1-5
ipynb0% | 0% | 0% | 33% | 66%
sql0% | 0% | 0% | 0% | 100%
xml0% | 0% | 0% | 0% | 100%
yaml0% | 0% | 0% | 0% | 100%
File Change Frequency per Logical Decomposition
primary
primary (file change frequency)
The number of recorded file updates
101+
51-100
21-50
6-20
1-5
init0% | 0% | 0% | 20% | 79%
ROOT0% | 0% | 0% | 0% | 100%
Most Frequently Changed Files (Top 12)

See data for all files...

File# lines# unitscreatedlast modified# changes
(days)
# contributorsfirst
contributor
latest
contributor
340 - 2024-05-17 2025-01-26 11 6 xun@datastrato.com 154112360+danhuawang@users....
199 - 2024-09-29 2025-01-26 8 4 xiaojing@datastrato.com 154112360+danhuawang@users....
init.sql
in init/trino
77 - 2023-12-11 2024-11-08 5 5 148952220+qqqttt123@users.n... e850506@gmail.com
init.sql
in init/mysql
11 - 2023-12-08 2024-07-11 4 3 148952220+qqqttt123@users.n... sai.sai.shao@gmail.com
315 - 2024-05-28 2024-12-16 3 2 guoxudong.dev@gmail.com e850506@gmail.com
init.sql
in init/postgres
839 - 2023-12-11 2024-07-04 3 3 148952220+qqqttt123@users.n... justin@datastrato.com
gravitino-access-control-example.ipynb
in init/jupyter/authorization
755 - 2024-11-20 2025-01-26 2 2 roryqi@apache.org 154112360+danhuawang@users....
datasources.yaml
in init/grafana
7 - 2024-11-06 2024-11-06 1 1 oren.ccl@gmail.com oren.ccl@gmail.com
dashboards.yaml
in init/grafana/dashboards
10 - 2024-11-06 2024-11-06 1 1 oren.ccl@gmail.com oren.ccl@gmail.com
11 - 2024-11-21 2024-11-21 1 1 roryqi@apache.org roryqi@apache.org
22 - 2024-11-20 2024-11-20 1 1 roryqi@apache.org roryqi@apache.org
core-site.xml
in init/hive
38 - 2024-11-20 2024-11-20 1 1 roryqi@apache.org roryqi@apache.org
Files With Most Contributors (Top 12)
Based on the number of unique email addresses found in commits.

See data for all files...

File# lines# unitscreatedlast modified# changes
(days)
# contributorsfirst
contributor
latest
contributor
340 - 2024-05-17 2025-01-26 11 6 xun@datastrato.com 154112360+danhuawang@users....
init.sql
in init/trino
77 - 2023-12-11 2024-11-08 5 5 148952220+qqqttt123@users.n... e850506@gmail.com
199 - 2024-09-29 2025-01-26 8 4 xiaojing@datastrato.com 154112360+danhuawang@users....
init.sql
in init/mysql
11 - 2023-12-08 2024-07-11 4 3 148952220+qqqttt123@users.n... sai.sai.shao@gmail.com
init.sql
in init/postgres
839 - 2023-12-11 2024-07-04 3 3 148952220+qqqttt123@users.n... justin@datastrato.com
315 - 2024-05-28 2024-12-16 3 2 guoxudong.dev@gmail.com e850506@gmail.com
gravitino-access-control-example.ipynb
in init/jupyter/authorization
755 - 2024-11-20 2025-01-26 2 2 roryqi@apache.org 154112360+danhuawang@users....
datasources.yaml
in init/grafana
7 - 2024-11-06 2024-11-06 1 1 oren.ccl@gmail.com oren.ccl@gmail.com
dashboards.yaml
in init/grafana/dashboards
10 - 2024-11-06 2024-11-06 1 1 oren.ccl@gmail.com oren.ccl@gmail.com
core-site.xml
in init/hive
38 - 2024-11-20 2024-11-20 1 1 roryqi@apache.org roryqi@apache.org
22 - 2024-11-20 2024-11-20 1 1 roryqi@apache.org roryqi@apache.org
11 - 2024-11-21 2024-11-21 1 1 roryqi@apache.org roryqi@apache.org
Files With Least Contributors (Top 12)
Based on the number of unique email addresses found in commits.

See data for all files...

File# lines# unitscreatedlast modified# changes
(days)
# contributorsfirst
contributor
latest
contributor
core-site.xml
in init/hive
38 - 2024-11-20 2024-11-20 1 1 roryqi@apache.org roryqi@apache.org
22 - 2024-11-20 2024-11-20 1 1 roryqi@apache.org roryqi@apache.org
11 - 2024-11-21 2024-11-21 1 1 roryqi@apache.org roryqi@apache.org
dashboards.yaml
in init/grafana/dashboards
10 - 2024-11-06 2024-11-06 1 1 oren.ccl@gmail.com oren.ccl@gmail.com
datasources.yaml
in init/grafana
7 - 2024-11-06 2024-11-06 1 1 oren.ccl@gmail.com oren.ccl@gmail.com
gravitino-access-control-example.ipynb
in init/jupyter/authorization
755 - 2024-11-20 2025-01-26 2 2 roryqi@apache.org 154112360+danhuawang@users....
315 - 2024-05-28 2024-12-16 3 2 guoxudong.dev@gmail.com e850506@gmail.com
init.sql
in init/postgres
839 - 2023-12-11 2024-07-04 3 3 148952220+qqqttt123@users.n... justin@datastrato.com
init.sql
in init/mysql
11 - 2023-12-08 2024-07-11 4 3 148952220+qqqttt123@users.n... sai.sai.shao@gmail.com
199 - 2024-09-29 2025-01-26 8 4 xiaojing@datastrato.com 154112360+danhuawang@users....
init.sql
in init/trino
77 - 2023-12-11 2024-11-08 5 5 148952220+qqqttt123@users.n... e850506@gmail.com
340 - 2024-05-17 2025-01-26 11 6 xun@datastrato.com 154112360+danhuawang@users....
Correlations

File Size vs. Number of Changes: 12 points

init/jupyter/authorization/gravitino-access-control-example.ipynb x: 755 lines of code y: 2 # changes init/jupyter/gravitino-fileset-example.ipynb x: 340 lines of code y: 11 # changes init/jupyter/gravitino-spark-trino-example.ipynb x: 199 lines of code y: 8 # changes init/jupyter/gravitino-trino-example.ipynb x: 315 lines of code y: 3 # changes docker-enable-ranger-hive-override.yaml x: 11 lines of code y: 1 # changes init/hive/core-site.xml x: 38 lines of code y: 1 # changes init/jupyter/ranger-spark-security.xml x: 22 lines of code y: 1 # changes init/trino/init.sql x: 77 lines of code y: 5 # changes init/grafana/dashboards/dashboards.yaml x: 10 lines of code y: 1 # changes init/grafana/datasources.yaml x: 7 lines of code y: 1 # changes init/mysql/init.sql x: 11 lines of code y: 4 # changes init/postgres/init.sql x: 839 lines of code y: 3 # changes
11.0
# changes
  min: 1.0
  average: 3.42
  25th percentile: 1.0
  median: 2.5
  75th percentile: 4.75
  max: 11.0
0 839.0
lines of code
min: 7.0 | average: 218.67 | 25th percentile: 11.0 | median: 57.5 | 75th percentile: 333.75 | max: 839.0

Number of Contributors vs. Number of Changes: 12 points

init/jupyter/authorization/gravitino-access-control-example.ipynb x: 2 # contributors y: 2 # changes init/jupyter/gravitino-fileset-example.ipynb x: 6 # contributors y: 11 # changes init/jupyter/gravitino-spark-trino-example.ipynb x: 4 # contributors y: 8 # changes init/jupyter/gravitino-trino-example.ipynb x: 2 # contributors y: 3 # changes docker-enable-ranger-hive-override.yaml x: 1 # contributors y: 1 # changes init/trino/init.sql x: 5 # contributors y: 5 # changes init/mysql/init.sql x: 3 # contributors y: 4 # changes init/postgres/init.sql x: 3 # contributors y: 3 # changes
11.0
# changes
  min: 1.0
  average: 3.42
  25th percentile: 1.0
  median: 2.5
  75th percentile: 4.75
  max: 11.0
0 6.0
# contributors
min: 1.0 | average: 2.5 | 25th percentile: 1.0 | median: 2.0 | 75th percentile: 3.75 | max: 6.0

Number of Contributors vs. File Size: 12 points

init/jupyter/authorization/gravitino-access-control-example.ipynb x: 2 # contributors y: 755 lines of code init/jupyter/gravitino-fileset-example.ipynb x: 6 # contributors y: 340 lines of code init/jupyter/gravitino-spark-trino-example.ipynb x: 4 # contributors y: 199 lines of code init/jupyter/gravitino-trino-example.ipynb x: 2 # contributors y: 315 lines of code docker-enable-ranger-hive-override.yaml x: 1 # contributors y: 11 lines of code init/hive/core-site.xml x: 1 # contributors y: 38 lines of code init/jupyter/ranger-spark-security.xml x: 1 # contributors y: 22 lines of code init/trino/init.sql x: 5 # contributors y: 77 lines of code init/grafana/datasources.yaml x: 1 # contributors y: 7 lines of code init/mysql/init.sql x: 3 # contributors y: 11 lines of code init/postgres/init.sql x: 3 # contributors y: 839 lines of code
839.0
lines of code
  min: 7.0
  average: 218.67
  25th percentile: 11.0
  median: 57.5
  75th percentile: 333.75
  max: 839.0
0 6.0
# contributors
min: 1.0 | average: 2.5 | 25th percentile: 1.0 | median: 2.0 | 75th percentile: 3.75 | max: 6.0