apache / parquet-format
File Change Frequency

File change frequency (churn) shows the distribution of file updates (days with at least one commit).

Overview
File Change Frequency Overall
  • There are 9 files with 1,262 lines of code.
    • 0 files changed more than 100 times (0 lines of code)
    • 0 files changed 51-100 times (0 lines of code)
    • 1 file changed 21-50 times (304 lines of code)
    • 0 files changed 6-20 times (0 lines of code)
    • 8 files changed 1-5 times (958 lines of code)
0% | 0% | 24% | 0% | 75%
Legend:
101+
51-100
21-50
6-20
1-5

explore: grouped by folders | grouped by update frequency | data
Contributors Count Frequency Overall
  • There are 9 files with 1,262 lines of code.
    • 1 file changed by more than 25 contributors (304 lines of code)
    • 0 files changed by 11-25 contributors (0 lines of code)
    • 0 files changed by 6-10 contributors (0 lines of code)
    • 8 files changed by 2-5 contributors (958 lines of code)
    • 0 files changed by 1 contributor (0 lines of code)
24% | 0% | 0% | 75% | 0%
Legend:
26+
11-25
6-10
2-5
1

explore: grouped by folders | grouped by contributors count | data
File Change Frequency per File Extension
md, java, sh, svg, txt, yml, xml, gitignore, thrift, json, py
File Change Frequency per Extension
The number of recorded file updates
101+
51-100
21-50
6-20
1-5
thrift0% | 0% | 100% | 0% | 0%
java0% | 0% | 0% | 0% | 100%
py0% | 0% | 0% | 0% | 100%
File Change Frequency per Logical Decomposition
primary
primary (file change frequency)
The number of recorded file updates
101+
51-100
21-50
6-20
1-5
src0% | 0% | 30% | 0% | 69%
dev0% | 0% | 0% | 0% | 100%
Most Frequently Changed Files (Top 9)

See data for all files...

File# lines# unitscreatedlast modified# changes
(days)
# contributorsfirst
contributor
latest
contributor
parquet.thrift
in src/main/thrift
304 - 2016-01-29 2023-05-12 43 28 lgoujon@twitter.com m1brobbel@gmail.com
Util.java
in src/main/java/org/apache/parquet/format
170 22 2014-12-19 2023-02-07 5 4 blue@apache.org fokko@apache.org
275 12 2014-08-05 2018-01-09 5 4 rblue@cloudera.com zi@cloudera.com
InterningProtocol.java
in src/main/java/org/apache/parquet/format
161 48 2014-12-19 2023-02-07 4 3 blue@apache.org fokko@apache.org
FieldConsumer.java
in src/main/java/org/apache/parquet/format/event
7 - 2014-12-19 2018-09-24 2 2 blue@apache.org gabor@apache.org
LogicalTypes.java
in src/main/java/org/apache/parquet/format
31 1 2017-10-10 2018-09-24 2 2 blue@apache.org gabor@apache.org
EventBasedThriftReader.java
in src/main/java/org/apache/parquet/format/event
56 7 2014-12-19 2018-09-24 2 2 blue@apache.org gabor@apache.org
Consumers.java
in src/main/java/org/apache/parquet/format/event
120 15 2014-12-19 2018-09-24 2 2 blue@apache.org gabor@apache.org
TypedConsumer.java
in src/main/java/org/apache/parquet/format/event
138 17 2014-12-19 2018-09-24 2 2 blue@apache.org gabor@apache.org
Files With Most Contributors (Top 9)
Based on the number of unique email addresses found in commits.

See data for all files...

File# lines# unitscreatedlast modified# changes
(days)
# contributorsfirst
contributor
latest
contributor
parquet.thrift
in src/main/thrift
304 - 2016-01-29 2023-05-12 43 28 lgoujon@twitter.com m1brobbel@gmail.com
275 12 2014-08-05 2018-01-09 5 4 rblue@cloudera.com zi@cloudera.com
Util.java
in src/main/java/org/apache/parquet/format
170 22 2014-12-19 2023-02-07 5 4 blue@apache.org fokko@apache.org
InterningProtocol.java
in src/main/java/org/apache/parquet/format
161 48 2014-12-19 2023-02-07 4 3 blue@apache.org fokko@apache.org
TypedConsumer.java
in src/main/java/org/apache/parquet/format/event
138 17 2014-12-19 2018-09-24 2 2 blue@apache.org gabor@apache.org
EventBasedThriftReader.java
in src/main/java/org/apache/parquet/format/event
56 7 2014-12-19 2018-09-24 2 2 blue@apache.org gabor@apache.org
FieldConsumer.java
in src/main/java/org/apache/parquet/format/event
7 - 2014-12-19 2018-09-24 2 2 blue@apache.org gabor@apache.org
Consumers.java
in src/main/java/org/apache/parquet/format/event
120 15 2014-12-19 2018-09-24 2 2 blue@apache.org gabor@apache.org
LogicalTypes.java
in src/main/java/org/apache/parquet/format
31 1 2017-10-10 2018-09-24 2 2 blue@apache.org gabor@apache.org
Files With Least Contributors (Top 9)
Based on the number of unique email addresses found in commits.

See data for all files...

File# lines# unitscreatedlast modified# changes
(days)
# contributorsfirst
contributor
latest
contributor
TypedConsumer.java
in src/main/java/org/apache/parquet/format/event
138 17 2014-12-19 2018-09-24 2 2 blue@apache.org gabor@apache.org
Consumers.java
in src/main/java/org/apache/parquet/format/event
120 15 2014-12-19 2018-09-24 2 2 blue@apache.org gabor@apache.org
EventBasedThriftReader.java
in src/main/java/org/apache/parquet/format/event
56 7 2014-12-19 2018-09-24 2 2 blue@apache.org gabor@apache.org
LogicalTypes.java
in src/main/java/org/apache/parquet/format
31 1 2017-10-10 2018-09-24 2 2 blue@apache.org gabor@apache.org
FieldConsumer.java
in src/main/java/org/apache/parquet/format/event
7 - 2014-12-19 2018-09-24 2 2 blue@apache.org gabor@apache.org
InterningProtocol.java
in src/main/java/org/apache/parquet/format
161 48 2014-12-19 2023-02-07 4 3 blue@apache.org fokko@apache.org
275 12 2014-08-05 2018-01-09 5 4 rblue@cloudera.com zi@cloudera.com
Util.java
in src/main/java/org/apache/parquet/format
170 22 2014-12-19 2023-02-07 5 4 blue@apache.org fokko@apache.org
parquet.thrift
in src/main/thrift
304 - 2016-01-29 2023-05-12 43 28 lgoujon@twitter.com m1brobbel@gmail.com
Correlations

File Size vs. Number of Changes: 9 points

src/main/thrift/parquet.thrift x: 304 lines of code y: 43 # changes src/main/java/org/apache/parquet/format/InterningProtocol.java x: 161 lines of code y: 4 # changes src/main/java/org/apache/parquet/format/Util.java x: 170 lines of code y: 5 # changes src/main/java/org/apache/parquet/format/LogicalTypes.java x: 31 lines of code y: 2 # changes src/main/java/org/apache/parquet/format/event/Consumers.java x: 120 lines of code y: 2 # changes src/main/java/org/apache/parquet/format/event/EventBasedThriftReader.java x: 56 lines of code y: 2 # changes src/main/java/org/apache/parquet/format/event/FieldConsumer.java x: 7 lines of code y: 2 # changes src/main/java/org/apache/parquet/format/event/TypedConsumer.java x: 138 lines of code y: 2 # changes dev/merge_parquet_pr.py x: 275 lines of code y: 5 # changes
43.0
# changes
  min: 2.0
  average: 7.44
  25th percentile: 2.0
  median: 2.0
  75th percentile: 5.0
  max: 43.0
0 304.0
lines of code
min: 7.0 | average: 140.22 | 25th percentile: 43.5 | median: 138.0 | 75th percentile: 222.5 | max: 304.0

Number of Contributors vs. Number of Changes: 9 points

src/main/thrift/parquet.thrift x: 28 # contributors y: 43 # changes src/main/java/org/apache/parquet/format/InterningProtocol.java x: 3 # contributors y: 4 # changes src/main/java/org/apache/parquet/format/Util.java x: 4 # contributors y: 5 # changes src/main/java/org/apache/parquet/format/LogicalTypes.java x: 2 # contributors y: 2 # changes
43.0
# changes
  min: 2.0
  average: 7.44
  25th percentile: 2.0
  median: 2.0
  75th percentile: 5.0
  max: 43.0
0 28.0
# contributors
min: 2.0 | average: 5.44 | 25th percentile: 2.0 | median: 2.0 | 75th percentile: 4.0 | max: 28.0

Number of Contributors vs. File Size: 9 points

src/main/thrift/parquet.thrift x: 28 # contributors y: 304 lines of code src/main/java/org/apache/parquet/format/InterningProtocol.java x: 3 # contributors y: 161 lines of code src/main/java/org/apache/parquet/format/Util.java x: 4 # contributors y: 170 lines of code src/main/java/org/apache/parquet/format/LogicalTypes.java x: 2 # contributors y: 31 lines of code src/main/java/org/apache/parquet/format/event/Consumers.java x: 2 # contributors y: 120 lines of code src/main/java/org/apache/parquet/format/event/EventBasedThriftReader.java x: 2 # contributors y: 56 lines of code src/main/java/org/apache/parquet/format/event/FieldConsumer.java x: 2 # contributors y: 7 lines of code src/main/java/org/apache/parquet/format/event/TypedConsumer.java x: 2 # contributors y: 138 lines of code dev/merge_parquet_pr.py x: 4 # contributors y: 275 lines of code
304.0
lines of code
  min: 7.0
  average: 140.22
  25th percentile: 43.5
  median: 138.0
  75th percentile: 222.5
  max: 304.0
0 28.0
# contributors
min: 2.0 | average: 5.44 | 25th percentile: 2.0 | median: 2.0 | 75th percentile: 4.0 | max: 28.0