apache / parquet-cpp
File Change Frequency

File change frequency (churn) shows the distribution of file updates (days with at least one commit).

Overview
File Change Frequency Overall
  • There are 113 files with 25,710 lines of code.
    • 0 files changed more than 100 times (0 lines of code)
    • 3 files changed 51-100 times (3,396 lines of code)
    • 7 files changed 21-50 times (3,081 lines of code)
    • 50 files changed 6-20 times (11,296 lines of code)
    • 53 files changed 1-5 times (7,937 lines of code)
0% | 13% | 11% | 43% | 30%
Legend:
101+
51-100
21-50
6-20
1-5

explore: grouped by folders | grouped by update frequency | data
Contributors Count Frequency Overall
  • There are 113 files with 25,710 lines of code.
    • 0 files changed by more than 25 contributors (0 lines of code)
    • 5 files changed by 11-25 contributors (4,516 lines of code)
    • 25 files changed by 6-10 contributors (7,688 lines of code)
    • 63 files changed by 2-5 contributors (9,343 lines of code)
    • 20 files changed by 1 contributor (4,163 lines of code)
0% | 17% | 29% | 36% | 16%
Legend:
26+
11-25
6-10
2-5
1

explore: grouped by folders | grouped by contributors count | data
File Change Frequency per File Extension
cc, h, cmake, txt, sh, md, py, in, gitignore, bat, yml, clang-format, thrift, json, pl, clang-tidy
File Change Frequency per Extension
The number of recorded file updates
101+
51-100
21-50
6-20
1-5
cc0% | 19% | 15% | 48% | 16%
cmake0% | 22% | 0% | 39% | 37%
h0% | 0% | 14% | 57% | 27%
thrift0% | 0% | 0% | 100% | 0%
yml0% | 0% | 0% | 100% | 0%
py0% | 0% | 0% | 0% | 100%
in0% | 0% | 0% | 0% | 100%
pl0% | 0% | 0% | 0% | 100%
File Change Frequency per Logical Decomposition
primary
primary (file change frequency)
The number of recorded file updates
101+
51-100
21-50
6-20
1-5
src0% | 14% | 15% | 50% | 19%
cmake_modules0% | 22% | 0% | 38% | 38%
benchmarks0% | 0% | 0% | 100% | 0%
tools0% | 0% | 0% | 42% | 57%
ROOT0% | 0% | 0% | 100% | 0%
build-support0% | 0% | 0% | 0% | 100%
dev0% | 0% | 0% | 0% | 100%
Most Frequently Changed Files (Top 50)

See data for all files...

File# lines# unitscreatedlast modified# changes
(days)
# contributorsfirst
contributor
latest
contributor
arrow-reader-writer-test.cc
in src/parquet/arrow
1820 43 2016-09-18 2018-08-23 67 17 uwelk@xhochy.com uwe.korn@blue-yonder.com
370 - 2016-12-30 2018-06-14 65 15 wes.mckinney@twosigma.com thamht01188@gmail.com
reader.cc
in src/parquet/arrow
1206 80 2016-09-18 2018-08-23 56 12 uwelk@xhochy.com uwe.korn@blue-yonder.com
writer.cc
in src/parquet/arrow
910 53 2016-09-18 2018-08-20 49 14 uwelk@xhochy.com antoine@python.org
types.h
in src/parquet
210 7 2016-01-28 2018-08-17 30 14 wes@cloudera.com wesm+git@apache.org
schema.cc
in src/parquet/arrow
602 22 2016-09-18 2018-02-20 30 8 uwelk@xhochy.com wes.mckinney@twosigma.com
reader-test.cc
in src/parquet
231 8 2016-01-26 2018-03-28 29 8 wes@cloudera.com antoine@python.org
arrow-schema-test.cc
in src/parquet/arrow
579 5 2016-09-18 2018-07-30 28 9 uwelk@xhochy.com wesm+git@apache.org
test-util.h
in src/parquet/arrow
390 13 2016-09-18 2018-08-25 27 10 uwelk@xhochy.com deepak.majeti@hpe.com
test-common.h
in src/parquet/util
159 14 2016-02-02 2018-06-28 21 9 wes@cloudera.com deepak.majeti@hpe.com
reader.h
in src/parquet/arrow
108 - 2016-09-18 2018-08-23 20 7 uwelk@xhochy.com uwe.korn@blue-yonder.com
196 8 2016-09-26 2018-08-23 18 7 uwelk@xhochy.com uwe.korn@blue-yonder.com
encoding-internal.h
in src/parquet
729 66 2017-02-03 2018-08-01 18 8 wes.mckinney@twosigma.com antoine@python.org
memory.h
in src/parquet/util
224 10 2016-12-30 2018-07-30 17 7 wes.mckinney@twosigma.com wesm+git@apache.org
statistics-test.cc
in src/parquet
599 24 2017-06-26 2018-08-01 16 7 wes.mckinney@twosigma.com antoine@python.org
FindArrow.cmake
in cmake_modules
108 - 2016-09-18 2018-02-15 15 8 uwelk@xhochy.com uwelk@xhochy.com
encoding.h
in src/parquet
87 9 2017-02-03 2018-07-30 14 9 wes.mckinney@twosigma.com wesm+git@apache.org
memory.cc
in src/parquet/util
384 26 2016-12-30 2018-07-30 14 6 wes.mckinney@twosigma.com wesm+git@apache.org
column_writer-test.cc
in src/parquet
584 19 2017-06-26 2018-08-25 14 6 wes.mckinney@twosigma.com deepak.majeti@hpe.com
schema.cc
in src/parquet
591 54 2017-01-26 2018-07-24 14 8 wes.mckinney@twosigma.com wes.mckinney@twosigma.com
column_writer.cc
in src/parquet
658 41 2017-06-26 2018-08-25 14 5 wes.mckinney@twosigma.com deepak.majeti@hpe.com
decode_benchmark.cc
in benchmarks
365 18 2016-09-05 2018-08-17 13 4 deepak.majeti@hpe.com wesm+git@apache.org
writer.h
in src/parquet/arrow
133 6 2016-09-18 2018-07-30 12 7 uwelk@xhochy.com wesm+git@apache.org
column_reader.cc
in src/parquet
333 12 2016-01-28 2018-08-01 12 6 wes@cloudera.com antoine@python.org
schema.h
in src/parquet/arrow
46 - 2016-09-18 2017-11-20 11 6 uwelk@xhochy.com cpcloud@gmail.com
exception.h
in src/parquet
47 - 2016-01-27 2018-02-21 11 5 wes@cloudera.com uwelk@xhochy.com
FindSnappy.cmake
in cmake_modules
88 - 2014-06-01 2017-05-16 11 7 nong@cloudera.com risuhin.max@gmail.com
types-test.cc
in src/parquet
108 - 2016-02-29 2018-08-01 11 7 deepak.majeti@hpe.com uwe.korn@blue-yonder.com
encoding-test.cc
in src/parquet
211 15 2017-02-03 2018-08-01 11 6 wes.mckinney@twosigma.com antoine@python.org
FindThrift.cmake
in cmake_modules
91 - 2014-05-13 2017-05-22 10 6 nong@cloudera.com rip.nsk@gmail.com
types.cc
in src/parquet
263 9 2016-05-05 2018-08-01 10 6 deepak.majeti@hpe.com uwe.korn@blue-yonder.com
column_reader.h
in src/parquet
310 11 2016-01-28 2018-07-23 10 6 wes@cloudera.com antoine@python.org
schema-test.cc
in src/parquet
534 26 2017-01-26 2018-04-23 10 6 wes.mckinney@twosigma.com joshua.storck@twosigma.com
logging.h
in src/parquet/util
4 - 2016-01-09 2017-06-27 9 5 wes@cloudera.com wes.mckinney@twosigma.com
public-api-test.cc
in src/parquet
26 1 2016-02-16 2017-12-13 9 3 wes@cloudera.com wes.mckinney@twosigma.com
FindBrotli.cmake
in cmake_modules
107 - 2016-11-26 2017-06-11 9 4 uwelk@xhochy.com risuhin.max@gmail.com
encoding-benchmark.cc
in src/parquet
113 8 2017-02-03 2018-07-30 9 4 wes.mckinney@twosigma.com wesm+git@apache.org
schema.h
in src/parquet
235 32 2017-01-26 2018-07-24 9 5 wes.mckinney@twosigma.com wes.mckinney@twosigma.com
memory-test.cc
in src/parquet/util
248 1 2016-12-30 2018-07-30 9 4 wes.mckinney@twosigma.com wesm+git@apache.org
file_reader.cc
in src/parquet
261 18 2017-12-12 2018-07-23 9 5 wes.mckinney@twosigma.com antoine@python.org
43 - 2017-05-02 2018-05-31 8 3 risuhin.max@gmail.com antoine@python.org
FindZLIB.cmake
in cmake_modules
99 - 2016-02-13 2017-05-16 8 5 wes@cloudera.com risuhin.max@gmail.com
column_writer.h
in src/parquet
167 8 2017-06-26 2018-08-25 8 3 wes.mckinney@twosigma.com deepak.majeti@hpe.com
metadata.cc
in src/parquet
696 95 2017-12-12 2018-08-15 8 4 wes.mckinney@twosigma.com deepak.majeti@hpe.com
reader.h
in src/parquet/api
11 - 2016-03-02 2017-12-12 7 3 wesm@apache.org wes.mckinney@twosigma.com
stopwatch.h
in src/parquet/util
24 2 2016-01-09 2017-05-02 7 4 wes@cloudera.com risuhin.max@gmail.com
comparison.h
in src/parquet/util
126 12 2016-10-03 2018-02-20 7 4 artem.tarasov@embl.de wes.mckinney@twosigma.com
186 8 2017-06-26 2018-07-23 7 4 wes.mckinney@twosigma.com antoine@python.org
statistics.cc
in src/parquet
251 23 2017-06-26 2018-07-30 7 4 wes.mckinney@twosigma.com wesm+git@apache.org
test-util.h
in src/parquet
341 26 2017-06-26 2018-07-23 7 3 wes.mckinney@twosigma.com antoine@python.org
Files With Most Contributors (Top 50)
Based on the number of unique email addresses found in commits.

See data for all files...

File# lines# unitscreatedlast modified# changes
(days)
# contributorsfirst
contributor
latest
contributor
arrow-reader-writer-test.cc
in src/parquet/arrow
1820 43 2016-09-18 2018-08-23 67 17 uwelk@xhochy.com uwe.korn@blue-yonder.com
370 - 2016-12-30 2018-06-14 65 15 wes.mckinney@twosigma.com thamht01188@gmail.com
writer.cc
in src/parquet/arrow
910 53 2016-09-18 2018-08-20 49 14 uwelk@xhochy.com antoine@python.org
types.h
in src/parquet
210 7 2016-01-28 2018-08-17 30 14 wes@cloudera.com wesm+git@apache.org
reader.cc
in src/parquet/arrow
1206 80 2016-09-18 2018-08-23 56 12 uwelk@xhochy.com uwe.korn@blue-yonder.com
test-util.h
in src/parquet/arrow
390 13 2016-09-18 2018-08-25 27 10 uwelk@xhochy.com deepak.majeti@hpe.com
arrow-schema-test.cc
in src/parquet/arrow
579 5 2016-09-18 2018-07-30 28 9 uwelk@xhochy.com wesm+git@apache.org
test-common.h
in src/parquet/util
159 14 2016-02-02 2018-06-28 21 9 wes@cloudera.com deepak.majeti@hpe.com
encoding.h
in src/parquet
87 9 2017-02-03 2018-07-30 14 9 wes.mckinney@twosigma.com wesm+git@apache.org
schema.cc
in src/parquet/arrow
602 22 2016-09-18 2018-02-20 30 8 uwelk@xhochy.com wes.mckinney@twosigma.com
reader-test.cc
in src/parquet
231 8 2016-01-26 2018-03-28 29 8 wes@cloudera.com antoine@python.org
encoding-internal.h
in src/parquet
729 66 2017-02-03 2018-08-01 18 8 wes.mckinney@twosigma.com antoine@python.org
FindArrow.cmake
in cmake_modules
108 - 2016-09-18 2018-02-15 15 8 uwelk@xhochy.com uwelk@xhochy.com
schema.cc
in src/parquet
591 54 2017-01-26 2018-07-24 14 8 wes.mckinney@twosigma.com wes.mckinney@twosigma.com
reader.h
in src/parquet/arrow
108 - 2016-09-18 2018-08-23 20 7 uwelk@xhochy.com uwe.korn@blue-yonder.com
196 8 2016-09-26 2018-08-23 18 7 uwelk@xhochy.com uwe.korn@blue-yonder.com
memory.h
in src/parquet/util
224 10 2016-12-30 2018-07-30 17 7 wes.mckinney@twosigma.com wesm+git@apache.org
statistics-test.cc
in src/parquet
599 24 2017-06-26 2018-08-01 16 7 wes.mckinney@twosigma.com antoine@python.org
writer.h
in src/parquet/arrow
133 6 2016-09-18 2018-07-30 12 7 uwelk@xhochy.com wesm+git@apache.org
FindSnappy.cmake
in cmake_modules
88 - 2014-06-01 2017-05-16 11 7 nong@cloudera.com risuhin.max@gmail.com
types-test.cc
in src/parquet
108 - 2016-02-29 2018-08-01 11 7 deepak.majeti@hpe.com uwe.korn@blue-yonder.com
memory.cc
in src/parquet/util
384 26 2016-12-30 2018-07-30 14 6 wes.mckinney@twosigma.com wesm+git@apache.org
column_writer-test.cc
in src/parquet
584 19 2017-06-26 2018-08-25 14 6 wes.mckinney@twosigma.com deepak.majeti@hpe.com
column_reader.cc
in src/parquet
333 12 2016-01-28 2018-08-01 12 6 wes@cloudera.com antoine@python.org
encoding-test.cc
in src/parquet
211 15 2017-02-03 2018-08-01 11 6 wes.mckinney@twosigma.com antoine@python.org
schema.h
in src/parquet/arrow
46 - 2016-09-18 2017-11-20 11 6 uwelk@xhochy.com cpcloud@gmail.com
FindThrift.cmake
in cmake_modules
91 - 2014-05-13 2017-05-22 10 6 nong@cloudera.com rip.nsk@gmail.com
schema-test.cc
in src/parquet
534 26 2017-01-26 2018-04-23 10 6 wes.mckinney@twosigma.com joshua.storck@twosigma.com
types.cc
in src/parquet
263 9 2016-05-05 2018-08-01 10 6 deepak.majeti@hpe.com uwe.korn@blue-yonder.com
column_reader.h
in src/parquet
310 11 2016-01-28 2018-07-23 10 6 wes@cloudera.com antoine@python.org
column_writer.cc
in src/parquet
658 41 2017-06-26 2018-08-25 14 5 wes.mckinney@twosigma.com deepak.majeti@hpe.com
exception.h
in src/parquet
47 - 2016-01-27 2018-02-21 11 5 wes@cloudera.com uwelk@xhochy.com
logging.h
in src/parquet/util
4 - 2016-01-09 2017-06-27 9 5 wes@cloudera.com wes.mckinney@twosigma.com
schema.h
in src/parquet
235 32 2017-01-26 2018-07-24 9 5 wes.mckinney@twosigma.com wes.mckinney@twosigma.com
file_reader.cc
in src/parquet
261 18 2017-12-12 2018-07-23 9 5 wes.mckinney@twosigma.com antoine@python.org
FindZLIB.cmake
in cmake_modules
99 - 2016-02-13 2017-05-16 8 5 wes@cloudera.com risuhin.max@gmail.com
FindGTest.cmake
in cmake_modules
88 - 2016-01-26 2017-05-16 6 5 wes@cloudera.com risuhin.max@gmail.com
CompilerInfo.cmake
in cmake_modules
59 - 2016-06-29 2017-12-13 6 5 wesm@apache.org wes.mckinney@twosigma.com
parquet.thrift
in src/parquet
273 - 2017-01-29 2018-07-31 6 5 wes.mckinney@twosigma.com gidon@il.ibm.com
decode_benchmark.cc
in benchmarks
365 18 2016-09-05 2018-08-17 13 4 deepak.majeti@hpe.com wesm+git@apache.org
FindBrotli.cmake
in cmake_modules
107 - 2016-11-26 2017-06-11 9 4 uwelk@xhochy.com risuhin.max@gmail.com
memory-test.cc
in src/parquet/util
248 1 2016-12-30 2018-07-30 9 4 wes.mckinney@twosigma.com wesm+git@apache.org
encoding-benchmark.cc
in src/parquet
113 8 2017-02-03 2018-07-30 9 4 wes.mckinney@twosigma.com wesm+git@apache.org
metadata.cc
in src/parquet
696 95 2017-12-12 2018-08-15 8 4 wes.mckinney@twosigma.com deepak.majeti@hpe.com
statistics.cc
in src/parquet
251 23 2017-06-26 2018-07-30 7 4 wes.mckinney@twosigma.com wesm+git@apache.org
comparison.h
in src/parquet/util
126 12 2016-10-03 2018-02-20 7 4 artem.tarasov@embl.de wes.mckinney@twosigma.com
stopwatch.h
in src/parquet/util
24 2 2016-01-09 2017-05-02 7 4 wes@cloudera.com risuhin.max@gmail.com
186 8 2017-06-26 2018-07-23 7 4 wes.mckinney@twosigma.com antoine@python.org
thrift.h
in src/parquet
101 12 2017-01-29 2018-02-21 6 4 wes.mckinney@twosigma.com deepak.majeti@hpe.com
comparison-test.cc
in src/parquet/util
168 2 2016-10-03 2017-09-21 6 4 artem.tarasov@embl.de deepak.majeti@hpe.com
Files With Least Contributors (Top 50)
Based on the number of unique email addresses found in commits.

See data for all files...

File# lines# unitscreatedlast modified# changes
(days)
# contributorsfirst
contributor
latest
contributor
cpplint.py
in build-support
2985 161 2016-01-09 2016-01-09 1 1 wes@cloudera.com wes@cloudera.com
crypto.cc
in src/parquet/util
255 9 2018-08-05 2018-08-05 1 1 gidon@il.ibm.com gidon@il.ibm.com
murmur3.cc
in src/parquet
136 14 2018-08-15 2018-08-15 1 1 cjjnjust@gmail.com cjjnjust@gmail.com
bloom_filter.cc
in src/parquet
117 7 2018-08-15 2018-08-15 1 1 cjjnjust@gmail.com cjjnjust@gmail.com
column_page.h
in src/parquet
106 18 2017-06-26 2017-12-12 3 1 wes.mckinney@twosigma.com wes.mckinney@twosigma.com
bloom_filter.h
in src/parquet
86 10 2018-08-15 2018-08-15 1 1 cjjnjust@gmail.com cjjnjust@gmail.com
FindGBenchmark.cmake
in cmake_modules
82 - 2016-05-02 2016-05-02 1 1 uwelk@xhochy.com uwelk@xhochy.com
file_reader.h
in src/parquet
72 2 2017-12-12 2017-12-12 1 1 wes.mckinney@twosigma.com wes.mckinney@twosigma.com
column_scanner.cc
in src/parquet
65 2 2017-06-26 2017-07-31 2 1 wes.mckinney@twosigma.com wes.mckinney@twosigma.com
comparison.cc
in src/parquet/util
53 1 2017-09-11 2018-01-24 2 1 deepak.majeti@hpe.com deepak.majeti@hpe.com
run_clang_format.py
in build-support
50 - 2017-12-11 2017-12-11 1 1 uwe.korn@blue-yonder.com uwe.korn@blue-yonder.com
schema-internal.h
in src/parquet
40 1 2017-01-26 2017-01-29 2 1 wes.mckinney@twosigma.com wes.mckinney@twosigma.com
murmur3.h
in src/parquet
22 - 2018-08-15 2018-08-15 1 1 cjjnjust@gmail.com cjjnjust@gmail.com
collect_coverage.py
in build-support
19 2 2016-02-09 2016-02-09 1 1 wes@cloudera.com wes@cloudera.com
crypto.h
in src/parquet/util
19 - 2018-08-05 2018-08-05 1 1 gidon@il.ibm.com gidon@il.ibm.com
hasher.h
in src/parquet
18 - 2018-08-15 2018-08-15 1 1 cjjnjust@gmail.com cjjnjust@gmail.com
15 1 2016-09-05 2016-09-09 2 1 deepak.majeti@hpe.com deepak.majeti@hpe.com
windows_compatibility.h
in src/parquet/util
13 - 2017-05-02 2017-05-02 1 1 risuhin.max@gmail.com risuhin.max@gmail.com
benchmark_main.cc
in src/parquet/util
6 1 2016-05-02 2016-05-02 1 1 uwelk@xhochy.com uwelk@xhochy.com
in
parquet_version.h.in
in src/parquet
4 - 2017-06-11 2017-09-11 2 1 deepak.majeti@hpe.com deepak.majeti@hpe.com
properties.h
in src/parquet
320 33 2017-06-26 2018-05-21 4 2 wes.mckinney@twosigma.com deepak.majeti@hpe.com
239 11 2014-10-28 2016-01-13 2 2 nong@cloudera.com nongli@gmail.com
207 7 2017-12-12 2018-04-23 2 2 wes.mckinney@twosigma.com joshua.storck@twosigma.com
column_scanner.h
in src/parquet
191 16 2017-06-26 2018-07-23 3 2 wes.mckinney@twosigma.com antoine@python.org
187 7 2017-06-26 2018-04-23 4 2 wes.mckinney@twosigma.com joshua.storck@twosigma.com
metadata-test.cc
in src/parquet
170 - 2017-12-12 2018-01-24 2 2 wes.mckinney@twosigma.com deepak.majeti@hpe.com
bloom_filter-test.cc
in src/parquet
149 1 2018-08-15 2018-08-17 2 2 cjjnjust@gmail.com antoine@python.org
test-specialization.h
in src/parquet
124 13 2017-06-26 2018-05-21 5 2 wes.mckinney@twosigma.com deepak.majeti@hpe.com
BuildUtils.cmake
in cmake_modules
97 - 2017-05-02 2018-07-14 2 2 risuhin.max@gmail.com uwe.korn@blue-yonder.com
record_reader.h
in src/parquet/arrow
47 - 2017-09-20 2018-07-23 3 2 wes.mckinney@twosigma.com antoine@python.org
stacktrace_addr2line.pl
in build-support
41 1 2016-01-26 2016-07-12 3 2 wes@cloudera.com wesm@apache.org
properties-test.cc
in src/parquet
37 - 2017-06-26 2017-12-12 4 2 wes.mckinney@twosigma.com wes.mckinney@twosigma.com
buffer-builder.h
in src/parquet/util
28 4 2016-02-26 2017-07-31 5 2 wesm@apache.org wes.mckinney@twosigma.com
in
27 - 2017-10-12 2017-10-17 2 2 trink@acm.org lv@cloudera.com
clean-all.cmake
in cmake_modules
27 - 2016-01-09 2016-10-12 2 2 wes@cloudera.com uwelk@xhochy.com
in
parquet.pc.in
in src/parquet
26 - 2017-03-30 2017-05-15 3 2 kou@clear-code.com uwe@apache.org
printer.h
in src/parquet
24 1 2017-12-12 2018-08-17 2 2 wes.mckinney@twosigma.com jacek.pliszka@gmail.com
writer.h
in src/parquet/api
8 - 2016-05-09 2017-12-12 3 2 uwelk@xhochy.com wes.mckinney@twosigma.com
SnappyConfig.h
in cmake_modules
6 - 2017-05-02 2017-10-20 2 2 risuhin.max@gmail.com uwe.korn@blue-yonder.com
schema.h
in src/parquet/api
4 - 2016-03-02 2017-01-26 3 2 wesm@apache.org wes.mckinney@twosigma.com
test-util.h
in src/parquet
341 26 2017-06-26 2018-07-23 7 3 wes.mckinney@twosigma.com antoine@python.org
column_reader-test.cc
in src/parquet
310 7 2017-06-26 2018-07-23 5 3 wes.mckinney@twosigma.com antoine@python.org
SetupCxxFlags.cmake
in cmake_modules
220 - 2017-12-13 2018-08-17 3 3 wes.mckinney@twosigma.com wesm+git@apache.org
printer.cc
in src/parquet
194 2 2017-12-12 2018-08-17 3 3 wes.mckinney@twosigma.com jacek.pliszka@gmail.com
statistics.h
in src/parquet
176 16 2017-06-26 2018-07-23 6 3 wes.mckinney@twosigma.com antoine@python.org
column_writer.h
in src/parquet
167 8 2017-06-26 2018-08-25 8 3 wes.mckinney@twosigma.com deepak.majeti@hpe.com
67 - 2018-02-15 2018-08-17 2 3 uwelk@xhochy.com wesm+git@apache.org
54 1 2016-09-05 2018-08-17 5 3 deepak.majeti@hpe.com jacek.pliszka@gmail.com
51 1 2016-09-05 2017-12-13 6 3 deepak.majeti@hpe.com wes.mckinney@twosigma.com
schema-util.h
in src/parquet/util
51 4 2017-06-22 2018-03-23 6 3 itai.in@gmail.com antoine@python.org
Correlations

File Size vs. Number of Changes: 113 points

src/parquet/arrow/test-util.h x: 390 lines of code y: 27 # changes src/parquet/column_writer-test.cc x: 584 lines of code y: 14 # changes src/parquet/column_writer.cc x: 658 lines of code y: 14 # changes src/parquet/column_writer.h x: 167 lines of code y: 8 # changes src/parquet/file-serialize-test.cc x: 231 lines of code y: 5 # changes src/parquet/file_writer.cc x: 302 lines of code y: 5 # changes src/parquet/file_writer.h x: 96 lines of code y: 5 # changes src/parquet/arrow/arrow-reader-writer-benchmark.cc x: 196 lines of code y: 18 # changes src/parquet/arrow/arrow-reader-writer-test.cc x: 1820 lines of code y: 67 # changes src/parquet/arrow/reader.cc x: 1206 lines of code y: 56 # changes src/parquet/arrow/reader.h x: 108 lines of code y: 20 # changes src/parquet/arrow/writer.cc x: 910 lines of code y: 49 # changes src/parquet/printer.cc x: 194 lines of code y: 3 # changes src/parquet/printer.h x: 24 lines of code y: 2 # changes tools/parquet_reader.cc x: 54 lines of code y: 5 # changes src/parquet/bloom_filter-test.cc x: 149 lines of code y: 2 # changes cmake_modules/ArrowExternalProject.cmake x: 67 lines of code y: 2 # changes cmake_modules/SetupCxxFlags.cmake x: 220 lines of code y: 3 # changes benchmarks/decode_benchmark.cc x: 365 lines of code y: 13 # changes cmake_modules/FindClangTools.cmake x: 111 lines of code y: 5 # changes src/parquet/arrow/record_reader.cc x: 557 lines of code y: 6 # changes src/parquet/types.h x: 210 lines of code y: 30 # changes src/parquet/metadata.cc x: 696 lines of code y: 8 # changes src/parquet/bloom_filter.cc x: 117 lines of code y: 1 # changes src/parquet/bloom_filter.h x: 86 lines of code y: 1 # changes src/parquet/hasher.h x: 18 lines of code y: 1 # changes src/parquet/murmur3.cc x: 136 lines of code y: 1 # changes src/parquet/murmur3.h x: 22 lines of code y: 1 # changes src/parquet/util/crypto.cc x: 255 lines of code y: 1 # changes src/parquet/column_reader.cc x: 333 lines of code y: 12 # changes src/parquet/encoding-internal.h x: 729 lines of code y: 18 # changes src/parquet/encoding-test.cc x: 211 lines of code y: 11 # changes src/parquet/statistics-test.cc x: 599 lines of code y: 16 # changes src/parquet/types-test.cc x: 108 lines of code y: 11 # changes src/parquet/types.cc x: 263 lines of code y: 10 # changes src/parquet/util/macros.h x: 37 lines of code y: 6 # changes src/parquet/parquet.thrift x: 273 lines of code y: 6 # changes src/parquet/arrow/arrow-schema-test.cc x: 579 lines of code y: 28 # changes src/parquet/arrow/writer.h x: 133 lines of code y: 12 # changes src/parquet/encoding-benchmark.cc x: 113 lines of code y: 9 # changes src/parquet/encoding.h x: 87 lines of code y: 14 # changes src/parquet/metadata.h x: 166 lines of code y: 5 # changes src/parquet/statistics.cc x: 251 lines of code y: 7 # changes src/parquet/util/memory-test.cc x: 248 lines of code y: 9 # changes src/parquet/util/memory.cc x: 384 lines of code y: 14 # changes src/parquet/util/memory.h x: 224 lines of code y: 17 # changes src/parquet/schema.cc x: 591 lines of code y: 14 # changes src/parquet/schema.h x: 235 lines of code y: 9 # changes src/parquet/arrow/record_reader.h x: 47 lines of code y: 3 # changes src/parquet/column-io-benchmark.cc x: 186 lines of code y: 7 # changes src/parquet/column_reader-test.cc x: 310 lines of code y: 5 # changes src/parquet/column_reader.h x: 310 lines of code y: 10 # changes src/parquet/column_scanner.h x: 191 lines of code y: 3 # changes src/parquet/file_reader.cc x: 261 lines of code y: 9 # changes src/parquet/statistics.h x: 176 lines of code y: 6 # changes src/parquet/test-util.h x: 341 lines of code y: 7 # changes cmake_modules/BuildUtils.cmake x: 97 lines of code y: 2 # changes src/parquet/util/test-common.h x: 159 lines of code y: 21 # changes cmake_modules/ThirdpartyToolchain.cmake x: 370 lines of code y: 65 # changes appveyor.yml x: 43 lines of code y: 8 # changes src/parquet/properties.h x: 320 lines of code y: 4 # changes src/parquet/test-specialization.h x: 124 lines of code y: 5 # changes src/parquet/column_scanner-test.cc x: 187 lines of code y: 4 # changes src/parquet/file-deserialize-test.cc x: 207 lines of code y: 2 # changes src/parquet/schema-test.cc x: 534 lines of code y: 10 # changes src/parquet/reader-test.cc x: 231 lines of code y: 29 # changes src/parquet/util/schema-util.h x: 51 lines of code y: 6 # changes src/parquet/thrift.h x: 101 lines of code y: 6 # changes src/parquet/exception.cc x: 28 lines of code y: 5 # changes src/parquet/exception.h x: 47 lines of code y: 11 # changes src/parquet/arrow/schema.cc x: 602 lines of code y: 30 # changes src/parquet/util/comparison.h x: 126 lines of code y: 7 # changes cmake_modules/FindArrow.cmake x: 108 lines of code y: 15 # changes src/parquet/metadata-test.cc x: 170 lines of code y: 2 # changes src/parquet/util/comparison.cc x: 53 lines of code y: 2 # changes cmake_modules/CompilerInfo.cmake x: 59 lines of code y: 6 # changes src/parquet/public-api-test.cc x: 26 lines of code y: 9 # changes src/parquet/api/reader.h x: 11 lines of code y: 7 # changes src/parquet/api/writer.h x: 8 lines of code y: 3 # changes src/parquet/column_page.h x: 106 lines of code y: 3 # changes src/parquet/file_reader.h x: 72 lines of code y: 1 # changes src/parquet/properties-test.cc x: 37 lines of code y: 4 # changes build-support/run_clang_format.py x: 50 lines of code y: 1 # changes cmake_modules/SnappyConfig.h x: 6 lines of code y: 2 # changes cmake_modules/parquet-cppConfig.cmake.in x: 27 lines of code y: 2 # changes src/parquet/util/comparison-test.cc x: 168 lines of code y: 6 # changes src/parquet/parquet_version.h.in x: 4 lines of code y: 2 # changes src/parquet/util/logging.h x: 4 lines of code y: 9 # changes src/parquet/util/visibility.h x: 31 lines of code y: 4 # changes cmake_modules/FindBrotli.cmake x: 107 lines of code y: 9 # changes cmake_modules/FindThrift.cmake x: 91 lines of code y: 10 # changes cmake_modules/FindGTest.cmake x: 88 lines of code y: 6 # changes cmake_modules/FindSnappy.cmake x: 88 lines of code y: 11 # changes cmake_modules/FindZLIB.cmake x: 99 lines of code y: 8 # changes src/parquet/parquet.pc.in x: 26 lines of code y: 3 # changes src/parquet/util/stopwatch.h x: 24 lines of code y: 7 # changes src/parquet/util/windows_compatibility.h x: 13 lines of code y: 1 # changes src/parquet/schema-internal.h x: 40 lines of code y: 2 # changes src/parquet/api/schema.h x: 4 lines of code y: 3 # changes src/parquet/api/io.h x: 5 lines of code y: 4 # changes tools/parquet-dump-schema.cc x: 15 lines of code y: 2 # changes build-support/stacktrace_addr2line.pl x: 41 lines of code y: 3 # changes src/parquet/util/benchmark_main.cc x: 6 lines of code y: 1 # changes dev/merge_parquet_pr.py x: 239 lines of code y: 2 # changes build-support/cpplint.py x: 2985 lines of code y: 1 # changes
67.0
# changes
  min: 1.0
  average: 9.26
  25th percentile: 2.0
  median: 6.0
  75th percentile: 11.0
  max: 67.0
0 2985.0
lines of code
min: 4.0 | average: 227.52 | 25th percentile: 44.5 | median: 113.0 | 75th percentile: 253.0 | max: 2985.0

Number of Contributors vs. Number of Changes: 113 points

src/parquet/arrow/test-util.h x: 10 # contributors y: 27 # changes src/parquet/column_writer-test.cc x: 6 # contributors y: 14 # changes src/parquet/column_writer.cc x: 5 # contributors y: 14 # changes src/parquet/column_writer.h x: 3 # contributors y: 8 # changes src/parquet/file-serialize-test.cc x: 4 # contributors y: 5 # changes src/parquet/arrow/arrow-reader-writer-benchmark.cc x: 7 # contributors y: 18 # changes src/parquet/arrow/arrow-reader-writer-test.cc x: 17 # contributors y: 67 # changes src/parquet/arrow/reader.cc x: 12 # contributors y: 56 # changes src/parquet/arrow/reader.h x: 7 # contributors y: 20 # changes src/parquet/arrow/writer.cc x: 14 # contributors y: 49 # changes src/parquet/printer.cc x: 3 # contributors y: 3 # changes src/parquet/printer.h x: 2 # contributors y: 2 # changes tools/parquet_reader.cc x: 3 # contributors y: 5 # changes cmake_modules/ArrowExternalProject.cmake x: 3 # contributors y: 2 # changes benchmarks/decode_benchmark.cc x: 4 # contributors y: 13 # changes src/parquet/arrow/record_reader.cc x: 4 # contributors y: 6 # changes src/parquet/types.h x: 14 # contributors y: 30 # changes src/parquet/metadata.cc x: 4 # contributors y: 8 # changes src/parquet/bloom_filter.cc x: 1 # contributors y: 1 # changes src/parquet/column_reader.cc x: 6 # contributors y: 12 # changes src/parquet/encoding-internal.h x: 8 # contributors y: 18 # changes src/parquet/encoding-test.cc x: 6 # contributors y: 11 # changes src/parquet/statistics-test.cc x: 7 # contributors y: 16 # changes src/parquet/types-test.cc x: 7 # contributors y: 11 # changes src/parquet/types.cc x: 6 # contributors y: 10 # changes src/parquet/parquet.thrift x: 5 # contributors y: 6 # changes src/parquet/arrow/arrow-schema-test.cc x: 9 # contributors y: 28 # changes src/parquet/arrow/writer.h x: 7 # contributors y: 12 # changes src/parquet/encoding-benchmark.cc x: 4 # contributors y: 9 # changes src/parquet/encoding.h x: 9 # contributors y: 14 # changes src/parquet/statistics.cc x: 4 # contributors y: 7 # changes src/parquet/util/memory.h x: 7 # contributors y: 17 # changes src/parquet/schema.cc x: 8 # contributors y: 14 # changes src/parquet/schema.h x: 5 # contributors y: 9 # changes src/parquet/arrow/record_reader.h x: 2 # contributors y: 3 # changes src/parquet/statistics.h x: 3 # contributors y: 6 # changes src/parquet/test-util.h x: 3 # contributors y: 7 # changes src/parquet/util/test-common.h x: 9 # contributors y: 21 # changes cmake_modules/ThirdpartyToolchain.cmake x: 15 # contributors y: 65 # changes src/parquet/properties.h x: 2 # contributors y: 4 # changes src/parquet/test-specialization.h x: 2 # contributors y: 5 # changes src/parquet/reader-test.cc x: 8 # contributors y: 29 # changes src/parquet/exception.h x: 5 # contributors y: 11 # changes src/parquet/arrow/schema.cc x: 8 # contributors y: 30 # changes cmake_modules/FindArrow.cmake x: 8 # contributors y: 15 # changes src/parquet/util/comparison.cc x: 1 # contributors y: 2 # changes src/parquet/public-api-test.cc x: 3 # contributors y: 9 # changes src/parquet/column_page.h x: 1 # contributors y: 3 # changes src/parquet/util/visibility.h x: 3 # contributors y: 4 # changes cmake_modules/FindZLIB.cmake x: 5 # contributors y: 8 # changes
67.0
# changes
  min: 1.0
  average: 9.26
  25th percentile: 2.0
  median: 6.0
  75th percentile: 11.0
  max: 67.0
0 17.0
# contributors
min: 1.0 | average: 4.22 | 25th percentile: 2.0 | median: 4.0 | 75th percentile: 6.0 | max: 17.0

Number of Contributors vs. File Size: 113 points

src/parquet/arrow/test-util.h x: 10 # contributors y: 390 lines of code src/parquet/column_writer-test.cc x: 6 # contributors y: 584 lines of code src/parquet/column_writer.cc x: 5 # contributors y: 658 lines of code src/parquet/column_writer.h x: 3 # contributors y: 167 lines of code src/parquet/file-serialize-test.cc x: 4 # contributors y: 231 lines of code src/parquet/file_writer.cc x: 4 # contributors y: 302 lines of code src/parquet/file_writer.h x: 4 # contributors y: 96 lines of code src/parquet/arrow/arrow-reader-writer-benchmark.cc x: 7 # contributors y: 196 lines of code src/parquet/arrow/arrow-reader-writer-test.cc x: 17 # contributors y: 1820 lines of code src/parquet/arrow/reader.cc x: 12 # contributors y: 1206 lines of code src/parquet/arrow/reader.h x: 7 # contributors y: 108 lines of code src/parquet/arrow/writer.cc x: 14 # contributors y: 910 lines of code src/parquet/printer.cc x: 3 # contributors y: 194 lines of code src/parquet/printer.h x: 2 # contributors y: 24 lines of code tools/parquet_reader.cc x: 3 # contributors y: 54 lines of code src/parquet/bloom_filter-test.cc x: 2 # contributors y: 149 lines of code cmake_modules/ArrowExternalProject.cmake x: 3 # contributors y: 67 lines of code cmake_modules/SetupCxxFlags.cmake x: 3 # contributors y: 220 lines of code benchmarks/decode_benchmark.cc x: 4 # contributors y: 365 lines of code cmake_modules/FindClangTools.cmake x: 4 # contributors y: 111 lines of code src/parquet/arrow/record_reader.cc x: 4 # contributors y: 557 lines of code src/parquet/types.h x: 14 # contributors y: 210 lines of code src/parquet/metadata.cc x: 4 # contributors y: 696 lines of code src/parquet/bloom_filter.cc x: 1 # contributors y: 117 lines of code src/parquet/bloom_filter.h x: 1 # contributors y: 86 lines of code src/parquet/hasher.h x: 1 # contributors y: 18 lines of code src/parquet/murmur3.cc x: 1 # contributors y: 136 lines of code src/parquet/util/crypto.cc x: 1 # contributors y: 255 lines of code src/parquet/column_reader.cc x: 6 # contributors y: 333 lines of code src/parquet/encoding-internal.h x: 8 # contributors y: 729 lines of code src/parquet/encoding-test.cc x: 6 # contributors y: 211 lines of code src/parquet/statistics-test.cc x: 7 # contributors y: 599 lines of code src/parquet/types.cc x: 6 # contributors y: 263 lines of code src/parquet/util/macros.h x: 4 # contributors y: 37 lines of code src/parquet/parquet.thrift x: 5 # contributors y: 273 lines of code src/parquet/arrow/arrow-schema-test.cc x: 9 # contributors y: 579 lines of code src/parquet/arrow/writer.h x: 7 # contributors y: 133 lines of code src/parquet/encoding.h x: 9 # contributors y: 87 lines of code src/parquet/metadata.h x: 4 # contributors y: 166 lines of code src/parquet/statistics.cc x: 4 # contributors y: 251 lines of code src/parquet/util/memory.cc x: 6 # contributors y: 384 lines of code src/parquet/util/memory.h x: 7 # contributors y: 224 lines of code src/parquet/schema.cc x: 8 # contributors y: 591 lines of code src/parquet/schema.h x: 5 # contributors y: 235 lines of code src/parquet/arrow/record_reader.h x: 2 # contributors y: 47 lines of code src/parquet/column-io-benchmark.cc x: 4 # contributors y: 186 lines of code src/parquet/column_reader-test.cc x: 3 # contributors y: 310 lines of code src/parquet/column_reader.h x: 6 # contributors y: 310 lines of code src/parquet/column_scanner.h x: 2 # contributors y: 191 lines of code src/parquet/file_reader.cc x: 5 # contributors y: 261 lines of code src/parquet/test-util.h x: 3 # contributors y: 341 lines of code cmake_modules/BuildUtils.cmake x: 2 # contributors y: 97 lines of code src/parquet/util/test-common.h x: 9 # contributors y: 159 lines of code cmake_modules/ThirdpartyToolchain.cmake x: 15 # contributors y: 370 lines of code appveyor.yml x: 3 # contributors y: 43 lines of code src/parquet/properties.h x: 2 # contributors y: 320 lines of code src/parquet/test-specialization.h x: 2 # contributors y: 124 lines of code src/parquet/column_scanner-test.cc x: 2 # contributors y: 187 lines of code src/parquet/file-deserialize-test.cc x: 2 # contributors y: 207 lines of code src/parquet/schema-test.cc x: 6 # contributors y: 534 lines of code src/parquet/reader-test.cc x: 8 # contributors y: 231 lines of code src/parquet/exception.cc x: 3 # contributors y: 28 lines of code src/parquet/exception.h x: 5 # contributors y: 47 lines of code src/parquet/arrow/schema.cc x: 8 # contributors y: 602 lines of code src/parquet/util/comparison.h x: 4 # contributors y: 126 lines of code cmake_modules/FindArrow.cmake x: 8 # contributors y: 108 lines of code src/parquet/metadata-test.cc x: 2 # contributors y: 170 lines of code src/parquet/util/comparison.cc x: 1 # contributors y: 53 lines of code cmake_modules/CompilerInfo.cmake x: 5 # contributors y: 59 lines of code src/parquet/api/reader.h x: 3 # contributors y: 11 lines of code src/parquet/api/writer.h x: 2 # contributors y: 8 lines of code src/parquet/file_reader.h x: 1 # contributors y: 72 lines of code src/parquet/arrow/schema.h x: 6 # contributors y: 46 lines of code src/parquet/parquet_version.h.in x: 1 # contributors y: 4 lines of code src/parquet/column_scanner.cc x: 1 # contributors y: 65 lines of code src/parquet/util/logging.h x: 5 # contributors y: 4 lines of code cmake_modules/FindThrift.cmake x: 6 # contributors y: 91 lines of code cmake_modules/FindGTest.cmake x: 5 # contributors y: 88 lines of code cmake_modules/FindSnappy.cmake x: 7 # contributors y: 88 lines of code cmake_modules/FindZLIB.cmake x: 5 # contributors y: 99 lines of code src/parquet/util/stopwatch.h x: 4 # contributors y: 24 lines of code src/parquet/schema-internal.h x: 1 # contributors y: 40 lines of code dev/merge_parquet_pr.py x: 2 # contributors y: 239 lines of code build-support/cpplint.py x: 1 # contributors y: 2985 lines of code
2985.0
lines of code
  min: 4.0
  average: 227.52
  25th percentile: 44.5
  median: 113.0
  75th percentile: 253.0
  max: 2985.0
0 17.0
# contributors
min: 1.0 | average: 4.22 | 25th percentile: 2.0 | median: 4.0 | 75th percentile: 6.0 | max: 17.0