apache / parquet-cpp
File Size

The distribution of size of files (measured in lines of code).

Intro
Learn more...
File Size Overall
23% | 27% | 27% | 14% | 8%
Legend:
1001+
501-1000
201-500
101-200
1-100


explore: grouped by folders | grouped by size | sunburst | 3D view
File Size per Extension
1001+
501-1000
201-500
101-200
1-100
cc19% | 41% | 25% | 11% | 2%
py90% | 0% | 7% | 0% | 2%
h0% | 14% | 39% | 30% | 16%
cmake0% | 0% | 36% | 20% | 43%
thrift0% | 0% | 100% | 0% | 0%
in0% | 0% | 0% | 0% | 100%
yml0% | 0% | 0% | 0% | 100%
pl0% | 0% | 0% | 0% | 100%
File Size per Logical Decomposition
primary
1001+
501-1000
201-500
101-200
1-100
src14% | 34% | 28% | 16% | 5%
build-support96% | 0% | 0% | 0% | 3%
cmake_modules0% | 0% | 35% | 19% | 44%
benchmarks0% | 0% | 100% | 0% | 0%
dev0% | 0% | 100% | 0% | 0%
tools0% | 0% | 0% | 0% | 100%
ROOT0% | 0% | 0% | 0% | 100%
Longest Files (Top 50)
File# lines# units
cpplint.py
in build-support
2985 161
arrow-reader-writer-test.cc
in src/parquet/arrow
1820 43
reader.cc
in src/parquet/arrow
1206 80
writer.cc
in src/parquet/arrow
910 53
encoding-internal.h
in src/parquet
729 66
metadata.cc
in src/parquet
696 95
column_writer.cc
in src/parquet
658 41
schema.cc
in src/parquet/arrow
602 22
statistics-test.cc
in src/parquet
599 24
schema.cc
in src/parquet
591 54
column_writer-test.cc
in src/parquet
584 19
arrow-schema-test.cc
in src/parquet/arrow
579 5
record_reader.cc
in src/parquet/arrow
557 47
schema-test.cc
in src/parquet
534 26
test-util.h
in src/parquet/arrow
390 13
memory.cc
in src/parquet/util
384 26
370 -
decode_benchmark.cc
in benchmarks
365 18
test-util.h
in src/parquet
341 26
column_reader.cc
in src/parquet
333 12
properties.h
in src/parquet
320 33
column_reader.h
in src/parquet
310 11
column_reader-test.cc
in src/parquet
310 7
file_writer.cc
in src/parquet
302 29
parquet.thrift
in src/parquet
273 -
types.cc
in src/parquet
263 9
file_reader.cc
in src/parquet
261 18
crypto.cc
in src/parquet/util
255 9
statistics.cc
in src/parquet
251 23
memory-test.cc
in src/parquet/util
248 1
239 11
schema.h
in src/parquet
235 32
reader-test.cc
in src/parquet
231 8
231 6
memory.h
in src/parquet/util
224 10
SetupCxxFlags.cmake
in cmake_modules
220 -
encoding-test.cc
in src/parquet
211 15
types.h
in src/parquet
210 7
207 7
196 8
printer.cc
in src/parquet
194 2
column_scanner.h
in src/parquet
191 16
187 7
186 8
statistics.h
in src/parquet
176 16
metadata-test.cc
in src/parquet
170 -
comparison-test.cc
in src/parquet/util
168 2
column_writer.h
in src/parquet
167 8
metadata.h
in src/parquet
166 -
test-common.h
in src/parquet/util
159 14
Files With Most Units (Top 50)
File# lines# units
cpplint.py
in build-support
2985 161
metadata.cc
in src/parquet
696 95
reader.cc
in src/parquet/arrow
1206 80
encoding-internal.h
in src/parquet
729 66
schema.cc
in src/parquet
591 54
writer.cc
in src/parquet/arrow
910 53
record_reader.cc
in src/parquet/arrow
557 47
arrow-reader-writer-test.cc
in src/parquet/arrow
1820 43
column_writer.cc
in src/parquet
658 41
properties.h
in src/parquet
320 33
schema.h
in src/parquet
235 32
file_writer.cc
in src/parquet
302 29
memory.cc
in src/parquet/util
384 26
test-util.h
in src/parquet
341 26
schema-test.cc
in src/parquet
534 26
statistics-test.cc
in src/parquet
599 24
statistics.cc
in src/parquet
251 23
schema.cc
in src/parquet/arrow
602 22
column_writer-test.cc
in src/parquet
584 19
decode_benchmark.cc
in benchmarks
365 18
file_reader.cc
in src/parquet
261 18
column_page.h
in src/parquet
106 18
statistics.h
in src/parquet
176 16
column_scanner.h
in src/parquet
191 16
encoding-test.cc
in src/parquet
211 15
test-common.h
in src/parquet/util
159 14
murmur3.cc
in src/parquet
136 14
test-specialization.h
in src/parquet
124 13
test-util.h
in src/parquet/arrow
390 13
column_reader.cc
in src/parquet
333 12
thrift.h
in src/parquet
101 12
comparison.h
in src/parquet/util
126 12
239 11
column_reader.h
in src/parquet
310 11
bloom_filter.h
in src/parquet
86 10
memory.h
in src/parquet/util
224 10
crypto.cc
in src/parquet/util
255 9
encoding.h
in src/parquet
87 9
types.cc
in src/parquet
263 9
column_writer.h
in src/parquet
167 8
reader-test.cc
in src/parquet
231 8
196 8
186 8
encoding-benchmark.cc
in src/parquet
113 8
207 7
types.h
in src/parquet
210 7
187 7
column_reader-test.cc
in src/parquet
310 7
bloom_filter.cc
in src/parquet
117 7
writer.h
in src/parquet/arrow
133 6
Files With Long Lines (Top 8)

There are 8 files with lines longer than 120 characters. In total, there are 25 long lines.

File# lines# units# long lines
reader-test.cc
in src/parquet
231 8 11
FindBrotli.cmake
in cmake_modules
107 - 6
370 - 2
FindZLIB.cmake
in cmake_modules
99 - 2
BuildUtils.cmake
in cmake_modules
97 - 1
FindGTest.cmake
in cmake_modules
88 - 1
FindArrow.cmake
in cmake_modules
108 - 1
FindSnappy.cmake
in cmake_modules
88 - 1
Correlations

File Size vs. Commits (all time): 113 points

src/parquet/arrow/test-util.h x: 29 commits (all time) y: 390 lines of code src/parquet/column_writer-test.cc x: 14 commits (all time) y: 584 lines of code src/parquet/column_writer.cc x: 14 commits (all time) y: 658 lines of code src/parquet/column_writer.h x: 8 commits (all time) y: 167 lines of code src/parquet/file-serialize-test.cc x: 5 commits (all time) y: 231 lines of code src/parquet/file_writer.cc x: 5 commits (all time) y: 302 lines of code src/parquet/file_writer.h x: 5 commits (all time) y: 96 lines of code src/parquet/arrow/arrow-reader-writer-benchmark.cc x: 18 commits (all time) y: 196 lines of code src/parquet/arrow/arrow-reader-writer-test.cc x: 76 commits (all time) y: 1820 lines of code src/parquet/arrow/reader.cc x: 61 commits (all time) y: 1206 lines of code src/parquet/arrow/reader.h x: 20 commits (all time) y: 108 lines of code src/parquet/arrow/writer.cc x: 51 commits (all time) y: 910 lines of code src/parquet/printer.cc x: 3 commits (all time) y: 194 lines of code src/parquet/printer.h x: 2 commits (all time) y: 24 lines of code tools/parquet_reader.cc x: 5 commits (all time) y: 54 lines of code src/parquet/bloom_filter-test.cc x: 3 commits (all time) y: 149 lines of code cmake_modules/ArrowExternalProject.cmake x: 3 commits (all time) y: 67 lines of code cmake_modules/SetupCxxFlags.cmake x: 4 commits (all time) y: 220 lines of code benchmarks/decode_benchmark.cc x: 13 commits (all time) y: 365 lines of code cmake_modules/FindClangTools.cmake x: 5 commits (all time) y: 111 lines of code src/parquet/arrow/record_reader.cc x: 6 commits (all time) y: 557 lines of code src/parquet/types.h x: 30 commits (all time) y: 210 lines of code src/parquet/metadata.cc x: 8 commits (all time) y: 696 lines of code src/parquet/bloom_filter.cc x: 1 commits (all time) y: 117 lines of code src/parquet/bloom_filter.h x: 1 commits (all time) y: 86 lines of code src/parquet/hasher.h x: 1 commits (all time) y: 18 lines of code src/parquet/murmur3.cc x: 1 commits (all time) y: 136 lines of code src/parquet/util/crypto.cc x: 1 commits (all time) y: 255 lines of code src/parquet/column_reader.cc x: 14 commits (all time) y: 333 lines of code src/parquet/encoding-internal.h x: 18 commits (all time) y: 729 lines of code src/parquet/encoding-test.cc x: 11 commits (all time) y: 211 lines of code src/parquet/statistics-test.cc x: 16 commits (all time) y: 599 lines of code src/parquet/types-test.cc x: 11 commits (all time) y: 108 lines of code src/parquet/types.cc x: 10 commits (all time) y: 263 lines of code src/parquet/util/macros.h x: 6 commits (all time) y: 37 lines of code src/parquet/parquet.thrift x: 6 commits (all time) y: 273 lines of code src/parquet/arrow/arrow-schema-test.cc x: 28 commits (all time) y: 579 lines of code src/parquet/arrow/writer.h x: 12 commits (all time) y: 133 lines of code src/parquet/encoding-benchmark.cc x: 9 commits (all time) y: 113 lines of code src/parquet/encoding.h x: 14 commits (all time) y: 87 lines of code src/parquet/metadata.h x: 5 commits (all time) y: 166 lines of code src/parquet/statistics.cc x: 7 commits (all time) y: 251 lines of code src/parquet/util/memory-test.cc x: 9 commits (all time) y: 248 lines of code src/parquet/util/memory.cc x: 14 commits (all time) y: 384 lines of code src/parquet/util/memory.h x: 17 commits (all time) y: 224 lines of code src/parquet/schema.cc x: 14 commits (all time) y: 591 lines of code src/parquet/schema.h x: 9 commits (all time) y: 235 lines of code src/parquet/arrow/record_reader.h x: 3 commits (all time) y: 47 lines of code src/parquet/column-io-benchmark.cc x: 7 commits (all time) y: 186 lines of code src/parquet/column_reader-test.cc x: 5 commits (all time) y: 310 lines of code src/parquet/column_reader.h x: 11 commits (all time) y: 310 lines of code src/parquet/file_reader.cc x: 9 commits (all time) y: 261 lines of code src/parquet/statistics.h x: 6 commits (all time) y: 176 lines of code src/parquet/test-util.h x: 7 commits (all time) y: 341 lines of code cmake_modules/BuildUtils.cmake x: 2 commits (all time) y: 97 lines of code src/parquet/util/test-common.h x: 21 commits (all time) y: 159 lines of code cmake_modules/ThirdpartyToolchain.cmake x: 80 commits (all time) y: 370 lines of code appveyor.yml x: 8 commits (all time) y: 43 lines of code src/parquet/properties.h x: 4 commits (all time) y: 320 lines of code src/parquet/test-specialization.h x: 5 commits (all time) y: 124 lines of code src/parquet/column_scanner-test.cc x: 4 commits (all time) y: 187 lines of code src/parquet/file-deserialize-test.cc x: 2 commits (all time) y: 207 lines of code src/parquet/schema-test.cc x: 10 commits (all time) y: 534 lines of code src/parquet/reader-test.cc x: 31 commits (all time) y: 231 lines of code src/parquet/util/schema-util.h x: 6 commits (all time) y: 51 lines of code src/parquet/thrift.h x: 6 commits (all time) y: 101 lines of code src/parquet/exception.cc x: 5 commits (all time) y: 28 lines of code src/parquet/exception.h x: 11 commits (all time) y: 47 lines of code src/parquet/arrow/schema.cc x: 31 commits (all time) y: 602 lines of code src/parquet/util/comparison.h x: 7 commits (all time) y: 126 lines of code cmake_modules/FindArrow.cmake x: 15 commits (all time) y: 108 lines of code src/parquet/metadata-test.cc x: 2 commits (all time) y: 170 lines of code src/parquet/util/comparison.cc x: 2 commits (all time) y: 53 lines of code cmake_modules/CompilerInfo.cmake x: 6 commits (all time) y: 59 lines of code src/parquet/public-api-test.cc x: 9 commits (all time) y: 26 lines of code src/parquet/api/reader.h x: 7 commits (all time) y: 11 lines of code src/parquet/api/writer.h x: 3 commits (all time) y: 8 lines of code src/parquet/column_page.h x: 3 commits (all time) y: 106 lines of code src/parquet/file_reader.h x: 1 commits (all time) y: 72 lines of code src/parquet/properties-test.cc x: 4 commits (all time) y: 37 lines of code build-support/run_clang_format.py x: 1 commits (all time) y: 50 lines of code cmake_modules/SnappyConfig.h x: 2 commits (all time) y: 6 lines of code src/parquet/column_scanner.cc x: 2 commits (all time) y: 65 lines of code src/parquet/util/logging.h x: 9 commits (all time) y: 4 lines of code src/parquet/util/visibility.h x: 4 commits (all time) y: 31 lines of code cmake_modules/FindThrift.cmake x: 10 commits (all time) y: 91 lines of code cmake_modules/FindGTest.cmake x: 6 commits (all time) y: 88 lines of code cmake_modules/FindSnappy.cmake x: 12 commits (all time) y: 88 lines of code cmake_modules/FindZLIB.cmake x: 8 commits (all time) y: 99 lines of code src/parquet/parquet.pc.in x: 3 commits (all time) y: 26 lines of code src/parquet/util/stopwatch.h x: 7 commits (all time) y: 24 lines of code src/parquet/schema-internal.h x: 2 commits (all time) y: 40 lines of code src/parquet/api/io.h x: 4 commits (all time) y: 5 lines of code tools/parquet-dump-schema.cc x: 2 commits (all time) y: 15 lines of code src/parquet/util/benchmark_main.cc x: 1 commits (all time) y: 6 lines of code dev/merge_parquet_pr.py x: 2 commits (all time) y: 239 lines of code build-support/cpplint.py x: 1 commits (all time) y: 2985 lines of code
2985.0
lines of code
  min: 4.0
  average: 227.52
  25th percentile: 44.5
  median: 113.0
  75th percentile: 253.0
  max: 2985.0
0 80.0
commits (all time)
min: 1.0 | average: 9.64 | 25th percentile: 3.0 | median: 6.0 | 75th percentile: 11.0 | max: 80.0

File Size vs. Contributors (all time): 113 points

src/parquet/arrow/test-util.h x: 10 contributors (all time) y: 390 lines of code src/parquet/column_writer-test.cc x: 6 contributors (all time) y: 584 lines of code src/parquet/column_writer.cc x: 5 contributors (all time) y: 658 lines of code src/parquet/column_writer.h x: 3 contributors (all time) y: 167 lines of code src/parquet/file-serialize-test.cc x: 4 contributors (all time) y: 231 lines of code src/parquet/file_writer.cc x: 4 contributors (all time) y: 302 lines of code src/parquet/file_writer.h x: 4 contributors (all time) y: 96 lines of code src/parquet/arrow/arrow-reader-writer-benchmark.cc x: 7 contributors (all time) y: 196 lines of code src/parquet/arrow/arrow-reader-writer-test.cc x: 17 contributors (all time) y: 1820 lines of code src/parquet/arrow/reader.cc x: 12 contributors (all time) y: 1206 lines of code src/parquet/arrow/reader.h x: 7 contributors (all time) y: 108 lines of code src/parquet/arrow/writer.cc x: 14 contributors (all time) y: 910 lines of code src/parquet/printer.cc x: 3 contributors (all time) y: 194 lines of code src/parquet/printer.h x: 2 contributors (all time) y: 24 lines of code tools/parquet_reader.cc x: 3 contributors (all time) y: 54 lines of code src/parquet/bloom_filter-test.cc x: 2 contributors (all time) y: 149 lines of code cmake_modules/ArrowExternalProject.cmake x: 3 contributors (all time) y: 67 lines of code cmake_modules/SetupCxxFlags.cmake x: 3 contributors (all time) y: 220 lines of code benchmarks/decode_benchmark.cc x: 4 contributors (all time) y: 365 lines of code cmake_modules/FindClangTools.cmake x: 4 contributors (all time) y: 111 lines of code src/parquet/arrow/record_reader.cc x: 4 contributors (all time) y: 557 lines of code src/parquet/types.h x: 14 contributors (all time) y: 210 lines of code src/parquet/metadata.cc x: 4 contributors (all time) y: 696 lines of code src/parquet/bloom_filter.cc x: 1 contributors (all time) y: 117 lines of code src/parquet/bloom_filter.h x: 1 contributors (all time) y: 86 lines of code src/parquet/hasher.h x: 1 contributors (all time) y: 18 lines of code src/parquet/murmur3.cc x: 1 contributors (all time) y: 136 lines of code src/parquet/util/crypto.cc x: 1 contributors (all time) y: 255 lines of code src/parquet/column_reader.cc x: 6 contributors (all time) y: 333 lines of code src/parquet/encoding-internal.h x: 8 contributors (all time) y: 729 lines of code src/parquet/encoding-test.cc x: 6 contributors (all time) y: 211 lines of code src/parquet/statistics-test.cc x: 7 contributors (all time) y: 599 lines of code src/parquet/types.cc x: 6 contributors (all time) y: 263 lines of code src/parquet/util/macros.h x: 4 contributors (all time) y: 37 lines of code src/parquet/parquet.thrift x: 5 contributors (all time) y: 273 lines of code src/parquet/arrow/arrow-schema-test.cc x: 9 contributors (all time) y: 579 lines of code src/parquet/arrow/writer.h x: 7 contributors (all time) y: 133 lines of code src/parquet/encoding.h x: 9 contributors (all time) y: 87 lines of code src/parquet/metadata.h x: 4 contributors (all time) y: 166 lines of code src/parquet/statistics.cc x: 4 contributors (all time) y: 251 lines of code src/parquet/util/memory.cc x: 6 contributors (all time) y: 384 lines of code src/parquet/util/memory.h x: 7 contributors (all time) y: 224 lines of code src/parquet/schema.cc x: 8 contributors (all time) y: 591 lines of code src/parquet/schema.h x: 5 contributors (all time) y: 235 lines of code src/parquet/arrow/record_reader.h x: 2 contributors (all time) y: 47 lines of code src/parquet/column-io-benchmark.cc x: 4 contributors (all time) y: 186 lines of code src/parquet/column_reader-test.cc x: 3 contributors (all time) y: 310 lines of code src/parquet/column_reader.h x: 6 contributors (all time) y: 310 lines of code src/parquet/column_scanner.h x: 2 contributors (all time) y: 191 lines of code src/parquet/file_reader.cc x: 5 contributors (all time) y: 261 lines of code src/parquet/test-util.h x: 3 contributors (all time) y: 341 lines of code cmake_modules/BuildUtils.cmake x: 2 contributors (all time) y: 97 lines of code src/parquet/util/test-common.h x: 9 contributors (all time) y: 159 lines of code cmake_modules/ThirdpartyToolchain.cmake x: 15 contributors (all time) y: 370 lines of code appveyor.yml x: 3 contributors (all time) y: 43 lines of code src/parquet/properties.h x: 2 contributors (all time) y: 320 lines of code src/parquet/test-specialization.h x: 2 contributors (all time) y: 124 lines of code src/parquet/column_scanner-test.cc x: 2 contributors (all time) y: 187 lines of code src/parquet/file-deserialize-test.cc x: 2 contributors (all time) y: 207 lines of code src/parquet/schema-test.cc x: 6 contributors (all time) y: 534 lines of code src/parquet/reader-test.cc x: 8 contributors (all time) y: 231 lines of code src/parquet/exception.cc x: 3 contributors (all time) y: 28 lines of code src/parquet/exception.h x: 5 contributors (all time) y: 47 lines of code src/parquet/arrow/schema.cc x: 8 contributors (all time) y: 602 lines of code src/parquet/util/comparison.h x: 4 contributors (all time) y: 126 lines of code cmake_modules/FindArrow.cmake x: 8 contributors (all time) y: 108 lines of code src/parquet/metadata-test.cc x: 2 contributors (all time) y: 170 lines of code src/parquet/util/comparison.cc x: 1 contributors (all time) y: 53 lines of code cmake_modules/CompilerInfo.cmake x: 5 contributors (all time) y: 59 lines of code src/parquet/api/reader.h x: 3 contributors (all time) y: 11 lines of code src/parquet/api/writer.h x: 2 contributors (all time) y: 8 lines of code src/parquet/file_reader.h x: 1 contributors (all time) y: 72 lines of code src/parquet/arrow/schema.h x: 6 contributors (all time) y: 46 lines of code src/parquet/parquet_version.h.in x: 1 contributors (all time) y: 4 lines of code src/parquet/column_scanner.cc x: 1 contributors (all time) y: 65 lines of code src/parquet/util/logging.h x: 5 contributors (all time) y: 4 lines of code cmake_modules/FindThrift.cmake x: 6 contributors (all time) y: 91 lines of code cmake_modules/FindGTest.cmake x: 5 contributors (all time) y: 88 lines of code cmake_modules/FindSnappy.cmake x: 7 contributors (all time) y: 88 lines of code cmake_modules/FindZLIB.cmake x: 5 contributors (all time) y: 99 lines of code src/parquet/util/stopwatch.h x: 4 contributors (all time) y: 24 lines of code src/parquet/schema-internal.h x: 1 contributors (all time) y: 40 lines of code dev/merge_parquet_pr.py x: 2 contributors (all time) y: 239 lines of code build-support/cpplint.py x: 1 contributors (all time) y: 2985 lines of code
2985.0
lines of code
  min: 4.0
  average: 227.52
  25th percentile: 44.5
  median: 113.0
  75th percentile: 253.0
  max: 2985.0
0 17.0
contributors (all time)
min: 1.0 | average: 4.22 | 25th percentile: 2.0 | median: 4.0 | 75th percentile: 6.0 | max: 17.0

File Size vs. Commits (30 days): 0 points

No data for "commits (30d)" vs. "lines of code".

File Size vs. Contributors (30 days): 0 points

No data for "contributors (30d)" vs. "lines of code".


File Size vs. Commits (90 days): 0 points

No data for "commits (90d)" vs. "lines of code".

File Size vs. Contributors (90 days): 0 points

No data for "contributors (90d)" vs. "lines of code".