apache / hudi-rs
File Size

The distribution of size of files (measured in lines of code).

Intro
Learn more...
File Size Overall
11% | 17% | 46% | 11% | 12%
Legend:
1001+
501-1000
201-500
101-200
1-100


explore: grouped by folders | grouped by size | sunburst | 3D view
File Size per Extension
1001+
501-1000
201-500
101-200
1-100
rs12% | 19% | 49% | 9% | 8%
pyi0% | 0% | 0% | 100% | 0%
toml0% | 0% | 0% | 23% | 76%
py0% | 0% | 0% | 0% | 100%
File Size per Logical Decomposition
primary
1001+
501-1000
201-500
101-200
1-100
crates13% | 14% | 51% | 10% | 10%
python0% | 62% | 0% | 14% | 22%
ROOT0% | 0% | 0% | 54% | 45%
Longest Files (Top 50)
File# lines# units
mod.rs
in crates/core/src/table
1306 30
arrow_array_reader.rs
in crates/core/src/avro_to_arrow
877 22
internal.rs
in python/src
541 30
selector.rs
in crates/core/src/timeline
540 15
lib.rs
in crates/datafusion/src
477 11
builder.rs
in crates/core/src/file_group
422 18
mod.rs
in crates/core/src/timeline
364 -
record_merger.rs
in crates/core/src/merge
340 9
reader.rs
in crates/core/src/file_group/log_file
323 13
instant.rs
in crates/core/src/timeline
319 21
builder.rs
in crates/core/src/table
315 12
partition.rs
in crates/core/src/table
311 14
log_block.rs
in crates/core/src/file_group/log_file
310 11
reader.rs
in crates/core/src/file_group
291 6
filter.rs
in crates/core/src/expr
282 8
schema.rs
in crates/core/src/avro_to_arrow
257 4
mod.rs
in crates/core/src/file_group
244 6
expr.rs
in crates/datafusion/src/util
229 8
fs_view.rs
in crates/core/src/table
210 -
listing.rs
in crates/core/src/table
204 1
content.rs
in crates/core/src/file_group/log_file
204 4
file_slice.rs
in crates/core/src/file_group
198 6
table.rs
in crates/core/src/config
192 5
log_format.rs
in crates/core/src/file_group/log_file
156 8
pyi
_internal.pyi
in python/hudi
127 38
arrow.rs
in crates/core/src/util
125 7
mod.rs
in crates/core/src/file_group/log_file
122 3
cliff.toml
in root
114 -
util.rs
in crates/core/src/storage
105 3
mod.rs
in crates/core/src/expr
104 4
Cargo.toml
in crates/core
78 -
Cargo.toml
in root
75 -
mod.rs
in crates/core/src/config
72 4
base_file.rs
in crates/core/src/file_group
70 1
69 -
scanner.rs
in crates/core/src/file_group/log_file
68 -
error.rs
in crates/core/src
61 -
util.rs
in crates/core/src/config
59 -
avro.rs
in crates/core/src/file_group/log_file
54 1
mod.rs
in crates/core/src/storage
53 -
Cargo.toml
in crates/datafusion
52 -
meta_field.rs
in crates/core/src/metadata
48 1
read.rs
in crates/core/src/config
46 1
Cargo.toml
in python
45 -
reader.rs
in crates/core/src/storage
45 2
builder.py
in python/hudi/table
42 9
mod.rs
in crates/core/src/merge
41 1
error.rs
in crates/core/src/storage
39 -
internal.rs
in crates/core/src/config
38 1
error.rs
in crates/core/src/config
37 -
Files With Most Units (Top 36)
File# lines# units
pyi
_internal.pyi
in python/hudi
127 38
internal.rs
in python/src
541 30
mod.rs
in crates/core/src/table
1306 30
arrow_array_reader.rs
in crates/core/src/avro_to_arrow
877 22
instant.rs
in crates/core/src/timeline
319 21
builder.rs
in crates/core/src/file_group
422 18
selector.rs
in crates/core/src/timeline
540 15
partition.rs
in crates/core/src/table
311 14
reader.rs
in crates/core/src/file_group/log_file
323 13
builder.rs
in crates/core/src/table
315 12
lib.rs
in crates/datafusion/src
477 11
log_block.rs
in crates/core/src/file_group/log_file
310 11
builder.py
in python/hudi/table
42 9
record_merger.rs
in crates/core/src/merge
340 9
expr.rs
in crates/datafusion/src/util
229 8
log_format.rs
in crates/core/src/file_group/log_file
156 8
filter.rs
in crates/core/src/expr
282 8
arrow.rs
in crates/core/src/util
125 7
file_slice.rs
in crates/core/src/file_group
198 6
reader.rs
in crates/core/src/file_group
291 6
mod.rs
in crates/core/src/file_group
244 6
table.rs
in crates/core/src/config
192 5
mod.rs
in crates/core/src/config
72 4
content.rs
in crates/core/src/file_group/log_file
204 4
mod.rs
in crates/core/src/expr
104 4
schema.rs
in crates/core/src/avro_to_arrow
257 4
mod.rs
in crates/core/src/file_group/log_file
122 3
util.rs
in crates/core/src/storage
105 3
reader.rs
in crates/core/src/storage
45 2
meta_field.rs
in crates/core/src/metadata
48 1
listing.rs
in crates/core/src/table
204 1
read.rs
in crates/core/src/config
46 1
internal.rs
in crates/core/src/config
38 1
avro.rs
in crates/core/src/file_group/log_file
54 1
base_file.rs
in crates/core/src/file_group
70 1
mod.rs
in crates/core/src/merge
41 1
Files With Long Lines (Top 4)

There are 4 files with lines longer than 120 characters. In total, there are 6 long lines.

File# lines# units# long lines
builder.rs
in crates/core/src/file_group
422 18 2
arrow_array_reader.rs
in crates/core/src/avro_to_arrow
877 22 2
util.rs
in crates/core/src/config
59 - 1
mod.rs
in crates/core/src/file_group
244 6 1
Correlations

File Size vs. Commits (all time): 62 points

crates/core/src/config/util.rs x: 2 commits (all time) y: 59 lines of code crates/core/src/expr/filter.rs x: 5 commits (all time) y: 282 lines of code crates/core/src/file_group/mod.rs x: 27 commits (all time) y: 244 lines of code crates/core/src/file_group/reader.rs x: 13 commits (all time) y: 291 lines of code crates/core/src/table/mod.rs x: 63 commits (all time) y: 1306 lines of code crates/core/src/util/mod.rs x: 4 commits (all time) y: 19 lines of code crates/datafusion/src/lib.rs x: 38 commits (all time) y: 477 lines of code python/src/internal.rs x: 23 commits (all time) y: 541 lines of code crates/core/src/timeline/instant.rs x: 5 commits (all time) y: 319 lines of code crates/core/src/timeline/mod.rs x: 18 commits (all time) y: 364 lines of code crates/core/src/timeline/selector.rs x: 9 commits (all time) y: 540 lines of code python/hudi/__init__.py x: 11 commits (all time) y: 17 lines of code python/hudi/_internal.pyi x: 24 commits (all time) y: 127 lines of code python/src/lib.rs x: 14 commits (all time) y: 23 lines of code Cargo.toml x: 39 commits (all time) y: 75 lines of code crates/core/Cargo.toml x: 22 commits (all time) y: 78 lines of code crates/core/src/avro_to_arrow/arrow_array_reader.rs x: 1 commits (all time) y: 877 lines of code crates/core/src/avro_to_arrow/mod.rs x: 1 commits (all time) y: 12 lines of code crates/core/src/avro_to_arrow/schema.rs x: 1 commits (all time) y: 257 lines of code crates/core/src/error.rs x: 14 commits (all time) y: 61 lines of code crates/core/src/file_group/log_file/avro.rs x: 1 commits (all time) y: 54 lines of code crates/core/src/file_group/log_file/content.rs x: 1 commits (all time) y: 204 lines of code crates/core/src/file_group/log_file/log_block.rs x: 5 commits (all time) y: 310 lines of code crates/core/src/file_group/log_file/mod.rs x: 5 commits (all time) y: 122 lines of code crates/core/src/lib.rs x: 22 commits (all time) y: 30 lines of code python/pyproject.toml x: 9 commits (all time) y: 69 lines of code crates/core/src/config/table.rs x: 15 commits (all time) y: 192 lines of code python/Cargo.toml x: 16 commits (all time) y: 45 lines of code rust-toolchain.toml x: 5 commits (all time) y: 20 lines of code crates/datafusion/Cargo.toml x: 20 commits (all time) y: 52 lines of code crates/hudi/Cargo.toml x: 8 commits (all time) y: 36 lines of code crates/core/src/file_group/base_file.rs x: 5 commits (all time) y: 70 lines of code crates/core/src/file_group/builder.rs x: 5 commits (all time) y: 422 lines of code crates/core/src/file_group/file_slice.rs x: 4 commits (all time) y: 198 lines of code crates/core/src/config/read.rs x: 12 commits (all time) y: 46 lines of code crates/core/src/config/mod.rs x: 12 commits (all time) y: 72 lines of code crates/core/src/table/fs_view.rs x: 29 commits (all time) y: 210 lines of code crates/datafusion/src/util/expr.rs x: 3 commits (all time) y: 229 lines of code crates/core/src/file_group/log_file/scanner.rs x: 1 commits (all time) y: 68 lines of code crates/core/src/table/listing.rs x: 2 commits (all time) y: 204 lines of code crates/core/src/table/partition.rs x: 12 commits (all time) y: 311 lines of code crates/core/src/merge/mod.rs x: 2 commits (all time) y: 41 lines of code crates/core/src/merge/record_merger.rs x: 3 commits (all time) y: 340 lines of code crates/core/src/table/builder.rs x: 9 commits (all time) y: 315 lines of code crates/core/src/config/internal.rs x: 6 commits (all time) y: 38 lines of code crates/core/src/metadata/meta_field.rs x: 2 commits (all time) y: 48 lines of code crates/core/src/metadata/mod.rs x: 2 commits (all time) y: 27 lines of code crates/core/src/storage/mod.rs x: 22 commits (all time) y: 53 lines of code crates/core/src/util/arrow.rs x: 1 commits (all time) y: 125 lines of code crates/core/src/file_group/log_file/log_format.rs x: 1 commits (all time) y: 156 lines of code crates/core/src/storage/reader.rs x: 1 commits (all time) y: 45 lines of code crates/core/src/storage/file_metadata.rs x: 3 commits (all time) y: 37 lines of code crates/core/src/storage/util.rs x: 2 commits (all time) y: 105 lines of code crates/core/src/expr/mod.rs x: 1 commits (all time) y: 104 lines of code crates/datafusion/src/util/mod.rs x: 1 commits (all time) y: 19 lines of code crates/core/src/config/error.rs x: 1 commits (all time) y: 37 lines of code cliff.toml x: 2 commits (all time) y: 114 lines of code python/hudi/table/builder.py x: 2 commits (all time) y: 42 lines of code python/hudi/table/__init__.py x: 1 commits (all time) y: 1 lines of code crates/hudi/src/lib.rs x: 3 commits (all time) y: 21 lines of code
1306.0
lines of code
  min: 1.0
  average: 176.82
  25th percentile: 40.5
  median: 76.5
  75th percentile: 263.25
  max: 1306.0
0 63.0
commits (all time)
min: 1.0 | average: 9.65 | 25th percentile: 2.0 | median: 5.0 | 75th percentile: 14.0 | max: 63.0

File Size vs. Contributors (all time): 62 points

crates/core/src/config/util.rs x: 1 contributors (all time) y: 59 lines of code crates/core/src/expr/filter.rs x: 2 contributors (all time) y: 282 lines of code crates/core/src/file_group/mod.rs x: 3 contributors (all time) y: 244 lines of code crates/core/src/file_group/reader.rs x: 2 contributors (all time) y: 291 lines of code crates/core/src/table/mod.rs x: 8 contributors (all time) y: 1306 lines of code crates/core/src/util/mod.rs x: 2 contributors (all time) y: 19 lines of code crates/datafusion/src/lib.rs x: 6 contributors (all time) y: 477 lines of code python/src/internal.rs x: 5 contributors (all time) y: 541 lines of code crates/core/src/timeline/instant.rs x: 1 contributors (all time) y: 319 lines of code crates/core/src/timeline/mod.rs x: 1 contributors (all time) y: 364 lines of code crates/core/src/timeline/selector.rs x: 1 contributors (all time) y: 540 lines of code python/hudi/__init__.py x: 3 contributors (all time) y: 17 lines of code python/hudi/_internal.pyi x: 3 contributors (all time) y: 127 lines of code python/src/lib.rs x: 2 contributors (all time) y: 23 lines of code Cargo.toml x: 6 contributors (all time) y: 75 lines of code crates/core/Cargo.toml x: 5 contributors (all time) y: 78 lines of code crates/core/src/avro_to_arrow/arrow_array_reader.rs x: 1 contributors (all time) y: 877 lines of code crates/core/src/avro_to_arrow/mod.rs x: 1 contributors (all time) y: 12 lines of code crates/core/src/avro_to_arrow/schema.rs x: 1 contributors (all time) y: 257 lines of code crates/core/src/error.rs x: 2 contributors (all time) y: 61 lines of code crates/core/src/file_group/log_file/avro.rs x: 1 contributors (all time) y: 54 lines of code crates/core/src/file_group/log_file/content.rs x: 1 contributors (all time) y: 204 lines of code crates/core/src/file_group/log_file/log_block.rs x: 1 contributors (all time) y: 310 lines of code crates/core/src/file_group/log_file/mod.rs x: 1 contributors (all time) y: 122 lines of code crates/core/src/lib.rs x: 5 contributors (all time) y: 30 lines of code python/pyproject.toml x: 3 contributors (all time) y: 69 lines of code crates/core/src/config/table.rs x: 4 contributors (all time) y: 192 lines of code python/Cargo.toml x: 4 contributors (all time) y: 45 lines of code crates/datafusion/Cargo.toml x: 3 contributors (all time) y: 52 lines of code crates/hudi/Cargo.toml x: 1 contributors (all time) y: 36 lines of code crates/core/src/file_group/base_file.rs x: 1 contributors (all time) y: 70 lines of code crates/core/src/file_group/builder.rs x: 1 contributors (all time) y: 422 lines of code crates/core/src/file_group/file_slice.rs x: 1 contributors (all time) y: 198 lines of code crates/core/src/config/mod.rs x: 4 contributors (all time) y: 72 lines of code crates/core/src/table/fs_view.rs x: 6 contributors (all time) y: 210 lines of code crates/datafusion/src/util/expr.rs x: 2 contributors (all time) y: 229 lines of code crates/core/src/table/partition.rs x: 5 contributors (all time) y: 311 lines of code crates/core/src/merge/record_merger.rs x: 1 contributors (all time) y: 340 lines of code crates/core/src/table/builder.rs x: 3 contributors (all time) y: 315 lines of code crates/core/src/config/internal.rs x: 4 contributors (all time) y: 38 lines of code crates/core/src/metadata/meta_field.rs x: 1 contributors (all time) y: 48 lines of code crates/core/src/metadata/mod.rs x: 1 contributors (all time) y: 27 lines of code crates/core/src/storage/mod.rs x: 6 contributors (all time) y: 53 lines of code crates/core/src/util/arrow.rs x: 1 contributors (all time) y: 125 lines of code crates/core/src/file_group/log_file/log_format.rs x: 1 contributors (all time) y: 156 lines of code crates/core/src/storage/reader.rs x: 1 contributors (all time) y: 45 lines of code crates/core/src/storage/util.rs x: 1 contributors (all time) y: 105 lines of code crates/datafusion/src/util/mod.rs x: 1 contributors (all time) y: 19 lines of code cliff.toml x: 1 contributors (all time) y: 114 lines of code python/hudi/table/builder.py x: 2 contributors (all time) y: 42 lines of code python/hudi/table/__init__.py x: 1 contributors (all time) y: 1 lines of code crates/hudi/src/lib.rs x: 1 contributors (all time) y: 21 lines of code
1306.0
lines of code
  min: 1.0
  average: 176.82
  25th percentile: 40.5
  median: 76.5
  75th percentile: 263.25
  max: 1306.0
0 8.0
contributors (all time)
min: 1.0 | average: 2.27 | 25th percentile: 1.0 | median: 1.0 | 75th percentile: 3.0 | max: 8.0

File Size vs. Commits (30 days): 14 points

crates/core/src/config/util.rs x: 1 commits (30d) y: 59 lines of code crates/core/src/expr/filter.rs x: 1 commits (30d) y: 282 lines of code crates/core/src/file_group/mod.rs x: 1 commits (30d) y: 244 lines of code crates/core/src/file_group/reader.rs x: 1 commits (30d) y: 291 lines of code crates/core/src/table/mod.rs x: 2 commits (30d) y: 1306 lines of code crates/core/src/util/mod.rs x: 1 commits (30d) y: 19 lines of code crates/datafusion/src/lib.rs x: 1 commits (30d) y: 477 lines of code python/src/internal.rs x: 2 commits (30d) y: 541 lines of code crates/core/src/timeline/instant.rs x: 1 commits (30d) y: 319 lines of code crates/core/src/timeline/mod.rs x: 1 commits (30d) y: 364 lines of code crates/core/src/timeline/selector.rs x: 1 commits (30d) y: 540 lines of code python/hudi/_internal.pyi x: 1 commits (30d) y: 127 lines of code python/src/lib.rs x: 1 commits (30d) y: 23 lines of code
1306.0
lines of code
  min: 17.0
  average: 329.21
  25th percentile: 50.0
  median: 286.5
  75th percentile: 492.75
  max: 1306.0
0 2.0
commits (30d)
min: 1.0 | average: 1.14 | 25th percentile: 1.0 | median: 1.0 | 75th percentile: 1.0 | max: 2.0

File Size vs. Contributors (30 days): 14 points

crates/core/src/config/util.rs x: 1 contributors (30d) y: 59 lines of code crates/core/src/expr/filter.rs x: 1 contributors (30d) y: 282 lines of code crates/core/src/file_group/mod.rs x: 1 contributors (30d) y: 244 lines of code crates/core/src/file_group/reader.rs x: 1 contributors (30d) y: 291 lines of code crates/core/src/table/mod.rs x: 1 contributors (30d) y: 1306 lines of code crates/core/src/util/mod.rs x: 1 contributors (30d) y: 19 lines of code crates/datafusion/src/lib.rs x: 1 contributors (30d) y: 477 lines of code python/src/internal.rs x: 1 contributors (30d) y: 541 lines of code crates/core/src/timeline/instant.rs x: 1 contributors (30d) y: 319 lines of code crates/core/src/timeline/mod.rs x: 1 contributors (30d) y: 364 lines of code python/hudi/_internal.pyi x: 1 contributors (30d) y: 127 lines of code python/src/lib.rs x: 1 contributors (30d) y: 23 lines of code
1306.0
lines of code
  min: 17.0
  average: 329.21
  25th percentile: 50.0
  median: 286.5
  75th percentile: 492.75
  max: 1306.0
0 1.0
contributors (30d)
min: 1.0 | average: 1.0 | 25th percentile: 1.0 | median: 1.0 | 75th percentile: 1.0 | max: 1.0

File Size vs. Commits (90 days): 30 points

crates/core/src/config/util.rs x: 1 commits (90d) y: 59 lines of code crates/core/src/expr/filter.rs x: 1 commits (90d) y: 282 lines of code crates/core/src/file_group/mod.rs x: 1 commits (90d) y: 244 lines of code crates/core/src/file_group/reader.rs x: 1 commits (90d) y: 291 lines of code crates/core/src/table/mod.rs x: 4 commits (90d) y: 1306 lines of code crates/core/src/util/mod.rs x: 1 commits (90d) y: 19 lines of code crates/datafusion/src/lib.rs x: 2 commits (90d) y: 477 lines of code python/src/internal.rs x: 2 commits (90d) y: 541 lines of code crates/core/src/timeline/instant.rs x: 2 commits (90d) y: 319 lines of code crates/core/src/timeline/mod.rs x: 1 commits (90d) y: 364 lines of code crates/core/src/timeline/selector.rs x: 1 commits (90d) y: 540 lines of code python/hudi/_internal.pyi x: 1 commits (90d) y: 127 lines of code python/src/lib.rs x: 1 commits (90d) y: 23 lines of code Cargo.toml x: 3 commits (90d) y: 75 lines of code crates/core/Cargo.toml x: 1 commits (90d) y: 78 lines of code crates/core/src/avro_to_arrow/arrow_array_reader.rs x: 1 commits (90d) y: 877 lines of code crates/core/src/avro_to_arrow/mod.rs x: 1 commits (90d) y: 12 lines of code crates/core/src/avro_to_arrow/schema.rs x: 1 commits (90d) y: 257 lines of code crates/core/src/file_group/log_file/avro.rs x: 1 commits (90d) y: 54 lines of code crates/core/src/file_group/log_file/content.rs x: 1 commits (90d) y: 204 lines of code crates/core/src/file_group/log_file/log_block.rs x: 1 commits (90d) y: 310 lines of code crates/core/src/file_group/log_file/mod.rs x: 1 commits (90d) y: 122 lines of code crates/core/src/file_group/log_file/reader.rs x: 1 commits (90d) y: 323 lines of code crates/core/src/lib.rs x: 1 commits (90d) y: 30 lines of code python/pyproject.toml x: 2 commits (90d) y: 69 lines of code crates/core/src/config/table.rs x: 1 commits (90d) y: 192 lines of code python/Cargo.toml x: 1 commits (90d) y: 45 lines of code
1306.0
lines of code
  min: 12.0
  average: 244.6
  25th percentile: 51.75
  median: 159.5
  75th percentile: 320.0
  max: 1306.0
0 4.0
commits (90d)
min: 1.0 | average: 1.3 | 25th percentile: 1.0 | median: 1.0 | 75th percentile: 1.0 | max: 4.0

File Size vs. Contributors (90 days): 30 points

crates/core/src/config/util.rs x: 1 contributors (90d) y: 59 lines of code crates/core/src/expr/filter.rs x: 1 contributors (90d) y: 282 lines of code crates/core/src/file_group/mod.rs x: 1 contributors (90d) y: 244 lines of code crates/core/src/file_group/reader.rs x: 1 contributors (90d) y: 291 lines of code crates/core/src/table/mod.rs x: 1 contributors (90d) y: 1306 lines of code crates/core/src/util/mod.rs x: 1 contributors (90d) y: 19 lines of code crates/datafusion/src/lib.rs x: 1 contributors (90d) y: 477 lines of code python/src/internal.rs x: 1 contributors (90d) y: 541 lines of code crates/core/src/timeline/instant.rs x: 1 contributors (90d) y: 319 lines of code crates/core/src/timeline/mod.rs x: 1 contributors (90d) y: 364 lines of code python/hudi/_internal.pyi x: 1 contributors (90d) y: 127 lines of code python/src/lib.rs x: 1 contributors (90d) y: 23 lines of code Cargo.toml x: 1 contributors (90d) y: 75 lines of code crates/core/Cargo.toml x: 1 contributors (90d) y: 78 lines of code crates/core/src/avro_to_arrow/arrow_array_reader.rs x: 1 contributors (90d) y: 877 lines of code crates/core/src/avro_to_arrow/mod.rs x: 1 contributors (90d) y: 12 lines of code crates/core/src/avro_to_arrow/schema.rs x: 1 contributors (90d) y: 257 lines of code crates/core/src/file_group/log_file/avro.rs x: 1 contributors (90d) y: 54 lines of code crates/core/src/file_group/log_file/content.rs x: 1 contributors (90d) y: 204 lines of code crates/core/src/file_group/log_file/log_block.rs x: 1 contributors (90d) y: 310 lines of code crates/core/src/file_group/log_file/mod.rs x: 1 contributors (90d) y: 122 lines of code crates/core/src/lib.rs x: 1 contributors (90d) y: 30 lines of code python/pyproject.toml x: 1 contributors (90d) y: 69 lines of code crates/core/src/config/table.rs x: 1 contributors (90d) y: 192 lines of code python/Cargo.toml x: 1 contributors (90d) y: 45 lines of code
1306.0
lines of code
  min: 12.0
  average: 244.6
  25th percentile: 51.75
  median: 159.5
  75th percentile: 320.0
  max: 1306.0
0 1.0
contributors (90d)
min: 1.0 | average: 1.0 | 25th percentile: 1.0 | median: 1.0 | 75th percentile: 1.0 | max: 1.0