apache / hudi
Components

An overview of source code logical components.

Intro

Logical decomposition is a representation of the organization of the main source code, where every and each file is put in exactly one logical component.

Logical Decompositions Overview

Analyzed system has 1 logical decomposition:

Logical Decomposition #1: PRIMARY

The decompositions is based on the folder structure at level 1 (relative to the source code root).

Bubble Chart | Tree Map
Component Sizes (Lines of Code)
The "primary" logical decomposition has 22 components.
  • 2,673 files, 273,271 LOC (100.0% vs. main code).
  • "hudi-common" is biggest, containing 23.4% of LOC.
  • "style" is smallest, containing 0.04% of LOC.


hudi-common63946 LOC (23%) 634 files
hudi-client60534 LOC (22%) 684 files
hudi-spark-datasource45847 LOC (16%) 311 files
hudi-flink-datasource37885 LOC (13%) 352 files
hudi-utilities20045 LOC (7%) 195 files
hudi-hadoop-common6833 LOC (2%) 49 files
hudi-hadoop-mr6516 LOC (2%) 62 files
hudi-sync6156 LOC (2%) 60 files
hudi-integ-test5745 LOC (2%) 91 files
hudi-cli5740 LOC (2%) 41 files
hudi-io3746 LOC (1%) 57 files
hudi-aws2307 LOC (<1%) 18 files
hudi-timeline-service1958 LOC (<1%) 15 files
hudi-kafka-connect1901 LOC (<1%) 22 files
hudi-platform-service1875 LOC (<1%) 29 files
hudi-gcp902 LOC (<1%) 6 files
scripts350 LOC (<1%) 2 files
packaging290 LOC (<1%) 25 files
hudi-examples275 LOC (<1%) 11 files
ROOT217 LOC (<1%) 1 file
docker104 LOC (<1%) 6 files
style99 LOC (<1%) 2 files
Component Commits
Components ordered by number of commits
Total Commits per Component
All commits, some commits may include files from multiple components.
hudi-client1562 commits (24%)
hudi-common1455 commits (23%)
hudi-spark-datasource962 commits (15%)
hudi-utilities721 commits (11%)
hudi-flink-datasource503 commits (7%)
hudi-cli253 commits (4%)
hudi-hadoop-mr251 commits (3%)
hudi-sync181 commits (2%)
hudi-integ-test136 commits (2%)
hudi-timeline-service87 commits (1%)
hudi-aws79 commits (1%)
hudi-hadoop-common66 commits (1%)
hudi-io61 commits (<1%)
hudi-kafka-connect41 commits (<1%)
ROOT35 commits (<1%)
hudi-gcp31 commits (<1%)
packaging25 commits (<1%)
hudi-platform-service21 commits (<1%)
style14 commits (<1%)
scripts8 commits (<1%)
docker5 commits (<1%)
hudi-examples4 commits (<1%)
Yearly File Updates Trend per Components
The number of file changes in commits
animated commit history: all time cumulative | 12 months window
2025 2024 2023 2022 2021 2020 2019 2018
hudi-client
680
1281
1480
1611
1244
363
hudi-common
522
1240
939
972
539
482
444
hudi-spark-datasource
331
869
841
833
183
14
hudi-utilities
113
466
569
256
190
292
184
hudi-flink-datasource
163
366
529
565
hudi-hadoop-mr
47
154
81
125
82
122
82
hudi-hadoop-common
29
140
hudi-cli
20
133
56
144
55
160
130
hudi-io
29
110
hudi-sync
19
100
97
152
65
18
hudi-integ-test
20
62
49
107
119
108
49
hudi-timeline-service
20
43
43
23
29
9
20
hudi-aws
9
49
35
20
9
hudi-kafka-connect
9
10
15
17
65
hudi-platform-service
4
11
68
hudi-gcp
1
13
26
8
ROOT
2
6
6
6
4
9
2
packaging
3
4
9
9
2
2
6
hudi-examples
3
3
5
scripts
3
3
1
1
style
1
1
2
5
2
1
2
docker
4
9
2


Dependencies between components in same commits (past 180 days)
The number on the lines shows the number of shared commits.
See detailed temporal dependencies report...

G hudi-client hudi-client hudi-client--hudi-client 118 hudi-common hudi-common hudi-client--hudi-common 106 hudi-io hudi-io hudi-client--hudi-io 13 hudi-aws hudi-aws hudi-client--hudi-aws 6 hudi-common--hudi-common 104 hudi-flink-datasource hudi-flink-datasource hudi-common--hudi-flink-datasource 26 hudi-spark-datasource hudi-spark-datasource hudi-spark-datasource--hudi-client 54 hudi-spark-datasource--hudi-common 54 hudi-spark-datasource--hudi-spark-datasource 69 hudi-spark-datasource--hudi-flink-datasource 16 hudi-utilities hudi-utilities hudi-spark-datasource--hudi-utilities 22 hudi-hadoop-mr hudi-hadoop-mr hudi-spark-datasource--hudi-hadoop-mr 17 hudi-hadoop-common hudi-hadoop-common hudi-spark-datasource--hudi-hadoop-common 6 hudi-spark-datasource--hudi-io 7 hudi-integ-test hudi-integ-test hudi-spark-datasource--hudi-integ-test 8 hudi-cli hudi-cli hudi-spark-datasource--hudi-cli 7 hudi-kafka-connect hudi-kafka-connect hudi-spark-datasource--hudi-kafka-connect 7 hudi-flink-datasource--hudi-client 31 hudi-flink-datasource--hudi-flink-datasource 23 hudi-utilities--hudi-client 30 hudi-utilities--hudi-common 24 hudi-utilities--hudi-flink-datasource 8 hudi-utilities--hudi-utilities 25 hudi-utilities--hudi-hadoop-mr 10 hudi-utilities--hudi-integ-test 7 hudi-hadoop-mr--hudi-client 24 hudi-hadoop-mr--hudi-common 26 hudi-hadoop-mr--hudi-flink-datasource 12 hudi-hadoop-mr--hudi-hadoop-mr 11 hudi-hadoop-common--hudi-client 12 hudi-hadoop-common--hudi-common 13 hudi-hadoop-common--hudi-flink-datasource 7 hudi-hadoop-common--hudi-hadoop-common 7 hudi-io--hudi-common 9 hudi-io--hudi-utilities 5 hudi-integ-test--hudi-client 8 hudi-integ-test--hudi-common 7 hudi-integ-test--hudi-flink-datasource 6 hudi-integ-test--hudi-hadoop-mr 5 hudi-integ-test--hudi-integ-test 6 hudi-cli--hudi-client 8 hudi-cli--hudi-common 8 hudi-cli--hudi-utilities 6 hudi-kafka-connect--hudi-client 7 hudi-kafka-connect--hudi-common 7 hudi-kafka-connect--hudi-flink-datasource 7 hudi-kafka-connect--hudi-utilities 6 hudi-kafka-connect--hudi-hadoop-mr 7 hudi-timeline-service hudi-timeline-service hudi-timeline-service--hudi-client 6 hudi-sync hudi-sync hudi-sync--hudi-sync 6


2025-05-07 16:57