huggingface / xet-core
Components

An overview of source code logical components.

Intro

Logical decomposition is a representation of the organization of the main source code, where every and each file is put in exactly one logical component.

Logical Decompositions Overview

Analyzed system has 1 logical decomposition:

Logical Decomposition #1: PRIMARY

The decompositions is based on the folder structure at level 1 (relative to the source code root).

Bubble Chart | Tree Map
Component Sizes (Lines of Code)
The "primary" logical decomposition has 19 components.
  • 177 files, 26,181 LOC (100.0% vs. main code).
  • "mdb_shard" is biggest, containing 19.16% of LOC.
  • "error_printer" is smallest, containing 0.46% of LOC.


mdb_shard5017 LOC (19%) 19 files
cas_client3361 LOC (12%) 13 files
cas_object3177 LOC (12%) 15 files
merkledb3073 LOC (11%) 20 files
data1767 LOC (6%) 19 files
chunk_cache1442 LOC (5%) 7 files
utils1440 LOC (5%) 12 files
progress_tracking1381 LOC (5%) 8 files
deduplication1348 LOC (5%) 11 files
hf_xet1053 LOC (4%) 9 files
hf_xet_wasm812 LOC (3%) 14 files
file_utils674 LOC (2%) 5 files
merklehash390 LOC (1%) 3 files
cas_types287 LOC (1%) 4 files
chunk_cache_bench264 LOC (1%) 5 files
parutils241 LOC (<1%) 3 files
xet_threadpool192 LOC (<1%) 5 files
ROOT142 LOC (<1%) 3 files
error_printer120 LOC (<1%) 2 files
Component Commits
Components ordered by number of commits
Total Commits per Component
All commits, some commits may include files from multiple components.
data96 commits (32%)
cas_client94 commits (31%)
hf_xet62 commits (20%)
mdb_shard53 commits (17%)
cas_object50 commits (16%)
utils34 commits (11%)
chunk_cache30 commits (10%)
ROOT29 commits (9%)
cas_types28 commits (9%)
deduplication16 commits (5%)
file_utils14 commits (4%)
merkledb13 commits (4%)
merklehash12 commits (4%)
xet_threadpool12 commits (4%)
progress_tracking8 commits (2%)
chunk_cache_bench8 commits (2%)
parutils5 commits (1%)
error_printer4 commits (1%)
hf_xet_wasm1 commits (<1%)
Yearly File Updates Trend per Components
The number of file changes in commits
animated commit history: all time cumulative | 12 months window
2025 2024
data
186
59
cas_client
138
85
mdb_shard
87
94
cas_object
80
49
hf_xet
64
43
utils
36
34
chunk_cache
30
34
merkledb
20
42
deduplication
41
cas_types
11
29
progress_tracking
30
ROOT
15
14
file_utils
14
13
chunk_cache_bench
6
16
xet_threadpool
19
merklehash
9
8
hf_xet_wasm
14
parutils
3
5
error_printer
1
5


Dependencies between components in same commits (past 180 days)
The number on the lines shows the number of shared commits.
See detailed temporal dependencies report...

G data data data--data 44 cas_client cas_client data--cas_client 32 mdb_shard mdb_shard data--mdb_shard 19 cas_object cas_object data--cas_object 15 utils utils data--utils 10 ROOT ROOT data--ROOT 10 cas_client--cas_client 34 cas_client--ROOT 9 mdb_shard--cas_client 12 mdb_shard--mdb_shard 22 mdb_shard--cas_object 7 deduplication deduplication mdb_shard--deduplication 7 mdb_shard--utils 9 mdb_shard--ROOT 7 merkledb merkledb mdb_shard--merkledb 5 hf_xet hf_xet hf_xet--data 18 hf_xet--cas_client 12 hf_xet--hf_xet 15 hf_xet--deduplication 6 hf_xet--utils 5 hf_xet--ROOT 7 cas_object--cas_client 14 cas_object--cas_object 17 cas_object--deduplication 6 cas_object--utils 8 cas_object--ROOT 5 cas_object--merkledb 6 deduplication--data 13 deduplication--cas_client 8 deduplication--deduplication 9 deduplication--ROOT 5 utils--cas_client 13 utils--utils 11 utils--ROOT 7 utils--merkledb 5 chunk_cache chunk_cache chunk_cache--data 6 chunk_cache--cas_client 11 chunk_cache--cas_object 5 chunk_cache--utils 7 chunk_cache--chunk_cache 10 xet_threadpool xet_threadpool xet_threadpool--data 11 xet_threadpool--cas_client 9 xet_threadpool--hf_xet 6 xet_threadpool--utils 5 progress_tracking progress_tracking progress_tracking--data 8 progress_tracking--hf_xet 6 progress_tracking--progress_tracking 5 file_utils file_utils file_utils--cas_client 7 file_utils--chunk_cache 5 cas_types cas_types cas_types--cas_client 6


2025-06-30 09:10