huggingface / tokenizers
Components

An overview of source code logical components.

Intro

Logical decomposition is a representation of the organization of the main source code, where every and each file is put in exactly one logical component.

Logical Decompositions Overview

Analyzed system has 1 logical decomposition:

Logical Decomposition #1: PRIMARY

The decompositions is based on the folder structure at level 1 (relative to the source code root).

Bubble Chart | Tree Map
Component Sizes (Lines of Code)
The "primary" logical decomposition has 2 components.
  • 146 files, 29,051 LOC (100.0% vs. main code).
  • "tokenizers" is biggest, containing 57.53% of LOC.
  • "bindings" is smallest, containing 42.47% of LOC.


tokenizers16714 LOC (57%) 77 files
bindings12337 LOC (42%) 69 files
Component Commits
Components ordered by number of commits
Total Commits per Component
All commits, some commits may include files from multiple components.
tokenizers787 commits (42%)
bindings590 commits (32%)
Yearly File Updates Trend per Components
The number of file changes in commits
animated commit history: all time cumulative | 12 months window
2025 2024 2023 2022 2021 2020 2019
tokenizers
119
110
104
215
76
1124
186
bindings
44
167
128
95
82
736
114


Dependencies between components in same commits (past 180 days)
The number on the lines shows the number of shared commits.
See detailed temporal dependencies report...

G tokenizers tokenizers tokenizers--tokenizers 13 bindings bindings tokenizers--bindings 10 bindings--bindings 12


2025-06-30 08:57