openai / tiktoken
File Size

The distribution of size of files (measured in lines of code).

Intro
Learn more...
File Size Overall
0% | 0% | 41% | 35% | 22%
Legend:
1001+
501-1000
201-500
101-200
1-100


explore: grouped by folders | grouped by size | sunburst | 3D view
File Size per Extension
1001+
501-1000
201-500
101-200
1-100
rs0% | 0% | 71% | 28% | 0%
py0% | 0% | 25% | 43% | 30%
toml0% | 0% | 0% | 0% | 100%
in0% | 0% | 0% | 0% | 100%
File Size per Logical Decomposition
primary
1001+
501-1000
201-500
101-200
1-100
src0% | 0% | 71% | 28% | 0%
tiktoken0% | 0% | 33% | 40% | 25%
tiktoken_ext0% | 0% | 0% | 100% | 0%
ROOT0% | 0% | 0% | 0% | 100%
scripts0% | 0% | 0% | 0% | 100%
Longest Files (Top 15)
File# lines# units
lib.rs
in src
393 12
core.py
in tiktoken
205 28
py.rs
in src
156 11
_educational.py
in tiktoken
134 11
load.py
in tiktoken
114 7
openai_public.py
in tiktoken_ext
108 6
model.py
in tiktoken
77 2
registry.py
in tiktoken
70 4
redact.py
in scripts
52 3
36 -
benchmark.py
in scripts
28 1
Cargo.toml
in root
23 -
setup.py
in root
16 -
in
8 -
__init__.py
in tiktoken
6 -
Files With Most Units (Top 10)
File# lines# units
core.py
in tiktoken
205 28
lib.rs
in src
393 12
_educational.py
in tiktoken
134 11
py.rs
in src
156 11
load.py
in tiktoken
114 7
openai_public.py
in tiktoken_ext
108 6
registry.py
in tiktoken
70 4
redact.py
in scripts
52 3
model.py
in tiktoken
77 2
benchmark.py
in scripts
28 1
Files With Long Lines (Top 0)

There are 0 files with lines longer than 120 characters. In total, there are 0 long lines.

File# lines# units# long lines
Correlations

File Size vs. Commits (all time): 14 points

src/lib.rs x: 12 commits (all time) y: 393 lines of code src/py.rs x: 2 commits (all time) y: 156 lines of code pyproject.toml x: 23 commits (all time) y: 36 lines of code Cargo.toml x: 16 commits (all time) y: 23 lines of code tiktoken/__init__.py x: 4 commits (all time) y: 6 lines of code tiktoken/core.py x: 12 commits (all time) y: 205 lines of code tiktoken/load.py x: 11 commits (all time) y: 114 lines of code tiktoken/model.py x: 13 commits (all time) y: 77 lines of code setup.py x: 4 commits (all time) y: 16 lines of code tiktoken/_educational.py x: 4 commits (all time) y: 134 lines of code tiktoken/registry.py x: 3 commits (all time) y: 70 lines of code tiktoken_ext/openai_public.py x: 11 commits (all time) y: 108 lines of code scripts/redact.py x: 2 commits (all time) y: 52 lines of code MANIFEST.in x: 2 commits (all time) y: 8 lines of code
393.0
lines of code
  min: 6.0
  average: 99.86
  25th percentile: 21.25
  median: 73.5
  75th percentile: 139.5
  max: 393.0
0 23.0
commits (all time)
min: 2.0 | average: 8.5 | 25th percentile: 2.75 | median: 7.5 | 75th percentile: 12.25 | max: 23.0

File Size vs. Contributors (all time): 14 points

src/lib.rs x: 5 contributors (all time) y: 393 lines of code src/py.rs x: 1 contributors (all time) y: 156 lines of code pyproject.toml x: 2 contributors (all time) y: 36 lines of code Cargo.toml x: 3 contributors (all time) y: 23 lines of code tiktoken/__init__.py x: 1 contributors (all time) y: 6 lines of code tiktoken/core.py x: 5 contributors (all time) y: 205 lines of code tiktoken/load.py x: 3 contributors (all time) y: 114 lines of code tiktoken/model.py x: 5 contributors (all time) y: 77 lines of code setup.py x: 2 contributors (all time) y: 16 lines of code tiktoken/_educational.py x: 2 contributors (all time) y: 134 lines of code tiktoken/registry.py x: 1 contributors (all time) y: 70 lines of code tiktoken_ext/openai_public.py x: 6 contributors (all time) y: 108 lines of code scripts/redact.py x: 1 contributors (all time) y: 52 lines of code MANIFEST.in x: 1 contributors (all time) y: 8 lines of code
393.0
lines of code
  min: 6.0
  average: 99.86
  25th percentile: 21.25
  median: 73.5
  75th percentile: 139.5
  max: 393.0
0 6.0
contributors (all time)
min: 1.0 | average: 2.71 | 25th percentile: 1.0 | median: 2.0 | 75th percentile: 5.0 | max: 6.0

File Size vs. Commits (30 days): 0 points

No data for "commits (30d)" vs. "lines of code".

File Size vs. Contributors (30 days): 0 points

No data for "contributors (30d)" vs. "lines of code".


File Size vs. Commits (90 days): 9 points

src/lib.rs x: 2 commits (90d) y: 393 lines of code src/py.rs x: 2 commits (90d) y: 156 lines of code pyproject.toml x: 4 commits (90d) y: 36 lines of code Cargo.toml x: 2 commits (90d) y: 23 lines of code tiktoken/__init__.py x: 1 commits (90d) y: 6 lines of code tiktoken/core.py x: 1 commits (90d) y: 205 lines of code tiktoken/load.py x: 2 commits (90d) y: 114 lines of code tiktoken/model.py x: 1 commits (90d) y: 77 lines of code setup.py x: 1 commits (90d) y: 16 lines of code
393.0
lines of code
  min: 6.0
  average: 114.0
  25th percentile: 19.5
  median: 77.0
  75th percentile: 180.5
  max: 393.0
0 4.0
commits (90d)
min: 1.0 | average: 1.78 | 25th percentile: 1.0 | median: 2.0 | 75th percentile: 2.0 | max: 4.0

File Size vs. Contributors (90 days): 9 points

src/lib.rs x: 1 contributors (90d) y: 393 lines of code src/py.rs x: 1 contributors (90d) y: 156 lines of code pyproject.toml x: 2 contributors (90d) y: 36 lines of code Cargo.toml x: 2 contributors (90d) y: 23 lines of code tiktoken/__init__.py x: 1 contributors (90d) y: 6 lines of code tiktoken/core.py x: 1 contributors (90d) y: 205 lines of code tiktoken/load.py x: 1 contributors (90d) y: 114 lines of code tiktoken/model.py x: 1 contributors (90d) y: 77 lines of code setup.py x: 1 contributors (90d) y: 16 lines of code
393.0
lines of code
  min: 6.0
  average: 114.0
  25th percentile: 19.5
  median: 77.0
  75th percentile: 180.5
  max: 393.0
0 2.0
contributors (90d)
min: 1.0 | average: 1.22 | 25th percentile: 1.0 | median: 1.0 | 75th percentile: 1.5 | max: 2.0