spotify / voyager
File Size

The distribution of size of files (measured in lines of code).

Intro
Learn more...
File Size Overall
24% | 29% | 25% | 6% | 13%
Legend:
1001+
501-1000
201-500
101-200
1-100


explore: grouped by folders | grouped by size | sunburst | 3D view
File Size per Extension
1001+
501-1000
201-500
101-200
1-100
h32% | 15% | 30% | 6% | 14%
cpp0% | 100% | 0% | 0% | 0%
java0% | 0% | 48% | 30% | 21%
py0% | 0% | 0% | 0% | 100%
toml0% | 0% | 0% | 0% | 100%
in0% | 0% | 0% | 0% | 100%
File Size per Logical Decomposition
primary
1001+
501-1000
201-500
101-200
1-100
python26% | 31% | 24% | 5% | 12%
cpp34% | 16% | 32% | 3% | 12%
java0% | 52% | 16% | 16% | 14%
benchmarks0% | 0% | 0% | 0% | 100%
Longest Files (Top 50)
File# lines# units
hnswalg.h
in python/src/cpp/src
1197 26
hnswalg.h
in cpp/src
1197 26
bindings.cpp
in python/src
878 7
782 43
TypedIndex.h
in python/src/cpp/src
567 38
TypedIndex.h
in cpp/src
567 38
E4M3.h
in python/src/cpp/src
354 3
E4M3.h
in cpp/src
354 3
InnerProduct.h
in python/src/cpp/src/Spaces
284 13
InnerProduct.h
in cpp/src/Spaces
284 13
array_utils.h
in python/src/cpp/src
255 12
array_utils.h
in cpp/src
255 12
StringIndex.java
in java/src/main/java/com/spotify/voyager/jni
244 25
Euclidean.h
in python/src/cpp/src/Spaces
211 11
Euclidean.h
in cpp/src/Spaces
211 11
Index.java
in java/src/main/java/com/spotify/voyager/jni
152 14
PythonInputStream.h
in python/src
142 8
StreamUtils.h
in python/src/cpp/src
132 18
StreamUtils.h
in cpp/src
132 18
102 8
Metadata.h
in python/src/cpp/src
90 14
Metadata.h
in cpp/src
90 14
std_utils.h
in python/src/cpp/src
78 2
std_utils.h
in cpp/src
78 2
hnswlib.h
in python/src/cpp/src
75 2
hnswlib.h
in cpp/src
75 2
visited_list_pool.h
in python/src/cpp/src
62 2
62 2
Index.h
in python/src/cpp/src
58 1
Index.h
in cpp/src
58 1
TinyJson.java
in java/src/main/java/com/spotify/voyager/jni/utils
57 2
index_query.py
in benchmarks
57 5
55 3
PythonFileLike.h
in python/src
52 8
51 2
Enums.h
in python/src/cpp/src
44 2
44 -
Enums.h
in cpp/src
44 2
JniLibExtractor.java
in java/src/main/java/com/spotify/voyager/jni/utils
41 2
40 3
index_creation.py
in benchmarks
36 2
Space.h
in python/src/cpp/src/Spaces
14 1
Space.h
in cpp/src/Spaces
14 1
typenames.h
in python
12 10
RecallException.java
in java/src/main/java/com/spotify/voyager/jni/exception
9 2
in
MANIFEST.in
in python
4 -
package-info.java
in java/src/main/java/com/spotify/voyager/jni
1 -
package-info.java
in java/src/main/java/com/spotify/voyager
1 -
__init__.py
in benchmarks
1 -
__init__.py
in python
1 -
Files With Most Units (Top 44)
File# lines# units
782 43
TypedIndex.h
in python/src/cpp/src
567 38
TypedIndex.h
in cpp/src
567 38
hnswalg.h
in python/src/cpp/src
1197 26
hnswalg.h
in cpp/src
1197 26
StringIndex.java
in java/src/main/java/com/spotify/voyager/jni
244 25
StreamUtils.h
in python/src/cpp/src
132 18
StreamUtils.h
in cpp/src
132 18
Index.java
in java/src/main/java/com/spotify/voyager/jni
152 14
Metadata.h
in python/src/cpp/src
90 14
Metadata.h
in cpp/src
90 14
InnerProduct.h
in python/src/cpp/src/Spaces
284 13
InnerProduct.h
in cpp/src/Spaces
284 13
array_utils.h
in python/src/cpp/src
255 12
array_utils.h
in cpp/src
255 12
Euclidean.h
in python/src/cpp/src/Spaces
211 11
Euclidean.h
in cpp/src/Spaces
211 11
typenames.h
in python
12 10
102 8
PythonFileLike.h
in python/src
52 8
PythonInputStream.h
in python/src
142 8
bindings.cpp
in python/src
878 7
index_query.py
in benchmarks
57 5
55 3
40 3
E4M3.h
in python/src/cpp/src
354 3
E4M3.h
in cpp/src
354 3
TinyJson.java
in java/src/main/java/com/spotify/voyager/jni/utils
57 2
JniLibExtractor.java
in java/src/main/java/com/spotify/voyager/jni/utils
41 2
RecallException.java
in java/src/main/java/com/spotify/voyager/jni/exception
9 2
51 2
index_creation.py
in benchmarks
36 2
hnswlib.h
in python/src/cpp/src
75 2
visited_list_pool.h
in python/src/cpp/src
62 2
std_utils.h
in python/src/cpp/src
78 2
Enums.h
in python/src/cpp/src
44 2
hnswlib.h
in cpp/src
75 2
62 2
std_utils.h
in cpp/src
78 2
Enums.h
in cpp/src
44 2
Space.h
in python/src/cpp/src/Spaces
14 1
Index.h
in python/src/cpp/src
58 1
Space.h
in cpp/src/Spaces
14 1
Index.h
in cpp/src
58 1
Files With Long Lines (Top 1)

There is only one file with lines longer than 120 characters. In total, there are 2 long lines.

File# lines# units# long lines
bindings.cpp
in python/src
878 7 2
Correlations

File Size vs. Commits (all time): 37 points

python/MANIFEST.in x: 2 commits (all time) y: 4 lines of code python/pyproject.toml x: 2 commits (all time) y: 44 lines of code python/src/PythonFileLike.h x: 3 commits (all time) y: 52 lines of code python/src/PythonInputStream.h x: 4 commits (all time) y: 142 lines of code python/src/PythonOutputStream.h x: 3 commits (all time) y: 40 lines of code python/src/bindings.cpp x: 1 commits (all time) y: 878 lines of code python/src/voyager/__init__.py x: 1 commits (all time) y: 1 lines of code java/com_spotify_voyager_jni_Index.cpp x: 11 commits (all time) y: 782 lines of code java/src/main/java/com/spotify/voyager/jni/Index.java x: 8 commits (all time) y: 152 lines of code cpp/src/TypedIndex.h x: 3 commits (all time) y: 567 lines of code java/src/main/java/com/spotify/voyager/jni/exception/RecallException.java x: 1 commits (all time) y: 9 lines of code cpp/src/Index.h x: 2 commits (all time) y: 58 lines of code cpp/src/array_utils.h x: 2 commits (all time) y: 255 lines of code cpp/src/E4M3.h x: 1 commits (all time) y: 354 lines of code cpp/src/Enums.h x: 1 commits (all time) y: 44 lines of code cpp/src/Metadata.h x: 1 commits (all time) y: 90 lines of code cpp/src/Spaces/Euclidean.h x: 1 commits (all time) y: 211 lines of code cpp/src/Spaces/InnerProduct.h x: 1 commits (all time) y: 284 lines of code cpp/src/Spaces/Space.h x: 1 commits (all time) y: 14 lines of code cpp/src/StreamUtils.h x: 1 commits (all time) y: 132 lines of code cpp/src/hnswalg.h x: 1 commits (all time) y: 1197 lines of code cpp/src/hnswlib.h x: 1 commits (all time) y: 75 lines of code cpp/src/std_utils.h x: 1 commits (all time) y: 78 lines of code cpp/src/visited_list_pool.h x: 1 commits (all time) y: 62 lines of code java/JavaInputStream.h x: 3 commits (all time) y: 102 lines of code java/JavaOutputStream.h x: 2 commits (all time) y: 55 lines of code python/typenames.h x: 2 commits (all time) y: 12 lines of code benchmarks/index_creation.py x: 2 commits (all time) y: 36 lines of code java/src/main/java/com/spotify/voyager/jni/StringIndex.java x: 5 commits (all time) y: 244 lines of code java/src/main/java/com/spotify/voyager/jni/utils/JniLibExtractor.java x: 1 commits (all time) y: 41 lines of code java/src/main/java/com/spotify/voyager/jni/utils/TinyJson.java x: 1 commits (all time) y: 57 lines of code java/thread_pool.h x: 1 commits (all time) y: 51 lines of code
1197.0
lines of code
  min: 1.0
  average: 167.14
  25th percentile: 25.0
  median: 57.0
  75th percentile: 181.5
  max: 1197.0
0 11.0
commits (all time)
min: 1.0 | average: 2.08 | 25th percentile: 1.0 | median: 1.0 | 75th percentile: 2.0 | max: 11.0

File Size vs. Contributors (all time): 37 points

python/MANIFEST.in x: 1 contributors (all time) y: 4 lines of code python/pyproject.toml x: 1 contributors (all time) y: 44 lines of code python/src/PythonFileLike.h x: 2 contributors (all time) y: 52 lines of code python/src/PythonInputStream.h x: 2 contributors (all time) y: 142 lines of code python/src/PythonOutputStream.h x: 2 contributors (all time) y: 40 lines of code python/src/bindings.cpp x: 1 contributors (all time) y: 878 lines of code java/com_spotify_voyager_jni_Index.cpp x: 5 contributors (all time) y: 782 lines of code java/src/main/java/com/spotify/voyager/jni/Index.java x: 3 contributors (all time) y: 152 lines of code cpp/src/TypedIndex.h x: 2 contributors (all time) y: 567 lines of code java/src/main/java/com/spotify/voyager/jni/exception/RecallException.java x: 1 contributors (all time) y: 9 lines of code cpp/src/Index.h x: 2 contributors (all time) y: 58 lines of code cpp/src/array_utils.h x: 2 contributors (all time) y: 255 lines of code cpp/src/E4M3.h x: 1 contributors (all time) y: 354 lines of code cpp/src/Metadata.h x: 1 contributors (all time) y: 90 lines of code cpp/src/Spaces/Euclidean.h x: 1 contributors (all time) y: 211 lines of code cpp/src/Spaces/InnerProduct.h x: 1 contributors (all time) y: 284 lines of code cpp/src/Spaces/Space.h x: 1 contributors (all time) y: 14 lines of code cpp/src/StreamUtils.h x: 1 contributors (all time) y: 132 lines of code cpp/src/hnswalg.h x: 1 contributors (all time) y: 1197 lines of code cpp/src/hnswlib.h x: 1 contributors (all time) y: 75 lines of code cpp/src/std_utils.h x: 1 contributors (all time) y: 78 lines of code cpp/src/visited_list_pool.h x: 1 contributors (all time) y: 62 lines of code java/JavaInputStream.h x: 2 contributors (all time) y: 102 lines of code python/typenames.h x: 2 contributors (all time) y: 12 lines of code benchmarks/index_creation.py x: 1 contributors (all time) y: 36 lines of code benchmarks/index_query.py x: 1 contributors (all time) y: 57 lines of code java/src/main/java/com/spotify/voyager/jni/StringIndex.java x: 3 contributors (all time) y: 244 lines of code java/src/main/java/com/spotify/voyager/jni/utils/JniLibExtractor.java x: 1 contributors (all time) y: 41 lines of code java/thread_pool.h x: 1 contributors (all time) y: 51 lines of code
1197.0
lines of code
  min: 1.0
  average: 167.14
  25th percentile: 25.0
  median: 57.0
  75th percentile: 181.5
  max: 1197.0
0 5.0
contributors (all time)
min: 1.0 | average: 1.46 | 25th percentile: 1.0 | median: 1.0 | 75th percentile: 2.0 | max: 5.0

File Size vs. Commits (30 days): 0 points

No data for "commits (30d)" vs. "lines of code".

File Size vs. Contributors (30 days): 0 points

No data for "contributors (30d)" vs. "lines of code".


File Size vs. Commits (90 days): 0 points

No data for "commits (90d)" vs. "lines of code".

File Size vs. Contributors (90 days): 0 points

No data for "contributors (90d)" vs. "lines of code".