openai / chatgpt-retrieval-plugin
File Size

The distribution of size of files (measured in lines of code).

Intro
Learn more...
File Size Overall
0% | 0% | 47% | 33% | 18%
Legend:
1001+
501-1000
201-500
101-200
1-100


explore: grouped by folders | grouped by size | sunburst | 3D view
File Size per Extension
1001+
501-1000
201-500
101-200
1-100
py0% | 0% | 48% | 34% | 17%
toml0% | 0% | 0% | 0% | 100%
yaml0% | 0% | 0% | 0% | 100%
jsonl0% | 0% | 0% | 0% | 100%
File Size per Logical Decomposition
primary
1001+
501-1000
201-500
101-200
1-100
datastore0% | 0% | 62% | 26% | 11%
scripts0% | 0% | 0% | 98% | 1%
server0% | 0% | 0% | 100% | 0%
local_server0% | 0% | 0% | 90% | 9%
services0% | 0% | 0% | 0% | 100%
models0% | 0% | 0% | 0% | 100%
ROOT0% | 0% | 0% | 0% | 100%
Longest Files (Top 36)
File# lines# units
milvus_datastore.py
in datastore/providers
413 7
azuresearch_datastore.py
in datastore/providers
327 4
weaviate_datastore.py
in datastore/providers
307 7
elasticsearch_datastore.py
in datastore/providers
277 8
qdrant_datastore.py
in datastore/providers
236 8
analyticdb_datastore.py
in datastore/providers
219 10
azurecosmosdb_datastore.py
in datastore/providers
208 3
chroma_datastore.py
in datastore/providers
199 4
pinecone_datastore.py
in datastore/providers
194 3
mongodb_atlas_datastore.py
in datastore/providers
165 5
llama_datastore.py
in datastore/providers
160 7
main.py
in server
130 2
pgvector_datastore.py
in datastore/providers
129 2
main.py
in local_server
118 1
process_zip.py
in scripts/process_zip
109 -
process_json.py
in scripts/process_json
106 -
process_jsonl.py
in scripts/process_jsonl
104 -
postgres_datastore.py
in datastore/providers
100 3
chunks.py
in services
91 3
file.py
in services
82 2
supabase_datastore.py
in datastore/providers
64 2
factory.py
in datastore
62 -
57 -
datastore.py
in datastore
53 -
zilliz_datastore.py
in datastore/providers
46 3
models.py
in models
44 -
redis_datastore.py
in datastore/providers
40 -
openai.py
in services
38 2
28 1
api.py
in models
22 -
pii_detection.py
in services
17 1
openapi.yaml
in local_server
12 -
date.py
in services
9 1
jsonl
example.jsonl
in scripts/process_jsonl
6 -
__init__.py
in datastore
1 -
__init__.py
in datastore/providers
1 -
Files With Most Units (Top 23)
File# lines# units
analyticdb_datastore.py
in datastore/providers
219 10
elasticsearch_datastore.py
in datastore/providers
277 8
qdrant_datastore.py
in datastore/providers
236 8
weaviate_datastore.py
in datastore/providers
307 7
milvus_datastore.py
in datastore/providers
413 7
llama_datastore.py
in datastore/providers
160 7
mongodb_atlas_datastore.py
in datastore/providers
165 5
chroma_datastore.py
in datastore/providers
199 4
azuresearch_datastore.py
in datastore/providers
327 4
chunks.py
in services
91 3
postgres_datastore.py
in datastore/providers
100 3
azurecosmosdb_datastore.py
in datastore/providers
208 3
pinecone_datastore.py
in datastore/providers
194 3
zilliz_datastore.py
in datastore/providers
46 3
file.py
in services
82 2
openai.py
in services
38 2
pgvector_datastore.py
in datastore/providers
129 2
supabase_datastore.py
in datastore/providers
64 2
main.py
in server
130 2
date.py
in services
9 1
pii_detection.py
in services
17 1
28 1
main.py
in local_server
118 1
Files With Long Lines (Top 9)

There are 9 files with lines longer than 120 characters. In total, there are 11 long lines.

File# lines# units# long lines
jsonl
example.jsonl
in scripts/process_jsonl
6 - 3
postgres_datastore.py
in datastore/providers
100 3 1
elasticsearch_datastore.py
in datastore/providers
277 8 1
zilliz_datastore.py
in datastore/providers
46 3 1
azuresearch_datastore.py
in datastore/providers
327 4 1
mongodb_atlas_datastore.py
in datastore/providers
165 5 1
factory.py
in datastore
62 - 1
openapi.yaml
in local_server
12 - 1
main.py
in server
130 2 1
Correlations

File Size vs. Commits (all time): 36 points

datastore/datastore.py x: 2 commits (all time) y: 53 lines of code datastore/factory.py x: 12 commits (all time) y: 62 lines of code datastore/providers/mongodb_atlas_datastore.py x: 1 commits (all time) y: 165 lines of code pyproject.toml x: 18 commits (all time) y: 57 lines of code services/openai.py x: 5 commits (all time) y: 38 lines of code datastore/providers/analyticdb_datastore.py x: 3 commits (all time) y: 219 lines of code datastore/providers/azurecosmosdb_datastore.py x: 2 commits (all time) y: 208 lines of code datastore/providers/azuresearch_datastore.py x: 4 commits (all time) y: 327 lines of code datastore/providers/elasticsearch_datastore.py x: 2 commits (all time) y: 277 lines of code datastore/providers/llama_datastore.py x: 2 commits (all time) y: 160 lines of code datastore/providers/milvus_datastore.py x: 11 commits (all time) y: 413 lines of code datastore/providers/pinecone_datastore.py x: 6 commits (all time) y: 194 lines of code datastore/providers/qdrant_datastore.py x: 4 commits (all time) y: 236 lines of code datastore/providers/redis_datastore.py x: 12 commits (all time) y: 40 lines of code datastore/providers/zilliz_datastore.py x: 8 commits (all time) y: 46 lines of code local_server/openapi.yaml x: 2 commits (all time) y: 12 lines of code services/chunks.py x: 7 commits (all time) y: 91 lines of code services/extract_metadata.py x: 4 commits (all time) y: 28 lines of code services/pii_detection.py x: 3 commits (all time) y: 17 lines of code datastore/providers/pgvector_datastore.py x: 2 commits (all time) y: 129 lines of code datastore/providers/weaviate_datastore.py x: 8 commits (all time) y: 307 lines of code local_server/main.py x: 3 commits (all time) y: 118 lines of code scripts/process_json/process_json.py x: 5 commits (all time) y: 106 lines of code scripts/process_jsonl/process_jsonl.py x: 5 commits (all time) y: 104 lines of code scripts/process_zip/process_zip.py x: 4 commits (all time) y: 109 lines of code server/main.py x: 7 commits (all time) y: 130 lines of code services/date.py x: 4 commits (all time) y: 9 lines of code services/file.py x: 9 commits (all time) y: 82 lines of code datastore/providers/postgres_datastore.py x: 1 commits (all time) y: 100 lines of code datastore/providers/supabase_datastore.py x: 1 commits (all time) y: 64 lines of code datastore/providers/chroma_datastore.py x: 2 commits (all time) y: 199 lines of code datastore/providers/__init__.py x: 1 commits (all time) y: 1 lines of code models/api.py x: 2 commits (all time) y: 22 lines of code models/models.py x: 1 commits (all time) y: 44 lines of code scripts/process_jsonl/example.jsonl x: 1 commits (all time) y: 6 lines of code
413.0
lines of code
  min: 1.0
  average: 115.94
  25th percentile: 38.5
  median: 95.5
  75th percentile: 186.75
  max: 413.0
0 18.0
commits (all time)
min: 1.0 | average: 4.58 | 25th percentile: 2.0 | median: 3.5 | 75th percentile: 6.75 | max: 18.0

File Size vs. Contributors (all time): 36 points

datastore/datastore.py x: 2 contributors (all time) y: 53 lines of code datastore/factory.py x: 11 contributors (all time) y: 62 lines of code datastore/providers/mongodb_atlas_datastore.py x: 1 contributors (all time) y: 165 lines of code pyproject.toml x: 13 contributors (all time) y: 57 lines of code services/openai.py x: 4 contributors (all time) y: 38 lines of code datastore/providers/analyticdb_datastore.py x: 3 contributors (all time) y: 219 lines of code datastore/providers/azurecosmosdb_datastore.py x: 2 contributors (all time) y: 208 lines of code datastore/providers/azuresearch_datastore.py x: 3 contributors (all time) y: 327 lines of code datastore/providers/elasticsearch_datastore.py x: 2 contributors (all time) y: 277 lines of code datastore/providers/llama_datastore.py x: 2 contributors (all time) y: 160 lines of code datastore/providers/milvus_datastore.py x: 7 contributors (all time) y: 413 lines of code datastore/providers/pinecone_datastore.py x: 4 contributors (all time) y: 194 lines of code datastore/providers/qdrant_datastore.py x: 2 contributors (all time) y: 236 lines of code datastore/providers/redis_datastore.py x: 7 contributors (all time) y: 40 lines of code datastore/providers/zilliz_datastore.py x: 6 contributors (all time) y: 46 lines of code local_server/openapi.yaml x: 2 contributors (all time) y: 12 lines of code services/chunks.py x: 3 contributors (all time) y: 91 lines of code services/extract_metadata.py x: 3 contributors (all time) y: 28 lines of code services/pii_detection.py x: 2 contributors (all time) y: 17 lines of code datastore/providers/pgvector_datastore.py x: 2 contributors (all time) y: 129 lines of code datastore/providers/weaviate_datastore.py x: 5 contributors (all time) y: 307 lines of code local_server/main.py x: 3 contributors (all time) y: 118 lines of code scripts/process_json/process_json.py x: 4 contributors (all time) y: 106 lines of code scripts/process_jsonl/process_jsonl.py x: 4 contributors (all time) y: 104 lines of code scripts/process_zip/process_zip.py x: 4 contributors (all time) y: 109 lines of code server/main.py x: 4 contributors (all time) y: 130 lines of code services/date.py x: 3 contributors (all time) y: 9 lines of code services/file.py x: 5 contributors (all time) y: 82 lines of code datastore/providers/postgres_datastore.py x: 1 contributors (all time) y: 100 lines of code datastore/providers/supabase_datastore.py x: 1 contributors (all time) y: 64 lines of code datastore/providers/chroma_datastore.py x: 2 contributors (all time) y: 199 lines of code datastore/providers/__init__.py x: 1 contributors (all time) y: 1 lines of code models/api.py x: 1 contributors (all time) y: 22 lines of code models/models.py x: 1 contributors (all time) y: 44 lines of code scripts/process_jsonl/example.jsonl x: 1 contributors (all time) y: 6 lines of code
413.0
lines of code
  min: 1.0
  average: 115.94
  25th percentile: 38.5
  median: 95.5
  75th percentile: 186.75
  max: 413.0
0 13.0
contributors (all time)
min: 1.0 | average: 3.39 | 25th percentile: 2.0 | median: 3.0 | 75th percentile: 4.0 | max: 13.0

File Size vs. Commits (30 days): 0 points

No data for "commits (30d)" vs. "lines of code".

File Size vs. Contributors (30 days): 0 points

No data for "contributors (30d)" vs. "lines of code".


File Size vs. Commits (90 days): 0 points

No data for "commits (90d)" vs. "lines of code".

File Size vs. Contributors (90 days): 0 points

No data for "contributors (90d)" vs. "lines of code".