GoogleCloudPlatform / genai-product-catalog
File Size

The distribution of size of files (measured in lines of code).

Intro
Learn more...
File Size Overall
64% | 18% | 7% | 2% | 8%
Legend:
1001+
501-1000
201-500
101-200
1-100


explore: grouped by folders | grouped by size | sunburst | 3D view
File Size per Extension
1001+
501-1000
201-500
101-200
1-100
ipynb71% | 20% | 8% | 0% | 0%
py0% | 0% | 0% | 21% | 78%
bzl0% | 0% | 0% | 0% | 100%
toml0% | 0% | 0% | 0% | 100%
in0% | 0% | 0% | 0% | 100%
yaml0% | 0% | 0% | 0% | 100%
File Size per Logical Decomposition
primary
1001+
501-1000
201-500
101-200
1-100
experiments64% | 18% | 7% | 2% | 8%
Longest Files (Top 42)
File# lines# units
0_EDA_flipkart_dataset.ipynb
in experiments/legacy/notebooks
2189 -
data_prep_with_restricts.ipynb
in experiments/legacy/notebooks/vectorSearchIndexUpdate
2153 -
NB4_create_streaming_index_with_filters.ipynb
in experiments/legacy/notebooks/vectorSearchIndexUpdate
1451 -
NB3_create_streaming_index.ipynb
in experiments/legacy/notebooks/vectorSearchIndexUpdate
1264 -
NB2_create_batch_index_with_filters.ipynb
in experiments/legacy/notebooks/vectorSearchIndexUpdate
860 -
VectoreSearch_Cleanup.ipynb
in experiments/legacy/notebooks/vectorSearchIndexUpdate
580 -
NB1_create_batch_index.ipynb
in experiments/legacy/notebooks/vectorSearchIndexUpdate
547 -
1_generate_embeddings.ipynb
in experiments/legacy/notebooks
420 -
2_create_vector_db.ipynb
in experiments/legacy/notebooks
387 -
attributes.py
in experiments/google/cloud/ml/applied/attributes
115 6
category.py
in experiments/google/cloud/ml/applied/categories
108 5
attributes.py
in experiments/legacy/backend
92 6
category.py
in experiments/legacy/backend
88 5
embeddings.py
in experiments/google/cloud/ml/applied/embeddings
76 4
image_to_text.py
in experiments/google/cloud/ml/applied/images
74 8
search.py
in experiments/google/cloud/ml/applied/embeddings
71 4
setup.py
in experiments
70 -
embeddings.py
in experiments/legacy/backend
68 4
copy_file_groups.bzl
in experiments/conf
56 -
domain_model.py
in experiments/google/cloud/ml/applied/model
49 4
api.py
in experiments/legacy/backend
44 3
app.toml
in experiments/conf
35 -
nearest_neighbors.py
in experiments/google/cloud/ml/applied/knn
35 1
utils.py
in experiments/google/cloud/ml/applied/utils
31 4
nearest_neighbors.py
in experiments/legacy/backend
27 1
in
requirements.in
in experiments/conf
27 -
config.py
in experiments/legacy/backend
26 -
config.py
in experiments/google/cloud/ml/applied
23 1
marketing.py
in experiments/legacy/backend
14 1
utils.py
in experiments/legacy/backend
11 2
marketing.py
in experiments/google/cloud/ml/applied/marketing
11 1
app.yaml
in experiments/legacy/backend
3 -
__init__.py
in experiments/google/cloud/ml/applied
2 -
__init__.py
in experiments/google/cloud/ml/applied/marketing
1 -
__init__.py
in experiments/google/cloud/ml/applied/model
1 -
__init__.py
in experiments/google/cloud/ml/applied/utils
1 -
__init__.py
in experiments/google/cloud/ml/applied/categories
1 -
__init__.py
in experiments/google/cloud/ml/applied/knn
1 -
gapic_version.py
in experiments/google/cloud/ml/applied
1 -
__init__.py
in experiments/google/cloud/ml/applied/attributes
1 -
__init__.py
in experiments/google/cloud/ml/applied/embeddings
1 -
__init__.py
in experiments/google/cloud/ml/applied/images
1 -
Files With Most Units (Top 17)
File# lines# units
image_to_text.py
in experiments/google/cloud/ml/applied/images
74 8
attributes.py
in experiments/legacy/backend
92 6
attributes.py
in experiments/google/cloud/ml/applied/attributes
115 6
category.py
in experiments/legacy/backend
88 5
category.py
in experiments/google/cloud/ml/applied/categories
108 5
embeddings.py
in experiments/legacy/backend
68 4
domain_model.py
in experiments/google/cloud/ml/applied/model
49 4
utils.py
in experiments/google/cloud/ml/applied/utils
31 4
search.py
in experiments/google/cloud/ml/applied/embeddings
71 4
embeddings.py
in experiments/google/cloud/ml/applied/embeddings
76 4
api.py
in experiments/legacy/backend
44 3
utils.py
in experiments/legacy/backend
11 2
marketing.py
in experiments/legacy/backend
14 1
nearest_neighbors.py
in experiments/legacy/backend
27 1
marketing.py
in experiments/google/cloud/ml/applied/marketing
11 1
nearest_neighbors.py
in experiments/google/cloud/ml/applied/knn
35 1
config.py
in experiments/google/cloud/ml/applied
23 1
Files With Long Lines (Top 12)

There are 12 files with lines longer than 120 characters. In total, there are 261 long lines.

File# lines# units# long lines
data_prep_with_restricts.ipynb
in experiments/legacy/notebooks/vectorSearchIndexUpdate
2153 - 49
NB3_create_streaming_index.ipynb
in experiments/legacy/notebooks/vectorSearchIndexUpdate
1264 - 48
NB4_create_streaming_index_with_filters.ipynb
in experiments/legacy/notebooks/vectorSearchIndexUpdate
1451 - 48
NB1_create_batch_index.ipynb
in experiments/legacy/notebooks/vectorSearchIndexUpdate
547 - 45
NB2_create_batch_index_with_filters.ipynb
in experiments/legacy/notebooks/vectorSearchIndexUpdate
860 - 33
0_EDA_flipkart_dataset.ipynb
in experiments/legacy/notebooks
2189 - 13
1_generate_embeddings.ipynb
in experiments/legacy/notebooks
420 - 12
2_create_vector_db.ipynb
in experiments/legacy/notebooks
387 - 7
VectoreSearch_Cleanup.ipynb
in experiments/legacy/notebooks/vectorSearchIndexUpdate
580 - 3
category.py
in experiments/legacy/backend
88 5 1
app.toml
in experiments/conf
35 - 1
setup.py
in experiments
70 - 1
Correlations

File Size vs. Commits (all time): 42 points

experiments/conf/app.toml x: 1 commits (all time) y: 35 lines of code experiments/conf/copy_file_groups.bzl x: 1 commits (all time) y: 56 lines of code experiments/conf/requirements.in x: 1 commits (all time) y: 27 lines of code experiments/google/cloud/ml/applied/__init__.py x: 1 commits (all time) y: 2 lines of code experiments/google/cloud/ml/applied/attributes/attributes.py x: 1 commits (all time) y: 115 lines of code experiments/google/cloud/ml/applied/categories/category.py x: 1 commits (all time) y: 108 lines of code experiments/google/cloud/ml/applied/config.py x: 1 commits (all time) y: 23 lines of code experiments/google/cloud/ml/applied/embeddings/embeddings.py x: 1 commits (all time) y: 76 lines of code experiments/google/cloud/ml/applied/marketing/marketing.py x: 1 commits (all time) y: 11 lines of code experiments/google/cloud/ml/applied/model/domain_model.py x: 1 commits (all time) y: 49 lines of code experiments/legacy/backend/attributes.py x: 1 commits (all time) y: 92 lines of code experiments/legacy/backend/embeddings.py x: 1 commits (all time) y: 68 lines of code experiments/legacy/notebooks/0_EDA_flipkart_dataset.ipynb x: 1 commits (all time) y: 2189 lines of code experiments/legacy/notebooks/1_generate_embeddings.ipynb x: 1 commits (all time) y: 420 lines of code experiments/legacy/notebooks/2_create_vector_db.ipynb x: 1 commits (all time) y: 387 lines of code experiments/legacy/notebooks/vectorSearchIndexUpdate/NB1_create_batch_index.ipynb x: 1 commits (all time) y: 547 lines of code experiments/legacy/notebooks/vectorSearchIndexUpdate/NB2_create_batch_index_with_filters.ipynb x: 1 commits (all time) y: 860 lines of code experiments/legacy/notebooks/vectorSearchIndexUpdate/NB3_create_streaming_index.ipynb x: 1 commits (all time) y: 1264 lines of code experiments/legacy/notebooks/vectorSearchIndexUpdate/NB4_create_streaming_index_with_filters.ipynb x: 1 commits (all time) y: 1451 lines of code experiments/legacy/notebooks/vectorSearchIndexUpdate/VectoreSearch_Cleanup.ipynb x: 1 commits (all time) y: 580 lines of code experiments/legacy/notebooks/vectorSearchIndexUpdate/data_prep_with_restricts.ipynb x: 1 commits (all time) y: 2153 lines of code
2189.0
lines of code
  min: 1.0
  average: 262.29
  25th percentile: 2.75
  median: 39.5
  75th percentile: 109.75
  max: 2189.0
0 1.0
commits (all time)
min: 1.0 | average: 1.0 | 25th percentile: 1.0 | median: 1.0 | 75th percentile: 1.0 | max: 1.0

File Size vs. Contributors (all time): 42 points

experiments/conf/app.toml x: 1 contributors (all time) y: 35 lines of code experiments/conf/copy_file_groups.bzl x: 1 contributors (all time) y: 56 lines of code experiments/conf/requirements.in x: 1 contributors (all time) y: 27 lines of code experiments/google/cloud/ml/applied/__init__.py x: 1 contributors (all time) y: 2 lines of code experiments/google/cloud/ml/applied/attributes/attributes.py x: 1 contributors (all time) y: 115 lines of code experiments/google/cloud/ml/applied/categories/category.py x: 1 contributors (all time) y: 108 lines of code experiments/google/cloud/ml/applied/config.py x: 1 contributors (all time) y: 23 lines of code experiments/google/cloud/ml/applied/embeddings/embeddings.py x: 1 contributors (all time) y: 76 lines of code experiments/google/cloud/ml/applied/marketing/marketing.py x: 1 contributors (all time) y: 11 lines of code experiments/google/cloud/ml/applied/model/domain_model.py x: 1 contributors (all time) y: 49 lines of code experiments/legacy/backend/attributes.py x: 1 contributors (all time) y: 92 lines of code experiments/legacy/backend/embeddings.py x: 1 contributors (all time) y: 68 lines of code experiments/legacy/notebooks/0_EDA_flipkart_dataset.ipynb x: 1 contributors (all time) y: 2189 lines of code experiments/legacy/notebooks/1_generate_embeddings.ipynb x: 1 contributors (all time) y: 420 lines of code experiments/legacy/notebooks/2_create_vector_db.ipynb x: 1 contributors (all time) y: 387 lines of code experiments/legacy/notebooks/vectorSearchIndexUpdate/NB1_create_batch_index.ipynb x: 1 contributors (all time) y: 547 lines of code experiments/legacy/notebooks/vectorSearchIndexUpdate/NB2_create_batch_index_with_filters.ipynb x: 1 contributors (all time) y: 860 lines of code experiments/legacy/notebooks/vectorSearchIndexUpdate/NB3_create_streaming_index.ipynb x: 1 contributors (all time) y: 1264 lines of code experiments/legacy/notebooks/vectorSearchIndexUpdate/NB4_create_streaming_index_with_filters.ipynb x: 1 contributors (all time) y: 1451 lines of code experiments/legacy/notebooks/vectorSearchIndexUpdate/VectoreSearch_Cleanup.ipynb x: 1 contributors (all time) y: 580 lines of code experiments/legacy/notebooks/vectorSearchIndexUpdate/data_prep_with_restricts.ipynb x: 1 contributors (all time) y: 2153 lines of code
2189.0
lines of code
  min: 1.0
  average: 262.29
  25th percentile: 2.75
  median: 39.5
  75th percentile: 109.75
  max: 2189.0
0 1.0
contributors (all time)
min: 1.0 | average: 1.0 | 25th percentile: 1.0 | median: 1.0 | 75th percentile: 1.0 | max: 1.0

File Size vs. Commits (30 days): 0 points

No data for "commits (30d)" vs. "lines of code".

File Size vs. Contributors (30 days): 0 points

No data for "contributors (30d)" vs. "lines of code".


File Size vs. Commits (90 days): 0 points

No data for "commits (90d)" vs. "lines of code".

File Size vs. Contributors (90 days): 0 points

No data for "contributors (90d)" vs. "lines of code".