GoogleCloudPlatform / gcs-connector-for-pytorch
File Size

The distribution of size of files (measured in lines of code).

Intro
Learn more...
File Size Overall
0% | 0% | 20% | 37% | 42%
Legend:
1001+
501-1000
201-500
101-200
1-100


explore: grouped by folders | grouped by size | sunburst | 3D view
File Size per Extension
1001+
501-1000
201-500
101-200
1-100
py0% | 0% | 27% | 48% | 24%
yaml0% | 0% | 0% | 0% | 100%
cfg0% | 0% | 0% | 0% | 100%
toml0% | 0% | 0% | 0% | 100%
File Size per Logical Decomposition
primary
1001+
501-1000
201-500
101-200
1-100
dataflux_pytorch0% | 0% | 23% | 41% | 35%
kokoro0% | 0% | 0% | 0% | 100%
ROOT0% | 0% | 0% | 0% | 100%
Longest Files (Top 31)
File# lines# units
standalone_dataloader.py
in dataflux_pytorch/benchmark/standalone_dataloader
226 8
train.py
in dataflux_pytorch/benchmark/checkpointing/multinode
201 6
multipart.py
in dataflux_pytorch/multipart_upload
142 4
train.py
in dataflux_pytorch/benchmark/checkpointing/singlenode
141 10
dataflux_mapstyle_dataset.py
in dataflux_pytorch
138 9
dataflux_iterable_dataset.py
in dataflux_pytorch
129 5
gcs_filesystem.py
in dataflux_pytorch/lightning
109 11
train_async_save.py
in dataflux_pytorch/benchmark/checkpointing/multinode
105 4
benchmark.py
in dataflux_pytorch/benchmark/checkpointing/simulated
97 4
llama2.py
in dataflux_pytorch/benchmark/checkpointing/simulated
97 4
deployment.yaml
in dataflux_pytorch/benchmark/standalone_dataloader
91 -
benchmark-deploy.yaml
in dataflux_pytorch/benchmark/checkpointing/multinode
84 -
benchmark-deployment.yaml
in dataflux_pytorch/benchmark/checkpointing/simulated
83 -
dataflux_lightning_checkpoint.py
in dataflux_pytorch/lightning
78 7
cfg
51 -
cfg
46 -
cfg
pypi.cfg
in kokoro
46 -
dataflux_checkpoint.py
in dataflux_pytorch
37 5
36 -
path_utils.py
in dataflux_pytorch/lightning
31 2
setup.py
in root
19 -
_helper.py
in dataflux_pytorch
16 1
gcsfuse-pv.yaml
in dataflux_pytorch/benchmark/checkpointing/multinode
15 -
gcsfuse-pvc.yaml
in dataflux_pytorch/benchmark/checkpointing/multinode
13 -
__init__.py
in dataflux_pytorch/lightning
8 -
6 -
6 -
cfg
6 -
cfg
presubmit.cfg
in kokoro
1 -
__init__.py
in dataflux_pytorch/multipart_upload
1 -
__init__.py
in dataflux_pytorch
1 -
Files With Most Units (Top 14)
File# lines# units
gcs_filesystem.py
in dataflux_pytorch/lightning
109 11
train.py
in dataflux_pytorch/benchmark/checkpointing/singlenode
141 10
dataflux_mapstyle_dataset.py
in dataflux_pytorch
138 9
standalone_dataloader.py
in dataflux_pytorch/benchmark/standalone_dataloader
226 8
dataflux_lightning_checkpoint.py
in dataflux_pytorch/lightning
78 7
train.py
in dataflux_pytorch/benchmark/checkpointing/multinode
201 6
dataflux_iterable_dataset.py
in dataflux_pytorch
129 5
dataflux_checkpoint.py
in dataflux_pytorch
37 5
multipart.py
in dataflux_pytorch/multipart_upload
142 4
benchmark.py
in dataflux_pytorch/benchmark/checkpointing/simulated
97 4
llama2.py
in dataflux_pytorch/benchmark/checkpointing/simulated
97 4
train_async_save.py
in dataflux_pytorch/benchmark/checkpointing/multinode
105 4
path_utils.py
in dataflux_pytorch/lightning
31 2
_helper.py
in dataflux_pytorch
16 1
Files With Long Lines (Top 8)

There are 8 files with lines longer than 120 characters. In total, there are 16 long lines.

File# lines# units# long lines
standalone_dataloader.py
in dataflux_pytorch/benchmark/standalone_dataloader
226 8 4
dataflux_mapstyle_dataset.py
in dataflux_pytorch
138 9 2
dataflux_iterable_dataset.py
in dataflux_pytorch
129 5 2
benchmark.py
in dataflux_pytorch/benchmark/checkpointing/simulated
97 4 2
llama2.py
in dataflux_pytorch/benchmark/checkpointing/simulated
97 4 2
deployment.yaml
in dataflux_pytorch/benchmark/standalone_dataloader
91 - 2
_helper.py
in dataflux_pytorch
16 1 1
gcs_filesystem.py
in dataflux_pytorch/lightning
109 11 1
Correlations

File Size vs. Commits (all time): 31 points

dataflux_pytorch/multipart_upload/multipart.py x: 3 commits (all time) y: 142 lines of code pyproject.toml x: 8 commits (all time) y: 36 lines of code dataflux_pytorch/benchmark/checkpointing/simulated/benchmark.py x: 2 commits (all time) y: 97 lines of code dataflux_pytorch/benchmark/checkpointing/simulated/llama2.py x: 1 commits (all time) y: 97 lines of code setup.py x: 7 commits (all time) y: 19 lines of code dataflux_pytorch/benchmark/checkpointing/singlenode/train.py x: 7 commits (all time) y: 141 lines of code dataflux_pytorch/benchmark/checkpointing/simulated/benchmark-deployment.yaml x: 1 commits (all time) y: 83 lines of code dataflux_pytorch/benchmark/checkpointing/multinode/train_async_save.py x: 1 commits (all time) y: 105 lines of code dataflux_pytorch/benchmark/checkpointing/multinode/train.py x: 12 commits (all time) y: 201 lines of code dataflux_pytorch/lightning/gcs_filesystem.py x: 8 commits (all time) y: 109 lines of code dataflux_pytorch/dataflux_mapstyle_dataset.py x: 15 commits (all time) y: 138 lines of code dataflux_pytorch/benchmark/checkpointing/multinode/benchmark-deploy.yaml x: 5 commits (all time) y: 84 lines of code dataflux_pytorch/benchmark/checkpointing/multinode/gcsfuse-pv.yaml x: 1 commits (all time) y: 15 lines of code dataflux_pytorch/benchmark/checkpointing/multinode/gcsfuse-pvc.yaml x: 1 commits (all time) y: 13 lines of code dataflux_pytorch/dataflux_checkpoint.py x: 6 commits (all time) y: 37 lines of code kokoro/presubmit_mac_x86.cfg x: 1 commits (all time) y: 6 lines of code dataflux_pytorch/lightning/__init__.py x: 5 commits (all time) y: 8 lines of code dataflux_pytorch/lightning/dataflux_lightning_checkpoint.py x: 13 commits (all time) y: 78 lines of code kokoro/presubmit.cfg x: 3 commits (all time) y: 1 lines of code kokoro/presubmit_base.cfg x: 1 commits (all time) y: 51 lines of code dataflux_pytorch/multipart_upload/__init__.py x: 1 commits (all time) y: 1 lines of code dataflux_pytorch/lightning/path_utils.py x: 1 commits (all time) y: 31 lines of code dataflux_pytorch/_helper.py x: 2 commits (all time) y: 16 lines of code dataflux_pytorch/dataflux_iterable_dataset.py x: 10 commits (all time) y: 129 lines of code dataflux_pytorch/benchmark/standalone_dataloader/deployment.yaml x: 1 commits (all time) y: 91 lines of code dataflux_pytorch/benchmark/standalone_dataloader/standalone_dataloader.py x: 2 commits (all time) y: 226 lines of code kokoro/pypi_presubmit.cfg x: 1 commits (all time) y: 46 lines of code
226.0
lines of code
  min: 1.0
  average: 66.45
  25th percentile: 13.0
  median: 46.0
  75th percentile: 105.0
  max: 226.0
0 15.0
commits (all time)
min: 1.0 | average: 3.97 | 25th percentile: 1.0 | median: 2.0 | 75th percentile: 7.0 | max: 15.0

File Size vs. Contributors (all time): 31 points

dataflux_pytorch/multipart_upload/multipart.py x: 2 contributors (all time) y: 142 lines of code pyproject.toml x: 3 contributors (all time) y: 36 lines of code dataflux_pytorch/benchmark/checkpointing/simulated/benchmark.py x: 1 contributors (all time) y: 97 lines of code setup.py x: 5 contributors (all time) y: 19 lines of code dataflux_pytorch/benchmark/checkpointing/singlenode/train.py x: 3 contributors (all time) y: 141 lines of code dataflux_pytorch/benchmark/checkpointing/simulated/benchmark-deployment.yaml x: 1 contributors (all time) y: 83 lines of code dataflux_pytorch/benchmark/checkpointing/multinode/train_async_save.py x: 1 contributors (all time) y: 105 lines of code dataflux_pytorch/benchmark/checkpointing/multinode/train.py x: 3 contributors (all time) y: 201 lines of code dataflux_pytorch/lightning/gcs_filesystem.py x: 4 contributors (all time) y: 109 lines of code dataflux_pytorch/dataflux_mapstyle_dataset.py x: 7 contributors (all time) y: 138 lines of code dataflux_pytorch/benchmark/checkpointing/multinode/benchmark-deploy.yaml x: 3 contributors (all time) y: 84 lines of code dataflux_pytorch/benchmark/checkpointing/multinode/gcsfuse-pv.yaml x: 1 contributors (all time) y: 15 lines of code dataflux_pytorch/benchmark/checkpointing/multinode/gcsfuse-pvc.yaml x: 1 contributors (all time) y: 13 lines of code dataflux_pytorch/dataflux_checkpoint.py x: 3 contributors (all time) y: 37 lines of code kokoro/presubmit_mac_x86.cfg x: 1 contributors (all time) y: 6 lines of code dataflux_pytorch/lightning/__init__.py x: 4 contributors (all time) y: 8 lines of code dataflux_pytorch/lightning/dataflux_lightning_checkpoint.py x: 5 contributors (all time) y: 78 lines of code kokoro/presubmit.cfg x: 1 contributors (all time) y: 1 lines of code kokoro/presubmit_base.cfg x: 1 contributors (all time) y: 51 lines of code dataflux_pytorch/lightning/path_utils.py x: 1 contributors (all time) y: 31 lines of code dataflux_pytorch/_helper.py x: 2 contributors (all time) y: 16 lines of code dataflux_pytorch/dataflux_iterable_dataset.py x: 5 contributors (all time) y: 129 lines of code dataflux_pytorch/benchmark/standalone_dataloader/deployment.yaml x: 1 contributors (all time) y: 91 lines of code dataflux_pytorch/benchmark/standalone_dataloader/standalone_dataloader.py x: 1 contributors (all time) y: 226 lines of code kokoro/pypi_presubmit.cfg x: 1 contributors (all time) y: 46 lines of code
226.0
lines of code
  min: 1.0
  average: 66.45
  25th percentile: 13.0
  median: 46.0
  75th percentile: 105.0
  max: 226.0
0 7.0
contributors (all time)
min: 1.0 | average: 2.16 | 25th percentile: 1.0 | median: 1.0 | 75th percentile: 3.0 | max: 7.0

File Size vs. Commits (30 days): 0 points

No data for "commits (30d)" vs. "lines of code".

File Size vs. Contributors (30 days): 0 points

No data for "contributors (30d)" vs. "lines of code".


File Size vs. Commits (90 days): 0 points

No data for "commits (90d)" vs. "lines of code".

File Size vs. Contributors (90 days): 0 points

No data for "contributors (90d)" vs. "lines of code".