GoogleCloudPlatform / public-datasets-pipelines
Cloud-native, data onboarding architecture for Google Cloud Datasets
GitHub Repo 
105K
lines of main code
994 files
2.2K
lines of test code
7 files
52K
lines of other code
269 files
4y
age
1,470 days
45%
main code touched
1 year (48K LOC)
4%
new main code
1 year (5K LOC)
46K
py
40K
yaml
18K
tf
JINJA2
0.4K
jinja2
0.05K
toml

5

49

34

212

130

2

5

3

16

13

2025 2024 2023 2022 2021

generated by sokrates.dev (configuration) on 2025-05-04