GoogleCloudPlatform / dataproc-templates
File Change Frequency

File change frequency (churn) shows the distribution of file updates (days with at least one commit).

Overview
File Change Frequency Overall
  • There are 172 files with 22,266 lines of code.
    • 2 files changed more than 100 times (1,226 lines of code)
    • 4 files changed 51-100 times (373 lines of code)
    • 52 files changed 21-50 times (12,041 lines of code)
    • 71 files changed 6-20 times (6,471 lines of code)
    • 43 files changed 1-5 times (2,155 lines of code)
5% | 1% | 54% | 29% | 9%
Legend:
101+
51-100
21-50
6-20
1-5

explore: grouped by folders | grouped by update frequency | data
Contributors Count Frequency Overall
  • There are 172 files with 22,266 lines of code.
    • 8 files changed by more than 25 contributors (2,699 lines of code)
    • 69 files changed by 11-25 contributors (13,184 lines of code)
    • 42 files changed by 6-10 contributors (3,364 lines of code)
    • 39 files changed by 2-5 contributors (1,871 lines of code)
    • 14 files changed by 1 contributor (1,148 lines of code)
12% | 59% | 15% | 8% | 5%
Legend:
26+
11-25
6-10
2-5
1

explore: grouped by folders | grouped by contributors count | data
File Change Frequency per File Extension
py, java, md, yaml, ipynb, sh, xml, txt, cfg, gitignore, json, in, ini, properties
File Change Frequency per Extension
The number of recorded file updates
101+
51-100
21-50
6-20
1-5
py12% | 2% | 29% | 41% | 14%
java5% | 3% | 42% | 32% | 15%
ipynb0% | 0% | 85% | 14% | 0%
xml0% | 0% | 0% | 100% | 0%
yaml0% | 0% | 0% | 94% | 5%
in0% | 0% | 0% | 0% | 100%
cfg0% | 0% | 0% | 0% | 100%
File Change Frequency per Logical Decomposition
primary
primary (file change frequency)
The number of recorded file updates
101+
51-100
21-50
6-20
1-5
python16% | 2% | 34% | 31% | 15%
java5% | 3% | 42% | 33% | 15%
notebooks0% | 0% | 74% | 23% | 2%
airflow0% | 0% | 0% | 100% | 0%
ROOT0% | 0% | 0% | 100% | 0%
Most Frequently Changed Files (Top 50)

See data for all files...

File# lines# unitscreatedlast modified# changes
(days)
# contributorsfirst
contributor
latest
contributor
template_constants.py
in python/dataproc_templates/util
848 4 2022-04-05 2025-03-04 124 44 ppaglilla@google.com 105878445+vanshaj-bhatia@us...
TemplateConstants.java
in java/src/main/java/com/google/cloud/dataproc/templates/util
378 - 2022-04-18 2024-10-08 122 39 nilofr@google.com 57837394+rajc242@users.nore...
DataProcTemplate.java
in java/src/main/java/com/google/cloud/dataproc/templates/main
177 6 2022-04-18 2024-10-26 94 32 nilofr@google.com 106058340+hhasija@users.nor...
BaseTemplate.java
in java/src/main/java/com/google/cloud/dataproc/templates
52 1 2022-04-18 2024-10-26 83 28 nilofr@google.com 106058340+hhasija@users.nor...
main.py
in python
98 2 2022-04-05 2025-03-04 83 37 ppaglilla@google.com 105878445+vanshaj-bhatia@us...
template_name.py
in python/dataproc_templates
46 2 2022-04-05 2025-03-04 78 37 ppaglilla@google.com 105878445+vanshaj-bhatia@us...
jdbc_to_gcs.py
in python/dataproc_templates/jdbc
219 2 2022-06-29 2024-06-02 50 27 naveenkm@google.com 149653140+itsabhisharma23@u...
OracleToBigQuery_notebook.ipynb
in notebooks/oracle2bq
881 - 2022-10-04 2024-09-03 48 26 shivamsomani@google.com 57837394+rajc242@users.nore...
OracleToSpanner_notebook.ipynb
in notebooks/oracle2spanner
894 - 2022-10-14 2024-09-03 46 25 surjitsh@google.com 57837394+rajc242@users.nore...
MySqlToSpanner_notebook.ipynb
in notebooks/mysql2spanner
903 - 2022-10-01 2024-09-03 46 23 agarwalsh@google.com 57837394+rajc242@users.nore...
JDBCToBigQuery.java
in java/src/main/java/com/google/cloud/dataproc/templates/jdbc
123 4 2022-04-18 2024-10-08 44 21 nilofr@google.com 57837394+rajc242@users.nore...
DataplexGCStoBQ.java
in java/src/main/java/com/google/cloud/dataproc/templates/dataplex
380 14 2022-04-18 2024-10-08 44 25 nilofr@google.com 57837394+rajc242@users.nore...
JDBCToGCS.java
in java/src/main/java/com/google/cloud/dataproc/templates/jdbc
64 4 2022-04-18 2023-05-11 37 18 nilofr@google.com 106058340+hhasija@users.nor...
HiveToGCS.java
in java/src/main/java/com/google/cloud/dataproc/templates/hive
86 3 2022-04-18 2023-05-11 36 22 nilofr@google.com 106058340+hhasija@users.nor...
911 - 2022-10-24 2024-09-03 36 23 varunikagupta@google.com 57837394+rajc242@users.nore...
hive_to_gcs.py
in python/dataproc_templates/hive
108 2 2022-05-04 2023-04-25 35 24 surjitsh@google.com 105878445+vanshaj-bhatia@us...
GCStoBigquery.java
in java/src/main/java/com/google/cloud/dataproc/templates/gcs
130 3 2022-04-18 2023-05-11 34 22 nilofr@google.com 106058340+hhasija@users.nor...
mssql-to-postgres-notebook.ipynb
in notebooks/mssql2postgresql
919 - 2022-10-01 2024-09-03 34 20 agarwalsh@google.com 57837394+rajc242@users.nore...
HiveToBigquery_notebook.ipynb
in notebooks/hive2bq
932 - 2022-10-01 2024-09-03 32 21 agarwalsh@google.com 57837394+rajc242@users.nore...
BigQueryToGCS.java
in java/src/main/java/com/google/cloud/dataproc/templates/bigquery
92 3 2022-04-18 2024-09-05 31 23 nilofr@google.com 123699534+edwinobgoogle@use...
SpannerToGCS.java
in java/src/main/java/com/google/cloud/dataproc/templates/databases
68 4 2022-04-18 2024-09-24 30 17 nilofr@google.com 57837394+rajc242@users.nore...
HiveToBigQuery.java
in java/src/main/java/com/google/cloud/dataproc/templates/hive
79 3 2022-04-18 2024-10-08 30 19 nilofr@google.com 57837394+rajc242@users.nore...
gcs_to_bigtable.py
in python/dataproc_templates/gcs
98 2 2022-05-27 2024-12-02 30 16 nilofr@google.com 57837394+rajc242@users.nore...
JDBCToGCSConfig.java
in java/src/main/java/com/google/cloud/dataproc/templates/jdbc
206 24 2022-07-25 2023-03-16 30 18 vanshajbhatia@google.com moukhtar@google.com
gcs_to_bigquery.py
in python/dataproc_templates/gcs
93 2 2022-04-05 2023-04-25 29 17 ppaglilla@google.com 105878445+vanshaj-bhatia@us...
gcs_to_jdbc.py
in python/dataproc_templates/gcs
113 2 2022-06-18 2023-05-11 29 19 hhasija@google.com vanshajbhatia@google.com
notebook_constants.py
in notebooks/parameterize_script/util
115 - 2023-05-03 2023-07-31 29 9 tanyawarrier@google.com 105878445+vanshaj-bhatia@us...
KafkaToBQ.java
in java/src/main/java/com/google/cloud/dataproc/templates/kafka
152 4 2022-06-17 2024-10-08 28 17 vanshajbhatia@google.com 57837394+rajc242@users.nore...
jdbc_to_jdbc.py
in python/dataproc_templates/jdbc
227 2 2022-06-27 2024-06-02 28 21 naveenkm@google.com 149653140+itsabhisharma23@u...
CassandraToBQ.java
in java/src/main/java/com/google/cloud/dataproc/templates/databases
61 4 2022-09-19 2024-10-08 27 16 anishks@google.com 57837394+rajc242@users.nore...
GCStoGCS.java
in java/src/main/java/com/google/cloud/dataproc/templates/gcs
93 3 2022-07-28 2023-05-11 27 15 hhasija@google.com 106058340+hhasija@users.nor...
GCSToJDBC.java
in java/src/main/java/com/google/cloud/dataproc/templates/gcs
96 6 2022-05-25 2023-05-11 27 14 hhasija@google.com 106058340+hhasija@users.nor...
snowflake_to_gcs.py
in python/dataproc_templates/snowflake
212 5 2022-08-19 2023-05-11 27 19 varunikagupta@google.com vanshajbhatia@google.com
CassandraToGCS.java
in java/src/main/java/com/google/cloud/dataproc/templates/databases
59 4 2022-09-13 2023-05-26 26 16 anishks@google.com poojabasker@google.com
S3ToBigQuery.java
in java/src/main/java/com/google/cloud/dataproc/templates/s3
128 3 2022-04-18 2023-05-11 26 19 nilofr@google.com 106058340+hhasija@users.nor...
jdbc_to_bigquery.py
in python/dataproc_templates/jdbc
171 2 2022-08-26 2024-06-02 26 19 rafaelsilva@posteo.net 149653140+itsabhisharma23@u...
run_notebook.py
in notebooks
27 1 2023-05-05 2023-07-12 25 10 nilofr@google.com neil@thirdchimp.net
RedshiftToGCS.java
in java/src/main/java/com/google/cloud/dataproc/templates/databases
56 3 2022-09-21 2023-05-23 25 16 hhasija@google.com neil@thirdchimp.net
bigquery_to_gcs.py
in python/dataproc_templates/bigquery
86 2 2022-04-07 2024-01-18 24 18 surjitsh@google.com 149017703+shubhampathakk@us...
JDBCToSpanner.java
in java/src/main/java/com/google/cloud/dataproc/templates/jdbc
99 5 2022-10-05 2024-09-24 24 17 hhasija@google.com 57837394+rajc242@users.nore...
gcs_to_mongo.py
in python/dataproc_templates/gcs
102 2 2022-07-12 2023-05-11 24 16 hhasija@google.com vanshajbhatia@google.com
PubSubToGCS.java
in java/src/main/java/com/google/cloud/dataproc/templates/pubsub
184 6 2022-05-03 2023-07-31 24 18 jigeorge@google.com 105878445+vanshaj-bhatia@us...
version.py
in python
1 - 2023-01-18 2024-09-05 23 14 117453385+ankuljain09@users... agarwalsh@google.com
script_name.py
in notebooks/parameterize_script
19 2 2023-05-03 2023-07-12 23 9 tanyawarrier@google.com neil@thirdchimp.net
SnowflakeToGCS.java
in java/src/main/java/com/google/cloud/dataproc/templates/snowflake
56 4 2022-07-30 2023-05-11 23 14 vanshajbhatia@google.com 106058340+hhasija@users.nor...
HbaseToGCS.java
in java/src/main/java/com/google/cloud/dataproc/templates/hbase
73 3 2022-06-29 2023-05-11 23 13 anishks@google.com 106058340+hhasija@users.nor...
GCSToSpanner.java
in java/src/main/java/com/google/cloud/dataproc/templates/gcs
83 5 2022-04-18 2024-11-03 23 16 nilofr@google.com 106058340+hhasija@users.nor...
GCSToJDBCConfig.java
in java/src/main/java/com/google/cloud/dataproc/templates/gcs
120 13 2022-05-25 2023-03-16 23 14 hhasija@google.com moukhtar@google.com
setup.py
in python
58 3 2022-04-05 2024-10-10 22 17 ppaglilla@google.com 105878445+vanshaj-bhatia@us...
hbase_to_gcs.py
in python/dataproc_templates/hbase
76 2 2022-06-14 2023-04-25 22 15 surjitsh@google.com 105878445+vanshaj-bhatia@us...
Files With Most Contributors (Top 50)
Based on the number of unique email addresses found in commits.

See data for all files...

File# lines# unitscreatedlast modified# changes
(days)
# contributorsfirst
contributor
latest
contributor
template_constants.py
in python/dataproc_templates/util
848 4 2022-04-05 2025-03-04 124 44 ppaglilla@google.com 105878445+vanshaj-bhatia@us...
TemplateConstants.java
in java/src/main/java/com/google/cloud/dataproc/templates/util
378 - 2022-04-18 2024-10-08 122 39 nilofr@google.com 57837394+rajc242@users.nore...
main.py
in python
98 2 2022-04-05 2025-03-04 83 37 ppaglilla@google.com 105878445+vanshaj-bhatia@us...
template_name.py
in python/dataproc_templates
46 2 2022-04-05 2025-03-04 78 37 ppaglilla@google.com 105878445+vanshaj-bhatia@us...
DataProcTemplate.java
in java/src/main/java/com/google/cloud/dataproc/templates/main
177 6 2022-04-18 2024-10-26 94 32 nilofr@google.com 106058340+hhasija@users.nor...
BaseTemplate.java
in java/src/main/java/com/google/cloud/dataproc/templates
52 1 2022-04-18 2024-10-26 83 28 nilofr@google.com 106058340+hhasija@users.nor...
jdbc_to_gcs.py
in python/dataproc_templates/jdbc
219 2 2022-06-29 2024-06-02 50 27 naveenkm@google.com 149653140+itsabhisharma23@u...
OracleToBigQuery_notebook.ipynb
in notebooks/oracle2bq
881 - 2022-10-04 2024-09-03 48 26 shivamsomani@google.com 57837394+rajc242@users.nore...
OracleToSpanner_notebook.ipynb
in notebooks/oracle2spanner
894 - 2022-10-14 2024-09-03 46 25 surjitsh@google.com 57837394+rajc242@users.nore...
DataplexGCStoBQ.java
in java/src/main/java/com/google/cloud/dataproc/templates/dataplex
380 14 2022-04-18 2024-10-08 44 25 nilofr@google.com 57837394+rajc242@users.nore...
hive_to_gcs.py
in python/dataproc_templates/hive
108 2 2022-05-04 2023-04-25 35 24 surjitsh@google.com 105878445+vanshaj-bhatia@us...
MySqlToSpanner_notebook.ipynb
in notebooks/mysql2spanner
903 - 2022-10-01 2024-09-03 46 23 agarwalsh@google.com 57837394+rajc242@users.nore...
911 - 2022-10-24 2024-09-03 36 23 varunikagupta@google.com 57837394+rajc242@users.nore...
BigQueryToGCS.java
in java/src/main/java/com/google/cloud/dataproc/templates/bigquery
92 3 2022-04-18 2024-09-05 31 23 nilofr@google.com 123699534+edwinobgoogle@use...
HiveToGCS.java
in java/src/main/java/com/google/cloud/dataproc/templates/hive
86 3 2022-04-18 2023-05-11 36 22 nilofr@google.com 106058340+hhasija@users.nor...
GCStoBigquery.java
in java/src/main/java/com/google/cloud/dataproc/templates/gcs
130 3 2022-04-18 2023-05-11 34 22 nilofr@google.com 106058340+hhasija@users.nor...
JDBCToBigQuery.java
in java/src/main/java/com/google/cloud/dataproc/templates/jdbc
123 4 2022-04-18 2024-10-08 44 21 nilofr@google.com 57837394+rajc242@users.nore...
HiveToBigquery_notebook.ipynb
in notebooks/hive2bq
932 - 2022-10-01 2024-09-03 32 21 agarwalsh@google.com 57837394+rajc242@users.nore...
jdbc_to_jdbc.py
in python/dataproc_templates/jdbc
227 2 2022-06-27 2024-06-02 28 21 naveenkm@google.com 149653140+itsabhisharma23@u...
mssql-to-postgres-notebook.ipynb
in notebooks/mssql2postgresql
919 - 2022-10-01 2024-09-03 34 20 agarwalsh@google.com 57837394+rajc242@users.nore...
HiveToBigQuery.java
in java/src/main/java/com/google/cloud/dataproc/templates/hive
79 3 2022-04-18 2024-10-08 30 19 nilofr@google.com 57837394+rajc242@users.nore...
gcs_to_jdbc.py
in python/dataproc_templates/gcs
113 2 2022-06-18 2023-05-11 29 19 hhasija@google.com vanshajbhatia@google.com
snowflake_to_gcs.py
in python/dataproc_templates/snowflake
212 5 2022-08-19 2023-05-11 27 19 varunikagupta@google.com vanshajbhatia@google.com
S3ToBigQuery.java
in java/src/main/java/com/google/cloud/dataproc/templates/s3
128 3 2022-04-18 2023-05-11 26 19 nilofr@google.com 106058340+hhasija@users.nor...
jdbc_to_bigquery.py
in python/dataproc_templates/jdbc
171 2 2022-08-26 2024-06-02 26 19 rafaelsilva@posteo.net 149653140+itsabhisharma23@u...
JDBCToGCS.java
in java/src/main/java/com/google/cloud/dataproc/templates/jdbc
64 4 2022-04-18 2023-05-11 37 18 nilofr@google.com 106058340+hhasija@users.nor...
JDBCToGCSConfig.java
in java/src/main/java/com/google/cloud/dataproc/templates/jdbc
206 24 2022-07-25 2023-03-16 30 18 vanshajbhatia@google.com moukhtar@google.com
PubSubToGCS.java
in java/src/main/java/com/google/cloud/dataproc/templates/pubsub
184 6 2022-05-03 2023-07-31 24 18 jigeorge@google.com 105878445+vanshaj-bhatia@us...
bigquery_to_gcs.py
in python/dataproc_templates/bigquery
86 2 2022-04-07 2024-01-18 24 18 surjitsh@google.com 149017703+shubhampathakk@us...
PubSubToBQ.java
in java/src/main/java/com/google/cloud/dataproc/templates/pubsub
186 4 2022-04-18 2023-05-11 22 18 nilofr@google.com 106058340+hhasija@users.nor...
SpannerToGCS.java
in java/src/main/java/com/google/cloud/dataproc/templates/databases
68 4 2022-04-18 2024-09-24 30 17 nilofr@google.com 57837394+rajc242@users.nore...
gcs_to_bigquery.py
in python/dataproc_templates/gcs
93 2 2022-04-05 2023-04-25 29 17 ppaglilla@google.com 105878445+vanshaj-bhatia@us...
KafkaToBQ.java
in java/src/main/java/com/google/cloud/dataproc/templates/kafka
152 4 2022-06-17 2024-10-08 28 17 vanshajbhatia@google.com 57837394+rajc242@users.nore...
JDBCToSpanner.java
in java/src/main/java/com/google/cloud/dataproc/templates/jdbc
99 5 2022-10-05 2024-09-24 24 17 hhasija@google.com 57837394+rajc242@users.nore...
setup.py
in python
58 3 2022-04-05 2024-10-10 22 17 ppaglilla@google.com 105878445+vanshaj-bhatia@us...
vertex_pipeline_pyspark.ipynb
in notebooks/generic_notebook
554 - 2022-10-01 2024-09-03 22 17 agarwalsh@google.com 57837394+rajc242@users.nore...
KafkaToGCS.java
in java/src/main/java/com/google/cloud/dataproc/templates/kafka
102 3 2022-07-19 2023-05-11 21 17 98432795+nikhil6790@users.n... 106058340+hhasija@users.nor...
JDBCToSpannerConfig.java
in java/src/main/java/com/google/cloud/dataproc/templates/jdbc
191 26 2022-10-05 2024-09-24 21 17 hhasija@google.com 57837394+rajc242@users.nore...
redshift_to_gcs.py
in python/dataproc_templates/redshift
131 2 2022-08-29 2023-05-11 21 17 shivamsomani@google.com vanshajbhatia@google.com
hive_ddl_extractor.py
in python/dataproc_templates/hive/util
63 2 2023-01-17 2023-03-30 19 17 shubu@google.com hemr@google.com
gcs_to_bigtable.py
in python/dataproc_templates/gcs
98 2 2022-05-27 2024-12-02 30 16 nilofr@google.com 57837394+rajc242@users.nore...
CassandraToBQ.java
in java/src/main/java/com/google/cloud/dataproc/templates/databases
61 4 2022-09-19 2024-10-08 27 16 anishks@google.com 57837394+rajc242@users.nore...
CassandraToGCS.java
in java/src/main/java/com/google/cloud/dataproc/templates/databases
59 4 2022-09-13 2023-05-26 26 16 anishks@google.com poojabasker@google.com
RedshiftToGCS.java
in java/src/main/java/com/google/cloud/dataproc/templates/databases
56 3 2022-09-21 2023-05-23 25 16 hhasija@google.com neil@thirdchimp.net
gcs_to_mongo.py
in python/dataproc_templates/gcs
102 2 2022-07-12 2023-05-11 24 16 hhasija@google.com vanshajbhatia@google.com
GCSToSpanner.java
in java/src/main/java/com/google/cloud/dataproc/templates/gcs
83 5 2022-04-18 2024-11-03 23 16 nilofr@google.com 106058340+hhasija@users.nor...
GCStoGCS.java
in java/src/main/java/com/google/cloud/dataproc/templates/gcs
93 3 2022-07-28 2023-05-11 27 15 hhasija@google.com 106058340+hhasija@users.nor...
hbase_to_gcs.py
in python/dataproc_templates/hbase
76 2 2022-06-14 2023-04-25 22 15 surjitsh@google.com 105878445+vanshaj-bhatia@us...
GCSToJDBC.java
in java/src/main/java/com/google/cloud/dataproc/templates/gcs
96 6 2022-05-25 2023-05-11 27 14 hhasija@google.com 106058340+hhasija@users.nor...
GCSToJDBCConfig.java
in java/src/main/java/com/google/cloud/dataproc/templates/gcs
120 13 2022-05-25 2023-03-16 23 14 hhasija@google.com moukhtar@google.com
Files With Least Contributors (Top 50)
Based on the number of unique email addresses found in commits.

See data for all files...

File# lines# unitscreatedlast modified# changes
(days)
# contributorsfirst
contributor
latest
contributor
elasticsearch_to_bigtable.py
in python/dataproc_templates/elasticsearch
144 2 2024-06-24 2024-11-09 3 1 163002505+rohilla-anuj@user... 163002505+rohilla-anuj@user...
elasticsearch_to_bq.py
in python/dataproc_templates/elasticsearch
141 2 2024-06-24 2024-11-20 3 1 163002505+rohilla-anuj@user... 163002505+rohilla-anuj@user...
elasticsearch_to_gcs.py
in python/dataproc_templates/elasticsearch
141 2 2024-06-24 2024-11-09 2 1 163002505+rohilla-anuj@user... 163002505+rohilla-anuj@user...
bigquery_to_memorystore.py
in python/dataproc_templates/bigquery
130 2 2025-03-04 2025-03-04 1 1 105878445+vanshaj-bhatia@us... 105878445+vanshaj-bhatia@us...
GCStoBigTableConfig.java
in java/src/main/java/com/google/cloud/dataproc/templates/gcs
107 6 2024-09-24 2024-09-24 1 1 57837394+rajc242@users.nore... 57837394+rajc242@users.nore...
MongoToBQConfig.java
in java/src/main/java/com/google/cloud/dataproc/templates/databases
105 3 2024-09-16 2024-09-16 1 1 57837394+rajc242@users.nore... 57837394+rajc242@users.nore...
BigQueryToJDBCConfig.java
in java/src/main/java/com/google/cloud/dataproc/templates/bigquery
98 12 2024-10-26 2024-10-26 1 1 106058340+hhasija@users.nor... 106058340+hhasija@users.nor...
elasticsearch_transformations.py
in python/dataproc_templates/util
96 6 2024-06-24 2024-06-24 1 1 163002505+rohilla-anuj@user... 163002505+rohilla-anuj@user...
BigQueryToJDBC.java
in java/src/main/java/com/google/cloud/dataproc/templates/bigquery
56 5 2024-10-26 2024-10-26 1 1 106058340+hhasija@users.nor... 106058340+hhasija@users.nor...
MongoToBQ.java
in java/src/main/java/com/google/cloud/dataproc/templates/databases
56 4 2024-09-16 2024-09-16 1 1 57837394+rajc242@users.nore... 57837394+rajc242@users.nore...
SpannerPostgresJDBCDialect.java
in java/src/main/java/com/google/cloud/dataproc/dialects
52 3 2024-09-24 2024-09-24 1 1 57837394+rajc242@users.nore... 57837394+rajc242@users.nore...
secret_manager.py
in python/dataproc_templates/util
19 2 2024-06-02 2024-06-02 1 1 149653140+itsabhisharma23@u... 149653140+itsabhisharma23@u...
__init__.py
in python/dataproc_templates/hive
2 - 2022-05-03 2022-05-11 3 1 surjitsh@google.com surjitsh@google.com
__init__.py
in python/dataproc_templates/elasticsearch
1 - 2024-06-24 2024-06-24 1 1 163002505+rohilla-anuj@user... 163002505+rohilla-anuj@user...
jdbc_input_manager_interface.py
in notebooks/util/jdbc
235 27 2023-03-10 2023-07-31 7 2 neil@thirdchimp.net 105878445+vanshaj-bhatia@us...
KafkaToBQDstream.java
in java/src/main/java/com/google/cloud/dataproc/templates/kafka
164 3 2023-12-18 2024-01-02 5 2 chakreshpatel@google.com chakresh84@gmail.com
KafkaToGCSDstream.java
in java/src/main/java/com/google/cloud/dataproc/templates/kafka
161 3 2023-12-22 2024-01-08 5 2 chakreshpatel@google.com 105878445+vanshaj-bhatia@us...
oracle_input_manager.py
in notebooks/util/jdbc/engines
161 9 2023-03-10 2023-07-31 7 2 neil@thirdchimp.net 105878445+vanshaj-bhatia@us...
mysql_input_manager.py
in notebooks/util/jdbc/engines
139 8 2023-03-16 2023-07-31 5 2 neil@thirdchimp.net 105878445+vanshaj-bhatia@us...
ValidationUtil.java
in java/src/main/java/com/google/cloud/dataproc/templates/util
59 3 2022-04-18 2022-04-26 2 2 nilofr@google.com agarwalsh@google.com
PropertyUtil.java
in java/src/main/java/com/google/cloud/dataproc/templates/util
47 4 2022-04-18 2022-04-26 2 2 nilofr@google.com agarwalsh@google.com
InputConfig.java
in java/src/main/java/com/google/cloud/dataproc/templates/config
37 7 2022-04-18 2022-04-26 2 2 nilofr@google.com agarwalsh@google.com
notebook_functions.py
in notebooks/util
25 3 2023-03-03 2023-07-31 5 2 neil@thirdchimp.net 105878445+vanshaj-bhatia@us...
jdbc_input_manager.py
in notebooks/util/jdbc
18 1 2023-03-10 2023-07-31 4 2 neil@thirdchimp.net 105878445+vanshaj-bhatia@us...
QueryConfig.java
in java/src/main/java/com/google/cloud/dataproc/templates/config
16 3 2022-04-18 2022-04-26 2 2 nilofr@google.com agarwalsh@google.com
__init__.py
in python/dataproc_templates
2 - 2022-04-05 2022-04-26 2 2 ppaglilla@google.com agarwalsh@google.com
__init__.py
in notebooks/util/jdbc
1 - 2023-03-10 2023-07-31 3 2 neil@thirdchimp.net 105878445+vanshaj-bhatia@us...
__init__.py
in notebooks/util/jdbc/engines
1 - 2023-03-10 2023-07-31 3 2 neil@thirdchimp.net 105878445+vanshaj-bhatia@us...
GeneralTemplateConfig.java
in java/src/main/java/com/google/cloud/dataproc/templates/config
106 9 2022-04-18 2024-10-08 3 3 nilofr@google.com 57837394+rajc242@users.nore...
83 2 2023-05-13 2023-07-12 9 3 tanyawarrier@google.com neil@thirdchimp.net
OutputConfig.java
in java/src/main/java/com/google/cloud/dataproc/templates/config
47 9 2022-04-18 2024-10-08 3 3 nilofr@google.com 57837394+rajc242@users.nore...
TemplateUtil.java
in java/src/main/java/com/google/cloud/dataproc/templates/util
22 1 2022-04-18 2024-10-08 3 3 nilofr@google.com 57837394+rajc242@users.nore...
base_template.py
in python/dataproc_templates
18 4 2022-04-05 2022-04-26 5 3 ppaglilla@google.com ppaglilla@google.com
__init__.py
in python/dataproc_templates/bigquery
2 - 2022-04-07 2025-03-04 3 3 surjitsh@google.com 105878445+vanshaj-bhatia@us...
__init__.py
in notebooks/hive2bq
1 - 2023-05-13 2023-07-12 4 3 tanyawarrier@google.com neil@thirdchimp.net
mongo_to_bq.py
in python/dataproc_templates/mongo
98 2 2023-10-30 2023-11-27 4 4 satpreet@google.com 105878445+vanshaj-bhatia@us...
__init__.py
in notebooks/parameterize_script
2 - 2023-05-03 2023-05-23 4 4 tanyawarrier@google.com neil@thirdchimp.net
__init__.py
in python/dataproc_templates/mongo
1 - 2022-08-03 2023-01-03 5 4 hhasija@google.com vanshajbhatia@google.com
__init__.py
in notebooks/oracle2bq
1 - 2023-05-12 2023-07-04 5 4 tanyawarrier@google.com vaibsinghal@google.com
pubsublite_to_bigtable.py
in python/dataproc_templates/pubsublite
175 4 2023-03-14 2023-05-11 10 5 tanyawarrier@google.com vanshajbhatia@google.com
110 2 2023-05-12 2023-07-31 12 5 tanyawarrier@google.com 105878445+vanshaj-bhatia@us...
SpannerJdbcDialect.java
in java/src/main/java/com/google/cloud/dataproc/dialects
66 4 2022-04-18 2023-01-03 6 5 nilofr@google.com vanshajbhatia@google.com
hbase-site.xml
in python/dataproc_templates/hbase
28 - 2022-06-15 2023-01-03 6 5 surjitsh@google.com vanshajbhatia@google.com
in
MANIFEST.in
in python
18 - 2023-01-18 2023-03-16 5 5 117453385+ankuljain09@users... moukhtar@google.com
DataprocTemplateException.java
in java/src/main/java/com/google/cloud/dataproc/templates/util
12 3 2022-04-18 2023-01-03 5 5 nilofr@google.com vanshajbhatia@google.com
4 - 2023-05-24 2023-06-12 4 5 105909213+shubhamgoogle@use... vaibsinghal@google.com
__init__.py
in python/dataproc_templates/jdbc
3 - 2022-06-27 2024-06-02 6 5 naveenkm@google.com 149653140+itsabhisharma23@u...
__init__.py
in python/dataproc_templates/util
2 - 2022-04-05 2023-01-03 6 5 ppaglilla@google.com vanshajbhatia@google.com
cfg
setup.cfg
in python
2 - 2023-01-18 2023-03-16 5 5 117453385+ankuljain09@users... moukhtar@google.com
__init__.py
in python/dataproc_templates/hbase
1 - 2022-06-14 2023-01-03 6 5 surjitsh@google.com vanshajbhatia@google.com
Correlations

File Size vs. Number of Changes: 172 points

python/dataproc_templates/bigquery/__init__.py x: 2 lines of code y: 3 # changes python/dataproc_templates/bigquery/bigquery_to_memorystore.py x: 130 lines of code y: 1 # changes python/dataproc_templates/template_name.py x: 46 lines of code y: 78 # changes python/dataproc_templates/util/template_constants.py x: 848 lines of code y: 124 # changes python/main.py x: 98 lines of code y: 83 # changes python/dataproc_templates/gcs/gcs_to_bigtable.py x: 98 lines of code y: 30 # changes python/dataproc_templates/elasticsearch/elasticsearch_to_bq.py x: 141 lines of code y: 3 # changes python/dataproc_templates/elasticsearch/elasticsearch_to_bigtable.py x: 144 lines of code y: 3 # changes python/dataproc_templates/elasticsearch/elasticsearch_to_gcs.py x: 141 lines of code y: 2 # changes python/dataproc_templates/util/dataframe_reader_wrappers.py x: 77 lines of code y: 12 # changes java/src/main/java/com/google/cloud/dataproc/templates/gcs/GCSToSpanner.java x: 83 lines of code y: 23 # changes java/src/main/java/com/google/cloud/dataproc/templates/gcs/GCSToSpannerConfig.java x: 124 lines of code y: 15 # changes java/src/main/java/com/google/cloud/dataproc/templates/BaseTemplate.java x: 52 lines of code y: 83 # changes java/src/main/java/com/google/cloud/dataproc/templates/bigquery/BigQueryToJDBC.java x: 56 lines of code y: 1 # changes java/src/main/java/com/google/cloud/dataproc/templates/bigquery/BigQueryToJDBCConfig.java x: 98 lines of code y: 1 # changes java/src/main/java/com/google/cloud/dataproc/templates/main/DataProcTemplate.java x: 177 lines of code y: 94 # changes cloudbuild.yaml x: 63 lines of code y: 13 # changes java/src/main/java/com/google/cloud/dataproc/templates/gcs/TextToBigquery.java x: 110 lines of code y: 12 # changes java/src/main/java/com/google/cloud/dataproc/templates/pubsublite/PubSubLiteToBigTable.java x: 146 lines of code y: 10 # changes python/setup.py x: 58 lines of code y: 22 # changes java/src/main/java/com/google/cloud/dataproc/templates/config/GeneralTemplateConfig.java x: 106 lines of code y: 3 # changes java/src/main/java/com/google/cloud/dataproc/templates/config/OutputConfig.java x: 47 lines of code y: 3 # changes java/src/main/java/com/google/cloud/dataproc/templates/databases/CassandraToBQ.java x: 61 lines of code y: 27 # changes java/src/main/java/com/google/cloud/dataproc/templates/dataplex/DataplexGCStoBQ.java x: 380 lines of code y: 44 # changes java/src/main/java/com/google/cloud/dataproc/templates/hive/HiveToBigQuery.java x: 79 lines of code y: 30 # changes java/src/main/java/com/google/cloud/dataproc/templates/jdbc/JDBCToBigQuery.java x: 123 lines of code y: 44 # changes java/src/main/java/com/google/cloud/dataproc/templates/kafka/KafkaToBQ.java x: 152 lines of code y: 28 # changes java/src/main/java/com/google/cloud/dataproc/templates/util/Dataplex/DataplexAPIUtil.java x: 27 lines of code y: 9 # changes java/src/main/java/com/google/cloud/dataproc/templates/util/Dataplex/DataplexAssetUtil.java x: 40 lines of code y: 9 # changes java/src/main/java/com/google/cloud/dataproc/templates/util/Dataplex/DataplexEntityUtil.java x: 239 lines of code y: 9 # changes java/src/main/java/com/google/cloud/dataproc/templates/util/TemplateConstants.java x: 378 lines of code y: 122 # changes java/src/main/java/com/google/cloud/dataproc/templates/util/TemplateUtil.java x: 22 lines of code y: 3 # changes java/src/main/java/com/google/cloud/dataproc/dialects/SpannerPostgresJDBCDialect.java x: 52 lines of code y: 1 # changes java/src/main/java/com/google/cloud/dataproc/templates/databases/SpannerToGCS.java x: 68 lines of code y: 30 # changes java/src/main/java/com/google/cloud/dataproc/templates/databases/SpannerToGCSConfig.java x: 178 lines of code y: 11 # changes java/src/main/java/com/google/cloud/dataproc/templates/jdbc/JDBCToSpanner.java x: 99 lines of code y: 24 # changes java/src/main/java/com/google/cloud/dataproc/templates/jdbc/JDBCToSpannerConfig.java x: 191 lines of code y: 21 # changes java/src/main/java/com/google/cloud/dataproc/templates/gcs/GCStoBigTable.java x: 89 lines of code y: 16 # changes java/src/main/java/com/google/cloud/dataproc/templates/gcs/GCStoBigTableConfig.java x: 107 lines of code y: 1 # changes java/src/main/java/com/google/cloud/dataproc/templates/databases/MongoToBQConfig.java x: 105 lines of code y: 1 # changes python/version.py x: 1 lines of code y: 23 # changes notebooks/postgresql2bq/postgresql-to-bigquery-notebook.ipynb x: 1139 lines of code y: 15 # changes java/src/main/java/com/google/cloud/dataproc/templates/bigquery/BigQueryToGCS.java x: 92 lines of code y: 31 # changes notebooks/generic_notebook/vertex_pipeline_pyspark.ipynb x: 554 lines of code y: 22 # changes notebooks/hive2bq/HiveToBigquery_notebook.ipynb x: 932 lines of code y: 32 # changes notebooks/mssql2bq/mssql-to-bigquery-notebook.ipynb x: 911 lines of code y: 36 # changes notebooks/mssql2postgresql/mssql-to-postgres-notebook.ipynb x: 919 lines of code y: 34 # changes notebooks/mysql2spanner/MySqlToSpanner_notebook.ipynb x: 903 lines of code y: 46 # changes notebooks/oracle2bq/OracleToBigQuery_notebook.ipynb x: 881 lines of code y: 48 # changes notebooks/oracle2postgres/OracleToPostgres_notebook.ipynb x: 978 lines of code y: 21 # changes notebooks/oracle2spanner/OracleToSpanner_notebook.ipynb x: 894 lines of code y: 46 # changes python/dataproc_templates/elasticsearch/__init__.py x: 1 lines of code y: 1 # changes python/dataproc_templates/util/argument_parsing.py x: 68 lines of code y: 19 # changes python/dataproc_templates/util/elasticsearch_transformations.py x: 96 lines of code y: 1 # changes python/dataproc_templates/jdbc/__init__.py x: 3 lines of code y: 6 # changes python/dataproc_templates/jdbc/jdbc_to_bigquery.py x: 171 lines of code y: 26 # changes python/dataproc_templates/jdbc/jdbc_to_gcs.py x: 219 lines of code y: 50 # changes python/dataproc_templates/jdbc/jdbc_to_jdbc.py x: 227 lines of code y: 28 # changes python/dataproc_templates/util/secret_manager.py x: 19 lines of code y: 1 # changes python/dataproc_templates/bigquery/bigquery_to_gcs.py x: 86 lines of code y: 24 # changes java/src/main/java/com/google/cloud/dataproc/templates/kafka/KafkaReader.java x: 71 lines of code y: 7 # changes java/src/main/java/com/google/cloud/dataproc/templates/kafka/KafkaToGCSDstream.java x: 161 lines of code y: 5 # changes java/src/main/java/com/google/cloud/dataproc/templates/kafka/KafkaToBQDstream.java x: 164 lines of code y: 5 # changes python/dataproc_templates/mongo/mongo_to_bq.py x: 98 lines of code y: 4 # changes java/src/main/java/com/google/cloud/dataproc/templates/pubsub/PubSubToGCS.java x: 184 lines of code y: 24 # changes notebooks/mysql2spanner/MySqlToSpanner_parameterize_script.py x: 110 lines of code y: 19 # changes notebooks/oracle2postgres/OracleToPostgres_parameterize_script.py x: 133 lines of code y: 11 # changes notebooks/parameterize_script/util/notebook_constants.py x: 115 lines of code y: 29 # changes notebooks/util/jdbc/engines/mysql_input_manager.py x: 139 lines of code y: 5 # changes notebooks/util/jdbc/engines/oracle_input_manager.py x: 161 lines of code y: 7 # changes notebooks/util/jdbc/jdbc_input_manager.py x: 18 lines of code y: 4 # changes notebooks/util/jdbc/jdbc_input_manager_interface.py x: 235 lines of code y: 7 # changes notebooks/util/notebook_functions.py x: 25 lines of code y: 5 # changes notebooks/hive2bq/HiveToBigquery_parameterize_script.py x: 83 lines of code y: 9 # changes notebooks/hive2bq/__init__.py x: 1 lines of code y: 4 # changes notebooks/oracle2postgres/__init__.py x: 1 lines of code y: 5 # changes notebooks/parameterize_script/script_name.py x: 19 lines of code y: 23 # changes notebooks/run_notebook.py x: 27 lines of code y: 25 # changes notebooks/util/sql_translation.py x: 48 lines of code y: 8 # changes notebooks/mysql2spanner/__init__.py x: 1 lines of code y: 9 # changes notebooks/parameterize_script/base_parameterize_script.py x: 33 lines of code y: 13 # changes notebooks/parameterize_script/util/argument_parsing.py x: 40 lines of code y: 14 # changes notebooks/postgresql2bq/PostgreSqlToBigQuery_parameterize_script.py x: 102 lines of code y: 6 # changes notebooks/postgresql2bq/__init__.py x: 1 lines of code y: 6 # changes notebooks/parameterize_script/util/__init__.py x: 1 lines of code y: 10 # changes java/src/main/java/com/google/cloud/dataproc/templates/general/GeneralTemplate.java x: 129 lines of code y: 21 # changes notebooks/util/global_typeconvert.config.yaml x: 4 lines of code y: 4 # changes java/src/main/java/com/google/cloud/dataproc/templates/databases/CassandraToGCS.java x: 59 lines of code y: 26 # changes java/src/main/java/com/google/cloud/dataproc/templates/databases/RedshiftToGCS.java x: 56 lines of code y: 25 # changes java/src/main/java/com/google/cloud/dataproc/templates/databases/MongoToGCS.java x: 69 lines of code y: 12 # changes java/src/main/java/com/google/cloud/dataproc/templates/gcs/GCSToJDBC.java x: 96 lines of code y: 27 # changes java/src/main/java/com/google/cloud/dataproc/templates/gcs/GCStoBigquery.java x: 130 lines of code y: 34 # changes java/src/main/java/com/google/cloud/dataproc/templates/gcs/GCStoGCS.java x: 93 lines of code y: 27 # changes java/src/main/java/com/google/cloud/dataproc/templates/gcs/GCStoMongo.java x: 105 lines of code y: 12 # changes java/src/main/java/com/google/cloud/dataproc/templates/hbase/HbaseToGCS.java x: 73 lines of code y: 23 # changes java/src/main/java/com/google/cloud/dataproc/templates/hive/HiveToGCS.java x: 86 lines of code y: 36 # changes java/src/main/java/com/google/cloud/dataproc/templates/jdbc/JDBCToGCS.java x: 64 lines of code y: 37 # changes java/src/main/java/com/google/cloud/dataproc/templates/jdbc/JDBCToJDBC.java x: 87 lines of code y: 11 # changes java/src/main/java/com/google/cloud/dataproc/templates/kafka/KafkaToGCS.java x: 102 lines of code y: 21 # changes java/src/main/java/com/google/cloud/dataproc/templates/kafka/KafkaToPubSub.java x: 130 lines of code y: 13 # changes java/src/main/java/com/google/cloud/dataproc/templates/pubsub/PubSubToBQ.java x: 186 lines of code y: 22 # changes java/src/main/java/com/google/cloud/dataproc/templates/pubsub/PubSubToBigTable.java x: 163 lines of code y: 18 # changes java/src/main/java/com/google/cloud/dataproc/templates/s3/S3ToBigQuery.java x: 128 lines of code y: 26 # changes java/src/main/java/com/google/cloud/dataproc/templates/snowflake/SnowflakeToGCS.java x: 56 lines of code y: 23 # changes java/src/main/java/com/google/cloud/dataproc/templates/word/WordCount.java x: 34 lines of code y: 17 # changes python/dataproc_templates/azure/azure_blob_storage_to_bigquery.py x: 129 lines of code y: 11 # changes python/dataproc_templates/gcs/gcs_to_jdbc.py x: 113 lines of code y: 29 # changes python/dataproc_templates/gcs/gcs_to_mongo.py x: 102 lines of code y: 24 # changes python/dataproc_templates/kafka/kafka_to_bq.py x: 102 lines of code y: 13 # changes python/dataproc_templates/kafka/kafka_to_gcs.py x: 104 lines of code y: 16 # changes python/dataproc_templates/mongo/mongo_to_gcs.py x: 93 lines of code y: 20 # changes python/dataproc_templates/pubsublite/__init__.py x: 2 lines of code y: 13 # changes python/dataproc_templates/pubsublite/pubsublite_to_bigtable.py x: 175 lines of code y: 10 # changes python/dataproc_templates/pubsublite/pubsublite_to_gcs.py x: 117 lines of code y: 21 # changes python/dataproc_templates/redshift/redshift_to_gcs.py x: 131 lines of code y: 21 # changes python/dataproc_templates/snowflake/snowflake_to_gcs.py x: 212 lines of code y: 27 # changes python/dataproc_templates/util/dataframe_writer_wrappers.py x: 56 lines of code y: 15 # changes java/src/main/java/com/google/cloud/dataproc/templates/jdbc/JDBCToJDBCConfig.java x: 207 lines of code y: 8 # changes python/dataproc_templates/cassandra/cassandra_to_bigquery.py x: 115 lines of code y: 20 # changes python/dataproc_templates/cassandra/cassandra_to_gcs.py x: 125 lines of code y: 17 # changes python/dataproc_templates/gcs/gcs_to_bigquery.py x: 93 lines of code y: 29 # changes python/dataproc_templates/gcs/gcs_to_gcs.py x: 137 lines of code y: 19 # changes python/dataproc_templates/gcs/text_to_bigquery.py x: 106 lines of code y: 20 # changes python/dataproc_templates/hbase/hbase_to_gcs.py x: 76 lines of code y: 22 # changes python/dataproc_templates/hive/hive_to_gcs.py x: 108 lines of code y: 35 # changes python/dataproc_templates/azure/__init__.py x: 1 lines of code y: 7 # changes python/dataproc_templates/hive/util/hive_ddl_extractor.py x: 63 lines of code y: 19 # changes python/dataproc_templates/kafka/__init__.py x: 2 lines of code y: 8 # changes java/src/main/java/com/google/cloud/dataproc/templates/databases/CassandraToBqConfig.java x: 108 lines of code y: 13 # changes java/src/main/java/com/google/cloud/dataproc/templates/databases/CassandraToGCSConfig.java x: 116 lines of code y: 15 # changes java/src/main/java/com/google/cloud/dataproc/templates/databases/RedshiftToGCSConfig.java x: 90 lines of code y: 11 # changes java/src/main/java/com/google/cloud/dataproc/templates/gcs/GCSToJDBCConfig.java x: 120 lines of code y: 23 # changes java/src/main/java/com/google/cloud/dataproc/templates/jdbc/JDBCToGCSConfig.java x: 206 lines of code y: 30 # changes java/src/main/java/com/google/cloud/dataproc/templates/snowflake/SnowflakeToGCSConfig.java x: 162 lines of code y: 18 # changes python/MANIFEST.in x: 18 lines of code y: 5 # changes airflow/dags/submit_pyspark_dataproc_template.py x: 47 lines of code y: 6 # changes airflow/dags/submit_spark_dataproc_template.py x: 50 lines of code y: 6 # changes java/src/main/java/com/google/cloud/dataproc/dialects/SpannerJdbcDialect.java x: 66 lines of code y: 6 # changes java/src/main/java/com/google/cloud/dataproc/templates/util/DataprocTemplateException.java x: 12 lines of code y: 5 # changes java/src/main/java/com/google/cloud/dataproc/templates/util/ReadSchemaUtil.java x: 22 lines of code y: 6 # changes java/src/main/resources/logback-test.xml x: 30 lines of code y: 9 # changes python/dataproc_templates/hbase/hbase-site.xml x: 28 lines of code y: 6 # changes python/dataproc_templates/hive/hive_to_bigquery.py x: 105 lines of code y: 10 # changes python/dataproc_templates/util/tracking.py x: 16 lines of code y: 10 # changes java/src/main/java/com/google/cloud/dataproc/templates/config/InputConfig.java x: 37 lines of code y: 2 # changes java/src/main/java/com/google/cloud/dataproc/templates/config/QueryConfig.java x: 16 lines of code y: 2 # changes java/src/main/java/com/google/cloud/dataproc/templates/util/PropertyUtil.java x: 47 lines of code y: 2 # changes java/src/main/java/com/google/cloud/dataproc/templates/util/ValidationUtil.java x: 59 lines of code y: 2 # changes python/dataproc_templates/__init__.py x: 2 lines of code y: 2 # changes
124.0
# changes
  min: 1.0
  average: 17.55
  25th percentile: 5.25
  median: 12.0
  75th percentile: 23.0
  max: 124.0
0 1139.0
lines of code
min: 1.0 | average: 129.45 | 25th percentile: 27.0 | median: 86.5 | 75th percentile: 129.75 | max: 1139.0

Number of Contributors vs. Number of Changes: 172 points

python/dataproc_templates/bigquery/__init__.py x: 3 # contributors y: 3 # changes python/dataproc_templates/bigquery/bigquery_to_memorystore.py x: 1 # contributors y: 1 # changes python/dataproc_templates/template_name.py x: 37 # contributors y: 78 # changes python/dataproc_templates/util/template_constants.py x: 44 # contributors y: 124 # changes python/main.py x: 37 # contributors y: 83 # changes python/dataproc_templates/gcs/gcs_to_bigtable.py x: 16 # contributors y: 30 # changes python/dataproc_templates/elasticsearch/elasticsearch_to_bq.py x: 1 # contributors y: 3 # changes python/dataproc_templates/elasticsearch/elasticsearch_to_gcs.py x: 1 # contributors y: 2 # changes python/dataproc_templates/util/dataframe_reader_wrappers.py x: 10 # contributors y: 12 # changes java/src/main/java/com/google/cloud/dataproc/templates/gcs/GCSToSpanner.java x: 16 # contributors y: 23 # changes java/src/main/java/com/google/cloud/dataproc/templates/gcs/GCSToSpannerConfig.java x: 12 # contributors y: 15 # changes java/src/main/java/com/google/cloud/dataproc/templates/BaseTemplate.java x: 28 # contributors y: 83 # changes java/src/main/java/com/google/cloud/dataproc/templates/main/DataProcTemplate.java x: 32 # contributors y: 94 # changes cloudbuild.yaml x: 13 # contributors y: 13 # changes java/src/main/java/com/google/cloud/dataproc/templates/gcs/TextToBigquery.java x: 11 # contributors y: 12 # changes java/src/main/java/com/google/cloud/dataproc/templates/pubsublite/PubSubLiteToBigTable.java x: 9 # contributors y: 10 # changes python/setup.py x: 17 # contributors y: 22 # changes java/src/main/java/com/google/cloud/dataproc/templates/databases/CassandraToBQ.java x: 16 # contributors y: 27 # changes java/src/main/java/com/google/cloud/dataproc/templates/dataplex/DataplexGCStoBQ.java x: 25 # contributors y: 44 # changes java/src/main/java/com/google/cloud/dataproc/templates/hive/HiveToBigQuery.java x: 19 # contributors y: 30 # changes java/src/main/java/com/google/cloud/dataproc/templates/jdbc/JDBCToBigQuery.java x: 21 # contributors y: 44 # changes java/src/main/java/com/google/cloud/dataproc/templates/kafka/KafkaToBQ.java x: 17 # contributors y: 28 # changes java/src/main/java/com/google/cloud/dataproc/templates/util/Dataplex/DataplexAPIUtil.java x: 8 # contributors y: 9 # changes java/src/main/java/com/google/cloud/dataproc/templates/util/TemplateConstants.java x: 39 # contributors y: 122 # changes java/src/main/java/com/google/cloud/dataproc/templates/databases/SpannerToGCS.java x: 17 # contributors y: 30 # changes java/src/main/java/com/google/cloud/dataproc/templates/databases/SpannerToGCSConfig.java x: 11 # contributors y: 11 # changes java/src/main/java/com/google/cloud/dataproc/templates/jdbc/JDBCToSpanner.java x: 17 # contributors y: 24 # changes java/src/main/java/com/google/cloud/dataproc/templates/jdbc/JDBCToSpannerConfig.java x: 17 # contributors y: 21 # changes java/src/main/java/com/google/cloud/dataproc/templates/gcs/GCStoBigTable.java x: 11 # contributors y: 16 # changes python/version.py x: 14 # contributors y: 23 # changes notebooks/postgresql2bq/postgresql-to-bigquery-notebook.ipynb x: 10 # contributors y: 15 # changes java/src/main/java/com/google/cloud/dataproc/templates/bigquery/BigQueryToGCS.java x: 23 # contributors y: 31 # changes notebooks/hive2bq/HiveToBigquery_notebook.ipynb x: 21 # contributors y: 32 # changes notebooks/mssql2bq/mssql-to-bigquery-notebook.ipynb x: 23 # contributors y: 36 # changes notebooks/mssql2postgresql/mssql-to-postgres-notebook.ipynb x: 20 # contributors y: 34 # changes notebooks/mysql2spanner/MySqlToSpanner_notebook.ipynb x: 23 # contributors y: 46 # changes notebooks/oracle2bq/OracleToBigQuery_notebook.ipynb x: 26 # contributors y: 48 # changes notebooks/oracle2postgres/OracleToPostgres_notebook.ipynb x: 12 # contributors y: 21 # changes notebooks/oracle2spanner/OracleToSpanner_notebook.ipynb x: 25 # contributors y: 46 # changes python/dataproc_templates/util/argument_parsing.py x: 14 # contributors y: 19 # changes python/dataproc_templates/jdbc/__init__.py x: 5 # contributors y: 6 # changes python/dataproc_templates/jdbc/jdbc_to_bigquery.py x: 19 # contributors y: 26 # changes python/dataproc_templates/jdbc/jdbc_to_gcs.py x: 27 # contributors y: 50 # changes python/dataproc_templates/jdbc/jdbc_to_jdbc.py x: 21 # contributors y: 28 # changes python/dataproc_templates/bigquery/bigquery_to_gcs.py x: 18 # contributors y: 24 # changes java/src/main/java/com/google/cloud/dataproc/templates/kafka/KafkaReader.java x: 8 # contributors y: 7 # changes java/src/main/java/com/google/cloud/dataproc/templates/kafka/KafkaToGCSDstream.java x: 2 # contributors y: 5 # changes python/dataproc_templates/mongo/mongo_to_bq.py x: 4 # contributors y: 4 # changes notebooks/mysql2spanner/MySqlToSpanner_parameterize_script.py x: 8 # contributors y: 19 # changes notebooks/oracle2bq/OracleToBigQuery_parameterize_script.py x: 5 # contributors y: 12 # changes notebooks/oracle2postgres/OracleToPostgres_parameterize_script.py x: 6 # contributors y: 11 # changes notebooks/parameterize_script/util/notebook_constants.py x: 9 # contributors y: 29 # changes notebooks/util/jdbc/__init__.py x: 2 # contributors y: 3 # changes notebooks/util/jdbc/engines/oracle_input_manager.py x: 2 # contributors y: 7 # changes notebooks/util/jdbc/jdbc_input_manager.py x: 2 # contributors y: 4 # changes notebooks/hive2bq/HiveToBigquery_parameterize_script.py x: 3 # contributors y: 9 # changes notebooks/hive2bq/__init__.py x: 3 # contributors y: 4 # changes notebooks/oracle2postgres/__init__.py x: 5 # contributors y: 5 # changes notebooks/parameterize_script/script_name.py x: 9 # contributors y: 23 # changes notebooks/run_notebook.py x: 10 # contributors y: 25 # changes notebooks/util/sql_translation.py x: 6 # contributors y: 8 # changes notebooks/mysql2spanner/__init__.py x: 6 # contributors y: 9 # changes notebooks/oracle2bq/__init__.py x: 4 # contributors y: 5 # changes notebooks/parameterize_script/base_parameterize_script.py x: 6 # contributors y: 13 # changes notebooks/parameterize_script/util/argument_parsing.py x: 6 # contributors y: 14 # changes notebooks/postgresql2bq/PostgreSqlToBigQuery_parameterize_script.py x: 6 # contributors y: 6 # changes notebooks/parameterize_script/util/__init__.py x: 6 # contributors y: 10 # changes java/src/main/java/com/google/cloud/dataproc/templates/general/GeneralTemplate.java x: 13 # contributors y: 21 # changes notebooks/util/global_typeconvert.config.yaml x: 5 # contributors y: 4 # changes java/src/main/java/com/google/cloud/dataproc/templates/databases/CassandraToGCS.java x: 16 # contributors y: 26 # changes java/src/main/java/com/google/cloud/dataproc/templates/databases/RedshiftToGCS.java x: 16 # contributors y: 25 # changes java/src/main/java/com/google/cloud/dataproc/templates/gcs/GCSToJDBC.java x: 14 # contributors y: 27 # changes java/src/main/java/com/google/cloud/dataproc/templates/gcs/GCStoBigquery.java x: 22 # contributors y: 34 # changes java/src/main/java/com/google/cloud/dataproc/templates/gcs/GCStoGCS.java x: 15 # contributors y: 27 # changes java/src/main/java/com/google/cloud/dataproc/templates/gcs/GCStoMongo.java x: 6 # contributors y: 12 # changes java/src/main/java/com/google/cloud/dataproc/templates/hbase/HbaseToGCS.java x: 13 # contributors y: 23 # changes java/src/main/java/com/google/cloud/dataproc/templates/hive/HiveToGCS.java x: 22 # contributors y: 36 # changes java/src/main/java/com/google/cloud/dataproc/templates/jdbc/JDBCToGCS.java x: 18 # contributors y: 37 # changes java/src/main/java/com/google/cloud/dataproc/templates/jdbc/JDBCToJDBC.java x: 9 # contributors y: 11 # changes java/src/main/java/com/google/cloud/dataproc/templates/kafka/KafkaToPubSub.java x: 8 # contributors y: 13 # changes java/src/main/java/com/google/cloud/dataproc/templates/pubsub/PubSubToBQ.java x: 18 # contributors y: 22 # changes java/src/main/java/com/google/cloud/dataproc/templates/pubsub/PubSubToBigTable.java x: 14 # contributors y: 18 # changes java/src/main/java/com/google/cloud/dataproc/templates/word/WordCount.java x: 14 # contributors y: 17 # changes python/dataproc_templates/azure/azure_blob_storage_to_bigquery.py x: 13 # contributors y: 11 # changes python/dataproc_templates/gcs/gcs_to_jdbc.py x: 19 # contributors y: 29 # changes python/dataproc_templates/gcs/gcs_to_mongo.py x: 16 # contributors y: 24 # changes python/dataproc_templates/kafka/kafka_to_bq.py x: 12 # contributors y: 13 # changes python/dataproc_templates/kafka/kafka_to_gcs.py x: 13 # contributors y: 16 # changes python/dataproc_templates/mongo/mongo_to_gcs.py x: 14 # contributors y: 20 # changes python/dataproc_templates/pubsublite/__init__.py x: 9 # contributors y: 13 # changes python/dataproc_templates/pubsublite/pubsublite_to_bigtable.py x: 5 # contributors y: 10 # changes python/dataproc_templates/snowflake/snowflake_to_gcs.py x: 19 # contributors y: 27 # changes java/src/main/java/com/google/cloud/dataproc/templates/jdbc/JDBCToJDBCConfig.java x: 7 # contributors y: 8 # changes python/dataproc_templates/cassandra/cassandra_to_bigquery.py x: 13 # contributors y: 20 # changes python/dataproc_templates/cassandra/cassandra_to_gcs.py x: 12 # contributors y: 17 # changes python/dataproc_templates/gcs/gcs_to_bigquery.py x: 17 # contributors y: 29 # changes python/dataproc_templates/gcs/gcs_to_gcs.py x: 12 # contributors y: 19 # changes python/dataproc_templates/hbase/hbase_to_gcs.py x: 15 # contributors y: 22 # changes python/dataproc_templates/hive/hive_to_gcs.py x: 24 # contributors y: 35 # changes python/dataproc_templates/azure/__init__.py x: 10 # contributors y: 7 # changes python/dataproc_templates/hive/util/hive_ddl_extractor.py x: 17 # contributors y: 19 # changes python/dataproc_templates/kafka/__init__.py x: 8 # contributors y: 8 # changes python/dataproc_templates/s3/__init__.py x: 7 # contributors y: 7 # changes java/src/main/java/com/google/cloud/dataproc/templates/databases/CassandraToGCSConfig.java x: 13 # contributors y: 15 # changes java/src/main/java/com/google/cloud/dataproc/templates/jdbc/JDBCToGCSConfig.java x: 18 # contributors y: 30 # changes java/src/main/java/com/google/cloud/dataproc/templates/snowflake/SnowflakeToGCSConfig.java x: 12 # contributors y: 18 # changes python/dataproc_templates/cassandra/__init__.py x: 8 # contributors y: 10 # changes python/dataproc_templates/hive/util/__init__.py x: 6 # contributors y: 5 # changes java/src/main/resources/hbase-site.xml x: 7 # contributors y: 9 # changes java/src/main/resources/logback-test.xml x: 10 # contributors y: 9 # changes python/dataproc_templates/snowflake/__init__.py x: 6 # contributors y: 7 # changes java/src/main/java/com/google/cloud/dataproc/templates/config/InputConfig.java x: 2 # contributors y: 2 # changes python/dataproc_templates/base_template.py x: 3 # contributors y: 5 # changes
124.0
# changes
  min: 1.0
  average: 17.55
  25th percentile: 5.25
  median: 12.0
  75th percentile: 23.0
  max: 124.0
0 44.0
# contributors
min: 1.0 | average: 10.81 | 25th percentile: 5.0 | median: 9.0 | 75th percentile: 16.0 | max: 44.0

Number of Contributors vs. File Size: 172 points

python/dataproc_templates/bigquery/__init__.py x: 3 # contributors y: 2 lines of code python/dataproc_templates/bigquery/bigquery_to_memorystore.py x: 1 # contributors y: 130 lines of code python/dataproc_templates/template_name.py x: 37 # contributors y: 46 lines of code python/dataproc_templates/util/template_constants.py x: 44 # contributors y: 848 lines of code python/main.py x: 37 # contributors y: 98 lines of code python/dataproc_templates/gcs/gcs_to_bigtable.py x: 16 # contributors y: 98 lines of code python/dataproc_templates/elasticsearch/elasticsearch_to_bq.py x: 1 # contributors y: 141 lines of code python/dataproc_templates/elasticsearch/elasticsearch_to_bigtable.py x: 1 # contributors y: 144 lines of code python/dataproc_templates/util/dataframe_reader_wrappers.py x: 10 # contributors y: 77 lines of code java/src/main/java/com/google/cloud/dataproc/templates/gcs/GCSToSpanner.java x: 16 # contributors y: 83 lines of code java/src/main/java/com/google/cloud/dataproc/templates/gcs/GCSToSpannerConfig.java x: 12 # contributors y: 124 lines of code java/src/main/java/com/google/cloud/dataproc/templates/BaseTemplate.java x: 28 # contributors y: 52 lines of code java/src/main/java/com/google/cloud/dataproc/templates/bigquery/BigQueryToJDBC.java x: 1 # contributors y: 56 lines of code java/src/main/java/com/google/cloud/dataproc/templates/bigquery/BigQueryToJDBCConfig.java x: 1 # contributors y: 98 lines of code java/src/main/java/com/google/cloud/dataproc/templates/main/DataProcTemplate.java x: 32 # contributors y: 177 lines of code cloudbuild.yaml x: 13 # contributors y: 63 lines of code java/src/main/java/com/google/cloud/dataproc/templates/gcs/TextToBigquery.java x: 11 # contributors y: 110 lines of code java/src/main/java/com/google/cloud/dataproc/templates/pubsublite/PubSubLiteToBigTable.java x: 9 # contributors y: 146 lines of code python/setup.py x: 17 # contributors y: 58 lines of code java/src/main/java/com/google/cloud/dataproc/templates/config/GeneralTemplateConfig.java x: 3 # contributors y: 106 lines of code java/src/main/java/com/google/cloud/dataproc/templates/config/OutputConfig.java x: 3 # contributors y: 47 lines of code java/src/main/java/com/google/cloud/dataproc/templates/databases/CassandraToBQ.java x: 16 # contributors y: 61 lines of code java/src/main/java/com/google/cloud/dataproc/templates/dataplex/DataplexGCStoBQ.java x: 25 # contributors y: 380 lines of code java/src/main/java/com/google/cloud/dataproc/templates/hive/HiveToBigQuery.java x: 19 # contributors y: 79 lines of code java/src/main/java/com/google/cloud/dataproc/templates/jdbc/JDBCToBigQuery.java x: 21 # contributors y: 123 lines of code java/src/main/java/com/google/cloud/dataproc/templates/kafka/KafkaToBQ.java x: 17 # contributors y: 152 lines of code java/src/main/java/com/google/cloud/dataproc/templates/util/Dataplex/DataplexAPIUtil.java x: 8 # contributors y: 27 lines of code java/src/main/java/com/google/cloud/dataproc/templates/util/Dataplex/DataplexAssetUtil.java x: 8 # contributors y: 40 lines of code java/src/main/java/com/google/cloud/dataproc/templates/util/Dataplex/DataplexEntityUtil.java x: 8 # contributors y: 239 lines of code java/src/main/java/com/google/cloud/dataproc/templates/util/TemplateConstants.java x: 39 # contributors y: 378 lines of code java/src/main/java/com/google/cloud/dataproc/templates/util/TemplateUtil.java x: 3 # contributors y: 22 lines of code java/src/main/java/com/google/cloud/dataproc/dialects/SpannerPostgresJDBCDialect.java x: 1 # contributors y: 52 lines of code java/src/main/java/com/google/cloud/dataproc/templates/databases/SpannerToGCS.java x: 17 # contributors y: 68 lines of code java/src/main/java/com/google/cloud/dataproc/templates/databases/SpannerToGCSConfig.java x: 11 # contributors y: 178 lines of code java/src/main/java/com/google/cloud/dataproc/templates/jdbc/JDBCToSpanner.java x: 17 # contributors y: 99 lines of code java/src/main/java/com/google/cloud/dataproc/templates/jdbc/JDBCToSpannerConfig.java x: 17 # contributors y: 191 lines of code java/src/main/java/com/google/cloud/dataproc/templates/gcs/GCStoBigTable.java x: 11 # contributors y: 89 lines of code java/src/main/java/com/google/cloud/dataproc/templates/gcs/GCStoBigTableConfig.java x: 1 # contributors y: 107 lines of code python/version.py x: 14 # contributors y: 1 lines of code notebooks/postgresql2bq/postgresql-to-bigquery-notebook.ipynb x: 10 # contributors y: 1139 lines of code java/src/main/java/com/google/cloud/dataproc/templates/bigquery/BigQueryToGCS.java x: 23 # contributors y: 92 lines of code notebooks/generic_notebook/vertex_pipeline_pyspark.ipynb x: 17 # contributors y: 554 lines of code notebooks/hive2bq/HiveToBigquery_notebook.ipynb x: 21 # contributors y: 932 lines of code notebooks/mssql2bq/mssql-to-bigquery-notebook.ipynb x: 23 # contributors y: 911 lines of code notebooks/mssql2postgresql/mssql-to-postgres-notebook.ipynb x: 20 # contributors y: 919 lines of code notebooks/mysql2spanner/MySqlToSpanner_notebook.ipynb x: 23 # contributors y: 903 lines of code notebooks/oracle2bq/OracleToBigQuery_notebook.ipynb x: 26 # contributors y: 881 lines of code notebooks/oracle2postgres/OracleToPostgres_notebook.ipynb x: 12 # contributors y: 978 lines of code notebooks/oracle2spanner/OracleToSpanner_notebook.ipynb x: 25 # contributors y: 894 lines of code python/dataproc_templates/elasticsearch/__init__.py x: 1 # contributors y: 1 lines of code python/dataproc_templates/util/argument_parsing.py x: 14 # contributors y: 68 lines of code python/dataproc_templates/jdbc/__init__.py x: 5 # contributors y: 3 lines of code python/dataproc_templates/jdbc/jdbc_to_bigquery.py x: 19 # contributors y: 171 lines of code python/dataproc_templates/jdbc/jdbc_to_gcs.py x: 27 # contributors y: 219 lines of code python/dataproc_templates/jdbc/jdbc_to_jdbc.py x: 21 # contributors y: 227 lines of code python/dataproc_templates/util/secret_manager.py x: 1 # contributors y: 19 lines of code python/dataproc_templates/bigquery/bigquery_to_gcs.py x: 18 # contributors y: 86 lines of code java/src/main/java/com/google/cloud/dataproc/templates/kafka/KafkaReader.java x: 8 # contributors y: 71 lines of code java/src/main/java/com/google/cloud/dataproc/templates/kafka/KafkaToGCSDstream.java x: 2 # contributors y: 161 lines of code java/src/main/java/com/google/cloud/dataproc/templates/kafka/KafkaToBQDstream.java x: 2 # contributors y: 164 lines of code python/dataproc_templates/mongo/mongo_to_bq.py x: 4 # contributors y: 98 lines of code java/src/main/java/com/google/cloud/dataproc/templates/pubsub/PubSubToGCS.java x: 18 # contributors y: 184 lines of code notebooks/mysql2spanner/MySqlToSpanner_parameterize_script.py x: 8 # contributors y: 110 lines of code notebooks/oracle2bq/OracleToBigQuery_parameterize_script.py x: 5 # contributors y: 110 lines of code notebooks/oracle2postgres/OracleToPostgres_parameterize_script.py x: 6 # contributors y: 133 lines of code notebooks/parameterize_script/util/notebook_constants.py x: 9 # contributors y: 115 lines of code notebooks/util/jdbc/__init__.py x: 2 # contributors y: 1 lines of code notebooks/util/jdbc/engines/mysql_input_manager.py x: 2 # contributors y: 139 lines of code notebooks/util/jdbc/jdbc_input_manager.py x: 2 # contributors y: 18 lines of code notebooks/util/jdbc/jdbc_input_manager_interface.py x: 2 # contributors y: 235 lines of code notebooks/util/notebook_functions.py x: 2 # contributors y: 25 lines of code notebooks/hive2bq/HiveToBigquery_parameterize_script.py x: 3 # contributors y: 83 lines of code notebooks/parameterize_script/script_name.py x: 9 # contributors y: 19 lines of code notebooks/run_notebook.py x: 10 # contributors y: 27 lines of code notebooks/util/sql_translation.py x: 6 # contributors y: 48 lines of code notebooks/mysql2spanner/__init__.py x: 6 # contributors y: 1 lines of code notebooks/oracle2bq/__init__.py x: 4 # contributors y: 1 lines of code notebooks/parameterize_script/base_parameterize_script.py x: 6 # contributors y: 33 lines of code notebooks/parameterize_script/util/argument_parsing.py x: 6 # contributors y: 40 lines of code notebooks/postgresql2bq/PostgreSqlToBigQuery_parameterize_script.py x: 6 # contributors y: 102 lines of code java/src/main/java/com/google/cloud/dataproc/templates/general/GeneralTemplate.java x: 13 # contributors y: 129 lines of code java/src/main/java/com/google/cloud/dataproc/templates/databases/RedshiftToGCS.java x: 16 # contributors y: 56 lines of code java/src/main/java/com/google/cloud/dataproc/templates/databases/MongoToGCS.java x: 10 # contributors y: 69 lines of code java/src/main/java/com/google/cloud/dataproc/templates/gcs/GCSToJDBC.java x: 14 # contributors y: 96 lines of code java/src/main/java/com/google/cloud/dataproc/templates/gcs/GCStoBigquery.java x: 22 # contributors y: 130 lines of code java/src/main/java/com/google/cloud/dataproc/templates/gcs/GCStoGCS.java x: 15 # contributors y: 93 lines of code java/src/main/java/com/google/cloud/dataproc/templates/gcs/GCStoMongo.java x: 6 # contributors y: 105 lines of code java/src/main/java/com/google/cloud/dataproc/templates/hbase/HbaseToGCS.java x: 13 # contributors y: 73 lines of code java/src/main/java/com/google/cloud/dataproc/templates/hive/HiveToGCS.java x: 22 # contributors y: 86 lines of code java/src/main/java/com/google/cloud/dataproc/templates/jdbc/JDBCToGCS.java x: 18 # contributors y: 64 lines of code java/src/main/java/com/google/cloud/dataproc/templates/jdbc/JDBCToJDBC.java x: 9 # contributors y: 87 lines of code java/src/main/java/com/google/cloud/dataproc/templates/kafka/KafkaToPubSub.java x: 8 # contributors y: 130 lines of code java/src/main/java/com/google/cloud/dataproc/templates/pubsub/PubSubToBigTable.java x: 14 # contributors y: 163 lines of code java/src/main/java/com/google/cloud/dataproc/templates/s3/S3ToBigQuery.java x: 19 # contributors y: 128 lines of code java/src/main/java/com/google/cloud/dataproc/templates/snowflake/SnowflakeToGCS.java x: 14 # contributors y: 56 lines of code java/src/main/java/com/google/cloud/dataproc/templates/word/WordCount.java x: 14 # contributors y: 34 lines of code python/dataproc_templates/gcs/gcs_to_jdbc.py x: 19 # contributors y: 113 lines of code python/dataproc_templates/gcs/gcs_to_mongo.py x: 16 # contributors y: 102 lines of code python/dataproc_templates/kafka/kafka_to_bq.py x: 12 # contributors y: 102 lines of code python/dataproc_templates/kafka/kafka_to_gcs.py x: 13 # contributors y: 104 lines of code python/dataproc_templates/mongo/mongo_to_gcs.py x: 14 # contributors y: 93 lines of code python/dataproc_templates/pubsublite/__init__.py x: 9 # contributors y: 2 lines of code python/dataproc_templates/pubsublite/pubsublite_to_bigtable.py x: 5 # contributors y: 175 lines of code python/dataproc_templates/pubsublite/pubsublite_to_gcs.py x: 12 # contributors y: 117 lines of code python/dataproc_templates/redshift/redshift_to_gcs.py x: 17 # contributors y: 131 lines of code python/dataproc_templates/snowflake/snowflake_to_gcs.py x: 19 # contributors y: 212 lines of code python/dataproc_templates/util/dataframe_writer_wrappers.py x: 10 # contributors y: 56 lines of code java/src/main/java/com/google/cloud/dataproc/templates/jdbc/JDBCToJDBCConfig.java x: 7 # contributors y: 207 lines of code python/dataproc_templates/cassandra/cassandra_to_bigquery.py x: 13 # contributors y: 115 lines of code python/dataproc_templates/gcs/gcs_to_bigquery.py x: 17 # contributors y: 93 lines of code python/dataproc_templates/gcs/gcs_to_gcs.py x: 12 # contributors y: 137 lines of code python/dataproc_templates/hbase/hbase_to_gcs.py x: 15 # contributors y: 76 lines of code python/dataproc_templates/hive/hive_to_gcs.py x: 24 # contributors y: 108 lines of code python/dataproc_templates/azure/__init__.py x: 10 # contributors y: 1 lines of code python/dataproc_templates/hive/util/hive_ddl_extractor.py x: 17 # contributors y: 63 lines of code python/dataproc_templates/kafka/__init__.py x: 8 # contributors y: 2 lines of code python/dataproc_templates/s3/__init__.py x: 7 # contributors y: 1 lines of code java/src/main/java/com/google/cloud/dataproc/templates/databases/CassandraToBqConfig.java x: 12 # contributors y: 108 lines of code java/src/main/java/com/google/cloud/dataproc/templates/databases/RedshiftToGCSConfig.java x: 11 # contributors y: 90 lines of code java/src/main/java/com/google/cloud/dataproc/templates/gcs/GCSToJDBCConfig.java x: 14 # contributors y: 120 lines of code java/src/main/java/com/google/cloud/dataproc/templates/jdbc/JDBCToGCSConfig.java x: 18 # contributors y: 206 lines of code java/src/main/java/com/google/cloud/dataproc/templates/snowflake/SnowflakeToGCSConfig.java x: 12 # contributors y: 162 lines of code python/MANIFEST.in x: 5 # contributors y: 18 lines of code airflow/dags/submit_spark_dataproc_template.py x: 6 # contributors y: 50 lines of code java/src/main/java/com/google/cloud/dataproc/dialects/SpannerJdbcDialect.java x: 5 # contributors y: 66 lines of code java/src/main/java/com/google/cloud/dataproc/templates/util/DataprocTemplateException.java x: 5 # contributors y: 12 lines of code java/src/main/java/com/google/cloud/dataproc/templates/util/ReadSchemaUtil.java x: 6 # contributors y: 22 lines of code java/src/main/resources/hbase-site.xml x: 7 # contributors y: 28 lines of code python/dataproc_templates/hbase/hbase-site.xml x: 5 # contributors y: 28 lines of code python/dataproc_templates/hive/hive_to_bigquery.py x: 8 # contributors y: 105 lines of code python/dataproc_templates/util/tracking.py x: 6 # contributors y: 16 lines of code java/src/main/java/com/google/cloud/dataproc/templates/config/InputConfig.java x: 2 # contributors y: 37 lines of code java/src/main/java/com/google/cloud/dataproc/templates/config/QueryConfig.java x: 2 # contributors y: 16 lines of code java/src/main/java/com/google/cloud/dataproc/templates/util/PropertyUtil.java x: 2 # contributors y: 47 lines of code java/src/main/java/com/google/cloud/dataproc/templates/util/ValidationUtil.java x: 2 # contributors y: 59 lines of code
1139.0
lines of code
  min: 1.0
  average: 129.45
  25th percentile: 27.0
  median: 86.5
  75th percentile: 129.75
  max: 1139.0
0 44.0
# contributors
min: 1.0 | average: 10.81 | 25th percentile: 5.0 | median: 9.0 | 75th percentile: 16.0 | max: 44.0