GoogleCloudPlatform / document-ai-samples
File Change Frequency

File change frequency (churn) shows the distribution of file updates (days with at least one commit).

Overview
File Change Frequency Overall
  • There are 284 files with 74,552 lines of code.
    • 0 files changed more than 100 times (0 lines of code)
    • 0 files changed 51-100 times (0 lines of code)
    • 0 files changed 21-50 times (0 lines of code)
    • 32 files changed 6-20 times (3,710 lines of code)
    • 252 files changed 1-5 times (70,842 lines of code)
0% | 0% | 0% | 4% | 95%
Legend:
101+
51-100
21-50
6-20
1-5

explore: grouped by folders | grouped by update frequency | data
Contributors Count Frequency Overall
  • There are 284 files with 74,552 lines of code.
    • 0 files changed by more than 25 contributors (0 lines of code)
    • 0 files changed by 11-25 contributors (0 lines of code)
    • 7 files changed by 6-10 contributors (1,366 lines of code)
    • 130 files changed by 2-5 contributors (13,521 lines of code)
    • 147 files changed by 1 contributor (59,665 lines of code)
0% | 0% | 1% | 18% | 80%
Legend:
26+
11-25
6-10
2-5
1

explore: grouped by folders | grouped by contributors count | data
File Change Frequency per File Extension
ipynb, md, py, json, txt, ts, js, sh, yaml, html, css, gitignore, java, dockerignore, xml, tf, mod, scss, tfvars, go, toml, gs, htmlhintrc, cs
File Change Frequency per Extension
The number of recorded file updates
101+
51-100
21-50
6-20
1-5
py0% | 0% | 0% | 28% | 71%
ts0% | 0% | 0% | 38% | 61%
html0% | 0% | 0% | 35% | 64%
ipynb0% | 0% | 0% | 0% | 100%
js0% | 0% | 0% | 0% | 100%
yaml0% | 0% | 0% | 0% | 100%
java0% | 0% | 0% | 0% | 100%
css0% | 0% | 0% | 0% | 100%
tf0% | 0% | 0% | 0% | 100%
gs0% | 0% | 0% | 0% | 100%
go0% | 0% | 0% | 0% | 100%
cs0% | 0% | 0% | 0% | 100%
scss0% | 0% | 0% | 0% | 100%
File Change Frequency per Logical Decomposition
primary
primary (file change frequency)
The number of recorded file updates
101+
51-100
21-50
6-20
1-5
bq-connector0% | 0% | 0% | 82% | 17%
tax-processing-pipeline-python0% | 0% | 0% | 99% | <1%
community0% | 0% | 0% | 33% | 66%
web-app-demo0% | 0% | 0% | 35% | 64%
ROOT0% | 0% | 0% | 93% | 6%
fraud-detection-python0% | 0% | 0% | 100% | 0%
pdf-splitter-python0% | 0% | 0% | 100% | 0%
extract-tables0% | 0% | 0% | 100% | 0%
incubator-tools0% | 0% | 0% | 0% | 100%
document_ai_warehouse0% | 0% | 0% | 0% | 100%
web-app-pix2info-python0% | 0% | 0% | 0% | 100%
document-processing-workflows0% | 0% | 0% | 0% | 100%
classify-split-extract-workflow0% | 0% | 0% | 0% | 100%
uptraining_docai_processor_using_python0% | 0% | 0% | 0% | 100%
document-json-explorer0% | 0% | 0% | 0% | 100%
paper_summarization0% | 0% | 0% | 0% | 100%
toolbox-batch-processing0% | 0% | 0% | 0% | 100%
hitl-custom-review0% | 0% | 0% | 0% | 100%
watermark-remover0% | 0% | 0% | 0% | 100%
sql-pdf-python0% | 0% | 0% | 0% | 100%
pdf-embedded-text0% | 0% | 0% | 0% | 100%
cx-content-moderation0% | 0% | 0% | 0% | 100%
apps-script-google-drive0% | 0% | 0% | 0% | 100%
filter-hitl-language0% | 0% | 0% | 0% | 100%
extract-languages0% | 0% | 0% | 0% | 100%
Most Frequently Changed Files (Top 50)

See data for all files...

File# lines# unitscreatedlast modified# changes
(days)
# contributorsfirst
contributor
latest
contributor
DocAIBQConnector.py
in bq-connector/docai_bq_connector/connector
280 4 2022-12-08 2023-02-28 17 6 mark.servidio@gmail.com 13262395+holtskinner@users....
Processor.py
in bq-connector/docai_bq_connector/doc_ai_processing
199 9 2022-12-08 2023-02-28 16 6 mark.servidio@gmail.com 13262395+holtskinner@users....
tax_pipeline.py
in tax-processing-pipeline-python
157 4 2022-03-14 2023-11-01 15 5 holtskinner@google.com 13262395+holtskinner@users....
BqDocumentMapper.py
in bq-connector/docai_bq_connector/connector
318 13 2022-12-08 2023-02-28 14 6 mark.servidio@gmail.com 13262395+holtskinner@users....
main.py
in web-app-demo/Backend
58 3 2022-04-04 2023-02-03 13 6 gall.zahavii@gmail.com holtskinner@google.com
helper.py
in web-app-demo/Backend/api
56 3 2022-03-14 2023-02-03 12 5 38544478+galz10@users.norep... holtskinner@google.com
BqMetadataMapper.py
in bq-connector/docai_bq_connector/connector
75 9 2022-12-09 2023-06-13 12 6 mservidio@google.com 13262395+holtskinner@users....
docai.py
in community/identity-form-autofiller-python/src
207 20 2022-05-18 2023-11-01 12 4 picardparis@users.noreply.g... 13262395+holtskinner@users....
main.py
in bq-connector
239 1 2022-12-08 2023-02-03 12 5 mark.servidio@gmail.com holtskinner@google.com
online_processing.py
in community/codelabs/docai-ocr
19 - 2022-03-30 2023-06-23 11 5 holtskinner@google.com 13262395+holtskinner@users....
batch_processing.py
in community/codelabs/docai-ocr
97 1 2022-03-30 2023-06-13 11 5 holtskinner@google.com 13262395+holtskinner@users....
main.py
in tax-processing-pipeline-python
83 7 2022-03-14 2023-11-01 10 5 holtskinner@google.com 13262395+holtskinner@users....
__init__.py
in bq-connector/docai_bq_connector
2 - 2022-12-08 2023-02-03 9 5 mark.servidio@gmail.com holtskinner@google.com
classification.py
in community/codelabs/docai-specialized-processors
44 1 2022-03-30 2023-06-23 9 5 holtskinner@google.com 13262395+holtskinner@users....
extraction.py
in community/codelabs/docai-specialized-processors
53 1 2022-03-30 2023-06-23 9 5 holtskinner@google.com 13262395+holtskinner@users....
main.py
in fraud-detection-python/cloud-functions/geocode-addresses
57 3 2022-03-15 2023-06-23 9 5 holtskinner@google.com 13262395+holtskinner@users....
docai_pipeline.py
in tax-processing-pipeline-python
59 2 2022-03-14 2023-11-01 9 5 holtskinner@google.com 13262395+holtskinner@users....
app.component.ts
in web-app-demo/Frontend/src/app
92 3 2022-03-14 2023-06-23 9 4 38544478+galz10@users.norep... 13262395+holtskinner@users....
main.py
in fraud-detection-python/cloud-functions/process-invoices
182 7 2022-03-15 2023-06-23 9 5 holtskinner@google.com 13262395+holtskinner@users....
firestore_utils.py
in tax-processing-pipeline-python
16 3 2022-03-14 2023-11-01 8 5 holtskinner@google.com 13262395+holtskinner@users....
setup.py
in tax-processing-pipeline-python
76 2 2022-03-14 2023-11-01 8 4 holtskinner@google.com 13262395+holtskinner@users....
consts.py
in tax-processing-pipeline-python
85 - 2022-03-14 2023-02-03 8 3 holtskinner@google.com holtskinner@google.com
docai_utils.py
in tax-processing-pipeline-python
91 8 2022-03-14 2023-11-01 8 5 holtskinner@google.com 13262395+holtskinner@users....
main.py
in community/identity-form-autofiller-python/src
104 12 2022-05-18 2023-02-03 8 4 picardparis@users.noreply.g... holtskinner@google.com
main.py
in pdf-splitter-python
139 5 2022-03-09 2023-02-03 8 6 matthewayne@users.noreply.g... holtskinner@google.com
index.html
in tax-processing-pipeline-python/templates
255 - 2022-03-14 2023-11-01 8 4 holtskinner@google.com 13262395+holtskinner@users....
general_utils.py
in tax-processing-pipeline-python
9 3 2022-03-14 2023-11-01 7 4 holtskinner@google.com 13262395+holtskinner@users....
form_parser.py
in community/codelabs/docai-form-parser
55 2 2022-03-30 2023-06-23 7 5 holtskinner@google.com 13262395+holtskinner@users....
processor-selection.component.ts
in web-app-demo/Frontend/src/app/components/processor-selection
225 6 2022-03-14 2022-05-16 7 3 38544478+galz10@users.norep... 38544478+galz10@users.norep...
noxfile.py
in root
297 13 2022-04-01 2024-07-02 7 6 11586922+kweinmeister@users... 13262395+holtskinner@users....
DocumentState.py
in bq-connector/docai_bq_connector/doc_ai_processing
15 3 2022-12-14 2023-01-06 6 3 deboraelkin@google.com 13262395+holtskinner@users....
main.py
in extract-tables
66 3 2022-05-06 2023-06-23 6 4 holtskinner@google.com 13262395+holtskinner@users....
config.yaml
in tax-processing-pipeline-python
6 - 2022-03-30 2023-06-23 5 4 holtskinner@google.com 13262395+holtskinner@users....
ProcessedDocument.py
in bq-connector/docai_bq_connector/doc_ai_processing
9 1 2022-12-08 2023-01-10 5 3 mark.servidio@gmail.com mservidio@google.com
gcs_utils.py
in filter-hitl-language
22 4 2022-05-02 2023-02-03 5 2 holtskinner@google.com holtskinner@google.com
docai_utils.py
in filter-hitl-language
41 2 2022-05-02 2023-02-03 5 2 holtskinner@google.com holtskinner@google.com
StorageManager.py
in bq-connector/docai_bq_connector/bigquery
84 7 2022-12-08 2023-01-06 5 4 mark.servidio@gmail.com deboraelkin@google.com
main.py
in community/expense-parser-python/cloud-functions
106 6 2022-03-18 2023-02-03 5 3 jiya.zhang.98@gmail.com holtskinner@google.com
docai_schemas.py
in community/identity-form-autofiller-python/src
162 - 2022-08-04 2023-02-03 5 3 13262395+holtskinner@users.... holtskinner@google.com
utilities.py
in incubator-tools/best-practices/utilities
439 22 2023-10-20 2024-02-02 5 3 moonatdeepak@gmail.com 13262395+holtskinner@users....
pdf_util.py
in bq-connector/docai_bq_connector/helper
7 1 2022-12-08 2023-08-03 4 4 mark.servidio@gmail.com 13262395+holtskinner@users....
main.py
in filter-hitl-language
8 - 2022-05-02 2023-02-03 4 2 holtskinner@google.com holtskinner@google.com
custom-theme.scss
in web-app-demo/Frontend/src
15 - 2022-03-14 2023-06-23 4 3 38544478+galz10@users.norep... 13262395+holtskinner@users....
owlbot.py
in root
20 - 2022-04-01 2023-06-13 4 5 11586922+kweinmeister@users... 13262395+holtskinner@users....
main.py
in sql-pdf-python/src/cloud-functions/create_docai
46 3 2022-12-06 2023-02-03 4 4 wahi80aws@gmail.com holtskinner@google.com
extract_languages.py
in extract-languages
52 1 2022-04-29 2023-02-03 4 2 holtskinner@google.com holtskinner@google.com
table_parsing.py
in community/codelabs/docai-form-parser
70 3 2022-09-12 2023-06-23 4 2 holtskinner@google.com 13262395+holtskinner@users....
main.py
in sql-pdf-python/src/cloud-functions/process_docai
88 3 2022-12-06 2023-02-03 4 4 wahi80aws@gmail.com holtskinner@google.com
main.py
in pdf-embedded-text
130 10 2023-01-17 2024-03-07 4 2 13262395+holtskinner@users.... 13262395+holtskinner@users....
main.py
in community/pdf-annotator-python
133 4 2022-03-10 2023-02-03 4 5 matthewayne@users.noreply.g... holtskinner@google.com
Files With Most Contributors (Top 50)
Based on the number of unique email addresses found in commits.

See data for all files...

File# lines# unitscreatedlast modified# changes
(days)
# contributorsfirst
contributor
latest
contributor
DocAIBQConnector.py
in bq-connector/docai_bq_connector/connector
280 4 2022-12-08 2023-02-28 17 6 mark.servidio@gmail.com 13262395+holtskinner@users....
Processor.py
in bq-connector/docai_bq_connector/doc_ai_processing
199 9 2022-12-08 2023-02-28 16 6 mark.servidio@gmail.com 13262395+holtskinner@users....
BqDocumentMapper.py
in bq-connector/docai_bq_connector/connector
318 13 2022-12-08 2023-02-28 14 6 mark.servidio@gmail.com 13262395+holtskinner@users....
main.py
in web-app-demo/Backend
58 3 2022-04-04 2023-02-03 13 6 gall.zahavii@gmail.com holtskinner@google.com
BqMetadataMapper.py
in bq-connector/docai_bq_connector/connector
75 9 2022-12-09 2023-06-13 12 6 mservidio@google.com 13262395+holtskinner@users....
main.py
in pdf-splitter-python
139 5 2022-03-09 2023-02-03 8 6 matthewayne@users.noreply.g... holtskinner@google.com
noxfile.py
in root
297 13 2022-04-01 2024-07-02 7 6 11586922+kweinmeister@users... 13262395+holtskinner@users....
tax_pipeline.py
in tax-processing-pipeline-python
157 4 2022-03-14 2023-11-01 15 5 holtskinner@google.com 13262395+holtskinner@users....
main.py
in bq-connector
239 1 2022-12-08 2023-02-03 12 5 mark.servidio@gmail.com holtskinner@google.com
helper.py
in web-app-demo/Backend/api
56 3 2022-03-14 2023-02-03 12 5 38544478+galz10@users.norep... holtskinner@google.com
batch_processing.py
in community/codelabs/docai-ocr
97 1 2022-03-30 2023-06-13 11 5 holtskinner@google.com 13262395+holtskinner@users....
online_processing.py
in community/codelabs/docai-ocr
19 - 2022-03-30 2023-06-23 11 5 holtskinner@google.com 13262395+holtskinner@users....
main.py
in tax-processing-pipeline-python
83 7 2022-03-14 2023-11-01 10 5 holtskinner@google.com 13262395+holtskinner@users....
__init__.py
in bq-connector/docai_bq_connector
2 - 2022-12-08 2023-02-03 9 5 mark.servidio@gmail.com holtskinner@google.com
docai_pipeline.py
in tax-processing-pipeline-python
59 2 2022-03-14 2023-11-01 9 5 holtskinner@google.com 13262395+holtskinner@users....
classification.py
in community/codelabs/docai-specialized-processors
44 1 2022-03-30 2023-06-23 9 5 holtskinner@google.com 13262395+holtskinner@users....
extraction.py
in community/codelabs/docai-specialized-processors
53 1 2022-03-30 2023-06-23 9 5 holtskinner@google.com 13262395+holtskinner@users....
main.py
in fraud-detection-python/cloud-functions/process-invoices
182 7 2022-03-15 2023-06-23 9 5 holtskinner@google.com 13262395+holtskinner@users....
main.py
in fraud-detection-python/cloud-functions/geocode-addresses
57 3 2022-03-15 2023-06-23 9 5 holtskinner@google.com 13262395+holtskinner@users....
firestore_utils.py
in tax-processing-pipeline-python
16 3 2022-03-14 2023-11-01 8 5 holtskinner@google.com 13262395+holtskinner@users....
docai_utils.py
in tax-processing-pipeline-python
91 8 2022-03-14 2023-11-01 8 5 holtskinner@google.com 13262395+holtskinner@users....
form_parser.py
in community/codelabs/docai-form-parser
55 2 2022-03-30 2023-06-23 7 5 holtskinner@google.com 13262395+holtskinner@users....
main.py
in community/pdf-annotator-python
133 4 2022-03-10 2023-02-03 4 5 matthewayne@users.noreply.g... holtskinner@google.com
owlbot.py
in root
20 - 2022-04-01 2023-06-13 4 5 11586922+kweinmeister@users... 13262395+holtskinner@users....
docai.py
in community/identity-form-autofiller-python/src
207 20 2022-05-18 2023-11-01 12 4 picardparis@users.noreply.g... 13262395+holtskinner@users....
app.component.ts
in web-app-demo/Frontend/src/app
92 3 2022-03-14 2023-06-23 9 4 38544478+galz10@users.norep... 13262395+holtskinner@users....
index.html
in tax-processing-pipeline-python/templates
255 - 2022-03-14 2023-11-01 8 4 holtskinner@google.com 13262395+holtskinner@users....
setup.py
in tax-processing-pipeline-python
76 2 2022-03-14 2023-11-01 8 4 holtskinner@google.com 13262395+holtskinner@users....
main.py
in community/identity-form-autofiller-python/src
104 12 2022-05-18 2023-02-03 8 4 picardparis@users.noreply.g... holtskinner@google.com
general_utils.py
in tax-processing-pipeline-python
9 3 2022-03-14 2023-11-01 7 4 holtskinner@google.com 13262395+holtskinner@users....
main.py
in extract-tables
66 3 2022-05-06 2023-06-23 6 4 holtskinner@google.com 13262395+holtskinner@users....
StorageManager.py
in bq-connector/docai_bq_connector/bigquery
84 7 2022-12-08 2023-01-06 5 4 mark.servidio@gmail.com deboraelkin@google.com
config.yaml
in tax-processing-pipeline-python
6 - 2022-03-30 2023-06-23 5 4 holtskinner@google.com 13262395+holtskinner@users....
pdf_util.py
in bq-connector/docai_bq_connector/helper
7 1 2022-12-08 2023-08-03 4 4 mark.servidio@gmail.com 13262395+holtskinner@users....
main.py
in sql-pdf-python/src/cloud-functions/create_docai
46 3 2022-12-06 2023-02-03 4 4 wahi80aws@gmail.com holtskinner@google.com
main.py
in sql-pdf-python/src/cloud-functions/process_docai
88 3 2022-12-06 2023-02-03 4 4 wahi80aws@gmail.com holtskinner@google.com
consts.py
in tax-processing-pipeline-python
85 - 2022-03-14 2023-02-03 8 3 holtskinner@google.com holtskinner@google.com
processor-selection.component.ts
in web-app-demo/Frontend/src/app/components/processor-selection
225 6 2022-03-14 2022-05-16 7 3 38544478+galz10@users.norep... 38544478+galz10@users.norep...
DocumentState.py
in bq-connector/docai_bq_connector/doc_ai_processing
15 3 2022-12-14 2023-01-06 6 3 deboraelkin@google.com 13262395+holtskinner@users....
ProcessedDocument.py
in bq-connector/docai_bq_connector/doc_ai_processing
9 1 2022-12-08 2023-01-10 5 3 mark.servidio@gmail.com mservidio@google.com
docai_schemas.py
in community/identity-form-autofiller-python/src
162 - 2022-08-04 2023-02-03 5 3 13262395+holtskinner@users.... holtskinner@google.com
main.py
in community/expense-parser-python/cloud-functions
106 6 2022-03-18 2023-02-03 5 3 jiya.zhang.98@gmail.com holtskinner@google.com
utilities.py
in incubator-tools/best-practices/utilities
439 22 2023-10-20 2024-02-02 5 3 moonatdeepak@gmail.com 13262395+holtskinner@users....
custom-theme.scss
in web-app-demo/Frontend/src
15 - 2022-03-14 2023-06-23 4 3 38544478+galz10@users.norep... 13262395+holtskinner@users....
DocumentField.py
in bq-connector/docai_bq_connector/doc_ai_processing
47 7 2022-12-08 2023-01-05 3 3 mark.servidio@gmail.com deboraelkin@google.com
DocReferenceException.py
in bq-connector/docai_bq_connector/exception
4 - 2022-12-20 2023-01-06 3 3 deboraelkin@google.com 13262395+holtskinner@users....
gcs_util.py
in bq-connector/docai_bq_connector/helper
21 2 2022-12-08 2023-01-05 3 3 mark.servidio@gmail.com deboraelkin@google.com
ConversionError.py
in bq-connector/docai_bq_connector/connector
39 4 2022-12-08 2023-01-06 3 3 mark.servidio@gmail.com mservidio@google.com
index.html
in cx-content-moderation/public
42 - 2023-03-24 2023-06-23 3 3 ghchinoy@google.com 13262395+holtskinner@users....
main.go
in cx-content-moderation
86 5 2023-03-24 2024-03-14 3 3 ghchinoy@google.com holtskinner@google.com
Files With Least Contributors (Top 50)
Based on the number of unique email addresses found in commits.

See data for all files...

File# lines# unitscreatedlast modified# changes
(days)
# contributorsfirst
contributor
latest
contributor
bank_statement_post_processing_tool.ipynb
in incubator-tools/bank_statement_post_processing_tool
2841 - 2024-03-12 2024-03-12 1 1 moonatdeepak@gmail.com moonatdeepak@gmail.com
dw_processing.ipynb
in document_ai_warehouse/document_ai_warehouse_processing_python
1283 - 2023-01-18 2023-02-03 2 1 holtskinner@google.com holtskinner@google.com
bank_statements_line_items_improver_and_missing_items_finder.ipynb
in incubator-tools/bank_statements_line_items_improver_and_missing_items_finder
1149 - 2024-03-12 2024-03-12 1 1 moonatdeepak@gmail.com moonatdeepak@gmail.com
line_items_improver_post_processing.ipynb
in incubator-tools/line_item_improver
1108 - 2023-12-08 2023-12-08 1 1 moonatdeepak@gmail.com moonatdeepak@gmail.com
docai_processor_migration.ipynb
in incubator-tools/docai_processor_migration
1089 - 2023-12-08 2023-12-08 1 1 moonatdeepak@gmail.com moonatdeepak@gmail.com
ocr_upgrade_tool_using_enterprise_ocr.ipynb
in incubator-tools/ocr_upgrade_tool_using_enterprise_ocr
1078 - 2025-03-03 2025-03-03 1 1 moonatdeepak@gmail.com moonatdeepak@gmail.com
ocr_upgradation_tool.ipynb
in incubator-tools/ocr_upgradation_tool
1055 - 2024-10-14 2024-10-14 1 1 moonatdeepak@gmail.com moonatdeepak@gmail.com
docai_uptraining.ipynb
in uptraining_docai_processor_using_python
1009 - 2024-02-15 2024-02-15 1 1 moonatdeepak@gmail.com moonatdeepak@gmail.com
reverse_annotation_tool.ipynb
in incubator-tools/reverse_annotation_tool
955 - 2024-05-17 2024-05-17 1 1 moonatdeepak@gmail.com moonatdeepak@gmail.com
specific_format_line_items_tagging.ipynb
in incubator-tools/specific_format_line_items_tagging
948 - 2024-03-12 2024-03-12 1 1 moonatdeepak@gmail.com moonatdeepak@gmail.com
cmek_docai_processor.ipynb
in incubator-tools/cmek_docai_processor
907 - 2024-02-27 2024-02-27 1 1 moonatdeepak@gmail.com moonatdeepak@gmail.com
child_entity_tag_using_header.ipynb
in incubator-tools/child_entity_tag_using_header
872 - 2023-12-08 2023-12-08 1 1 moonatdeepak@gmail.com moonatdeepak@gmail.com
cs_decision_matrix_automation.ipynb
in incubator-tools/cs_decision_matrix_automation
794 - 2024-08-13 2024-08-13 1 1 moonatdeepak@gmail.com moonatdeepak@gmail.com
docai_processor_visual_assessment.ipynb
in incubator-tools/docai_processor_visual_assessment_tool
782 - 2024-01-24 2024-01-24 1 1 135305956+sana-google@users... 135305956+sana-google@users...
importing_processor_and_evaluating_with_alternate_test_sets.ipynb
in incubator-tools/importing_processor_and_evaluating_with_alternate_test_sets
766 - 2024-01-24 2024-01-24 1 1 135305956+sana-google@users... 135305956+sana-google@users...
pii_synthetic_redaction_tool.ipynb
in incubator-tools/pii_synthetic_redaction_tool
759 - 2024-02-27 2024-02-27 1 1 moonatdeepak@gmail.com moonatdeepak@gmail.com
date_entities_annotation.ipynb
in incubator-tools/date_entities_annotation_tool
755 - 2023-12-08 2023-12-08 1 1 moonatdeepak@gmail.com moonatdeepak@gmail.com
categorizing_bank_statement_transactions_by_account_number.ipynb
in incubator-tools/categorizing_bank_statement_transactions_by_account_number
750 - 2024-03-12 2024-03-12 1 1 moonatdeepak@gmail.com moonatdeepak@gmail.com
Label_Section_Headers.ipynb
in incubator-tools/Label_Section_Headers
738 - 2024-10-31 2024-10-31 1 1 moonatdeepak@gmail.com moonatdeepak@gmail.com
paper_summarization.ipynb
in paper_summarization
737 - 2022-09-06 2023-02-03 2 1 holtskinner@google.com holtskinner@google.com
line_item_comparision.ipynb
in incubator-tools/line_item_comparision
714 - 2023-12-08 2023-12-08 1 1 moonatdeepak@gmail.com moonatdeepak@gmail.com
export_import_document_schema_gemini.ipynb
in incubator-tools/export_import_document_schema_gemini
693 - 2024-10-31 2024-10-31 1 1 moonatdeepak@gmail.com moonatdeepak@gmail.com
amount_rectification_from_words_to_numbers.ipynb
in incubator-tools/amount_rectification_from_words_to_numbers
688 - 2024-01-24 2024-01-24 1 1 135305956+sana-google@users... 135305956+sana-google@users...
scripts.js
in web-app-pix2info-python/src/frontend
666 93 2023-02-20 2023-02-21 2 1 picardparis@users.noreply.g... picardparis@users.noreply.g...
entity_label_restructuring_tool.ipynb
in incubator-tools/entity_label_restructuring_tool
660 - 2024-10-31 2024-10-31 1 1 moonatdeepak@gmail.com moonatdeepak@gmail.com
document_level_accuracy.ipynb
in incubator-tools/document_level_accuracy
655 - 2024-01-02 2024-01-02 1 1 135305956+sana-google@users... 135305956+sana-google@users...
combine_two_processor_output.ipynb
in incubator-tools/combine_two_processors_output
652 - 2024-05-17 2024-05-17 1 1 moonatdeepak@gmail.com moonatdeepak@gmail.com
Table_Spanning_Page_Merge_Script.ipynb
in incubator-tools/advance_table_line_enhancement
631 - 2024-03-12 2024-03-12 1 1 moonatdeepak@gmail.com moonatdeepak@gmail.com
docai_parser_result_merger.ipynb
in incubator-tools/best-practices/parser_result_merger
613 - 2023-10-20 2023-10-20 1 1 moonatdeepak@gmail.com moonatdeepak@gmail.com
identifying_poor_performing_docs.ipynb
in incubator-tools/best-practices/identifying_poor_performing_docs
601 - 2023-10-20 2023-12-08 2 1 moonatdeepak@gmail.com moonatdeepak@gmail.com
formparser_table_to_entity_converter_tool.ipynb
in incubator-tools/formparser_table_to_entity_converter_tool
590 - 2024-03-12 2024-03-12 1 1 moonatdeepak@gmail.com moonatdeepak@gmail.com
paragraph_separation.ipynb
in incubator-tools/paragraph_separation
578 - 2024-03-12 2024-03-12 1 1 moonatdeepak@gmail.com moonatdeepak@gmail.com
map_ocr_style_information_to_cde_entities.ipynb
in incubator-tools/map_ocr_style_information_to_cde_entities
562 - 2025-03-03 2025-03-03 1 1 moonatdeepak@gmail.com moonatdeepak@gmail.com
document-schema-from-form-parser-output.ipynb
in incubator-tools/document-schema-from-form-parser-output
542 - 2024-03-12 2024-03-12 1 1 moonatdeepak@gmail.com moonatdeepak@gmail.com
language_detection.ipynb
in incubator-tools/language_detection
540 - 2024-01-02 2024-01-02 1 1 135305956+sana-google@users... 135305956+sana-google@users...
combine_address_line.ipynb
in incubator-tools/combine_address_line
530 - 2024-01-24 2024-01-24 1 1 135305956+sana-google@users... 135305956+sana-google@users...
signature_detection.ipynb
in incubator-tools/signature_detection
523 - 2025-03-03 2025-03-03 1 1 moonatdeepak@gmail.com moonatdeepak@gmail.com
ocr_based_document_section_splitter.ipynb
in incubator-tools/ocr_based_document_section_splitter
512 - 2024-03-12 2024-03-12 1 1 moonatdeepak@gmail.com moonatdeepak@gmail.com
watermarks_and_line_removal.ipynb
in incubator-tools/watermarks_and_line_removal
511 - 2024-08-13 2024-08-13 1 1 moonatdeepak@gmail.com moonatdeepak@gmail.com
document_ai_warehouse.ipynb
in document_ai_warehouse/document-ai-warehouse-java-samples
503 - 2023-01-23 2023-01-23 1 1 53488138+kolban-google@user... 53488138+kolban-google@user...
pre_and_post_hitl_visualization.ipynb
in incubator-tools/best-practices/pre_post_hitl_visualization
500 - 2023-10-20 2023-12-08 2 1 moonatdeepak@gmail.com moonatdeepak@gmail.com
parsed_json_split_address.ipynb
in incubator-tools/parsed_json_split_address
484 - 2023-12-08 2023-12-08 1 1 moonatdeepak@gmail.com moonatdeepak@gmail.com
old_ocr_to_new_ocr_conversion.ipynb
in incubator-tools/old_ocr_to_new_ocr_conversion
484 - 2024-02-27 2024-02-27 1 1 moonatdeepak@gmail.com moonatdeepak@gmail.com
extending_entity_bounding_box.ipynb
in incubator-tools/extending_entity_bounding_boxes
476 - 2024-01-24 2024-01-24 1 1 135305956+sana-google@users... 135305956+sana-google@users...
parse_table_into_chunks.ipynb
in incubator-tools/parse_table_into_chunks
472 - 2024-08-13 2024-08-13 1 1 moonatdeepak@gmail.com moonatdeepak@gmail.com
466 - 2023-09-12 2024-07-02 3 1 13262395+holtskinner@users.... 13262395+holtskinner@users....
cds_dataset_creator.ipynb
in incubator-tools/cds_dataset_creator
461 - 2024-01-24 2024-01-24 1 1 135305956+sana-google@users... 135305956+sana-google@users...
Signature_Detection_by_Reading_Pixels.ipynb
in incubator-tools/Signature_Detection_by_Reading_Pixels
460 - 2024-10-31 2024-10-31 1 1 moonatdeepak@gmail.com moonatdeepak@gmail.com
hitl_rejected_documents_tracking.ipynb
in incubator-tools/best-practices/hitl_rejected_documents_tracking
446 - 2023-10-20 2023-12-08 2 1 moonatdeepak@gmail.com moonatdeepak@gmail.com
lineitem_improver_using_column_data.ipynb
in incubator-tools/lineitem_improver_using_column_data
442 - 2025-03-03 2025-03-03 1 1 moonatdeepak@gmail.com moonatdeepak@gmail.com
Correlations

File Size vs. Number of Changes: 284 points

incubator-tools/character_box_removal/character_box_removal.ipynb x: 413 lines of code y: 1 # changes incubator-tools/divide_pdf_to_high_quality_images/divide_pdf_to_high_quality_images.ipynb x: 263 lines of code y: 1 # changes incubator-tools/docai_document_processing_pipeline/src/load_queue_cf/__init__.py x: 1 lines of code y: 1 # changes incubator-tools/docai_document_processing_pipeline/src/load_queue_cf/main.py x: 251 lines of code y: 1 # changes incubator-tools/docai_document_processing_pipeline/src/process_batch_cf/main.py x: 168 lines of code y: 1 # changes incubator-tools/image_segmentation/image_segmentation.ipynb x: 366 lines of code y: 1 # changes incubator-tools/lineitem_improver_crosspage/lineitem_improver_crosspage.ipynb x: 438 lines of code y: 1 # changes incubator-tools/lineitem_improver_using_column_data/lineitem_improver_using_column_data.ipynb x: 442 lines of code y: 1 # changes incubator-tools/map_ocr_style_information_to_cde_entities/map_ocr_style_information_to_cde_entities.ipynb x: 562 lines of code y: 1 # changes incubator-tools/ocr_upgrade_tool_using_enterprise_ocr/ocr_upgrade_tool_using_enterprise_ocr.ipynb x: 1078 lines of code y: 1 # changes incubator-tools/signature_detection/signature_detection.ipynb x: 523 lines of code y: 1 # changes incubator-tools/Detecting_language_of_text_within_entities/Detecting_language_of_text_within_entities.ipynb x: 293 lines of code y: 1 # changes incubator-tools/Label_Section_Headers/Label_Section_Headers.ipynb x: 738 lines of code y: 1 # changes incubator-tools/PDF_Table_Identification/PDF_Table_Identification.ipynb x: 377 lines of code y: 1 # changes incubator-tools/Signature_Detection_by_Reading_Pixels/Signature_Detection_by_Reading_Pixels.ipynb x: 460 lines of code y: 1 # changes incubator-tools/Text_ordering_by_bounding_box_coordinates/Text_ordering_by_bounding_box_coordinates.ipynb x: 312 lines of code y: 1 # changes incubator-tools/add_vertices_to_entities/add_vertices_to_entities.ipynb x: 389 lines of code y: 1 # changes incubator-tools/convert_automl_response_to_documentai_format/convert_automl_response_to_documentai_format.ipynb x: 434 lines of code y: 1 # changes incubator-tools/entity_label_restructuring_tool/entity_label_restructuring_tool.ipynb x: 660 lines of code y: 1 # changes incubator-tools/export_import_document_schema_gemini/export_import_document_schema_gemini.ipynb x: 693 lines of code y: 1 # changes incubator-tools/image_pixel_downsizer/image_pixel_downsizer.ipynb x: 267 lines of code y: 1 # changes incubator-tools/set_field_description_via_api/set_field_description_via_api.ipynb x: 318 lines of code y: 1 # changes incubator-tools/tag_col_number_to_ocr_paragraphs/tag_col_number_to_ocr_paragraphs.ipynb x: 370 lines of code y: 1 # changes incubator-tools/ocr_upgradation_tool/ocr_upgradation_tool.ipynb x: 1055 lines of code y: 1 # changes incubator-tools/Entity_data_extraction_from_json/Entity_data_extraction_from_json.ipynb x: 357 lines of code y: 1 # changes incubator-tools/Extracting_Embedded_links_in_PDF/Extracting_Embedded_links_in_PDF.ipynb x: 331 lines of code y: 1 # changes incubator-tools/cdc_document_type_entity_addition/cdc_document_type_entity_addition.ipynb x: 234 lines of code y: 1 # changes incubator-tools/cs_decision_matrix_automation/cs_decision_matrix_automation.ipynb x: 794 lines of code y: 1 # changes incubator-tools/enhance_checkbox/code.ipynb x: 392 lines of code y: 1 # changes incubator-tools/parse_table_into_chunks/parse_table_into_chunks.ipynb x: 472 lines of code y: 1 # changes incubator-tools/parsing_documentai_ json_outputs_with_jq/Parsing Document AI JSON Outputs with JQ.ipynb x: 222 lines of code y: 1 # changes incubator-tools/swap_ocr_confusion_characters/swap_ocr_confusion_characters.ipynb x: 241 lines of code y: 1 # changes incubator-tools/watermarks_and_line_removal/watermarks_and_line_removal.ipynb x: 511 lines of code y: 1 # changes classify-split-extract-workflow/classify-extract.yaml x: 215 lines of code y: 1 # changes classify-split-extract-workflow/classify-job/bq_mlops.py x: 47 lines of code y: 1 # changes classify-split-extract-workflow/classify-job/config.py x: 210 lines of code y: 1 # changes classify-split-extract-workflow/classify-job/docai_helper.py x: 32 lines of code y: 1 # changes classify-split-extract-workflow/classify-job/gcs_helper.py x: 112 lines of code y: 1 # changes classify-split-extract-workflow/classify-job/main.py x: 42 lines of code y: 1 # changes classify-split-extract-workflow/classify-job/split_and_classify.py x: 289 lines of code y: 1 # changes classify-split-extract-workflow/classify-job/utils.py x: 84 lines of code y: 1 # changes noxfile.py x: 297 lines of code y: 7 # changes toolbox-batch-processing/documentai-toolbox-batch-entity-extraction.ipynb x: 466 lines of code y: 3 # changes incubator-tools/DocAI_Json_to_Canonical_Json_Conversion/Docai_json_to_canonical_json_conversion.ipynb x: 405 lines of code y: 1 # changes incubator-tools/Export_import_document_schema_from_processor/Export_import_document_schema_from_processor.ipynb x: 287 lines of code y: 1 # changes incubator-tools/combine_two_processors_output/combine_two_processor_output.ipynb x: 652 lines of code y: 1 # changes incubator-tools/normalize_date_value_19xx_to_20xx/normalize_date_value_19xx_to_20xx.ipynb x: 275 lines of code y: 1 # changes incubator-tools/reverse_annotation_tool/reverse_annotation_tool.ipynb x: 955 lines of code y: 1 # changes incubator-tools/signature-detection-technique/signature_detection_technique.ipynb x: 298 lines of code y: 1 # changes cx-content-moderation/main.go x: 86 lines of code y: 3 # changes incubator-tools/Reference_architecture_asynchronous/auto_deploy_v8/CFScript/main.py x: 141 lines of code y: 2 # changes incubator-tools/advance_table_line_enhancement/tool_helper_functions.py x: 836 lines of code y: 2 # changes incubator-tools/backmapping_entities_from_parser_output_to_original_language/backmap_utils.py x: 886 lines of code y: 2 # changes incubator-tools/synonyms_based_splitter_document_labeling/synonyms_based_splitter_document_labeling.ipynb x: 869 lines of code y: 2 # changes incubator-tools/advance_table_line_enhancement/Table_Spanning_Page_Merge_Script.ipynb x: 631 lines of code y: 1 # changes incubator-tools/advance_table_line_enhancement/fp_tables_to_csv.ipynb x: 314 lines of code y: 1 # changes incubator-tools/advance_table_line_enhancement/line_enhancement_basic_flow.ipynb x: 422 lines of code y: 1 # changes incubator-tools/bank_statement_post_processing_tool/bank_statement_post_processing_tool.ipynb x: 2841 lines of code y: 1 # changes incubator-tools/bank_statements_line_items_improver_and_missing_items_finder/bank_statements_line_items_improver_and_missing_items_finder.ipynb x: 1149 lines of code y: 1 # changes incubator-tools/categorizing_bank_statement_transactions_by_account_number/categorizing_bank_statement_transactions_by_account_number.ipynb x: 750 lines of code y: 1 # changes incubator-tools/document-schema-from-form-parser-output/document-schema-from-form-parser-output.ipynb x: 542 lines of code y: 1 # changes incubator-tools/formparser_table_to_entity_converter_tool/formparser_table_to_entity_converter_tool.ipynb x: 590 lines of code y: 1 # changes incubator-tools/paragraph_separation/paragraph_separation.ipynb x: 578 lines of code y: 1 # changes incubator-tools/specific_format_line_items_tagging/specific_format_line_items_tagging.ipynb x: 948 lines of code y: 1 # changes pdf-embedded-text/main.py x: 130 lines of code y: 4 # changes incubator-tools/cmek_docai_processor/cmek_docai_processor.ipynb x: 907 lines of code y: 1 # changes incubator-tools/entity_sorting_csharp/entity_sorting_csharp.cs x: 77 lines of code y: 1 # changes incubator-tools/hitl_line_item_prefix_issue/hitl_line_item_prefix_issue.ipynb x: 361 lines of code y: 1 # changes incubator-tools/identity_document_proofing_evaluation/identity_document_proofing_evaluation.ipynb x: 280 lines of code y: 1 # changes incubator-tools/old_ocr_to_new_ocr_conversion/old_ocr_to_new_ocr_conversion.ipynb x: 484 lines of code y: 1 # changes incubator-tools/pii_synthetic_redaction_tool/pii_synthetic_redaction_tool.ipynb x: 759 lines of code y: 1 # changes uptraining_docai_processor_using_python/docai_uptraining.ipynb x: 1009 lines of code y: 1 # changes incubator-tools/best-practices/utilities/utilities.py x: 439 lines of code y: 5 # changes incubator-tools/amount_rectification_from_words_to_numbers/amount_rectification_from_words_to_numbers.ipynb x: 688 lines of code y: 1 # changes incubator-tools/combine_address_line/combine_address_line.ipynb x: 530 lines of code y: 1 # changes incubator-tools/docai_processor_visual_assessment_tool/docai_processor_visual_assessment.ipynb x: 782 lines of code y: 1 # changes incubator-tools/importing_processor_and_evaluating_with_alternate_test_sets/importing_processor_and_evaluating_with_alternate_test_sets.ipynb x: 766 lines of code y: 1 # changes incubator-tools/labeled_dataset_validation/labeled_dataset_validation.ipynb x: 409 lines of code y: 1 # changes incubator-tools/best-practices/hitl_rejected_documents_tracking/hitl_rejected_documents_tracking.ipynb x: 446 lines of code y: 2 # changes incubator-tools/best-practices/identifying_poor_performing_docs/identifying_poor_performing_docs.ipynb x: 601 lines of code y: 2 # changes incubator-tools/best-practices/key_value_pair_entity_conversion/key_value_pair_entity_conversion.ipynb x: 356 lines of code y: 2 # changes incubator-tools/best-practices/pre_post_bounding_box_mismatch/pre_post_bounding_box_mismatch.ipynb x: 415 lines of code y: 2 # changes incubator-tools/best-practices/pre_post_hitl_visualization/pre_and_post_hitl_visualization.ipynb x: 500 lines of code y: 2 # changes incubator-tools/child_entity_tag_using_header/child_entity_tag_using_header.ipynb x: 872 lines of code y: 1 # changes incubator-tools/date_entities_annotation_tool/date_entities_annotation.ipynb x: 755 lines of code y: 1 # changes incubator-tools/docai_processor_migration/docai_processor_migration.ipynb x: 1089 lines of code y: 1 # changes incubator-tools/line_item_comparision/line_item_comparision.ipynb x: 714 lines of code y: 1 # changes incubator-tools/line_item_improver/line_items_improver_post_processing.ipynb x: 1108 lines of code y: 1 # changes community/identity-form-autofiller-python/src/docai.py x: 207 lines of code y: 12 # changes tax-processing-pipeline-python/docai_pipeline.py x: 59 lines of code y: 9 # changes tax-processing-pipeline-python/docai_utils.py x: 91 lines of code y: 8 # changes tax-processing-pipeline-python/firestore_utils.py x: 16 lines of code y: 8 # changes tax-processing-pipeline-python/general_utils.py x: 9 lines of code y: 7 # changes tax-processing-pipeline-python/main.py x: 83 lines of code y: 10 # changes tax-processing-pipeline-python/setup.py x: 76 lines of code y: 8 # changes tax-processing-pipeline-python/tax_pipeline.py x: 157 lines of code y: 15 # changes tax-processing-pipeline-python/templates/index.html x: 255 lines of code y: 8 # changes incubator-tools/best-practices/parser_result_merger/docai_parser_result_merger.ipynb x: 613 lines of code y: 1 # changes incubator-tools/best-practices/removing_empty_bounding_boxes/removing_empty_bounding_boxes.ipynb x: 338 lines of code y: 1 # changes document-processing-workflows/main.tf x: 363 lines of code y: 2 # changes document_ai_warehouse/common/src/common/utils/document_ai_utils.py x: 223 lines of code y: 2 # changes document_ai_warehouse/common/src/common/utils/document_warehouse_utils.py x: 278 lines of code y: 2 # changes document_ai_warehouse/common/src/common/utils/logging_handler.py x: 29 lines of code y: 2 # changes document_ai_warehouse/common/src/common/utils/storage_utils.py x: 23 lines of code y: 2 # changes document_ai_warehouse/document_ai_warehouse_batch_ingestion/main.py x: 531 lines of code y: 2 # changes document-processing-workflows/src/functions/parse-results/main.py x: 317 lines of code y: 2 # changes document-processing-workflows/src/functions/split-document/main.py x: 50 lines of code y: 2 # changes document_ai_warehouse/common/src/common/utils/helper.py x: 29 lines of code y: 1 # changes document_ai_warehouse/document_ai_warehouse_batch_ingestion/config.py x: 20 lines of code y: 1 # changes document-processing-workflows/src/workflows/batch_process_documents.yaml x: 159 lines of code y: 1 # changes document-processing-workflows/variables.tf x: 21 lines of code y: 1 # changes bq-connector/docai_bq_connector/helper/pdf_util.py x: 7 lines of code y: 4 # changes watermark-remover/watermark-remover.ipynb x: 255 lines of code y: 1 # changes hitl-custom-review/hitl-custom-review.ipynb x: 346 lines of code y: 1 # changes apps-script-google-drive/documentai.gs x: 98 lines of code y: 2 # changes community/codelabs/docai-form-parser/form_parser.py x: 55 lines of code y: 7 # changes community/codelabs/docai-form-parser/table_parsing.py x: 70 lines of code y: 4 # changes community/codelabs/docai-ocr/online_processing.py x: 19 lines of code y: 11 # changes community/codelabs/docai-specialized-processors/classification.py x: 44 lines of code y: 9 # changes community/codelabs/docai-specialized-processors/extraction.py x: 53 lines of code y: 9 # changes community/identity-form-autofiller-python/src/frontend/index.html x: 118 lines of code y: 2 # changes cx-content-moderation/public/index.html x: 42 lines of code y: 3 # changes document-json-explorer/public/index.html x: 31 lines of code y: 3 # changes document-json-explorer/src/About.js x: 41 lines of code y: 3 # changes document-json-explorer/src/App.css x: 29 lines of code y: 3 # changes document-json-explorer/src/App.js x: 9 lines of code y: 3 # changes document-json-explorer/src/Details.js x: 180 lines of code y: 3 # changes document-json-explorer/src/DocAITopLevel.js x: 95 lines of code y: 3 # changes document-json-explorer/src/DrawDocument.js x: 132 lines of code y: 3 # changes document-json-explorer/src/Entity.js x: 71 lines of code y: 3 # changes document-json-explorer/src/JSONPage.js x: 19 lines of code y: 3 # changes document-json-explorer/src/NoData.js x: 12 lines of code y: 3 # changes document-json-explorer/src/setupTests.js x: 1 lines of code y: 3 # changes document_ai_warehouse/document-ai-warehouse-java-samples/src/main/java/org/example/CreateDocument.java x: 105 lines of code y: 2 # changes document_ai_warehouse/document-ai-warehouse-java-samples/src/main/java/org/example/CreateDocumentDocAi.java x: 133 lines of code y: 2 # changes document_ai_warehouse/document-ai-warehouse-java-samples/src/main/java/org/example/CreateSchema.java x: 116 lines of code y: 2 # changes document_ai_warehouse/document-ai-warehouse-java-samples/src/main/java/org/example/DeleteDocument.java x: 45 lines of code y: 2 # changes document_ai_warehouse/document-ai-warehouse-java-samples/src/main/java/org/example/SearchDocuments.java x: 85 lines of code y: 2 # changes extract-tables/main.py x: 66 lines of code y: 6 # changes fraud-detection-python/cloud-functions/process-invoices/main.py x: 182 lines of code y: 9 # changes tax-processing-pipeline-python/config.yaml x: 6 lines of code y: 5 # changes web-app-demo/Frontend/src/app/app.component.ts x: 92 lines of code y: 9 # changes web-app-demo/Frontend/src/app/components/base-layer/base-layer.component.ts x: 54 lines of code y: 3 # changes web-app-demo/Frontend/src/custom-theme.scss x: 15 lines of code y: 4 # changes web-app-pix2info-python/src/frontend/index.html x: 154 lines of code y: 3 # changes web-app-pix2info-python/src/frontend/styles.css x: 149 lines of code y: 3 # changes bq-connector/docai_bq_connector/connector/BqMetadataMapper.py x: 75 lines of code y: 12 # changes community/codelabs/docai-ocr/batch_processing.py x: 97 lines of code y: 11 # changes community/codelabs/docai-ocr/batch_processing_toolbox.py x: 67 lines of code y: 2 # changes document_ai_warehouse/document_ai_warehouse_processing_python/document_warehouse_utils.py x: 259 lines of code y: 3 # changes owlbot.py x: 20 lines of code y: 4 # changes web-app-pix2info-python/src/backend/render.py x: 634 lines of code y: 3 # changes web-app-pix2info-python/src/main.py x: 111 lines of code y: 3 # changes bq-connector/docai_bq_connector/connector/BqDocumentMapper.py x: 318 lines of code y: 14 # changes bq-connector/docai_bq_connector/connector/DocAIBQConnector.py x: 280 lines of code y: 17 # changes bq-connector/docai_bq_connector/doc_ai_processing/Processor.py x: 199 lines of code y: 16 # changes web-app-pix2info-python/src/app.yaml x: 4 lines of code y: 2 # changes web-app-pix2info-python/src/backend/samples.py x: 35 lines of code y: 2 # changes web-app-pix2info-python/src/frontend/scripts.js x: 666 lines of code y: 2 # changes bq-connector/docai_bq_connector/__init__.py x: 2 lines of code y: 9 # changes bq-connector/main.py x: 239 lines of code y: 12 # changes community/expense-parser-python/cloud-functions/main.py x: 106 lines of code y: 5 # changes community/identity-form-autofiller-python/src/docai_schemas.py x: 162 lines of code y: 5 # changes community/identity-form-autofiller-python/src/main.py x: 104 lines of code y: 8 # changes document_ai_warehouse/document_ai_warehouse_processing_python/dw_processing.ipynb x: 1283 lines of code y: 2 # changes extract-languages/extract_languages.py x: 52 lines of code y: 4 # changes filter-hitl-language/docai_utils.py x: 41 lines of code y: 5 # changes filter-hitl-language/gcs_utils.py x: 22 lines of code y: 5 # changes paper_summarization/paper_summarization.ipynb x: 737 lines of code y: 2 # changes pdf-splitter-python/main.py x: 139 lines of code y: 8 # changes sql-pdf-python/src/cloud-functions/create_docai/main.py x: 46 lines of code y: 4 # changes sql-pdf-python/src/cloud-functions/process_docai/main.py x: 88 lines of code y: 4 # changes tax-processing-pipeline-python/consts.py x: 85 lines of code y: 8 # changes web-app-demo/Backend/api/helper.py x: 56 lines of code y: 12 # changes web-app-demo/Backend/main.py x: 58 lines of code y: 13 # changes document_ai_warehouse/document-ai-warehouse-java-samples/document_ai_warehouse.ipynb x: 503 lines of code y: 1 # changes document_ai_warehouse/document_ai_warehouse_processing_python/storage_utils.py x: 10 lines of code y: 1 # changes bq-connector/docai_bq_connector/bigquery/StorageManager.py x: 84 lines of code y: 5 # changes bq-connector/docai_bq_connector/doc_ai_processing/DocumentState.py x: 15 lines of code y: 6 # changes bq-connector/docai_bq_connector/doc_ai_processing/DocumentField.py x: 47 lines of code y: 3 # changes bq-connector/docai_bq_connector/helper/gcs_util.py x: 21 lines of code y: 3 # changes bq-connector/docai_bq_connector/helper/__init__.py x: 13 lines of code y: 1 # changes web-app-demo/Frontend/src/app/components/processor-selection/processor-selection.component.ts x: 225 lines of code y: 7 # changes web-app-demo/Frontend/src/app/app-routing.module.ts x: 8 lines of code y: 2 # changes web-app-demo/Frontend/src/app/components/canvas/canvas.component.ts x: 15 lines of code y: 2 # changes web-app-demo/Frontend/src/app/data-sharing-service.service.ts x: 57 lines of code y: 2 # changes
17.0
# changes
  min: 1.0
  average: 2.73
  25th percentile: 1.0
  median: 2.0
  75th percentile: 3.0
  max: 17.0
0 2841.0
lines of code
min: 1.0 | average: 262.51 | 25th percentile: 35.25 | median: 162.5 | 75th percentile: 403.25 | max: 2841.0

Number of Contributors vs. Number of Changes: 284 points

incubator-tools/character_box_removal/character_box_removal.ipynb x: 1 # contributors y: 1 # changes noxfile.py x: 6 # contributors y: 7 # changes toolbox-batch-processing/documentai-toolbox-batch-entity-extraction.ipynb x: 1 # contributors y: 3 # changes cx-content-moderation/main.go x: 3 # contributors y: 3 # changes incubator-tools/Reference_architecture_asynchronous/auto_deploy_v8/CFScript/main.py x: 3 # contributors y: 2 # changes incubator-tools/advance_table_line_enhancement/tool_helper_functions.py x: 2 # contributors y: 2 # changes pdf-embedded-text/main.py x: 2 # contributors y: 4 # changes incubator-tools/best-practices/utilities/utilities.py x: 3 # contributors y: 5 # changes incubator-tools/best-practices/hitl_rejected_documents_tracking/hitl_rejected_documents_tracking.ipynb x: 1 # contributors y: 2 # changes community/identity-form-autofiller-python/src/docai.py x: 4 # contributors y: 12 # changes tax-processing-pipeline-python/docai_pipeline.py x: 5 # contributors y: 9 # changes tax-processing-pipeline-python/docai_utils.py x: 5 # contributors y: 8 # changes tax-processing-pipeline-python/general_utils.py x: 4 # contributors y: 7 # changes tax-processing-pipeline-python/main.py x: 5 # contributors y: 10 # changes tax-processing-pipeline-python/setup.py x: 4 # contributors y: 8 # changes tax-processing-pipeline-python/tax_pipeline.py x: 5 # contributors y: 15 # changes bq-connector/docai_bq_connector/helper/pdf_util.py x: 4 # contributors y: 4 # changes hitl-custom-review/hitl-custom-review.ipynb x: 2 # contributors y: 1 # changes community/codelabs/docai-form-parser/form_parser.py x: 5 # contributors y: 7 # changes community/codelabs/docai-ocr/online_processing.py x: 5 # contributors y: 11 # changes extract-tables/main.py x: 4 # contributors y: 6 # changes tax-processing-pipeline-python/config.yaml x: 4 # contributors y: 5 # changes web-app-demo/Frontend/src/app/app.component.ts x: 4 # contributors y: 9 # changes web-app-demo/Frontend/src/custom-theme.scss x: 3 # contributors y: 4 # changes web-app-pix2info-python/src/frontend/index.html x: 2 # contributors y: 3 # changes bq-connector/docai_bq_connector/connector/BqMetadataMapper.py x: 6 # contributors y: 12 # changes owlbot.py x: 5 # contributors y: 4 # changes bq-connector/docai_bq_connector/connector/BqDocumentMapper.py x: 6 # contributors y: 14 # changes bq-connector/docai_bq_connector/connector/DocAIBQConnector.py x: 6 # contributors y: 17 # changes bq-connector/docai_bq_connector/doc_ai_processing/Processor.py x: 6 # contributors y: 16 # changes bq-connector/main.py x: 5 # contributors y: 12 # changes filter-hitl-language/docai_utils.py x: 2 # contributors y: 5 # changes pdf-splitter-python/main.py x: 6 # contributors y: 8 # changes tax-processing-pipeline-python/consts.py x: 3 # contributors y: 8 # changes web-app-demo/Backend/main.py x: 6 # contributors y: 13 # changes bq-connector/docai_bq_connector/doc_ai_processing/DocumentState.py x: 3 # contributors y: 6 # changes web-app-demo/Frontend/src/app/components/processor-selection/processor-selection.component.ts x: 3 # contributors y: 7 # changes
17.0
# changes
  min: 1.0
  average: 2.73
  25th percentile: 1.0
  median: 2.0
  75th percentile: 3.0
  max: 17.0
0 6.0
# contributors
min: 1.0 | average: 1.99 | 25th percentile: 1.0 | median: 1.0 | 75th percentile: 3.0 | max: 6.0

Number of Contributors vs. File Size: 284 points

incubator-tools/character_box_removal/character_box_removal.ipynb x: 1 # contributors y: 413 lines of code incubator-tools/divide_pdf_to_high_quality_images/divide_pdf_to_high_quality_images.ipynb x: 1 # contributors y: 263 lines of code incubator-tools/docai_document_processing_pipeline/src/load_queue_cf/__init__.py x: 1 # contributors y: 1 lines of code incubator-tools/docai_document_processing_pipeline/src/load_queue_cf/main.py x: 1 # contributors y: 251 lines of code incubator-tools/docai_document_processing_pipeline/src/process_batch_cf/main.py x: 1 # contributors y: 168 lines of code incubator-tools/image_segmentation/image_segmentation.ipynb x: 1 # contributors y: 366 lines of code incubator-tools/lineitem_improver_crosspage/lineitem_improver_crosspage.ipynb x: 1 # contributors y: 438 lines of code incubator-tools/map_ocr_style_information_to_cde_entities/map_ocr_style_information_to_cde_entities.ipynb x: 1 # contributors y: 562 lines of code incubator-tools/ocr_upgrade_tool_using_enterprise_ocr/ocr_upgrade_tool_using_enterprise_ocr.ipynb x: 1 # contributors y: 1078 lines of code incubator-tools/signature_detection/signature_detection.ipynb x: 1 # contributors y: 523 lines of code incubator-tools/Detecting_language_of_text_within_entities/Detecting_language_of_text_within_entities.ipynb x: 1 # contributors y: 293 lines of code incubator-tools/Label_Section_Headers/Label_Section_Headers.ipynb x: 1 # contributors y: 738 lines of code incubator-tools/PDF_Table_Identification/PDF_Table_Identification.ipynb x: 1 # contributors y: 377 lines of code incubator-tools/Signature_Detection_by_Reading_Pixels/Signature_Detection_by_Reading_Pixels.ipynb x: 1 # contributors y: 460 lines of code incubator-tools/Text_ordering_by_bounding_box_coordinates/Text_ordering_by_bounding_box_coordinates.ipynb x: 1 # contributors y: 312 lines of code incubator-tools/add_vertices_to_entities/add_vertices_to_entities.ipynb x: 1 # contributors y: 389 lines of code incubator-tools/convert_automl_response_to_documentai_format/convert_automl_response_to_documentai_format.ipynb x: 1 # contributors y: 434 lines of code incubator-tools/entity_label_restructuring_tool/entity_label_restructuring_tool.ipynb x: 1 # contributors y: 660 lines of code incubator-tools/export_import_document_schema_gemini/export_import_document_schema_gemini.ipynb x: 1 # contributors y: 693 lines of code incubator-tools/set_field_description_via_api/set_field_description_via_api.ipynb x: 1 # contributors y: 318 lines of code incubator-tools/split_pdf_horizontal_vertical/split_pdf_horizontal_vertical.ipynb x: 1 # contributors y: 270 lines of code incubator-tools/ocr_upgradation_tool/ocr_upgradation_tool.ipynb x: 1 # contributors y: 1055 lines of code incubator-tools/Entity_data_extraction_from_json/Entity_data_extraction_from_json.ipynb x: 1 # contributors y: 357 lines of code incubator-tools/Extracting_Embedded_links_in_PDF/Extracting_Embedded_links_in_PDF.ipynb x: 1 # contributors y: 331 lines of code incubator-tools/cdc_document_type_entity_addition/cdc_document_type_entity_addition.ipynb x: 1 # contributors y: 234 lines of code incubator-tools/cs_decision_matrix_automation/cs_decision_matrix_automation.ipynb x: 1 # contributors y: 794 lines of code incubator-tools/enhance_checkbox/code.ipynb x: 1 # contributors y: 392 lines of code incubator-tools/parse_table_into_chunks/parse_table_into_chunks.ipynb x: 1 # contributors y: 472 lines of code incubator-tools/parsing_documentai_ json_outputs_with_jq/Parsing Document AI JSON Outputs with JQ.ipynb x: 1 # contributors y: 222 lines of code incubator-tools/swap_ocr_confusion_characters/swap_ocr_confusion_characters.ipynb x: 1 # contributors y: 241 lines of code incubator-tools/watermarks_and_line_removal/watermarks_and_line_removal.ipynb x: 1 # contributors y: 511 lines of code classify-split-extract-workflow/classify-job/bq_mlops.py x: 1 # contributors y: 47 lines of code classify-split-extract-workflow/classify-job/config.py x: 1 # contributors y: 210 lines of code classify-split-extract-workflow/classify-job/docai_helper.py x: 1 # contributors y: 32 lines of code classify-split-extract-workflow/classify-job/gcs_helper.py x: 1 # contributors y: 112 lines of code classify-split-extract-workflow/classify-job/main.py x: 1 # contributors y: 42 lines of code classify-split-extract-workflow/classify-job/split_and_classify.py x: 1 # contributors y: 289 lines of code classify-split-extract-workflow/classify-job/utils.py x: 1 # contributors y: 84 lines of code noxfile.py x: 6 # contributors y: 297 lines of code incubator-tools/combine_two_processors_output/combine_two_processor_output.ipynb x: 1 # contributors y: 652 lines of code incubator-tools/reverse_annotation_tool/reverse_annotation_tool.ipynb x: 1 # contributors y: 955 lines of code cx-content-moderation/main.go x: 3 # contributors y: 86 lines of code incubator-tools/Reference_architecture_asynchronous/auto_deploy_v8/CFScript/main.py x: 3 # contributors y: 141 lines of code incubator-tools/advance_table_line_enhancement/tool_helper_functions.py x: 2 # contributors y: 836 lines of code incubator-tools/backmapping_entities_from_parser_output_to_original_language/backmap_utils.py x: 2 # contributors y: 886 lines of code incubator-tools/synonyms_based_splitter_document_labeling/synonyms_based_splitter_document_labeling.ipynb x: 2 # contributors y: 869 lines of code incubator-tools/advance_table_line_enhancement/Table_Spanning_Page_Merge_Script.ipynb x: 1 # contributors y: 631 lines of code incubator-tools/advance_table_line_enhancement/line_enhancement_basic_flow.ipynb x: 1 # contributors y: 422 lines of code incubator-tools/bank_statement_post_processing_tool/bank_statement_post_processing_tool.ipynb x: 1 # contributors y: 2841 lines of code incubator-tools/bank_statements_line_items_improver_and_missing_items_finder/bank_statements_line_items_improver_and_missing_items_finder.ipynb x: 1 # contributors y: 1149 lines of code incubator-tools/categorizing_bank_statement_transactions_by_account_number/categorizing_bank_statement_transactions_by_account_number.ipynb x: 1 # contributors y: 750 lines of code incubator-tools/document-schema-from-form-parser-output/document-schema-from-form-parser-output.ipynb x: 1 # contributors y: 542 lines of code incubator-tools/formparser_table_to_entity_converter_tool/formparser_table_to_entity_converter_tool.ipynb x: 1 # contributors y: 590 lines of code incubator-tools/paragraph_separation/paragraph_separation.ipynb x: 1 # contributors y: 578 lines of code incubator-tools/specific_format_line_items_tagging/specific_format_line_items_tagging.ipynb x: 1 # contributors y: 948 lines of code pdf-embedded-text/main.py x: 2 # contributors y: 130 lines of code incubator-tools/cmek_docai_processor/cmek_docai_processor.ipynb x: 1 # contributors y: 907 lines of code incubator-tools/entity_sorting_csharp/entity_sorting_csharp.cs x: 1 # contributors y: 77 lines of code incubator-tools/old_ocr_to_new_ocr_conversion/old_ocr_to_new_ocr_conversion.ipynb x: 1 # contributors y: 484 lines of code uptraining_docai_processor_using_python/docai_uptraining.ipynb x: 1 # contributors y: 1009 lines of code incubator-tools/best-practices/utilities/utilities.py x: 3 # contributors y: 439 lines of code incubator-tools/combine_address_line/combine_address_line.ipynb x: 1 # contributors y: 530 lines of code incubator-tools/docai_processor_visual_assessment_tool/docai_processor_visual_assessment.ipynb x: 1 # contributors y: 782 lines of code incubator-tools/importing_processor_and_evaluating_with_alternate_test_sets/importing_processor_and_evaluating_with_alternate_test_sets.ipynb x: 1 # contributors y: 766 lines of code incubator-tools/best-practices/identifying_poor_performing_docs/identifying_poor_performing_docs.ipynb x: 1 # contributors y: 601 lines of code incubator-tools/best-practices/pre_post_hitl_visualization/pre_and_post_hitl_visualization.ipynb x: 1 # contributors y: 500 lines of code incubator-tools/child_entity_tag_using_header/child_entity_tag_using_header.ipynb x: 1 # contributors y: 872 lines of code incubator-tools/docai_processor_migration/docai_processor_migration.ipynb x: 1 # contributors y: 1089 lines of code incubator-tools/line_item_comparision/line_item_comparision.ipynb x: 1 # contributors y: 714 lines of code incubator-tools/line_item_improver/line_items_improver_post_processing.ipynb x: 1 # contributors y: 1108 lines of code community/identity-form-autofiller-python/src/docai.py x: 4 # contributors y: 207 lines of code tax-processing-pipeline-python/docai_pipeline.py x: 5 # contributors y: 59 lines of code tax-processing-pipeline-python/docai_utils.py x: 5 # contributors y: 91 lines of code tax-processing-pipeline-python/firestore_utils.py x: 5 # contributors y: 16 lines of code tax-processing-pipeline-python/general_utils.py x: 4 # contributors y: 9 lines of code tax-processing-pipeline-python/main.py x: 5 # contributors y: 83 lines of code tax-processing-pipeline-python/setup.py x: 4 # contributors y: 76 lines of code tax-processing-pipeline-python/tax_pipeline.py x: 5 # contributors y: 157 lines of code tax-processing-pipeline-python/templates/index.html x: 4 # contributors y: 255 lines of code incubator-tools/best-practices/parser_result_merger/docai_parser_result_merger.ipynb x: 1 # contributors y: 613 lines of code incubator-tools/best-practices/removing_empty_bounding_boxes/removing_empty_bounding_boxes.ipynb x: 1 # contributors y: 338 lines of code document-processing-workflows/main.tf x: 2 # contributors y: 363 lines of code document_ai_warehouse/common/src/common/utils/document_ai_utils.py x: 2 # contributors y: 223 lines of code document_ai_warehouse/common/src/common/utils/document_warehouse_utils.py x: 2 # contributors y: 278 lines of code document_ai_warehouse/common/src/common/utils/logging_handler.py x: 2 # contributors y: 29 lines of code document_ai_warehouse/document_ai_warehouse_batch_ingestion/main.py x: 2 # contributors y: 531 lines of code document-processing-workflows/src/functions/parse-results/main.py x: 2 # contributors y: 317 lines of code document-processing-workflows/src/functions/split-document/main.py x: 2 # contributors y: 50 lines of code document_ai_warehouse/common/src/common/utils/docai_warehouse_helper.py x: 1 # contributors y: 165 lines of code document_ai_warehouse/document_ai_warehouse_batch_ingestion/config.py x: 1 # contributors y: 20 lines of code hitl-custom-review/hitl-custom-review.ipynb x: 2 # contributors y: 346 lines of code apps-script-google-drive/documentai.gs x: 2 # contributors y: 98 lines of code community/codelabs/docai-form-parser/form_parser.py x: 5 # contributors y: 55 lines of code community/codelabs/docai-form-parser/table_parsing.py x: 2 # contributors y: 70 lines of code community/codelabs/docai-specialized-processors/classification.py x: 5 # contributors y: 44 lines of code community/identity-form-autofiller-python/src/frontend/index.html x: 2 # contributors y: 118 lines of code community/identity-form-autofiller-python/src/frontend/styles.css x: 2 # contributors y: 102 lines of code cx-content-moderation/public/index.html x: 3 # contributors y: 42 lines of code document-json-explorer/public/index.html x: 3 # contributors y: 31 lines of code document-json-explorer/src/App.js x: 3 # contributors y: 9 lines of code document-json-explorer/src/Details.js x: 3 # contributors y: 180 lines of code document-json-explorer/src/DocAITopLevel.js x: 3 # contributors y: 95 lines of code document-json-explorer/src/DrawDocument.js x: 3 # contributors y: 132 lines of code document-json-explorer/src/Entity.js x: 3 # contributors y: 71 lines of code document-json-explorer/src/EntityHilight.js x: 3 # contributors y: 67 lines of code document-json-explorer/src/JSONPage.js x: 3 # contributors y: 19 lines of code document-json-explorer/src/PageSelector.js x: 3 # contributors y: 46 lines of code document_ai_warehouse/document-ai-warehouse-java-samples/src/main/java/org/example/SearchDocuments.java x: 2 # contributors y: 85 lines of code extract-tables/main.py x: 4 # contributors y: 66 lines of code fraud-detection-python/cloud-functions/process-invoices/main.py x: 5 # contributors y: 182 lines of code web-app-demo/Frontend/src/app/app.component.ts x: 4 # contributors y: 92 lines of code web-app-pix2info-python/src/frontend/index.html x: 2 # contributors y: 154 lines of code bq-connector/docai_bq_connector/connector/BqMetadataMapper.py x: 6 # contributors y: 75 lines of code community/codelabs/docai-ocr/batch_processing_toolbox.py x: 2 # contributors y: 67 lines of code document_ai_warehouse/document_ai_warehouse_processing_python/document_warehouse_utils.py x: 2 # contributors y: 259 lines of code web-app-pix2info-python/src/backend/docai.py x: 2 # contributors y: 184 lines of code web-app-pix2info-python/src/backend/etag.py x: 2 # contributors y: 16 lines of code web-app-pix2info-python/src/backend/options.py x: 2 # contributors y: 42 lines of code web-app-pix2info-python/src/backend/render.py x: 2 # contributors y: 634 lines of code bq-connector/docai_bq_connector/connector/BqDocumentMapper.py x: 6 # contributors y: 318 lines of code bq-connector/docai_bq_connector/connector/DocAIBQConnector.py x: 6 # contributors y: 280 lines of code bq-connector/docai_bq_connector/doc_ai_processing/Processor.py x: 6 # contributors y: 199 lines of code bq-connector/docai_bq_connector/__init__.py x: 5 # contributors y: 2 lines of code bq-connector/main.py x: 5 # contributors y: 239 lines of code community/expense-parser-python/cloud-functions/main.py x: 3 # contributors y: 106 lines of code community/identity-form-autofiller-python/src/docai_schemas.py x: 3 # contributors y: 162 lines of code community/identity-form-autofiller-python/src/main.py x: 4 # contributors y: 104 lines of code community/pdf-annotator-python/main.py x: 5 # contributors y: 133 lines of code document_ai_warehouse/document_ai_warehouse_processing_python/document_ai_utils.py x: 1 # contributors y: 67 lines of code document_ai_warehouse/document_ai_warehouse_processing_python/dw_processing.ipynb x: 1 # contributors y: 1283 lines of code filter-hitl-language/main.py x: 2 # contributors y: 8 lines of code pdf-splitter-python/main.py x: 6 # contributors y: 139 lines of code sql-pdf-python/src/cloud-functions/create_docai/main.py x: 4 # contributors y: 46 lines of code sql-pdf-python/src/cloud-functions/process_docai/main.py x: 4 # contributors y: 88 lines of code web-app-demo/Backend/main.py x: 6 # contributors y: 58 lines of code web-app-demo/Frontend/src/app/components/processor-selection/processor-selection.component.ts x: 3 # contributors y: 225 lines of code web-app-demo/Frontend/src/app/components/entity-tab/entity-tab.component.ts x: 2 # contributors y: 224 lines of code
2841.0
lines of code
  min: 1.0
  average: 262.51
  25th percentile: 35.25
  median: 162.5
  75th percentile: 403.25
  max: 2841.0
0 6.0
# contributors
min: 1.0 | average: 1.99 | 25th percentile: 1.0 | median: 1.0 | 75th percentile: 3.0 | max: 6.0