GoogleCloudPlatform / document-ai-samples
File Size

The distribution of size of files (measured in lines of code).

Intro
Learn more...
File Size Overall
14% | 34% | 41% | 4% | 5%
Legend:
1001+
501-1000
201-500
101-200
1-100


explore: grouped by folders | grouped by size | sunburst | 3D view
File Size per Extension
1001+
501-1000
201-500
101-200
1-100
ipynb18% | 37% | 43% | 0% | 0%
py0% | 26% | 32% | 19% | 20%
js0% | 35% | 19% | 16% | 29%
ts0% | 0% | 54% | 0% | 45%
yaml0% | 0% | 56% | 41% | 1%
tf0% | 0% | 94% | 0% | 5%
html0% | 0% | 35% | 38% | 25%
java0% | 0% | 0% | 66% | 33%
css0% | 0% | 0% | 56% | 43%
gs0% | 0% | 0% | 0% | 100%
go0% | 0% | 0% | 0% | 100%
cs0% | 0% | 0% | 0% | 100%
scss0% | 0% | 0% | 0% | 100%
File Size per Logical Decomposition
primary
1001+
501-1000
201-500
101-200
1-100
incubator-tools14% | 40% | 44% | <1% | <1%
document_ai_warehouse32% | 26% | 19% | 13% | 9%
uptraining_docai_processor_using_python100% | 0% | 0% | 0% | 0%
web-app-pix2info-python0% | 64% | 0% | 29% | 6%
paper_summarization0% | 100% | 0% | 0% | 0%
document-processing-workflows0% | 0% | 69% | 24% | 5%
bq-connector0% | 0% | 61% | 14% | 23%
classify-split-extract-workflow0% | 0% | 67% | 10% | 22%
community0% | 0% | 32% | 41% | 25%
toolbox-batch-processing0% | 0% | 100% | 0% | 0%
web-app-demo0% | 0% | 36% | 0% | 63%
hitl-custom-review0% | 0% | 100% | 0% | 0%
ROOT0% | 0% | 93% | 0% | 6%
tax-processing-pipeline-python0% | 0% | 30% | 18% | 50%
watermark-remover0% | 0% | 100% | 0% | 0%
document-json-explorer0% | 0% | 0% | 33% | 66%
fraud-detection-python0% | 0% | 0% | 76% | 23%
pdf-splitter-python0% | 0% | 0% | 100% | 0%
pdf-embedded-text0% | 0% | 0% | 100% | 0%
sql-pdf-python0% | 0% | 0% | 0% | 100%
cx-content-moderation0% | 0% | 0% | 0% | 100%
apps-script-google-drive0% | 0% | 0% | 0% | 100%
filter-hitl-language0% | 0% | 0% | 0% | 100%
extract-tables0% | 0% | 0% | 0% | 100%
extract-languages0% | 0% | 0% | 0% | 100%
Longest Files (Top 50)
File# lines# units
bank_statement_post_processing_tool.ipynb
in incubator-tools/bank_statement_post_processing_tool
2841 -
dw_processing.ipynb
in document_ai_warehouse/document_ai_warehouse_processing_python
1283 -
bank_statements_line_items_improver_and_missing_items_finder.ipynb
in incubator-tools/bank_statements_line_items_improver_and_missing_items_finder
1149 -
line_items_improver_post_processing.ipynb
in incubator-tools/line_item_improver
1108 -
docai_processor_migration.ipynb
in incubator-tools/docai_processor_migration
1089 -
ocr_upgrade_tool_using_enterprise_ocr.ipynb
in incubator-tools/ocr_upgrade_tool_using_enterprise_ocr
1078 -
ocr_upgradation_tool.ipynb
in incubator-tools/ocr_upgradation_tool
1055 -
docai_uptraining.ipynb
in uptraining_docai_processor_using_python
1009 -
reverse_annotation_tool.ipynb
in incubator-tools/reverse_annotation_tool
955 -
specific_format_line_items_tagging.ipynb
in incubator-tools/specific_format_line_items_tagging
948 -
cmek_docai_processor.ipynb
in incubator-tools/cmek_docai_processor
907 -
backmap_utils.py
in incubator-tools/backmapping_entities_from_parser_output_to_original_language
886 21
child_entity_tag_using_header.ipynb
in incubator-tools/child_entity_tag_using_header
872 -
synonyms_based_splitter_document_labeling.ipynb
in incubator-tools/synonyms_based_splitter_document_labeling
869 -
tool_helper_functions.py
in incubator-tools/advance_table_line_enhancement
836 33
cs_decision_matrix_automation.ipynb
in incubator-tools/cs_decision_matrix_automation
794 -
docai_processor_visual_assessment.ipynb
in incubator-tools/docai_processor_visual_assessment_tool
782 -
importing_processor_and_evaluating_with_alternate_test_sets.ipynb
in incubator-tools/importing_processor_and_evaluating_with_alternate_test_sets
766 -
pii_synthetic_redaction_tool.ipynb
in incubator-tools/pii_synthetic_redaction_tool
759 -
date_entities_annotation.ipynb
in incubator-tools/date_entities_annotation_tool
755 -
categorizing_bank_statement_transactions_by_account_number.ipynb
in incubator-tools/categorizing_bank_statement_transactions_by_account_number
750 -
Label_Section_Headers.ipynb
in incubator-tools/Label_Section_Headers
738 -
paper_summarization.ipynb
in paper_summarization
737 -
line_item_comparision.ipynb
in incubator-tools/line_item_comparision
714 -
export_import_document_schema_gemini.ipynb
in incubator-tools/export_import_document_schema_gemini
693 -
amount_rectification_from_words_to_numbers.ipynb
in incubator-tools/amount_rectification_from_words_to_numbers
688 -
scripts.js
in web-app-pix2info-python/src/frontend
666 93
entity_label_restructuring_tool.ipynb
in incubator-tools/entity_label_restructuring_tool
660 -
document_level_accuracy.ipynb
in incubator-tools/document_level_accuracy
655 -
combine_two_processor_output.ipynb
in incubator-tools/combine_two_processors_output
652 -
render.py
in web-app-pix2info-python/src/backend
634 60
Table_Spanning_Page_Merge_Script.ipynb
in incubator-tools/advance_table_line_enhancement
631 -
docai_parser_result_merger.ipynb
in incubator-tools/best-practices/parser_result_merger
613 -
identifying_poor_performing_docs.ipynb
in incubator-tools/best-practices/identifying_poor_performing_docs
601 -
formparser_table_to_entity_converter_tool.ipynb
in incubator-tools/formparser_table_to_entity_converter_tool
590 -
paragraph_separation.ipynb
in incubator-tools/paragraph_separation
578 -
map_ocr_style_information_to_cde_entities.ipynb
in incubator-tools/map_ocr_style_information_to_cde_entities
562 -
document-schema-from-form-parser-output.ipynb
in incubator-tools/document-schema-from-form-parser-output
542 -
language_detection.ipynb
in incubator-tools/language_detection
540 -
main.py
in document_ai_warehouse/document_ai_warehouse_batch_ingestion
531 22
combine_address_line.ipynb
in incubator-tools/combine_address_line
530 -
signature_detection.ipynb
in incubator-tools/signature_detection
523 -
ocr_based_document_section_splitter.ipynb
in incubator-tools/ocr_based_document_section_splitter
512 -
watermarks_and_line_removal.ipynb
in incubator-tools/watermarks_and_line_removal
511 -
document_ai_warehouse.ipynb
in document_ai_warehouse/document-ai-warehouse-java-samples
503 -
pre_and_post_hitl_visualization.ipynb
in incubator-tools/best-practices/pre_post_hitl_visualization
500 -
parsed_json_split_address.ipynb
in incubator-tools/parsed_json_split_address
484 -
old_ocr_to_new_ocr_conversion.ipynb
in incubator-tools/old_ocr_to_new_ocr_conversion
484 -
extending_entity_bounding_box.ipynb
in incubator-tools/extending_entity_bounding_boxes
476 -
parse_table_into_chunks.ipynb
in incubator-tools/parse_table_into_chunks
472 -
Files With Most Units (Top 50)
File# lines# units
scripts.js
in web-app-pix2info-python/src/frontend
666 93
render.py
in web-app-pix2info-python/src/backend
634 60
scripts.js
in community/identity-form-autofiller-python/src/frontend
366 43
tool_helper_functions.py
in incubator-tools/advance_table_line_enhancement
836 33
main.py
in document_ai_warehouse/document_ai_warehouse_batch_ingestion
531 22
utilities.py
in incubator-tools/best-practices/utilities
439 22
document_warehouse_utils.py
in document_ai_warehouse/common/src/common/utils
278 21
backmap_utils.py
in incubator-tools/backmapping_entities_from_parser_output_to_original_language
886 21
docai.py
in community/identity-form-autofiller-python/src
207 20
document_warehouse_utils.py
in document_ai_warehouse/document_ai_warehouse_processing_python
259 19
main.py
in web-app-pix2info-python/src
111 13
BqDocumentMapper.py
in bq-connector/docai_bq_connector/connector
318 13
main.py
in incubator-tools/docai_document_processing_pipeline/src/load_queue_cf
251 13
noxfile.py
in root
297 13
config.py
in classify-split-extract-workflow/classify-job
210 12
main.py
in community/identity-form-autofiller-python/src
104 12
docai.py
in web-app-pix2info-python/src/backend
184 11
utils.py
in classify-split-extract-workflow/classify-job
84 10
split_and_classify.py
in classify-split-extract-workflow/classify-job
289 10
main.py
in pdf-embedded-text
130 10
main.py
in incubator-tools/Reference_architecture_asynchronous/auto_deploy_v8/CFScript
141 10
data-sharing-service.service.ts
in web-app-demo/Frontend/src/app
57 10
Processor.py
in bq-connector/docai_bq_connector/doc_ai_processing
199 9
BqMetadataMapper.py
in bq-connector/docai_bq_connector/connector
75 9
document_ai_utils.py
in document_ai_warehouse/common/src/common/utils
223 9
docai_utils.py
in tax-processing-pipeline-python
91 8
StorageManager.py
in bq-connector/docai_bq_connector/bigquery
84 7
DocumentField.py
in bq-connector/docai_bq_connector/doc_ai_processing
47 7
main.py
in tax-processing-pipeline-python
83 7
main.py
in fraud-detection-python/cloud-functions/process-invoices
182 7
main.py
in incubator-tools/docai_document_processing_pipeline/src/process_batch_cf
168 7
main.py
in document-processing-workflows/src/functions/parse-results
317 7
gcs_helper.py
in classify-split-extract-workflow/classify-job
112 6
gs
documentai.gs
in apps-script-google-drive
98 6
document_ai_utils.py
in document_ai_warehouse/document_ai_warehouse_processing_python
67 6
main.py
in community/expense-parser-python/cloud-functions
106 6
processor-selection.component.ts
in web-app-demo/Frontend/src/app/components/processor-selection
225 6
entity-tab.component.ts
in web-app-demo/Frontend/src/app/components/entity-tab
224 6
main.go
in cx-content-moderation
86 5
main.py
in pdf-splitter-python
139 5
CreateDocumentDocAi.java
in document_ai_warehouse/document-ai-warehouse-java-samples/src/main/java/org/example
133 5
CreateDocument.java
in document_ai_warehouse/document-ai-warehouse-java-samples/src/main/java/org/example
105 5
SearchDocuments.java
in document_ai_warehouse/document-ai-warehouse-java-samples/src/main/java/org/example
85 5
DocAIBQConnector.py
in bq-connector/docai_bq_connector/connector
280 4
ConversionError.py
in bq-connector/docai_bq_connector/connector
39 4
CreateSchema.java
in document_ai_warehouse/document-ai-warehouse-java-samples/src/main/java/org/example
116 4
ListSchema.java
in document_ai_warehouse/document-ai-warehouse-java-samples/src/main/java/org/example
50 4
docai_warehouse_helper.py
in document_ai_warehouse/common/src/common/utils
165 4
helper.py
in document_ai_warehouse/common/src/common/utils
29 4
logging_handler.py
in document_ai_warehouse/common/src/common/utils
29 4
Files With Long Lines (Top 50)

There are 123 files with lines longer than 120 characters. In total, there are 1146 long lines.

File# lines# units# long lines
hitl-custom-review.ipynb
in hitl-custom-review
346 - 61
bank_statement_post_processing_tool.ipynb
in incubator-tools/bank_statement_post_processing_tool
2841 - 41
docai_processor_migration.ipynb
in incubator-tools/docai_processor_migration
1089 - 37
466 - 32
Extracting_Embedded_links_in_PDF.ipynb
in incubator-tools/Extracting_Embedded_links_in_PDF
331 - 32
ocr_upgradation_tool.ipynb
in incubator-tools/ocr_upgradation_tool
1055 - 25
docai_parser_result_merger.ipynb
in incubator-tools/best-practices/parser_result_merger
613 - 24
identifying_poor_performing_docs.ipynb
in incubator-tools/best-practices/identifying_poor_performing_docs
601 - 23
child_entity_tag_using_header.ipynb
in incubator-tools/child_entity_tag_using_header
872 - 22
docai_document_processing_pipeline.ipynb
in incubator-tools/docai_document_processing_pipeline
413 - 20
cds_dataset_creator.ipynb
in incubator-tools/cds_dataset_creator
461 - 19
code.ipynb
in incubator-tools/enhance_checkbox
392 - 19
cmek_docai_processor.ipynb
in incubator-tools/cmek_docai_processor
907 - 19
synonyms_based_splitter_document_labeling.ipynb
in incubator-tools/synonyms_based_splitter_document_labeling
869 - 18
ocr_upgrade_tool_using_enterprise_ocr.ipynb
in incubator-tools/ocr_upgrade_tool_using_enterprise_ocr
1078 - 18
importing_processor_and_evaluating_with_alternate_test_sets.ipynb
in incubator-tools/importing_processor_and_evaluating_with_alternate_test_sets
766 - 17
Table_Spanning_Page_Merge_Script.ipynb
in incubator-tools/advance_table_line_enhancement
631 - 17
categorizing_bank_statement_transactions_by_account_number.ipynb
in incubator-tools/categorizing_bank_statement_transactions_by_account_number
750 - 16
combine_address_line.ipynb
in incubator-tools/combine_address_line
530 - 15
docai_processor_visual_assessment.ipynb
in incubator-tools/docai_processor_visual_assessment_tool
782 - 15
line_enhancement_basic_flow.ipynb
in incubator-tools/advance_table_line_enhancement
422 - 14
Table_Parsing_using_CDE_Headers_and_Form_parser.ipynb
in incubator-tools/advance_table_line_enhancement
401 - 14
Table_extraction_with_Line_Enhancement.ipynb
in incubator-tools/advance_table_line_enhancement
439 - 14
docai_uptraining.ipynb
in uptraining_docai_processor_using_python
1009 - 14
Docai_json_to_canonical_json_conversion.ipynb
in incubator-tools/DocAI_Json_to_Canonical_Json_Conversion
405 - 13
currency_normalization.ipynb
in incubator-tools/currency_normalization
330 - 13
Label_Section_Headers.ipynb
in incubator-tools/Label_Section_Headers
738 - 13
ocr_based_document_section_splitter.ipynb
in incubator-tools/ocr_based_document_section_splitter
512 - 12
bank_statements_line_items_improver_and_missing_items_finder.ipynb
in incubator-tools/bank_statements_line_items_improver_and_missing_items_finder
1149 - 12
formparser_table_to_entity_converter_tool.ipynb
in incubator-tools/formparser_table_to_entity_converter_tool
590 - 12
Parsing Document AI JSON Outputs with JQ.ipynb
in incubator-tools/parsing_documentai_ json_outputs_with_jq
222 - 11
labeled_dataset_validation.ipynb
in incubator-tools/labeled_dataset_validation
409 - 11
document_ai_warehouse.ipynb
in document_ai_warehouse/document-ai-warehouse-java-samples
503 - 10
overlapping_split.ipynb
in incubator-tools/overlapping_split
423 - 10
language_detection.ipynb
in incubator-tools/language_detection
540 - 10
post_processing_negative_values.ipynb
in incubator-tools/post_processing_negative_values
309 - 10
parsed_json_split_address.ipynb
in incubator-tools/parsed_json_split_address
484 - 10
export_import_document_schema_gemini.ipynb
in incubator-tools/export_import_document_schema_gemini
693 - 10
special_character_removal.ipynb
in incubator-tools/special_character_removal
394 - 10
load_documents.yaml
in document-processing-workflows/src/workflows
222 - 10
date_entities_annotation.ipynb
in incubator-tools/date_entities_annotation_tool
755 - 9
line_items_improver_post_processing.ipynb
in incubator-tools/line_item_improver
1108 - 9
pii_synthetic_redaction_tool.ipynb
in incubator-tools/pii_synthetic_redaction_tool
759 - 9
pre_post_bounding_box_mismatch.ipynb
in incubator-tools/best-practices/pre_post_bounding_box_mismatch
415 - 9
fp_tables_to_csv.ipynb
in incubator-tools/advance_table_line_enhancement
314 - 9
document-schema-from-form-parser-output.ipynb
in incubator-tools/document-schema-from-form-parser-output
542 - 9
classify-extract.yaml
in classify-split-extract-workflow
215 - 8
dw_processing.ipynb
in document_ai_warehouse/document_ai_warehouse_processing_python
1283 - 8
Text_ordering_by_bounding_box_coordinates.ipynb
in incubator-tools/Text_ordering_by_bounding_box_coordinates
312 - 8
label_migration_child_to_parent.ipynb
in incubator-tools/label_migration_child_to_parent
273 - 8
Correlations

File Size vs. Commits (all time): 284 points

incubator-tools/character_box_removal/character_box_removal.ipynb x: 1 commits (all time) y: 413 lines of code incubator-tools/divide_pdf_to_high_quality_images/divide_pdf_to_high_quality_images.ipynb x: 1 commits (all time) y: 263 lines of code incubator-tools/docai_document_processing_pipeline/src/load_queue_cf/__init__.py x: 1 commits (all time) y: 1 lines of code incubator-tools/docai_document_processing_pipeline/src/load_queue_cf/main.py x: 1 commits (all time) y: 251 lines of code incubator-tools/docai_document_processing_pipeline/src/process_batch_cf/main.py x: 1 commits (all time) y: 168 lines of code incubator-tools/image_segmentation/image_segmentation.ipynb x: 1 commits (all time) y: 366 lines of code incubator-tools/lineitem_improver_crosspage/lineitem_improver_crosspage.ipynb x: 1 commits (all time) y: 438 lines of code incubator-tools/map_ocr_style_information_to_cde_entities/map_ocr_style_information_to_cde_entities.ipynb x: 1 commits (all time) y: 562 lines of code incubator-tools/ocr_upgrade_tool_using_enterprise_ocr/ocr_upgrade_tool_using_enterprise_ocr.ipynb x: 1 commits (all time) y: 1078 lines of code incubator-tools/signature_detection/signature_detection.ipynb x: 1 commits (all time) y: 523 lines of code incubator-tools/Detecting_language_of_text_within_entities/Detecting_language_of_text_within_entities.ipynb x: 1 commits (all time) y: 293 lines of code incubator-tools/Label_Section_Headers/Label_Section_Headers.ipynb x: 1 commits (all time) y: 738 lines of code incubator-tools/PDF_Table_Identification/PDF_Table_Identification.ipynb x: 1 commits (all time) y: 377 lines of code incubator-tools/Signature_Detection_by_Reading_Pixels/Signature_Detection_by_Reading_Pixels.ipynb x: 1 commits (all time) y: 460 lines of code incubator-tools/Text_ordering_by_bounding_box_coordinates/Text_ordering_by_bounding_box_coordinates.ipynb x: 1 commits (all time) y: 312 lines of code incubator-tools/add_vertices_to_entities/add_vertices_to_entities.ipynb x: 1 commits (all time) y: 389 lines of code incubator-tools/convert_automl_response_to_documentai_format/convert_automl_response_to_documentai_format.ipynb x: 1 commits (all time) y: 434 lines of code incubator-tools/entity_label_restructuring_tool/entity_label_restructuring_tool.ipynb x: 1 commits (all time) y: 660 lines of code incubator-tools/export_import_document_schema_gemini/export_import_document_schema_gemini.ipynb x: 1 commits (all time) y: 693 lines of code incubator-tools/set_field_description_via_api/set_field_description_via_api.ipynb x: 1 commits (all time) y: 318 lines of code incubator-tools/split_pdf_horizontal_vertical/split_pdf_horizontal_vertical.ipynb x: 1 commits (all time) y: 270 lines of code incubator-tools/ocr_upgradation_tool/ocr_upgradation_tool.ipynb x: 1 commits (all time) y: 1055 lines of code incubator-tools/Entity_data_extraction_from_json/Entity_data_extraction_from_json.ipynb x: 1 commits (all time) y: 357 lines of code incubator-tools/Extracting_Embedded_links_in_PDF/Extracting_Embedded_links_in_PDF.ipynb x: 1 commits (all time) y: 331 lines of code incubator-tools/cdc_document_type_entity_addition/cdc_document_type_entity_addition.ipynb x: 1 commits (all time) y: 234 lines of code incubator-tools/cs_decision_matrix_automation/cs_decision_matrix_automation.ipynb x: 1 commits (all time) y: 794 lines of code incubator-tools/enhance_checkbox/code.ipynb x: 1 commits (all time) y: 392 lines of code incubator-tools/parse_table_into_chunks/parse_table_into_chunks.ipynb x: 1 commits (all time) y: 472 lines of code incubator-tools/parsing_documentai_ json_outputs_with_jq/Parsing Document AI JSON Outputs with JQ.ipynb x: 1 commits (all time) y: 222 lines of code incubator-tools/swap_ocr_confusion_characters/swap_ocr_confusion_characters.ipynb x: 1 commits (all time) y: 241 lines of code incubator-tools/watermarks_and_line_removal/watermarks_and_line_removal.ipynb x: 1 commits (all time) y: 511 lines of code classify-split-extract-workflow/classify-job/bq_mlops.py x: 1 commits (all time) y: 47 lines of code classify-split-extract-workflow/classify-job/config.py x: 1 commits (all time) y: 210 lines of code classify-split-extract-workflow/classify-job/docai_helper.py x: 1 commits (all time) y: 32 lines of code classify-split-extract-workflow/classify-job/gcs_helper.py x: 1 commits (all time) y: 112 lines of code classify-split-extract-workflow/classify-job/main.py x: 1 commits (all time) y: 42 lines of code classify-split-extract-workflow/classify-job/split_and_classify.py x: 1 commits (all time) y: 289 lines of code classify-split-extract-workflow/classify-job/utils.py x: 1 commits (all time) y: 84 lines of code noxfile.py x: 11 commits (all time) y: 297 lines of code toolbox-batch-processing/documentai-toolbox-batch-entity-extraction.ipynb x: 3 commits (all time) y: 466 lines of code incubator-tools/combine_two_processors_output/combine_two_processor_output.ipynb x: 1 commits (all time) y: 652 lines of code incubator-tools/reverse_annotation_tool/reverse_annotation_tool.ipynb x: 1 commits (all time) y: 955 lines of code cx-content-moderation/main.go x: 5 commits (all time) y: 86 lines of code incubator-tools/Reference_architecture_asynchronous/auto_deploy_v8/CFScript/main.py x: 3 commits (all time) y: 141 lines of code incubator-tools/advance_table_line_enhancement/tool_helper_functions.py x: 2 commits (all time) y: 836 lines of code incubator-tools/backmapping_entities_from_parser_output_to_original_language/backmap_utils.py x: 2 commits (all time) y: 886 lines of code incubator-tools/synonyms_based_splitter_document_labeling/synonyms_based_splitter_document_labeling.ipynb x: 2 commits (all time) y: 869 lines of code incubator-tools/advance_table_line_enhancement/Table_Spanning_Page_Merge_Script.ipynb x: 1 commits (all time) y: 631 lines of code incubator-tools/advance_table_line_enhancement/line_enhancement_basic_flow.ipynb x: 1 commits (all time) y: 422 lines of code incubator-tools/bank_statement_post_processing_tool/bank_statement_post_processing_tool.ipynb x: 1 commits (all time) y: 2841 lines of code incubator-tools/bank_statements_line_items_improver_and_missing_items_finder/bank_statements_line_items_improver_and_missing_items_finder.ipynb x: 1 commits (all time) y: 1149 lines of code incubator-tools/categorizing_bank_statement_transactions_by_account_number/categorizing_bank_statement_transactions_by_account_number.ipynb x: 1 commits (all time) y: 750 lines of code incubator-tools/document-schema-from-form-parser-output/document-schema-from-form-parser-output.ipynb x: 1 commits (all time) y: 542 lines of code incubator-tools/formparser_table_to_entity_converter_tool/formparser_table_to_entity_converter_tool.ipynb x: 1 commits (all time) y: 590 lines of code incubator-tools/paragraph_separation/paragraph_separation.ipynb x: 1 commits (all time) y: 578 lines of code incubator-tools/specific_format_line_items_tagging/specific_format_line_items_tagging.ipynb x: 1 commits (all time) y: 948 lines of code pdf-embedded-text/main.py x: 5 commits (all time) y: 130 lines of code incubator-tools/cmek_docai_processor/cmek_docai_processor.ipynb x: 1 commits (all time) y: 907 lines of code incubator-tools/entity_sorting_csharp/entity_sorting_csharp.cs x: 1 commits (all time) y: 77 lines of code incubator-tools/old_ocr_to_new_ocr_conversion/old_ocr_to_new_ocr_conversion.ipynb x: 1 commits (all time) y: 484 lines of code uptraining_docai_processor_using_python/docai_uptraining.ipynb x: 1 commits (all time) y: 1009 lines of code incubator-tools/best-practices/utilities/utilities.py x: 5 commits (all time) y: 439 lines of code incubator-tools/combine_address_line/combine_address_line.ipynb x: 1 commits (all time) y: 530 lines of code incubator-tools/docai_processor_visual_assessment_tool/docai_processor_visual_assessment.ipynb x: 1 commits (all time) y: 782 lines of code incubator-tools/importing_processor_and_evaluating_with_alternate_test_sets/importing_processor_and_evaluating_with_alternate_test_sets.ipynb x: 1 commits (all time) y: 766 lines of code incubator-tools/best-practices/hitl_rejected_documents_tracking/hitl_rejected_documents_tracking.ipynb x: 2 commits (all time) y: 446 lines of code incubator-tools/best-practices/identifying_poor_performing_docs/identifying_poor_performing_docs.ipynb x: 2 commits (all time) y: 601 lines of code incubator-tools/best-practices/key_value_pair_entity_conversion/key_value_pair_entity_conversion.ipynb x: 2 commits (all time) y: 356 lines of code incubator-tools/best-practices/pre_post_bounding_box_mismatch/pre_post_bounding_box_mismatch.ipynb x: 2 commits (all time) y: 415 lines of code incubator-tools/best-practices/pre_post_hitl_visualization/pre_and_post_hitl_visualization.ipynb x: 2 commits (all time) y: 500 lines of code incubator-tools/child_entity_tag_using_header/child_entity_tag_using_header.ipynb x: 1 commits (all time) y: 872 lines of code incubator-tools/docai_processor_migration/docai_processor_migration.ipynb x: 1 commits (all time) y: 1089 lines of code incubator-tools/line_item_comparision/line_item_comparision.ipynb x: 1 commits (all time) y: 714 lines of code incubator-tools/line_item_improver/line_items_improver_post_processing.ipynb x: 1 commits (all time) y: 1108 lines of code community/identity-form-autofiller-python/src/docai.py x: 16 commits (all time) y: 207 lines of code tax-processing-pipeline-python/docai_pipeline.py x: 23 commits (all time) y: 59 lines of code tax-processing-pipeline-python/docai_utils.py x: 20 commits (all time) y: 91 lines of code tax-processing-pipeline-python/firestore_utils.py x: 21 commits (all time) y: 16 lines of code tax-processing-pipeline-python/general_utils.py x: 13 commits (all time) y: 9 lines of code tax-processing-pipeline-python/main.py x: 26 commits (all time) y: 83 lines of code tax-processing-pipeline-python/setup.py x: 18 commits (all time) y: 76 lines of code tax-processing-pipeline-python/tax_pipeline.py x: 29 commits (all time) y: 157 lines of code tax-processing-pipeline-python/templates/index.html x: 18 commits (all time) y: 255 lines of code incubator-tools/best-practices/parser_result_merger/docai_parser_result_merger.ipynb x: 1 commits (all time) y: 613 lines of code incubator-tools/best-practices/removing_empty_bounding_boxes/removing_empty_bounding_boxes.ipynb x: 1 commits (all time) y: 338 lines of code document-processing-workflows/main.tf x: 2 commits (all time) y: 363 lines of code document_ai_warehouse/common/src/common/utils/document_ai_utils.py x: 2 commits (all time) y: 223 lines of code document_ai_warehouse/common/src/common/utils/document_warehouse_utils.py x: 2 commits (all time) y: 278 lines of code document_ai_warehouse/common/src/common/utils/logging_handler.py x: 2 commits (all time) y: 29 lines of code document_ai_warehouse/document_ai_warehouse_batch_ingestion/main.py x: 2 commits (all time) y: 531 lines of code document-processing-workflows/src/functions/parse-results/main.py x: 2 commits (all time) y: 317 lines of code document-processing-workflows/src/functions/split-document/main.py x: 2 commits (all time) y: 50 lines of code document_ai_warehouse/common/src/common/utils/docai_warehouse_helper.py x: 1 commits (all time) y: 165 lines of code document_ai_warehouse/document_ai_warehouse_batch_ingestion/config.py x: 1 commits (all time) y: 20 lines of code bq-connector/docai_bq_connector/helper/pdf_util.py x: 5 commits (all time) y: 7 lines of code hitl-custom-review/hitl-custom-review.ipynb x: 4 commits (all time) y: 346 lines of code apps-script-google-drive/documentai.gs x: 2 commits (all time) y: 98 lines of code community/codelabs/docai-form-parser/form_parser.py x: 16 commits (all time) y: 55 lines of code community/codelabs/docai-form-parser/table_parsing.py x: 4 commits (all time) y: 70 lines of code community/codelabs/docai-specialized-processors/classification.py x: 19 commits (all time) y: 44 lines of code community/codelabs/docai-specialized-processors/extraction.py x: 19 commits (all time) y: 53 lines of code community/identity-form-autofiller-python/src/frontend/index.html x: 3 commits (all time) y: 118 lines of code community/identity-form-autofiller-python/src/frontend/styles.css x: 3 commits (all time) y: 102 lines of code cx-content-moderation/public/index.html x: 3 commits (all time) y: 42 lines of code document-json-explorer/public/index.html x: 4 commits (all time) y: 31 lines of code document-json-explorer/src/About.js x: 4 commits (all time) y: 41 lines of code document-json-explorer/src/App.js x: 4 commits (all time) y: 9 lines of code document-json-explorer/src/Details.js x: 4 commits (all time) y: 180 lines of code document-json-explorer/src/DocAITopLevel.js x: 4 commits (all time) y: 95 lines of code document-json-explorer/src/DrawDocument.js x: 4 commits (all time) y: 132 lines of code document-json-explorer/src/EntityHilight.js x: 4 commits (all time) y: 67 lines of code document-json-explorer/src/JSONPage.js x: 4 commits (all time) y: 19 lines of code document-json-explorer/src/PageSelector.js x: 4 commits (all time) y: 46 lines of code document_ai_warehouse/document-ai-warehouse-java-samples/src/main/java/org/example/CreateDocument.java x: 2 commits (all time) y: 105 lines of code document_ai_warehouse/document-ai-warehouse-java-samples/src/main/java/org/example/CreateDocumentDocAi.java x: 2 commits (all time) y: 133 lines of code document_ai_warehouse/document-ai-warehouse-java-samples/src/main/java/org/example/CreateSchema.java x: 2 commits (all time) y: 116 lines of code document_ai_warehouse/document-ai-warehouse-java-samples/src/main/java/org/example/SearchDocuments.java x: 2 commits (all time) y: 85 lines of code extract-tables/main.py x: 7 commits (all time) y: 66 lines of code fraud-detection-python/cloud-functions/geocode-addresses/main.py x: 18 commits (all time) y: 57 lines of code fraud-detection-python/cloud-functions/process-invoices/main.py x: 18 commits (all time) y: 182 lines of code tax-processing-pipeline-python/config.yaml x: 10 commits (all time) y: 6 lines of code web-app-demo/Frontend/src/app/app.component.ts x: 21 commits (all time) y: 92 lines of code web-app-demo/Frontend/src/app/components/base-layer/base-layer.component.ts x: 7 commits (all time) y: 54 lines of code web-app-demo/Frontend/src/app/components/entity-tab/entity-tab.component.css x: 8 commits (all time) y: 26 lines of code web-app-demo/Frontend/src/custom-theme.scss x: 12 commits (all time) y: 15 lines of code web-app-pix2info-python/src/frontend/index.html x: 3 commits (all time) y: 154 lines of code bq-connector/docai_bq_connector/connector/BqMetadataMapper.py x: 25 commits (all time) y: 75 lines of code community/codelabs/docai-ocr/batch_processing_toolbox.py x: 2 commits (all time) y: 67 lines of code document_ai_warehouse/document_ai_warehouse_processing_python/document_warehouse_utils.py x: 3 commits (all time) y: 259 lines of code web-app-pix2info-python/src/backend/docai.py x: 3 commits (all time) y: 184 lines of code web-app-pix2info-python/src/backend/etag.py x: 3 commits (all time) y: 16 lines of code web-app-pix2info-python/src/backend/processors.py x: 3 commits (all time) y: 27 lines of code web-app-pix2info-python/src/backend/render.py x: 3 commits (all time) y: 634 lines of code bq-connector/docai_bq_connector/connector/BqDocumentMapper.py x: 40 commits (all time) y: 318 lines of code bq-connector/docai_bq_connector/connector/DocAIBQConnector.py x: 36 commits (all time) y: 280 lines of code bq-connector/docai_bq_connector/doc_ai_processing/Processor.py x: 38 commits (all time) y: 199 lines of code web-app-pix2info-python/src/app.yaml x: 2 commits (all time) y: 4 lines of code web-app-pix2info-python/src/backend/samples.py x: 2 commits (all time) y: 35 lines of code web-app-pix2info-python/src/frontend/scripts.js x: 2 commits (all time) y: 666 lines of code bq-connector/main.py x: 18 commits (all time) y: 239 lines of code community/expense-parser-python/cloud-functions/main.py x: 10 commits (all time) y: 106 lines of code community/identity-form-autofiller-python/src/docai_schemas.py x: 8 commits (all time) y: 162 lines of code community/identity-form-autofiller-python/src/main.py x: 12 commits (all time) y: 104 lines of code community/pdf-annotator-python/main.py x: 14 commits (all time) y: 133 lines of code document_ai_warehouse/document_ai_warehouse_processing_python/dw_processing.ipynb x: 2 commits (all time) y: 1283 lines of code filter-hitl-language/docai_utils.py x: 5 commits (all time) y: 41 lines of code filter-hitl-language/gcs_utils.py x: 6 commits (all time) y: 22 lines of code paper_summarization/paper_summarization.ipynb x: 2 commits (all time) y: 737 lines of code pdf-splitter-python/main.py x: 17 commits (all time) y: 139 lines of code sql-pdf-python/src/cloud-functions/create_docai/main.py x: 8 commits (all time) y: 46 lines of code sql-pdf-python/src/cloud-functions/process_docai/main.py x: 8 commits (all time) y: 88 lines of code tax-processing-pipeline-python/consts.py x: 19 commits (all time) y: 85 lines of code web-app-demo/Backend/api/helper.py x: 25 commits (all time) y: 56 lines of code web-app-demo/Backend/main.py x: 36 commits (all time) y: 58 lines of code document_ai_warehouse/document-ai-warehouse-java-samples/document_ai_warehouse.ipynb x: 1 commits (all time) y: 503 lines of code bq-connector/docai_bq_connector/doc_ai_processing/ProcessedDocument.py x: 6 commits (all time) y: 9 lines of code bq-connector/docai_bq_connector/bigquery/StorageManager.py x: 14 commits (all time) y: 84 lines of code bq-connector/docai_bq_connector/exception/DocReferenceException.py x: 3 commits (all time) y: 4 lines of code bq-connector/docai_bq_connector/doc_ai_processing/DocumentField.py x: 3 commits (all time) y: 47 lines of code web-app-demo/Frontend/src/app/components/processor-selection/processor-selection.component.ts x: 24 commits (all time) y: 225 lines of code web-app-demo/Frontend/src/app/components/base-layer/base-layer.component.html x: 9 commits (all time) y: 17 lines of code web-app-demo/Frontend/src/app/components/processor-selection/processor-selection.component.html x: 10 commits (all time) y: 15 lines of code web-app-demo/Frontend/src/app/components/upload-file/upload-file.component.html x: 9 commits (all time) y: 23 lines of code web-app-demo/Frontend/server.js x: 8 commits (all time) y: 7 lines of code web-app-demo/Frontend/src/app/app.component.css x: 6 commits (all time) y: 43 lines of code web-app-demo/Frontend/src/app/components/base-layer/base-layer.component.css x: 6 commits (all time) y: 30 lines of code web-app-demo/Frontend/src/app/components/canvas/canvas.component.ts x: 5 commits (all time) y: 15 lines of code web-app-demo/Frontend/src/app/components/entity-tab/entity-tab.component.ts x: 5 commits (all time) y: 224 lines of code web-app-demo/Frontend/src/app/data-sharing-service.service.ts x: 5 commits (all time) y: 57 lines of code web-app-demo/Frontend/src/app/document-annotation.ts x: 5 commits (all time) y: 50 lines of code
2841.0
lines of code
  min: 1.0
  average: 262.51
  25th percentile: 35.25
  median: 162.5
  75th percentile: 403.25
  max: 2841.0
0 40.0
commits (all time)
min: 1.0 | average: 4.64 | 25th percentile: 1.0 | median: 2.0 | 75th percentile: 5.0 | max: 40.0

File Size vs. Contributors (all time): 284 points

incubator-tools/character_box_removal/character_box_removal.ipynb x: 1 contributors (all time) y: 413 lines of code incubator-tools/divide_pdf_to_high_quality_images/divide_pdf_to_high_quality_images.ipynb x: 1 contributors (all time) y: 263 lines of code incubator-tools/docai_document_processing_pipeline/src/load_queue_cf/__init__.py x: 1 contributors (all time) y: 1 lines of code incubator-tools/docai_document_processing_pipeline/src/load_queue_cf/main.py x: 1 contributors (all time) y: 251 lines of code incubator-tools/docai_document_processing_pipeline/src/process_batch_cf/main.py x: 1 contributors (all time) y: 168 lines of code incubator-tools/image_segmentation/image_segmentation.ipynb x: 1 contributors (all time) y: 366 lines of code incubator-tools/lineitem_improver_crosspage/lineitem_improver_crosspage.ipynb x: 1 contributors (all time) y: 438 lines of code incubator-tools/map_ocr_style_information_to_cde_entities/map_ocr_style_information_to_cde_entities.ipynb x: 1 contributors (all time) y: 562 lines of code incubator-tools/ocr_upgrade_tool_using_enterprise_ocr/ocr_upgrade_tool_using_enterprise_ocr.ipynb x: 1 contributors (all time) y: 1078 lines of code incubator-tools/signature_detection/signature_detection.ipynb x: 1 contributors (all time) y: 523 lines of code incubator-tools/Detecting_language_of_text_within_entities/Detecting_language_of_text_within_entities.ipynb x: 1 contributors (all time) y: 293 lines of code incubator-tools/Label_Section_Headers/Label_Section_Headers.ipynb x: 1 contributors (all time) y: 738 lines of code incubator-tools/PDF_Table_Identification/PDF_Table_Identification.ipynb x: 1 contributors (all time) y: 377 lines of code incubator-tools/Signature_Detection_by_Reading_Pixels/Signature_Detection_by_Reading_Pixels.ipynb x: 1 contributors (all time) y: 460 lines of code incubator-tools/Text_ordering_by_bounding_box_coordinates/Text_ordering_by_bounding_box_coordinates.ipynb x: 1 contributors (all time) y: 312 lines of code incubator-tools/add_vertices_to_entities/add_vertices_to_entities.ipynb x: 1 contributors (all time) y: 389 lines of code incubator-tools/convert_automl_response_to_documentai_format/convert_automl_response_to_documentai_format.ipynb x: 1 contributors (all time) y: 434 lines of code incubator-tools/entity_label_restructuring_tool/entity_label_restructuring_tool.ipynb x: 1 contributors (all time) y: 660 lines of code incubator-tools/export_import_document_schema_gemini/export_import_document_schema_gemini.ipynb x: 1 contributors (all time) y: 693 lines of code incubator-tools/set_field_description_via_api/set_field_description_via_api.ipynb x: 1 contributors (all time) y: 318 lines of code incubator-tools/split_pdf_horizontal_vertical/split_pdf_horizontal_vertical.ipynb x: 1 contributors (all time) y: 270 lines of code incubator-tools/ocr_upgradation_tool/ocr_upgradation_tool.ipynb x: 1 contributors (all time) y: 1055 lines of code incubator-tools/Entity_data_extraction_from_json/Entity_data_extraction_from_json.ipynb x: 1 contributors (all time) y: 357 lines of code incubator-tools/Extracting_Embedded_links_in_PDF/Extracting_Embedded_links_in_PDF.ipynb x: 1 contributors (all time) y: 331 lines of code incubator-tools/cdc_document_type_entity_addition/cdc_document_type_entity_addition.ipynb x: 1 contributors (all time) y: 234 lines of code incubator-tools/cs_decision_matrix_automation/cs_decision_matrix_automation.ipynb x: 1 contributors (all time) y: 794 lines of code incubator-tools/enhance_checkbox/code.ipynb x: 1 contributors (all time) y: 392 lines of code incubator-tools/parse_table_into_chunks/parse_table_into_chunks.ipynb x: 1 contributors (all time) y: 472 lines of code incubator-tools/parsing_documentai_ json_outputs_with_jq/Parsing Document AI JSON Outputs with JQ.ipynb x: 1 contributors (all time) y: 222 lines of code incubator-tools/swap_ocr_confusion_characters/swap_ocr_confusion_characters.ipynb x: 1 contributors (all time) y: 241 lines of code incubator-tools/watermarks_and_line_removal/watermarks_and_line_removal.ipynb x: 1 contributors (all time) y: 511 lines of code classify-split-extract-workflow/classify-job/bq_mlops.py x: 1 contributors (all time) y: 47 lines of code classify-split-extract-workflow/classify-job/config.py x: 1 contributors (all time) y: 210 lines of code classify-split-extract-workflow/classify-job/docai_helper.py x: 1 contributors (all time) y: 32 lines of code classify-split-extract-workflow/classify-job/gcs_helper.py x: 1 contributors (all time) y: 112 lines of code classify-split-extract-workflow/classify-job/main.py x: 1 contributors (all time) y: 42 lines of code classify-split-extract-workflow/classify-job/split_and_classify.py x: 1 contributors (all time) y: 289 lines of code classify-split-extract-workflow/classify-job/utils.py x: 1 contributors (all time) y: 84 lines of code noxfile.py x: 6 contributors (all time) y: 297 lines of code incubator-tools/combine_two_processors_output/combine_two_processor_output.ipynb x: 1 contributors (all time) y: 652 lines of code incubator-tools/reverse_annotation_tool/reverse_annotation_tool.ipynb x: 1 contributors (all time) y: 955 lines of code cx-content-moderation/main.go x: 3 contributors (all time) y: 86 lines of code incubator-tools/Reference_architecture_asynchronous/auto_deploy_v8/CFScript/main.py x: 3 contributors (all time) y: 141 lines of code incubator-tools/advance_table_line_enhancement/tool_helper_functions.py x: 2 contributors (all time) y: 836 lines of code incubator-tools/backmapping_entities_from_parser_output_to_original_language/backmap_utils.py x: 2 contributors (all time) y: 886 lines of code incubator-tools/synonyms_based_splitter_document_labeling/synonyms_based_splitter_document_labeling.ipynb x: 2 contributors (all time) y: 869 lines of code incubator-tools/advance_table_line_enhancement/Table_Spanning_Page_Merge_Script.ipynb x: 1 contributors (all time) y: 631 lines of code incubator-tools/advance_table_line_enhancement/line_enhancement_basic_flow.ipynb x: 1 contributors (all time) y: 422 lines of code incubator-tools/bank_statement_post_processing_tool/bank_statement_post_processing_tool.ipynb x: 1 contributors (all time) y: 2841 lines of code incubator-tools/bank_statements_line_items_improver_and_missing_items_finder/bank_statements_line_items_improver_and_missing_items_finder.ipynb x: 1 contributors (all time) y: 1149 lines of code incubator-tools/categorizing_bank_statement_transactions_by_account_number/categorizing_bank_statement_transactions_by_account_number.ipynb x: 1 contributors (all time) y: 750 lines of code incubator-tools/document-schema-from-form-parser-output/document-schema-from-form-parser-output.ipynb x: 1 contributors (all time) y: 542 lines of code incubator-tools/formparser_table_to_entity_converter_tool/formparser_table_to_entity_converter_tool.ipynb x: 1 contributors (all time) y: 590 lines of code incubator-tools/paragraph_separation/paragraph_separation.ipynb x: 1 contributors (all time) y: 578 lines of code incubator-tools/specific_format_line_items_tagging/specific_format_line_items_tagging.ipynb x: 1 contributors (all time) y: 948 lines of code pdf-embedded-text/main.py x: 2 contributors (all time) y: 130 lines of code incubator-tools/cmek_docai_processor/cmek_docai_processor.ipynb x: 1 contributors (all time) y: 907 lines of code incubator-tools/entity_sorting_csharp/entity_sorting_csharp.cs x: 1 contributors (all time) y: 77 lines of code incubator-tools/old_ocr_to_new_ocr_conversion/old_ocr_to_new_ocr_conversion.ipynb x: 1 contributors (all time) y: 484 lines of code uptraining_docai_processor_using_python/docai_uptraining.ipynb x: 1 contributors (all time) y: 1009 lines of code incubator-tools/best-practices/utilities/utilities.py x: 3 contributors (all time) y: 439 lines of code incubator-tools/combine_address_line/combine_address_line.ipynb x: 1 contributors (all time) y: 530 lines of code incubator-tools/docai_processor_visual_assessment_tool/docai_processor_visual_assessment.ipynb x: 1 contributors (all time) y: 782 lines of code incubator-tools/importing_processor_and_evaluating_with_alternate_test_sets/importing_processor_and_evaluating_with_alternate_test_sets.ipynb x: 1 contributors (all time) y: 766 lines of code incubator-tools/best-practices/identifying_poor_performing_docs/identifying_poor_performing_docs.ipynb x: 1 contributors (all time) y: 601 lines of code incubator-tools/best-practices/pre_post_hitl_visualization/pre_and_post_hitl_visualization.ipynb x: 1 contributors (all time) y: 500 lines of code incubator-tools/child_entity_tag_using_header/child_entity_tag_using_header.ipynb x: 1 contributors (all time) y: 872 lines of code incubator-tools/docai_processor_migration/docai_processor_migration.ipynb x: 1 contributors (all time) y: 1089 lines of code incubator-tools/line_item_comparision/line_item_comparision.ipynb x: 1 contributors (all time) y: 714 lines of code incubator-tools/line_item_improver/line_items_improver_post_processing.ipynb x: 1 contributors (all time) y: 1108 lines of code community/identity-form-autofiller-python/src/docai.py x: 4 contributors (all time) y: 207 lines of code tax-processing-pipeline-python/docai_pipeline.py x: 5 contributors (all time) y: 59 lines of code tax-processing-pipeline-python/docai_utils.py x: 5 contributors (all time) y: 91 lines of code tax-processing-pipeline-python/firestore_utils.py x: 5 contributors (all time) y: 16 lines of code tax-processing-pipeline-python/general_utils.py x: 4 contributors (all time) y: 9 lines of code tax-processing-pipeline-python/main.py x: 5 contributors (all time) y: 83 lines of code tax-processing-pipeline-python/setup.py x: 4 contributors (all time) y: 76 lines of code tax-processing-pipeline-python/tax_pipeline.py x: 5 contributors (all time) y: 157 lines of code tax-processing-pipeline-python/templates/index.html x: 4 contributors (all time) y: 255 lines of code incubator-tools/best-practices/parser_result_merger/docai_parser_result_merger.ipynb x: 1 contributors (all time) y: 613 lines of code incubator-tools/best-practices/removing_empty_bounding_boxes/removing_empty_bounding_boxes.ipynb x: 1 contributors (all time) y: 338 lines of code document-processing-workflows/main.tf x: 2 contributors (all time) y: 363 lines of code document_ai_warehouse/common/src/common/utils/document_ai_utils.py x: 2 contributors (all time) y: 223 lines of code document_ai_warehouse/common/src/common/utils/document_warehouse_utils.py x: 2 contributors (all time) y: 278 lines of code document_ai_warehouse/common/src/common/utils/logging_handler.py x: 2 contributors (all time) y: 29 lines of code document_ai_warehouse/document_ai_warehouse_batch_ingestion/main.py x: 2 contributors (all time) y: 531 lines of code document-processing-workflows/src/functions/parse-results/main.py x: 2 contributors (all time) y: 317 lines of code document-processing-workflows/src/functions/split-document/main.py x: 2 contributors (all time) y: 50 lines of code document_ai_warehouse/common/src/common/utils/docai_warehouse_helper.py x: 1 contributors (all time) y: 165 lines of code document_ai_warehouse/document_ai_warehouse_batch_ingestion/config.py x: 1 contributors (all time) y: 20 lines of code hitl-custom-review/hitl-custom-review.ipynb x: 2 contributors (all time) y: 346 lines of code apps-script-google-drive/documentai.gs x: 2 contributors (all time) y: 98 lines of code community/codelabs/docai-form-parser/form_parser.py x: 5 contributors (all time) y: 55 lines of code community/codelabs/docai-form-parser/table_parsing.py x: 2 contributors (all time) y: 70 lines of code community/codelabs/docai-specialized-processors/classification.py x: 5 contributors (all time) y: 44 lines of code community/identity-form-autofiller-python/src/frontend/index.html x: 2 contributors (all time) y: 118 lines of code community/identity-form-autofiller-python/src/frontend/styles.css x: 2 contributors (all time) y: 102 lines of code cx-content-moderation/public/index.html x: 3 contributors (all time) y: 42 lines of code document-json-explorer/public/index.html x: 3 contributors (all time) y: 31 lines of code document-json-explorer/src/App.js x: 3 contributors (all time) y: 9 lines of code document-json-explorer/src/Details.js x: 3 contributors (all time) y: 180 lines of code document-json-explorer/src/DocAITopLevel.js x: 3 contributors (all time) y: 95 lines of code document-json-explorer/src/DrawDocument.js x: 3 contributors (all time) y: 132 lines of code document-json-explorer/src/Entity.js x: 3 contributors (all time) y: 71 lines of code document-json-explorer/src/EntityHilight.js x: 3 contributors (all time) y: 67 lines of code document-json-explorer/src/JSONPage.js x: 3 contributors (all time) y: 19 lines of code document-json-explorer/src/PageSelector.js x: 3 contributors (all time) y: 46 lines of code document_ai_warehouse/document-ai-warehouse-java-samples/src/main/java/org/example/SearchDocuments.java x: 2 contributors (all time) y: 85 lines of code extract-tables/main.py x: 4 contributors (all time) y: 66 lines of code fraud-detection-python/cloud-functions/process-invoices/main.py x: 5 contributors (all time) y: 182 lines of code web-app-demo/Frontend/src/app/app.component.ts x: 4 contributors (all time) y: 92 lines of code web-app-pix2info-python/src/frontend/index.html x: 2 contributors (all time) y: 154 lines of code bq-connector/docai_bq_connector/connector/BqMetadataMapper.py x: 6 contributors (all time) y: 75 lines of code community/codelabs/docai-ocr/batch_processing_toolbox.py x: 2 contributors (all time) y: 67 lines of code document_ai_warehouse/document_ai_warehouse_processing_python/document_warehouse_utils.py x: 2 contributors (all time) y: 259 lines of code web-app-pix2info-python/src/backend/docai.py x: 2 contributors (all time) y: 184 lines of code web-app-pix2info-python/src/backend/etag.py x: 2 contributors (all time) y: 16 lines of code web-app-pix2info-python/src/backend/options.py x: 2 contributors (all time) y: 42 lines of code web-app-pix2info-python/src/backend/render.py x: 2 contributors (all time) y: 634 lines of code bq-connector/docai_bq_connector/connector/BqDocumentMapper.py x: 6 contributors (all time) y: 318 lines of code bq-connector/docai_bq_connector/connector/DocAIBQConnector.py x: 6 contributors (all time) y: 280 lines of code bq-connector/docai_bq_connector/doc_ai_processing/Processor.py x: 6 contributors (all time) y: 199 lines of code bq-connector/docai_bq_connector/__init__.py x: 5 contributors (all time) y: 2 lines of code bq-connector/main.py x: 5 contributors (all time) y: 239 lines of code community/expense-parser-python/cloud-functions/main.py x: 3 contributors (all time) y: 106 lines of code community/identity-form-autofiller-python/src/docai_schemas.py x: 3 contributors (all time) y: 162 lines of code community/identity-form-autofiller-python/src/main.py x: 4 contributors (all time) y: 104 lines of code community/pdf-annotator-python/main.py x: 5 contributors (all time) y: 133 lines of code document_ai_warehouse/document_ai_warehouse_processing_python/document_ai_utils.py x: 1 contributors (all time) y: 67 lines of code document_ai_warehouse/document_ai_warehouse_processing_python/dw_processing.ipynb x: 1 contributors (all time) y: 1283 lines of code filter-hitl-language/main.py x: 2 contributors (all time) y: 8 lines of code pdf-splitter-python/main.py x: 6 contributors (all time) y: 139 lines of code sql-pdf-python/src/cloud-functions/create_docai/main.py x: 4 contributors (all time) y: 46 lines of code sql-pdf-python/src/cloud-functions/process_docai/main.py x: 4 contributors (all time) y: 88 lines of code web-app-demo/Backend/main.py x: 6 contributors (all time) y: 58 lines of code web-app-demo/Frontend/src/app/components/processor-selection/processor-selection.component.ts x: 3 contributors (all time) y: 225 lines of code web-app-demo/Frontend/src/app/components/entity-tab/entity-tab.component.ts x: 2 contributors (all time) y: 224 lines of code
2841.0
lines of code
  min: 1.0
  average: 262.51
  25th percentile: 35.25
  median: 162.5
  75th percentile: 403.25
  max: 2841.0
0 6.0
contributors (all time)
min: 1.0 | average: 1.99 | 25th percentile: 1.0 | median: 1.0 | 75th percentile: 3.0 | max: 6.0

File Size vs. Commits (30 days): 0 points

No data for "commits (30d)" vs. "lines of code".

File Size vs. Contributors (30 days): 0 points

No data for "contributors (30d)" vs. "lines of code".


File Size vs. Commits (90 days): 13 points

incubator-tools/character_box_removal/character_box_removal.ipynb x: 1 commits (90d) y: 413 lines of code incubator-tools/divide_pdf_to_high_quality_images/divide_pdf_to_high_quality_images.ipynb x: 1 commits (90d) y: 263 lines of code incubator-tools/docai_document_processing_pipeline/src/load_queue_cf/__init__.py x: 1 commits (90d) y: 1 lines of code incubator-tools/docai_document_processing_pipeline/src/load_queue_cf/main.py x: 1 commits (90d) y: 251 lines of code incubator-tools/docai_document_processing_pipeline/src/process_batch_cf/main.py x: 1 commits (90d) y: 168 lines of code incubator-tools/image_segmentation/image_segmentation.ipynb x: 1 commits (90d) y: 366 lines of code incubator-tools/lineitem_improver_crosspage/lineitem_improver_crosspage.ipynb x: 1 commits (90d) y: 438 lines of code incubator-tools/lineitem_improver_using_column_data/lineitem_improver_using_column_data.ipynb x: 1 commits (90d) y: 442 lines of code incubator-tools/map_ocr_style_information_to_cde_entities/map_ocr_style_information_to_cde_entities.ipynb x: 1 commits (90d) y: 562 lines of code incubator-tools/ocr_upgrade_tool_using_enterprise_ocr/ocr_upgrade_tool_using_enterprise_ocr.ipynb x: 1 commits (90d) y: 1078 lines of code incubator-tools/signature_detection/signature_detection.ipynb x: 1 commits (90d) y: 523 lines of code
1078.0
lines of code
  min: 1.0
  average: 378.38
  25th percentile: 209.5
  median: 413.0
  75th percentile: 482.5
  max: 1078.0
0 1.0
commits (90d)
min: 1.0 | average: 1.0 | 25th percentile: 1.0 | median: 1.0 | 75th percentile: 1.0 | max: 1.0

File Size vs. Contributors (90 days): 13 points

incubator-tools/character_box_removal/character_box_removal.ipynb x: 1 contributors (90d) y: 413 lines of code incubator-tools/divide_pdf_to_high_quality_images/divide_pdf_to_high_quality_images.ipynb x: 1 contributors (90d) y: 263 lines of code incubator-tools/docai_document_processing_pipeline/src/load_queue_cf/__init__.py x: 1 contributors (90d) y: 1 lines of code incubator-tools/docai_document_processing_pipeline/src/load_queue_cf/main.py x: 1 contributors (90d) y: 251 lines of code incubator-tools/docai_document_processing_pipeline/src/process_batch_cf/main.py x: 1 contributors (90d) y: 168 lines of code incubator-tools/image_segmentation/image_segmentation.ipynb x: 1 contributors (90d) y: 366 lines of code incubator-tools/lineitem_improver_crosspage/lineitem_improver_crosspage.ipynb x: 1 contributors (90d) y: 438 lines of code incubator-tools/lineitem_improver_using_column_data/lineitem_improver_using_column_data.ipynb x: 1 contributors (90d) y: 442 lines of code incubator-tools/map_ocr_style_information_to_cde_entities/map_ocr_style_information_to_cde_entities.ipynb x: 1 contributors (90d) y: 562 lines of code incubator-tools/ocr_upgrade_tool_using_enterprise_ocr/ocr_upgrade_tool_using_enterprise_ocr.ipynb x: 1 contributors (90d) y: 1078 lines of code incubator-tools/signature_detection/signature_detection.ipynb x: 1 contributors (90d) y: 523 lines of code
1078.0
lines of code
  min: 1.0
  average: 378.38
  25th percentile: 209.5
  median: 413.0
  75th percentile: 482.5
  max: 1078.0
0 1.0
contributors (90d)
min: 1.0 | average: 1.0 | 25th percentile: 1.0 | median: 1.0 | 75th percentile: 1.0 | max: 1.0