aws-samples / amazon-textract-a2i-pdf
File Size

The distribution of size of files (measured in lines of code).

Intro
  • File size measurements show the distribution of size of files.
  • Files are classified in four categories based on their size (lines of code): 1-100 (very small files), 101-200 (small files), 201-500 (medium size files), 501-1000 (long files), 1001+(very long files).
  • It is a good practice to keep files small. Long files may become "bloaters", code that have increased to such gargantuan proportions that they are hard to work with.
Learn more...
File Size Overall
  • There are 13 files with 836 lines of code.
    • 0 very long files (0 lines of code)
    • 0 long files (0 lines of code)
    • 1 medium size files (289 lines of codeclsfd_ftr_w_mp_ins)
    • 0 small files (0 lines of code)
    • 12 very small files (547 lines of code)
0% | 0% | 34% | 0% | 65%
Legend:
1001+
501-1000
201-500
101-200
1-100


explore: zoomable circles | sunburst | 3D view
File Size per Extension
1001+
501-1000
201-500
101-200
1-100
py0% | 0% | 38% | 0% | 61%
java0% | 0% | 0% | 0% | 100%
File Size per Logical Decomposition
primary
1001+
501-1000
201-500
101-200
1-100
multipagepdfa2i0% | 0% | 99% | 0% | <1%
deploy_code/multipagepdfa2i_analyzepdf0% | 0% | 0% | 0% | 100%
deploy_code/multipagepdfa2i_humancomplete0% | 0% | 0% | 0% | 100%
deploy_code/multipagepdfa2i_wrapup0% | 0% | 0% | 0% | 100%
deploy_code/multipagepdfa2i_pngextract/src/main/java0% | 0% | 0% | 0% | 100%
deploy_code/multipagepdfa2i_kickoff0% | 0% | 0% | 0% | 100%
ROOT0% | 0% | 0% | 0% | 100%
Longest Files (Top 13)
File# lines# units
multipagepdfa2i_stack.py
in multipagepdfa2i
289 7
lambda_function.py
in deploy_code/multipagepdfa2i_analyzepdf
82 5
gather_data.py
in deploy_code/multipagepdfa2i_wrapup
80 8
lambda_function.py
in deploy_code/multipagepdfa2i_humancomplete
65 7
clean_data.py
in deploy_code/multipagepdfa2i_analyzepdf
63 6
clean_data.py
in deploy_code/multipagepdfa2i_humancomplete
63 6
PdfFromS3Pdf.java
in deploy_code/multipagepdfa2i_pngextract/src/main/java
52 3
lambda_function.py
in deploy_code/multipagepdfa2i_kickoff
47 3
setup.py
in root
30 -
lambda_function.py
in deploy_code/multipagepdfa2i_wrapup
30 3
Lambda.java
in deploy_code/multipagepdfa2i_pngextract/src/main/java
29 -
app.py
in root
5 -
__init__.py
in multipagepdfa2i
1 -
Files With Most Units (Top 9)
File# lines# units
gather_data.py
in deploy_code/multipagepdfa2i_wrapup
80 8
multipagepdfa2i_stack.py
in multipagepdfa2i
289 7
lambda_function.py
in deploy_code/multipagepdfa2i_humancomplete
65 7
clean_data.py
in deploy_code/multipagepdfa2i_analyzepdf
63 6
clean_data.py
in deploy_code/multipagepdfa2i_humancomplete
63 6
lambda_function.py
in deploy_code/multipagepdfa2i_analyzepdf
82 5
lambda_function.py
in deploy_code/multipagepdfa2i_wrapup
30 3
lambda_function.py
in deploy_code/multipagepdfa2i_kickoff
47 3
PdfFromS3Pdf.java
in deploy_code/multipagepdfa2i_pngextract/src/main/java
52 3
Files With Long Lines (Top 2)

There are 2 files with lines longer than 120 characters. In total, there are 4 long lines.

File# lines# units# long lines
PdfFromS3Pdf.java
in deploy_code/multipagepdfa2i_pngextract/src/main/java
52 3 3
lambda_function.py
in deploy_code/multipagepdfa2i_humancomplete
65 7 1