aws-samples / amazon-textract-serverless-large-scale-document-processing
File Size

The distribution of size of files (measured in lines of code).

Intro
  • File size measurements show the distribution of size of files.
  • Files are classified in four categories based on their size (lines of code): 1-100 (very small files), 101-200 (small files), 201-500 (medium size files), 501-1000 (long files), 1001+(very long files).
  • It is a good practice to keep files small. Long files may become "bloaters", code that have increased to such gargantuan proportions that they are hard to work with.
Learn more...
File Size Overall
  • There are 23 files with 3,219 lines of code.
    • 0 very long files (0 lines of code)
    • 2 long files (1,044 lines of code)
    • 1 medium size files (239 lines of codeclsfd_ftr_w_mp_ins)
    • 6 small files (930 lines of code)
    • 14 very small files (1,006 lines of code)
0% | 32% | 7% | 28% | 31%
Legend:
1001+
501-1000
201-500
101-200
1-100


explore: zoomable circles | sunburst | 3D view
File Size per Extension
1001+
501-1000
201-500
101-200
1-100
py0% | 35% | 0% | 31% | 33%
ts0% | 0% | 100% | 0% | 0%
File Size per Logical Decomposition
primary
1001+
501-1000
201-500
101-200
1-100
src0% | 32% | 0% | 29% | 37%
textract-pipeline/lambda/textractor0% | 86% | 0% | 0% | 13%
textract-pipeline/lib0% | 0% | 100% | 0% | 0%
textract-pipeline/lambda/helper0% | 0% | 0% | 100% | 0%
textract-pipeline/lambda/asyncprocessor0% | 0% | 0% | 100% | 0%
textract-pipeline/lambda/jobresultprocessor0% | 0% | 0% | 0% | 100%
textract-pipeline/lambda/syncprocessor0% | 0% | 0% | 0% | 100%
textract-pipeline/lambda/documentprocessor0% | 0% | 0% | 0% | 100%
textract-pipeline/lambda/s3batchprocessor0% | 0% | 0% | 0% | 100%
textract-pipeline/lambda/s3processor0% | 0% | 0% | 0% | 100%
Longest Files (Top 23)
File# lines# units
trp.py
in textract-pipeline/lambda/textractor/python
522 103
trp.py
in src
522 103
textract-pipeline-stack.ts
in textract-pipeline/lib
239 1
helper.py
in textract-pipeline/lambda/helper/python
185 21
helper.py
in src
185 21
lambda_function.py
in textract-pipeline/lambda/asyncprocessor
166 7
asyncproc.py
in src
166 7
datastore.py
in textract-pipeline/lambda/helper/python
114 7
datastore.py
in src
114 7
events.py
in src
100 9
test.py
in src
98 9
lambda_function.py
in textract-pipeline/lambda/jobresultprocessor
85 4
jobresultsproc.py
in src
85 4
og.py
in textract-pipeline/lambda/textractor/python
83 6
og.py
in src
83 6
lambda_function.py
in textract-pipeline/lambda/syncprocessor
78 4
syncproc.py
in src
78 4
lambda_function.py
in textract-pipeline/lambda/documentprocessor
72 4
docproc.py
in src
72 4
lambda_function.py
in textract-pipeline/lambda/s3batchprocessor
52 2
s3batchproc.py
in src
52 2
lambda_function.py
in textract-pipeline/lambda/s3processor
34 2
s3proc.py
in src
34 2
Files With Most Units (Top 20)
File# lines# units
trp.py
in textract-pipeline/lambda/textractor/python
522 103
trp.py
in src
522 103
helper.py
in textract-pipeline/lambda/helper/python
185 21
helper.py
in src
185 21
test.py
in src
98 9
events.py
in src
100 9
datastore.py
in textract-pipeline/lambda/helper/python
114 7
lambda_function.py
in textract-pipeline/lambda/asyncprocessor
166 7
datastore.py
in src
114 7
asyncproc.py
in src
166 7
og.py
in textract-pipeline/lambda/textractor/python
83 6
og.py
in src
83 6
lambda_function.py
in textract-pipeline/lambda/documentprocessor
72 4
lambda_function.py
in textract-pipeline/lambda/syncprocessor
78 4
lambda_function.py
in textract-pipeline/lambda/jobresultprocessor
85 4
syncproc.py
in src
78 4
docproc.py
in src
72 4
jobresultsproc.py
in src
85 4
lambda_function.py
in textract-pipeline/lambda/s3processor
34 2
lambda_function.py
in textract-pipeline/lambda/s3batchprocessor
52 2
Files With Long Lines (Top 9)

There are 9 files with lines longer than 120 characters. In total, there are 15 long lines.

File# lines# units# long lines
textract-pipeline-stack.ts
in textract-pipeline/lib
239 1 3
datastore.py
in textract-pipeline/lambda/helper/python
114 7 2
trp.py
in textract-pipeline/lambda/textractor/python
522 103 2
datastore.py
in src
114 7 2
trp.py
in src
522 103 2
lambda_function.py
in textract-pipeline/lambda/documentprocessor
72 4 1
lambda_function.py
in textract-pipeline/lambda/syncprocessor
78 4 1
syncproc.py
in src
78 4 1
docproc.py
in src
72 4 1