aws-samples / amazon-textract-textractor
File Size

The distribution of size of files (measured in lines of code).

Intro
  • File size measurements show the distribution of size of files.
  • Files are classified in four categories based on their size (lines of code): 1-100 (very small files), 101-200 (small files), 201-500 (medium size files), 501-1000 (long files), 1001+(very long files).
  • It is a good practice to keep files small. Long files may become "bloaters", code that have increased to such gargantuan proportions that they are hard to work with.
Learn more...
File Size Overall
  • There are 48 files with 3,847 lines of code.
    • 0 very long files (0 lines of code)
    • 2 long files (1,308 lines of code)
    • 3 medium size files (760 lines of codeclsfd_ftr_w_mp_ins)
    • 8 small files (1,065 lines of code)
    • 35 very small files (714 lines of code)
0% | 34% | 19% | 27% | 18%
Legend:
1001+
501-1000
201-500
101-200
1-100


explore: zoomable circles | sunburst | 3D view
File Size per Extension
1001+
501-1000
201-500
101-200
1-100
py0% | 35% | 20% | 28% | 16%
cfg0% | 0% | 0% | 0% | 100%
in0% | 0% | 0% | 0% | 100%
File Size per Logical Decomposition
primary
1001+
501-1000
201-500
101-200
1-100
tpipelinegeofinder/textractgeofinder0% | 61% | 16% | 20% | 1%
src0% | 41% | 16% | 32% | 8%
caller/textractcaller0% | 0% | 98% | 0% | 1%
prettyprinter/textractprettyprinter0% | 0% | 0% | 98% | 1%
overlayer/textractoverlayer0% | 0% | 0% | 66% | 33%
helper0% | 0% | 0% | 0% | 100%
tpipelinepagedimensions/textractpagedimensions0% | 0% | 0% | 0% | 100%
tpipelinepagedimensions0% | 0% | 0% | 0% | 100%
tpipelinegeofinder0% | 0% | 0% | 0% | 100%
caller0% | 0% | 0% | 0% | 100%
overlayer0% | 0% | 0% | 0% | 100%
prettyprinter0% | 0% | 0% | 0% | 100%
helper/textracthelper0% | 0% | 0% | 0% | 100%
Longest Files (Top 48)
File# lines# units
tgeofinder.py
in tpipelinegeofinder/textractgeofinder
773 34
trp.py
in src
535 103
t_call.py
in caller/textractcaller
335 19
og.py
in src
216 11
tword.py
in tpipelinegeofinder/textractgeofinder
209 15
tdp.py
in src
193 12
ocrdb.py
in tpipelinegeofinder/textractgeofinder
138 14
t_overlay.py
in overlayer/textractoverlayer
135 16
t_pretty_print.py
in prettyprinter/textractprettyprinter
132 7
textractor.py
in src
127 5
t_pretty_print_expense.py
in prettyprinter/textractprettyprinter
125 5
sample_patient_intake_form_parser.py
in tpipelinegeofinder/textractgeofinder
112 3
helper.py
in src
103 13
setup.py
in helper
79 3
t_pagedimensions.py
in tpipelinepagedimensions/textractpagedimensions
77 5
image_tools.py
in overlayer/textractoverlayer
64 6
ta.py
in src
61 10
setup.py
in tpipelinepagedimensions
53 1
trptest.py
in src
48 3
setup.py
in tpipelinegeofinder
45 1
setup.py
in caller
45 1
setup.py
in overlayer
44 1
setup.py
in prettyprinter
42 1
cfg
setup.cfg
in tpipelinegeofinder
15 -
cfg
setup.cfg
in caller
15 -
cfg
setup.cfg
in helper
15 -
cfg
setup.cfg
in prettyprinter
15 -
cfg
setup.cfg
in overlayer
15 -
cfg
setup.cfg
in tpipelinepagedimensions
15 -
tinterface.py
in tpipelinegeofinder/textractgeofinder
10 -
__init__.py
in caller/textractcaller
5 -
in
Manifest.in
in helper
5 -
in
Manifest.in
in tpipelinegeofinder
4 -
__init__.py
in tpipelinegeofinder/textractgeofinder
4 -
in
Manifest.in
in caller
4 -
__init__.py
in helper/textracthelper
4 -
in
Manifest.in
in prettyprinter
4 -
__init__.py
in prettyprinter/textractprettyprinter
4 -
in
Manifest.in
in overlayer
4 -
__init__.py
in overlayer/textractoverlayer
4 -
in
Manifest.in
in tpipelinepagedimensions
4 -
__init__.py
in tpipelinepagedimensions/textractpagedimensions
4 -
_version.py
in tpipelinegeofinder/textractgeofinder
1 -
_version.py
in caller/textractcaller
1 -
_version.py
in helper/textracthelper
1 -
_version.py
in prettyprinter/textractprettyprinter
1 -
_version.py
in overlayer/textractoverlayer
1 -
_version.py
in tpipelinepagedimensions/textractpagedimensions
1 -
Files With Most Units (Top 20)
File# lines# units
trp.py
in src
535 103
tgeofinder.py
in tpipelinegeofinder/textractgeofinder
773 34
t_call.py
in caller/textractcaller
335 19
t_overlay.py
in overlayer/textractoverlayer
135 16
tword.py
in tpipelinegeofinder/textractgeofinder
209 15
ocrdb.py
in tpipelinegeofinder/textractgeofinder
138 14
helper.py
in src
103 13
tdp.py
in src
193 12
og.py
in src
216 11
ta.py
in src
61 10
t_pretty_print.py
in prettyprinter/textractprettyprinter
132 7
image_tools.py
in overlayer/textractoverlayer
64 6
textractor.py
in src
127 5
t_pretty_print_expense.py
in prettyprinter/textractprettyprinter
125 5
t_pagedimensions.py
in tpipelinepagedimensions/textractpagedimensions
77 5
sample_patient_intake_form_parser.py
in tpipelinegeofinder/textractgeofinder
112 3
trptest.py
in src
48 3
setup.py
in helper
79 3
setup.py
in tpipelinegeofinder
45 1
setup.py
in caller
45 1
Files With Long Lines (Top 6)

There are 6 files with lines longer than 120 characters. In total, there are 11 long lines.

File# lines# units# long lines
tgeofinder.py
in tpipelinegeofinder/textractgeofinder
773 34 3
t_pretty_print_expense.py
in prettyprinter/textractprettyprinter
125 5 3
t_overlay.py
in overlayer/textractoverlayer
135 16 2
trp.py
in src
535 103 1
__init__.py
in caller/textractcaller
5 - 1
t_pagedimensions.py
in tpipelinepagedimensions/textractpagedimensions
77 5 1