aws-samples / amazon-textract-code-samples
File Size

The distribution of size of files (measured in lines of code).

Intro
  • File size measurements show the distribution of size of files.
  • Files are classified in four categories based on their size (lines of code): 1-100 (very small files), 101-200 (small files), 201-500 (medium size files), 501-1000 (long files), 1001+(very long files).
  • It is a good practice to keep files small. Long files may become "bloaters", code that have increased to such gargantuan proportions that they are hard to work with.
Learn more...
File Size Overall
  • There are 46 files with 1,581 lines of code.
    • 0 very long files (0 lines of code)
    • 0 long files (0 lines of code)
    • 0 medium size files (0 lines of codeclsfd_ftr_w_mp_ins)
    • 2 small files (211 lines of code)
    • 44 very small files (1,370 lines of code)
0% | 0% | 0% | 13% | 86%
Legend:
1001+
501-1000
201-500
101-200
1-100


explore: zoomable circles | sunburst | 3D view
File Size per Extension
1001+
501-1000
201-500
101-200
1-100
cs0% | 0% | 0% | 16% | 83%
py0% | 0% | 0% | 0% | 100%
File Size per Logical Decomposition
primary
1001+
501-1000
201-500
101-200
1-100
src-csharp/Services0% | 0% | 0% | 36% | 63%
src-csharp/TextractExtensions0% | 0% | 0% | 19% | 80%
src-csharp/ArgHandlers0% | 0% | 0% | 0% | 100%
python0% | 0% | 0% | 0% | 100%
src-csharp0% | 0% | 0% | 0% | 100%
Longest Files (Top 46)
File# lines# units
TextractTextDetectionService.cs
in src-csharp/Services
108 12
Page.cs
in src-csharp/TextractExtensions
103 6
Program.cs
in src-csharp
99 1
TextractTextAnalysisService.cs
in src-csharp/Services
82 8
TextractDocument.cs
in src-csharp/TextractExtensions
58 4
ComprehendService.cs
in src-csharp/Services
55 4
12-pdf-text.py
in python
54 3
Cell.cs
in src-csharp/TextractExtensions
52 2
Table.cs
in src-csharp/TextractExtensions
47 2
FormsRedactionHandler.cs
in src-csharp/ArgHandlers
47 2
ReadingOrderHandler.cs
in src-csharp/ArgHandlers
45 2
TablesExpenseHandler.cs
in src-csharp/ArgHandlers
45 2
07-search.py
in python
42 1
FieldValue.cs
in src-csharp/TextractExtensions
41 2
Line.cs
in src-csharp/TextractExtensions
41 1
FieldKey.cs
in src-csharp/TextractExtensions
37 2
Field.cs
in src-csharp/TextractExtensions
37 1
11-tables-expense.py
in python
32 1
TablesHandler.cs
in src-csharp/ArgHandlers
31 2
03-reading-order.py
in python
29 -
FormsHandler.cs
in src-csharp/ArgHandlers
29 2
TranslateHandler.cs
in src-csharp/ArgHandlers
28 2
05-nlp-medical.py
in python
27 -
09-forms-redaction.py
in python
26 -
NlpComprehendHandler.cs
in src-csharp/ArgHandlers
26 2
NlpComprehendMedicalHandler.cs
in src-csharp/ArgHandlers
26 2
08-forms.py
in python
25 -
Word.cs
in src-csharp/TextractExtensions
23 2
04-nlp-comprehend.py
in python
22 -
NewGeometry.cs
in src-csharp/TextractExtensions
22 2
Form.cs
in src-csharp/TextractExtensions
21 4
SearchHandler.cs
in src-csharp/ArgHandlers
21 2
TranslateService.cs
in src-csharp/Services
19 2
06-translate.py
in python
17 -
10-tables.py
in python
16 -
SelectionElement.cs
in src-csharp/TextractExtensions
16 1
Row.cs
in src-csharp/TextractExtensions
16 2
PdfTextHandler.cs
in src-csharp/ArgHandlers
16 2
DetectTextHandler.cs
in src-csharp/ArgHandlers
15 2
DetectTextS3Handler.cs
in src-csharp/ArgHandlers
15 2
ElasticSearchService.cs
in src-csharp/Services
15 2
02-detect-text-s3.py
in python
14 -
NewBoundingBox.cs
in src-csharp/TextractExtensions
14 2
01-detect-text-local.py
in python
9 -
IndexedText.cs
in src-csharp/Services
9 1
Column.cs
in src-csharp/Services
9 1
Files With Most Units (Top 20)
File# lines# units
TextractTextDetectionService.cs
in src-csharp/Services
108 12
TextractTextAnalysisService.cs
in src-csharp/Services
82 8
Page.cs
in src-csharp/TextractExtensions
103 6
TextractDocument.cs
in src-csharp/TextractExtensions
58 4
Form.cs
in src-csharp/TextractExtensions
21 4
ComprehendService.cs
in src-csharp/Services
55 4
12-pdf-text.py
in python
54 3
FieldValue.cs
in src-csharp/TextractExtensions
41 2
Word.cs
in src-csharp/TextractExtensions
23 2
FieldKey.cs
in src-csharp/TextractExtensions
37 2
Table.cs
in src-csharp/TextractExtensions
47 2
Cell.cs
in src-csharp/TextractExtensions
52 2
NewBoundingBox.cs
in src-csharp/TextractExtensions
14 2
Row.cs
in src-csharp/TextractExtensions
16 2
NewGeometry.cs
in src-csharp/TextractExtensions
22 2
FormsHandler.cs
in src-csharp/ArgHandlers
29 2
ReadingOrderHandler.cs
in src-csharp/ArgHandlers
45 2
SearchHandler.cs
in src-csharp/ArgHandlers
21 2
TablesHandler.cs
in src-csharp/ArgHandlers
31 2
DetectTextHandler.cs
in src-csharp/ArgHandlers
15 2
Files With Long Lines (Top 11)

There are 11 files with lines longer than 120 characters. In total, there are 19 long lines.

File# lines# units# long lines
Program.cs
in src-csharp
99 1 7
03-reading-order.py
in python
29 - 2
NlpComprehendMedicalHandler.cs
in src-csharp/ArgHandlers
26 2 2
07-search.py
in python
42 1 1
Page.cs
in src-csharp/TextractExtensions
103 6 1
NewGeometry.cs
in src-csharp/TextractExtensions
22 2 1
ReadingOrderHandler.cs
in src-csharp/ArgHandlers
45 2 1
SearchHandler.cs
in src-csharp/ArgHandlers
21 2 1
NlpComprehendHandler.cs
in src-csharp/ArgHandlers
26 2 1
ComprehendService.cs
in src-csharp/Services
55 4 1
TranslateService.cs
in src-csharp/Services
19 2 1