aws-samples / amazon-textract-transformer-pipeline
Features of Interest
Aspects in the source code identified through RegEx patterns.
Intro
Features of interest are any aspects of a software system that can be identified through patterns in code.
Features of interest provide you with a way to focus your attention on relevant parts of the codebase.
Typical examples include, security, TODOs, logging.
A feature of interest may be present in multiple files. Any source code file may be in zero or multiple features of interest.
Overview
GENERAL
TODOs
2,167 LOC (30%)
8 files
|
found text
|
found text per file
1 GENERAL Concerns
The "general" group contains
2
concerns.
TODOs
Unclassified
general
1.1 TODOs
2,167 LOC (30%)
8 files
|
found text
|
found text per file
The following criteria are used to filter files:
files with any line of content like "
.*(TODO|FIXME)( |:| ).*
".
8
files match defined criteria (
2,167
lines of code,
30.7%
vs. main code):
6
*.py files (
1,205
lines of code)
2
*.html files (
962
lines of code)
14
lines match the content pattern.
details...
per component - primary logical decomposition
notebooks/src
in 2 files, 405 LOC (27%)
pipeline/ocr
in 1 file 258 LOC (23%)
pipeline/postprocessing
in 1 file 203 LOC (31%)
notebooks/annotation
in 1 file 560 LOC (100%)
notebooks/review
in 1 file 402 LOC (100%)
pipeline/review
in 1 file 113 LOC (32%)
notebooks/preproc
in 1 file 226 LOC (99%)