apache / tika
Features of Interest
Aspects in the source code identified through RegEx patterns.
Intro
Features of interest are any aspects of a software system that can be identified through patterns in code.
Features of interest provide you with a way to focus your attention on relevant parts of the codebase.
Typical examples include, security, TODOs, logging.
A feature of interest may be present in multiple files. Any source code file may be in zero or multiple features of interest.
Overview
GENERAL
TODOs
54,545 LOC (39%)
198 files
|
found text per file
|
found text per file
1 GENERAL Concerns
The "general" group contains
2
concerns.
TODOs
Unclassified
general
1.1 TODOs
54,545 LOC (39%)
198 files
|
found text per file
|
found text per file
The following criteria are used to filter files:
files with any line of content like "
.*(TODO|FIXME)( |:| ).*
".
198
files match defined criteria (
54,545
LOC,
39.5%
vs. main code):
196
*.java files (
46,532
LOC)
1
*.xml files (
8,000
LOC)
1
*.html files (
13
LOC)
400
lines match the content pattern.
details...
per component - primary logical decomposition
tika-parsers
in 117 files, 30,638 LOC (46%)
tika-core
in 31 files, 14,145 LOC (43%)
tika-eval
in 7 files, 1,307 LOC (10%)
tika-pipes
in 13 files, 2,425 LOC (35%)
tika-server
in 5 files, 1,064 LOC (18%)
tika-batch
in 6 files, 971 LOC (27%)
tika-fuzzing
in 6 files, 1,419 LOC (71%)
tika-example
in 2 files, 155 LOC (8%)
tika-langdetect
in 3 files, 832 LOC (48%)
tika-serialization
in 2 files, 555 LOC (48%)
tika-translate
in 2 files, 191 LOC (19%)
tika-xmp
in 2 files, 399 LOC (47%)
tika-detectors
in 2 files, 444 LOC (100%)