apache / tika
Features of Interest
Aspects in the source code identified through RegEx patterns.
Intro
Features of interest are any aspects of a software system that can be identified through patterns in code.
Features of interest provide you with a way to focus your attention on relevant parts of the codebase.
Typical examples include, security, TODOs, logging.
A feature of interest may be present in multiple files. Any source code file may be in zero or multiple features of interest.
Overview
GENERAL
TODOs
51,270 LOC (38%)
187 files
|
found text per file
|
found text per file
1 GENERAL Concerns
The "general" group contains
2
concerns.
TODOs
Unclassified
general
1.1 TODOs
51,270 LOC (38%)
187 files
|
found text per file
|
found text per file
The following criteria are used to filter files:
files with any line of content like "
.*(TODO|FIXME)( |:| ).*
".
187
files match defined criteria (
51,270
LOC,
38.9%
vs. main code):
185
*.java files (
43,750
LOC)
1
*.xml files (
7,507
LOC)
1
*.html files (
13
LOC)
372
lines match the content pattern.
details...
per component - primary logical decomposition
tika-parsers
in 110 files, 28,683 LOC (45%)
tika-core
in 29 files, 13,353 LOC (42%)
tika-eval
in 7 files, 1,315 LOC (11%)
tika-server
in 5 files, 1,105 LOC (19%)
tika-pipes
in 13 files, 2,376 LOC (44%)
tika-batch
in 6 files, 1,039 LOC (28%)
tika-fuzzing
in 6 files, 1,416 LOC (71%)
tika-example
in 2 files, 164 LOC (8%)
tika-langdetect
in 3 files, 834 LOC (48%)
tika-translate
in 2 files, 191 LOC (19%)
tika-xmp
in 2 files, 399 LOC (46%)
tika-serialization
in 1 file 198 LOC (37%)
tika-detectors
in 1 file 197 LOC (100%)