apache / opennlp-sandbox
Features of Interest
Aspects in the source code identified through RegEx patterns.
Intro
Features of interest are any aspects of a software system that can be identified through patterns in code.
Features of interest provide you with a way to focus your attention on relevant parts of the codebase.
Typical examples include, security, TODOs, logging.
A feature of interest may be present in multiple files. Any source code file may be in zero or multiple features of interest.
Overview
GENERAL
TODOs
11,346 LOC (27%)
79 files
|
found text per file
|
found text per file
1 GENERAL Concerns
The "general" group contains
2
concerns.
TODOs
Unclassified
general
1.1 TODOs
11,346 LOC (27%)
79 files
|
found text per file
|
found text per file
The following criteria are used to filter files:
files with any line of content like "
.*(TODO|FIXME)( |:| ).*
".
79
files match defined criteria (
11,346
LOC,
27.2%
vs. main code):
74
*.java files (
10,676
LOC)
3
*.py files (
579
LOC)
2
*.js files (
91
LOC)
157
lines match the content pattern.
details...
per component - primary logical decomposition
opennlp-similarity
in 13 files, 3,711 LOC (19%)
opennlp-coref
in 7 files, 990 LOC (17%)
opennlp-wsd
in 7 files, 1,196 LOC (33%)
corpus-server
in 11 files, 1,142 LOC (41%)
caseditor-opennlp-plugin
in 17 files, 1,716 LOC (71%)
opennlp-dl
in 2 files, 339 LOC (21%)
tf-ner-poc
in 4 files, 603 LOC (42%)
nlp-utils
in 1 file 100 LOC (8%)
tagging-server
in 5 files, 328 LOC (44%)
caseditor-corpus-server-plugin
in 3 files, 533 LOC (77%)
wikinews-importer
in 2 files, 316 LOC (72%)
mallet-addon
in 2 files, 176 LOC (48%)
mahout-addon
in 5 files, 196 LOC (74%)