facebookresearch / CodeGen
Features of Interest
Aspects in the source code identified through RegEx patterns.
Intro
Features of interest are any aspects of a software system that can be identified through patterns in code.
Features of interest provide you with a way to focus your attention on relevant parts of the codebase.
Typical examples include, security, TODOs, logging.
A feature of interest may be present in multiple files. Any source code file may be in zero or multiple features of interest.
Overview
GENERAL
TODOs
5,089 LOC (5%)
12 files
|
found text
|
found text per file
1 GENERAL Concerns
The "general" group contains
2
concerns.
TODOs
Unclassified
general
1.1 TODOs
5,089 LOC (5%)
12 files
|
found text
|
found text per file
The following criteria are used to filter files:
files with any line of content like "
.*(TODO|FIXME)( |:| ).*
".
12
files match defined criteria (
5,089
lines of code,
5.4%
vs. main code). All matches are in *.py files.
24
lines match the content pattern.
details...
per component - primary logical decomposition
codegen_sources/model/src
in 3 files, 2,888 LOC (41%)
codegen_sources/preprocessing/dataset_modes
in 2 files, 762 LOC (64%)
codegen_sources/preprocessing/lang_processors
in 2 files, 403 LOC (47%)
codegen_sources/preprocessing/obfuscation
in 3 files, 722 LOC (90%)
codegen_sources/wrappers
in 1 file 298 LOC (50%)
codegen_sources/preprocessing/bpe_modes
in 1 file 16 LOC (9%)