microsoft / presidio
File Size

The distribution of size of files (measured in lines of code).

Intro
  • File size measurements show the distribution of size of files.
  • Files are classified in four categories based on their size (lines of code): 1-100 (very small files), 101-200 (small files), 201-500 (medium size files), 501-1000 (long files), 1001+(very long files).
  • It is a good practice to keep files small. Long files may become "bloaters", code that have increased to such gargantuan proportions that they are hard to work with.
Learn more...
File Size Overall
  • There are 110 files with 4,420 lines of code.
    • 0 very long files (0 lines of code)
    • 0 long files (0 lines of code)
    • 1 medium size files (217 lines of codeclsfd_ftr_w_mp_ins)
    • 5 small files (712 lines of code)
    • 104 very small files (3,491 lines of code)
0% | 0% | 4% | 16% | 78%
Legend:
1001+
501-1000
201-500
101-200
1-100


explore: zoomable circles | sunburst | 3D view
File Size per Extension
1001+
501-1000
201-500
101-200
1-100
py0% | 0% | 5% | 14% | 79%
MD0% | 0% | 0% | 45% | 54%
yml0% | 0% | 0% | 0% | 100%
yaml0% | 0% | 0% | 0% | 100%
cfg0% | 0% | 0% | 0% | 100%
html0% | 0% | 0% | 0% | 100%
File Size per Logical Decomposition
primary
1001+
501-1000
201-500
101-200
1-100
presidio-analyzer/presidio_analyzer0% | 0% | 31% | 39% | 29%
presidio-anonymizer0% | 0% | 0% | 52% | 47%
presidio-analyzer/presidio_analyzer/recognizer_registry0% | 0% | 0% | 98% | 1%
presidio-analyzer/presidio_analyzer/predefined_recognizers0% | 0% | 0% | 10% | 89%
presidio-anonymizer/presidio_anonymizer/operators0% | 0% | 0% | 0% | 100%
ROOT0% | 0% | 0% | 0% | 100%
presidio-analyzer0% | 0% | 0% | 0% | 100%
presidio-anonymizer/presidio_anonymizer/entities0% | 0% | 0% | 0% | 100%
presidio-analyzer/presidio_analyzer/nlp_engine0% | 0% | 0% | 0% | 100%
presidio-image-redactor0% | 0% | 0% | 0% | 100%
presidio-image-redactor/presidio_image_redactor0% | 0% | 0% | 0% | 100%
presidio-anonymizer/presidio_anonymizer/core0% | 0% | 0% | 0% | 100%
presidio-anonymizer/presidio_anonymizer/services0% | 0% | 0% | 0% | 100%
presidio-anonymizer/presidio_anonymizer0% | 0% | 0% | 0% | 100%
presidio-image-redactor/presidio_image_redactor/entities0% | 0% | 0% | 0% | 100%
presidio-analyzer/conf0% | 0% | 0% | 0% | 100%
overrides0% | 0% | 0% | 0% | 100%
Longest Files (Top 50)
File# lines# units
entity_recognizer.py
in presidio-analyzer/presidio_analyzer
217 17
recognizer_registry.py
in presidio-analyzer/presidio_analyzer/recognizer_registry
150 6
README.MD
in presidio-anonymizer
150 -
pattern_recognizer.py
in presidio-analyzer/presidio_analyzer
147 10
iban_recognizer.py
in presidio-analyzer/presidio_analyzer/predefined_recognizers
147 8
analyzer_engine.py
in presidio-analyzer/presidio_analyzer
118 6
date_recognizer.py
in presidio-analyzer/presidio_analyzer/predefined_recognizers
97 2
app.py
in presidio-analyzer
95 1
iban_patterns.py
in presidio-analyzer/presidio_analyzer/predefined_recognizers
88 -
app.py
in presidio-anonymizer
85 1
engine_base.py
in presidio-anonymizer/presidio_anonymizer/core
80 4
nlp_engine_provider.py
in presidio-analyzer/presidio_analyzer/nlp_engine
78 4
image_analyzer_engine.py
in presidio-image-redactor/presidio_image_redactor
76 3
README.MD
in presidio-image-redactor
74 -
spacy_recognizer.py
in presidio-analyzer/presidio_analyzer/predefined_recognizers
73 5
mkdocs.yml
in root
72 -
phone_recognizer.py
in presidio-analyzer/presidio_analyzer/predefined_recognizers
66 7
azure-pipelines-ci.yml
in root
65 -
recognizer_result.py
in presidio-analyzer/presidio_analyzer
65 14
credit_card_recognizer.py
in presidio-analyzer/presidio_analyzer/predefined_recognizers
62 4
README.MD
in root
59 -
au_abn_recognizer.py
in presidio-analyzer/presidio_analyzer/predefined_recognizers
57 3
au_acn_recognizer.py
in presidio-analyzer/presidio_analyzer/predefined_recognizers
57 3
anonymizer_engine.py
in presidio-anonymizer/presidio_anonymizer
57 6
au_tfn_recognizer.py
in presidio-analyzer/presidio_analyzer/predefined_recognizers
56 3
validators.py
in presidio-anonymizer/presidio_anonymizer/services
56 6
__init__.py
in presidio-analyzer/presidio_analyzer/predefined_recognizers
55 -
au_medicare_recognizer.py
in presidio-analyzer/presidio_analyzer/predefined_recognizers
55 3
medical_license_recognizer.py
in presidio-analyzer/presidio_analyzer/predefined_recognizers
54 4
aba_routing_recognizer.py
in presidio-analyzer/presidio_analyzer/predefined_recognizers
53 4
us_ssn_recognizer.py
in presidio-analyzer/presidio_analyzer/predefined_recognizers
53 2
operators_factory.py
in presidio-anonymizer/presidio_anonymizer/operators
53 6
uk_nhs_recognizer.py
in presidio-analyzer/presidio_analyzer/predefined_recognizers
48 3
app.py
in presidio-image-redactor
48 1
us_driver_license_recognizer.py
in presidio-analyzer/presidio_analyzer/predefined_recognizers
46 1
setup.py
in presidio-analyzer
46 -
install_nlp_models.py
in presidio-analyzer
45 2
image_pii_verify_engine.py
in presidio-image-redactor/presidio_image_redactor
45 3
pii_entity.py
in presidio-anonymizer/presidio_anonymizer/entities/engine
44 5
recognizer_result.py
in presidio-anonymizer/presidio_anonymizer/entities/engine
44 9
setup.py
in presidio-image-redactor
44 -
setup.py
in presidio-anonymizer
43 -
nlp_artifacts.py
in presidio-analyzer/presidio_analyzer/nlp_engine
42 3
operator_result.py
in presidio-anonymizer/presidio_anonymizer/entities/engine/result
42 6
mask.py
in presidio-anonymizer/presidio_anonymizer/operators
40 6
spacy_nlp_engine.py
in presidio-analyzer/presidio_analyzer/nlp_engine
39 6
es_nif_recognizer.py
in presidio-analyzer/presidio_analyzer/predefined_recognizers
39 3
crypto_recognizer.py
in presidio-analyzer/presidio_analyzer/predefined_recognizers
36 3
us_itin_recognizer.py
in presidio-analyzer/presidio_analyzer/predefined_recognizers
36 1
analysis_explanation.py
in presidio-analyzer/presidio_analyzer
34 6
Files With Most Units (Top 20)
File# lines# units
entity_recognizer.py
in presidio-analyzer/presidio_analyzer
217 17
recognizer_result.py
in presidio-analyzer/presidio_analyzer
65 14
pattern_recognizer.py
in presidio-analyzer/presidio_analyzer
147 10
recognizer_result.py
in presidio-anonymizer/presidio_anonymizer/entities/engine
44 9
iban_recognizer.py
in presidio-analyzer/presidio_analyzer/predefined_recognizers
147 8
phone_recognizer.py
in presidio-analyzer/presidio_analyzer/predefined_recognizers
66 7
engine_result.py
in presidio-anonymizer/presidio_anonymizer/entities/engine/result
29 7
analysis_explanation.py
in presidio-analyzer/presidio_analyzer
34 6
analyzer_engine.py
in presidio-analyzer/presidio_analyzer
118 6
spacy_nlp_engine.py
in presidio-analyzer/presidio_analyzer/nlp_engine
39 6
recognizer_registry.py
in presidio-analyzer/presidio_analyzer/recognizer_registry
150 6
anonymizer_engine.py
in presidio-anonymizer/presidio_anonymizer
57 6
operator_result.py
in presidio-anonymizer/presidio_anonymizer/entities/engine/result
42 6
mask.py
in presidio-anonymizer/presidio_anonymizer/operators
40 6
operators_factory.py
in presidio-anonymizer/presidio_anonymizer/operators
53 6
validators.py
in presidio-anonymizer/presidio_anonymizer/services
56 6
pattern.py
in presidio-analyzer/presidio_analyzer
17 5
spacy_recognizer.py
in presidio-analyzer/presidio_analyzer/predefined_recognizers
73 5
text_replace_builder.py
in presidio-anonymizer/presidio_anonymizer/core
32 5
operator_config.py
in presidio-anonymizer/presidio_anonymizer/entities/engine
26 5
Files With Long Lines (Top 11)

There are 11 files with lines longer than 120 characters. In total, there are 32 long lines.

File# lines# units# long lines
README.MD
in root
59 - 12
SECURITY.MD
in root
19 - 5
README.MD
in presidio-anonymizer
150 - 4
README.MD
in presidio-analyzer
25 - 2
ip_recognizer.py
in presidio-analyzer/presidio_analyzer/predefined_recognizers
31 1 2
README.MD
in presidio-image-redactor
74 - 2
domain_recognizer.py
in presidio-analyzer/presidio_analyzer/predefined_recognizers
30 2 1
email_recognizer.py
in presidio-analyzer/presidio_analyzer/predefined_recognizers
30 2 1
iban_recognizer.py
in presidio-analyzer/presidio_analyzer/predefined_recognizers
147 8 1
us_driver_license_recognizer.py
in presidio-analyzer/presidio_analyzer/predefined_recognizers
46 1 1
us_itin_recognizer.py
in presidio-analyzer/presidio_analyzer/predefined_recognizers
36 1 1