aws-samples / data-discovery-using-glue-comprehend
File Size

The distribution of size of files (measured in lines of code).

Intro
  • File size measurements show the distribution of size of files.
  • Files are classified in four categories based on their size (lines of code): 1-100 (very small files), 101-200 (small files), 201-500 (medium size files), 501-1000 (long files), 1001+(very long files).
  • It is a good practice to keep files small. Long files may become "bloaters", code that have increased to such gargantuan proportions that they are hard to work with.
Learn more...
File Size Overall
  • There are 8 files with 851 lines of code.
    • 0 very long files (0 lines of code)
    • 1 long files (524 lines of code)
    • 0 medium size files (0 lines of codeclsfd_ftr_w_mp_ins)
    • 0 small files (0 lines of code)
    • 7 very small files (327 lines of code)
0% | 61% | 0% | 0% | 38%
Legend:
1001+
501-1000
201-500
101-200
1-100


explore: zoomable circles | sunburst | 3D view
File Size per Extension
1001+
501-1000
201-500
101-200
1-100
yml0% | 100% | 0% | 0% | 0%
py0% | 0% | 0% | 0% | 100%
MD0% | 0% | 0% | 0% | 100%
File Size per Logical Decomposition
primary
1001+
501-1000
201-500
101-200
1-100
CloudFormation-template0% | 100% | 0% | 0% | 0%
scripts0% | 0% | 0% | 0% | 100%
QuickSight0% | 0% | 0% | 0% | 100%
Longest Files (Top 8)
File# lines# units
simplify-data-discovery-for-business-users-cf.yml
in CloudFormation-template
524 -
glue_comprehend_workflow_custom.py
in scripts
88 -
Glue_Comprehend_Job.py
in scripts
64 -
trigger_glue_crawler.py
in scripts
58 1
Inference_custom_entity_recognition.py
in scripts
46 1
Using Parameters for Filter Control.MD
in QuickSight
36 -
comprehend_create_custom_entity.py
in scripts
24 1
glue_comprehend_workflow.py
in scripts
11 1
Files With Most Units (Top 4)
File# lines# units
Inference_custom_entity_recognition.py
in scripts
46 1
glue_comprehend_workflow.py
in scripts
11 1
comprehend_create_custom_entity.py
in scripts
24 1
trigger_glue_crawler.py
in scripts
58 1
Files With Long Lines (Top 4)

There are 4 files with lines longer than 120 characters. In total, there are 16 long lines.

File# lines# units# long lines
Using Parameters for Filter Control.MD
in QuickSight
36 - 7
simplify-data-discovery-for-business-users-cf.yml
in CloudFormation-template
524 - 7
glue_comprehend_workflow_custom.py
in scripts
88 - 1
Glue_Comprehend_Job.py
in scripts
64 - 1