microsoft / ai4eutils
File Size

The distribution of size of files (measured in lines of code).

Intro
  • File size measurements show the distribution of size of files.
  • Files are classified in four categories based on their size (lines of code): 1-100 (very small files), 101-200 (small files), 201-500 (medium size files), 501-1000 (long files), 1001+(very long files).
  • It is a good practice to keep files small. Long files may become "bloaters", code that have increased to such gargantuan proportions that they are hard to work with.
Learn more...
File Size Overall
  • There are 34 files with 3,205 lines of code.
    • 0 very long files (0 lines of code)
    • 0 long files (0 lines of code)
    • 4 medium size files (885 lines of codeclsfd_ftr_w_mp_ins)
    • 11 small files (1,499 lines of code)
    • 19 very small files (821 lines of code)
0% | 0% | 27% | 46% | 25%
Legend:
1001+
501-1000
201-500
101-200
1-100


explore: zoomable circles | sunburst | 3D view
File Size per Extension
1001+
501-1000
201-500
101-200
1-100
py0% | 0% | 27% | 46% | 25%
yml0% | 0% | 0% | 0% | 100%
File Size per Logical Decomposition
primary
1001+
501-1000
201-500
101-200
1-100
ROOT0% | 0% | 43% | 34% | 21%
geospatial/visualization0% | 0% | 0% | 100% | 0%
geospatial/data0% | 0% | 0% | 81% | 18%
azure-sdk-calc-storage-size0% | 0% | 0% | 52% | 47%
geospatial/preprocessing0% | 0% | 0% | 100% | 0%
azure-metrics-calc-storage-size0% | 0% | 0% | 0% | 100%
geospatial/model_scoring0% | 0% | 0% | 0% | 100%
geospatial0% | 0% | 0% | 0% | 100%
TF_OD_API0% | 0% | 0% | 0% | 100%
geospatial/models0% | 0% | 0% | 0% | 100%
Longest Files (Top 34)
File# lines# units
parallel_change_blob_access_tier.py
in root
240 14
parallel_enumerate_blobs.py
in root
218 13
gDrive_download.py
in root
215 4
sas_blob_utils.py
in root
212 20
parallel_delete_blobs.py
in root
192 13
imagery_visualizer.py
in geospatial/visualization
164 7
raster_label_visualizer.py
in geospatial/visualization
164 13
parallel_enumerate_containers.py
in root
160 10
directory_listing.py
in root
148 4
azure-data.py
in azure-sdk-calc-storage-size
130 8
StreamingDatasets.py
in geospatial/data
120 4
create_label_masks.py
in geospatial/preprocessing
110 6
ai4e_azure_utils.py
in root
104 7
write_html_image_list.py
in root
104 1
NAIPTileIndex.py
in geospatial/data
103 4
path_utils.py
in root
92 11
metrics-data.py
in azure-metrics-calc-storage-size
89 2
enumerate_blob_folders_at_depth.py
in root
79 1
back_up_blob_containers.py
in root
72 1
url_utils.py
in root
52 4
custom_logging.py
in azure-sdk-calc-storage-size
51 9
run_inference_in_parallel.py
in geospatial/model_scoring
50 1
utils.py
in azure-metrics-calc-storage-size
49 4
utils.py
in azure-sdk-calc-storage-size
49 4
TileDatasets.py
in geospatial/data
49 3
matlab_porting_tools.py
in root
45 5
ai4e_web_utils.py
in root
37 3
ai4e_string_utils.py
in root
33 2
remove_empty_folders.py
in root
30 1
cleanup-logs.py
in azure-sdk-calc-storage-size
18 1
environment.yml
in geospatial
8 -
reformat_tfodapi_config.py
in TF_OD_API
7 -
enums.py
in geospatial
6 -
segmentation_model.py
in geospatial/models/segmentation
5 -
Files With Most Units (Top 20)
File# lines# units
sas_blob_utils.py
in root
212 20
parallel_change_blob_access_tier.py
in root
240 14
raster_label_visualizer.py
in geospatial/visualization
164 13
parallel_delete_blobs.py
in root
192 13
parallel_enumerate_blobs.py
in root
218 13
path_utils.py
in root
92 11
parallel_enumerate_containers.py
in root
160 10
custom_logging.py
in azure-sdk-calc-storage-size
51 9
azure-data.py
in azure-sdk-calc-storage-size
130 8
ai4e_azure_utils.py
in root
104 7
imagery_visualizer.py
in geospatial/visualization
164 7
create_label_masks.py
in geospatial/preprocessing
110 6
matlab_porting_tools.py
in root
45 5
utils.py
in azure-metrics-calc-storage-size
49 4
utils.py
in azure-sdk-calc-storage-size
49 4
directory_listing.py
in root
148 4
gDrive_download.py
in root
215 4
NAIPTileIndex.py
in geospatial/data
103 4
StreamingDatasets.py
in geospatial/data
120 4
url_utils.py
in root
52 4
Files With Long Lines (Top 14)

There are 14 files with lines longer than 120 characters. In total, there are 22 long lines.

File# lines# units# long lines
NAIPTileIndex.py
in geospatial/data
103 4 5
directory_listing.py
in root
148 4 3
TileDatasets.py
in geospatial/data
49 3 3
metrics-data.py
in azure-metrics-calc-storage-size
89 2 1
azure-data.py
in azure-sdk-calc-storage-size
130 8 1
custom_logging.py
in azure-sdk-calc-storage-size
51 9 1
enumerate_blob_folders_at_depth.py
in root
79 1 1
gDrive_download.py
in root
215 4 1
StreamingDatasets.py
in geospatial/data
120 4 1
run_inference_in_parallel.py
in geospatial/model_scoring
50 1 1
create_label_masks.py
in geospatial/preprocessing
110 6 1
parallel_change_blob_access_tier.py
in root
240 14 1
parallel_enumerate_blobs.py
in root
218 13 1
write_html_image_list.py
in root
104 1 1