microsoft / SynapseML
File Size

The distribution of size of files (measured in lines of code).

Intro
  • File size measurements show the distribution of size of files.
  • Files are classified in four categories based on their size (lines of code): 1-100 (very small files), 101-200 (small files), 201-500 (medium size files), 501-1000 (long files), 1001+(very long files).
  • It is a good practice to keep files small. Long files may become "bloaters", code that have increased to such gargantuan proportions that they are hard to work with.
Learn more...
File Size Overall
  • There are 394 files with 34,173 lines of code.
    • 0 very long files (0 lines of code)
    • 4 long files (2,525 lines of code)
    • 37 medium size files (11,838 lines of codeclsfd_ftr_w_mp_ins)
    • 75 small files (10,007 lines of code)
    • 278 very small files (9,803 lines of code)
0% | 7% | 34% | 29% | 28%
Legend:
1001+
501-1000
201-500
101-200
1-100


explore: zoomable circles | sunburst | 3D view
File Size per Extension
1001+
501-1000
201-500
101-200
1-100
scala0% | 3% | 39% | 30% | 26%
py0% | 26% | 8% | 19% | 46%
yaml0% | 92% | 0% | 0% | 7%
js0% | 0% | 31% | 38% | 29%
css0% | 0% | 0% | 40% | 59%
sbt0% | 0% | 0% | 91% | 8%
Dockerfile0% | 0% | 0% | 48% | 51%
java0% | 0% | 0% | 0% | 100%
R0% | 0% | 0% | 0% | 100%
yml0% | 0% | 0% | 0% | 100%
in0% | 0% | 0% | 0% | 100%
File Size per Logical Decomposition
primary
1001+
501-1000
201-500
101-200
1-100
core0% | 7% | 20% | 37% | 34%
ROOT0% | 77% | 0% | 16% | 6%
deep-learning0% | 28% | 22% | 34% | 13%
cognitive0% | 0% | 64% | 17% | 18%
lightgbm0% | 0% | 63% | 20% | 16%
vw0% | 0% | 40% | 10% | 49%
opencv0% | 0% | 77% | 0% | 22%
website0% | 0% | 23% | 38% | 37%
project0% | 0% | 45% | 23% | 30%
tools0% | 0% | 0% | 33% | 66%
templates0% | 0% | 0% | 0% | 100%
Longest Files (Top 50)
File# lines# units
collaborative_filtering.py
in core/src/main/python/synapse/ml/cyber/anomaly
769 34
pipeline.yaml
in root
635 -
HTTPSourceV2.scala
in core/src/main/scala/org/apache/spark/sql/execution/streaming/continuous
599 45
ONNXModel.scala
in deep-learning/src/main/scala/com/microsoft/azure/synapse/ml/onnx
522 27
SpeechToTextSDK.scala
in cognitive/src/main/scala/com/microsoft/azure/synapse/ml/cognitive
468 16
TextAnalytics.scala
in cognitive/src/main/scala/com/microsoft/azure/synapse/ml/cognitive
468 24
ImageTransformer.scala
in opencv/src/main/scala/com/microsoft/azure/synapse/ml/opencv
461 33
ComputerVision.scala
in cognitive/src/main/scala/com/microsoft/azure/synapse/ml/cognitive
458 29
Wrappable.scala
in core/src/main/scala/com/microsoft/azure/synapse/ml/codegen
450 13
CNTKModel.scala
in deep-learning/src/main/scala/com/microsoft/azure/synapse/ml/cntk
414 18
LightGBMBooster.scala
in lightgbm/src/main/scala/com/microsoft/azure/synapse/ml/lightgbm/booster
410 41
index.js
in website/src/pages
409 3
TextTranslator.scala
in cognitive/src/main/scala/com/microsoft/azure/synapse/ml/cognitive
395 20
ComputeModelStatistics.scala
in core/src/main/scala/com/microsoft/azure/synapse/ml/train
391 21
DatasetAggregator.scala
in lightgbm/src/main/scala/com/microsoft/azure/synapse/ml/lightgbm/dataset
390 34
MultivariateAnomalyDetection.scala
in cognitive/src/main/scala/com/microsoft/azure/synapse/ml/cognitive
380 18
LightGBMBase.scala
in lightgbm/src/main/scala/com/microsoft/azure/synapse/ml/lightgbm
367 16
VowpalWabbitBase.scala
in vw/src/main/scala/com/microsoft/azure/synapse/ml/vw
362 11
LightGBMParams.scala
in lightgbm/src/main/scala/com/microsoft/azure/synapse/ml/lightgbm/params
338 10
DistributedHTTPSource.scala
in core/src/main/scala/org/apache/spark/sql/execution/streaming
326 27
TextAnalyticsSDK.scala
in cognitive/src/main/scala/com/microsoft/azure/synapse/ml/cognitive
307 14
TrainUtils.scala
in lightgbm/src/main/scala/com/microsoft/azure/synapse/ml/lightgbm
299 17
AzureSearch.scala
in cognitive/src/main/scala/com/microsoft/azure/synapse/ml/cognitive
294 21
TextAnalyticsSDKSchemas.scala
in cognitive/src/main/scala/com/microsoft/azure/synapse/ml/cognitive
286 24
HTTPSchema.scala
in core/src/main/scala/com/microsoft/azure/synapse/ml/io/http
280 20
VowpalWabbitContextualBandit.scala
in vw/src/main/scala/com/microsoft/azure/synapse/ml/vw
278 16
BingImageSearch.scala
in cognitive/src/main/scala/com/microsoft/azure/synapse/ml/cognitive
272 3
CognitiveServiceBase.scala
in cognitive/src/main/scala/com/microsoft/azure/synapse/ml/cognitive
267 24
Superpixel.scala
in core/src/main/scala/com/microsoft/azure/synapse/ml/lime
263 17
Face.scala
in cognitive/src/main/scala/com/microsoft/azure/synapse/ml/cognitive
262 8
RankingTrainValidationSplit.scala
in core/src/main/scala/com/microsoft/azure/synapse/ml/recommendation
252 11
FormRecognizer.scala
in cognitive/src/main/scala/com/microsoft/azure/synapse/ml/cognitive
247 17
scalers.py
in core/src/main/python/synapse/ml/cyber/feature
246 26
LIME.scala
in core/src/main/scala/com/microsoft/azure/synapse/ml/lime
246 13
TrainClassifier.scala
in core/src/main/scala/com/microsoft/azure/synapse/ml/train
245 9
TextFeaturizer.scala
in core/src/main/scala/com/microsoft/azure/synapse/ml/featurize/text
234 8
TrainParams.scala
in lightgbm/src/main/scala/com/microsoft/azure/synapse/ml/lightgbm/params
227 2
CodegenPlugin.scala
in project
220 1
BallTree.scala
in core/src/main/scala/com/microsoft/azure/synapse/ml/nn
212 20
ICEExplainer.scala
in core/src/main/scala/com/microsoft/azure/synapse/ml/explainers
210 9
Parsers.scala
in core/src/main/scala/com/microsoft/azure/synapse/ml/io/http
204 21
BinaryFileFormat.scala
in core/src/main/scala/com/microsoft/azure/synapse/ml/io/binary
200 12
Categoricals.scala
in core/src/main/scala/com/microsoft/azure/synapse/ml/core/schema
199 11
AnomalyDetection.scala
in cognitive/src/main/scala/com/microsoft/azure/synapse/ml/cognitive
186 11
Sampler.scala
in core/src/main/scala/com/microsoft/azure/synapse/ml/explainers
179 11
ModelDownloader.scala
in deep-learning/src/main/scala/com/microsoft/azure/synapse/ml/downloader
178 14
FeatureBalanceMeasure.scala
in core/src/main/scala/com/microsoft/azure/synapse/ml/exploratory
177 5
Featurize.scala
in core/src/main/scala/com/microsoft/azure/synapse/ml/featurize
175 4
TuneHyperparameters.scala
in core/src/main/scala/com/microsoft/azure/synapse/ml/automl
174 5
UnrollImage.scala
in core/src/main/scala/com/microsoft/azure/synapse/ml/image
172 9
Files With Most Units (Top 20)
File# lines# units
HTTPSourceV2.scala
in core/src/main/scala/org/apache/spark/sql/execution/streaming/continuous
599 45
LightGBMBooster.scala
in lightgbm/src/main/scala/com/microsoft/azure/synapse/ml/lightgbm/booster
410 41
collaborative_filtering.py
in core/src/main/python/synapse/ml/cyber/anomaly
769 34
DatasetAggregator.scala
in lightgbm/src/main/scala/com/microsoft/azure/synapse/ml/lightgbm/dataset
390 34
ImageTransformer.scala
in opencv/src/main/scala/com/microsoft/azure/synapse/ml/opencv
461 33
ComputerVision.scala
in cognitive/src/main/scala/com/microsoft/azure/synapse/ml/cognitive
458 29
DistributedHTTPSource.scala
in core/src/main/scala/org/apache/spark/sql/execution/streaming
326 27
ONNXModel.scala
in deep-learning/src/main/scala/com/microsoft/azure/synapse/ml/onnx
522 27
scalers.py
in core/src/main/python/synapse/ml/cyber/feature
246 26
CognitiveServiceBase.scala
in cognitive/src/main/scala/com/microsoft/azure/synapse/ml/cognitive
267 24
TextAnalytics.scala
in cognitive/src/main/scala/com/microsoft/azure/synapse/ml/cognitive
468 24
TextAnalyticsSDKSchemas.scala
in cognitive/src/main/scala/com/microsoft/azure/synapse/ml/cognitive
286 24
CNTKModel.py
in deep-learning/src/main/python/synapse/ml/cntk
85 24
AzureSearch.scala
in cognitive/src/main/scala/com/microsoft/azure/synapse/ml/cognitive
294 21
Parsers.scala
in core/src/main/scala/com/microsoft/azure/synapse/ml/io/http
204 21
ComputeModelStatistics.scala
in core/src/main/scala/com/microsoft/azure/synapse/ml/train
391 21
ONNXModel.py
in deep-learning/src/main/python/synapse/ml/onnx
101 21
TextTranslator.scala
in cognitive/src/main/scala/com/microsoft/azure/synapse/ml/cognitive
395 20
HTTPSchema.scala
in core/src/main/scala/com/microsoft/azure/synapse/ml/io/http
280 20
BallTree.scala
in core/src/main/scala/com/microsoft/azure/synapse/ml/nn
212 20
Files With Long Lines (Top 12)

There are 12 files with lines longer than 120 characters. In total, there are 32 long lines.

File# lines# units# long lines
pipeline.yaml
in root
635 - 7
index.js
in website/src/pages
409 3 5
custom.css
in website/src/css
157 - 4
index.js
in website/src/theme/FeatureCards
101 1 4
HyperparamBuilder.py
in core/src/main/python/synapse/ml/automl
53 11 2
LightGBMBooster.scala
in lightgbm/src/main/scala/com/microsoft/azure/synapse/ml/lightgbm/booster
410 41 2
mini.Dockerfile
in tools/helm/livy
41 - 2
docusaurus.config.js
in website
159 3 2
Utils.py
in core/src/main/python/synapse/ml/core/schema
54 5 1
collaborative_filtering.py
in core/src/main/python/synapse/ml/cyber/anomaly
769 34 1
mini.Dockerfile
in tools/helm/spark
85 - 1
videos.js
in website/src/pages
121 2 1