microsoft / data-accelerator
File Size

The distribution of size of files (measured in lines of code).

Intro
  • File size measurements show the distribution of size of files.
  • Files are classified in four categories based on their size (lines of code): 1-100 (very small files), 101-200 (small files), 201-500 (medium size files), 501-1000 (long files), 1001+(very long files).
  • It is a good practice to keep files small. Long files may become "bloaters", code that have increased to such gargantuan proportions that they are hard to work with.
Learn more...
File Size Overall
  • There are 709 files with 50,926 lines of code.
    • 1 very long files (1,026 lines of code)
    • 10 long files (6,571 lines of code)
    • 41 medium size files (12,055 lines of codeclsfd_ftr_w_mp_ins)
    • 85 small files (12,080 lines of code)
    • 572 very small files (19,194 lines of code)
2% | 12% | 23% | 23% | 37%
Legend:
1001+
501-1000
201-500
101-200
1-100


explore: zoomable circles | sunburst | 3D view
File Size per Extension
1001+
501-1000
201-500
101-200
1-100
jsx9% | 11% | 23% | 35% | 19%
js0% | 24% | 37% | 8% | 30%
cs0% | 7% | 24% | 22% | 45%
ps10% | 38% | 23% | 28% | 10%
psm10% | 72% | 27% | 0% | 0%
scala0% | 8% | 8% | 23% | 59%
yaml0% | 0% | 0% | 41% | 58%
css0% | 0% | 0% | 46% | 53%
sfproj0% | 0% | 0% | 0% | 100%
py0% | 0% | 0% | 0% | 100%
cmd0% | 0% | 0% | 0% | 100%
html0% | 0% | 0% | 0% | 100%
yml0% | 0% | 0% | 0% | 100%
File Size per Logical Decomposition
primary
1001+
501-1000
201-500
101-200
1-100
Website/Packages6% | 18% | 28% | 25% | 21%
Services/DataX.Flow0% | 20% | 27% | 26% | 25%
DeploymentCloud/Deployment.Common0% | 94% | 0% | 0% | 5%
DataProcessing/datax-host0% | 10% | 10% | 25% | 53%
Services/DataX.Config0% | 0% | 15% | 20% | 64%
Services/DataX.Utilities0% | 0% | 38% | 25% | 35%
Services/DataX.SimulatedData0% | 0% | 49% | 29% | 20%
Website/Website0% | 0% | 27% | 24% | 48%
Services/DataX.Gateway0% | 0% | 40% | 35% | 24%
DeploymentCloud/Deployment.DataX0% | 0% | 71% | 0% | 28%
Services/JobRunner0% | 0% | 29% | 12% | 58%
DeploymentCloud/Deployment.JobRunner0% | 0% | 70% | 0% | 29%
Services/DataX.Metrics0% | 0% | 22% | 49% | 27%
DeploymentCloud/Deployment.Kubernetes0% | 0% | 0% | 41% | 58%
DataProcessing/datax-utility0% | 0% | 0% | 28% | 71%
Services/DataX.ServiceHost0% | 0% | 0% | 16% | 83%
DataProcessing/datax-core0% | 0% | 0% | 0% | 100%
Services/DataX.Contract0% | 0% | 0% | 0% | 100%
Services/AspnetCore0% | 0% | 0% | 0% | 100%
DataProcessing/datax-udf-samples0% | 0% | 0% | 0% | 100%
DataProcessing/datax-keyvault0% | 0% | 0% | 0% | 100%
DataProcessing0% | 0% | 0% | 0% | 100%
Services/JobRunnerWebJob0% | 0% | 0% | 0% | 100%
Longest Files (Top 50)
File# lines# units
flowDefinitionPanel.jsx
in Website/Packages/datax-pipeline/src/modules/flowDefinition/components
1026 -
flowActions.js
in Website/Packages/datax-pipeline/src/modules/flowDefinition
834 18
flowModels.js
in Website/Packages/datax-pipeline/src/modules/flowDefinition
834 13
deployResources.ps1
in DeploymentCloud/Deployment.Common
806 -
inputSettingsContent.jsx
in Website/Packages/datax-pipeline/src/modules/flowDefinition/components/input
769 -
KernelService.cs
in Services/DataX.Flow/DataX.Flow.InteractiveQuery
628 23
UtilityModule.psm1
in DeploymentCloud/Deployment.Common/Helpers
616 -
querySettingsContent.jsx
in Website/Packages/datax-query/src/modules/query/components
543 -
Engine.cs
in Services/DataX.Flow/DataX.Flow.CodegenRules
525 18
HadoopClient.scala
in DataProcessing/datax-host/src/main/scala/datax/fs
508 33
Formatter.cs
in Services/DataX.Flow/DataX.Flow.CodegenRules
508 33
metrics.datasource.js
in Website/Packages/datax-metrics/src/modules/metrics
491 21
CommonProcessorFactory.scala
in DataProcessing/datax-host/src/main/scala/datax/processor
488 1
flowSelectors.js
in Website/Packages/datax-pipeline/src/modules/flowDefinition
487 27
FlowManagementController.cs
in Services/DataX.Flow/Flow.ManagementService/Controllers
470 23
flowHelpers.js
in Website/Packages/datax-pipeline/src/modules/flowDefinition
445 31
InteractiveQueryManager.cs
in Services/DataX.Flow/DataX.Flow.InteractiveQuery
377 17
GatewayController.cs
in Services/DataX.Gateway/DataX.Gateway.Api/Controllers
359 12
outputSettingsContent.jsx
in Website/Packages/datax-pipeline/src/modules/flowDefinition/components/output
343 -
auth.js
in Website/Website
337 18
SqlParser.cs
in Services/DataX.Flow/DataX.Flow.SqlParser
336 12
S500_ResolveOutputs.cs
in Services/DataX.Config/DataX.Config/ConfigGeneration/Processor
325 12
queryBuilder.jsx
in Website/Packages/datax-pipeline/src/modules/flowDefinition/components/rule
319 -
composition.js
in Website/Packages/datax-home/src/modules/home
313 -
flowReducer.js
in Website/Packages/datax-pipeline/src/modules/flowDefinition
306 1
HDInsightKernel.cs
in Services/DataX.Flow/DataX.Flow.InteractiveQuery/HDInsight
302 15
DataGenService.cs
in Services/DataX.SimulatedData/DataX.SimulatedData.DataGenService
296 8
rulesSettingsContent.jsx
in Website/Packages/datax-pipeline/src/modules/flowDefinition/components/rule
291 -
scheduleSettingsContent.jsx
in Website/Packages/datax-pipeline/src/modules/flowDefinition/components/schedule
290 -
JsonConfig.cs
in Services/DataX.Config/DataX.Config/ConfigDataModel
287 19
deploySample.ps1
in DeploymentCloud/Deployment.DataX
285 -
functionSettingsContent.jsx
in Website/Packages/datax-pipeline/src/modules/flowDefinition/components/function
273 -
sparkJobsList.jsx
in Website/Packages/datax-jobs/src/modules/jobs/components
271 -
S600_GenerateJobConfigBatch.cs
in Services/DataX.Config/DataX.Config/ConfigGeneration/Processor
269 13
DataGen.cs
in Services/DataX.SimulatedData/DataX.SimulatedData.DataGenService
261 9
Helper.cs
in Services/DataX.Flow/DataX.Flow.Common
255 20
Rule.cs
in Services/DataX.Flow/DataX.Flow.CodegenRules
248 7
referenceDataSettingsContent.jsx
in Website/Packages/datax-pipeline/src/modules/flowDefinition/components/referenceData
244 -
Engine.cs
in Services/DataX.Flow/DataX.Flow.SchemaInference
242 5
ConfigHelper.cs
in Services/DataX.Config/DataX.Config/Utility
240 14
JobRunner.cs
in Services/JobRunner
240 11
StorageCreator.cs
in Services/DataX.Utilities/DataX.Utilities.Blob
236 21
CosmosDBUtility.cs
in Services/DataX.Utilities/DataX.Utility.CosmosDB
236 10
utilities.psm1
in DeploymentCloud/Deployment.JobRunner
230 -
azureFunctionSettings.jsx
in Website/Packages/datax-pipeline/src/modules/flowDefinition/components/function
215 -
Deploy-FabricApplication.ps1
in Services/DataX.Metrics/DataX.Metrics/Scripts
211 -
udfSettings.jsx
in Website/Packages/datax-pipeline/src/modules/flowDefinition/components/function
209 -
BlobHelper.cs
in Services/DataX.Utilities/DataX.Utilities.Blob
208 10
EventHubUtil.cs
in Services/DataX.Utilities/DataX.Utilities.EventHub
206 8
db.js
in Website/Website/db
206 20
Files With Most Units (Top 20)
File# lines# units
HadoopClient.scala
in DataProcessing/datax-host/src/main/scala/datax/fs
508 33
Formatter.cs
in Services/DataX.Flow/DataX.Flow.CodegenRules
508 33
flowHelpers.js
in Website/Packages/datax-pipeline/src/modules/flowDefinition
445 31
flowSelectors.js
in Website/Packages/datax-pipeline/src/modules/flowDefinition
487 27
KernelService.cs
in Services/DataX.Flow/DataX.Flow.InteractiveQuery
628 23
FlowManagementController.cs
in Services/DataX.Flow/Flow.ManagementService/Controllers
470 23
StorageCreator.cs
in Services/DataX.Utilities/DataX.Utilities.Blob
236 21
metrics.datasource.js
in Website/Packages/datax-metrics/src/modules/metrics
491 21
Helper.cs
in Services/DataX.Flow/DataX.Flow.Common
255 20
db.js
in Website/Website/db
206 20
JsonConfig.cs
in Services/DataX.Config/DataX.Config/ConfigDataModel
287 19
Engine.cs
in Services/DataX.Flow/DataX.Flow.CodegenRules
525 18
flowActions.js
in Website/Packages/datax-pipeline/src/modules/flowDefinition
834 18
auth.js
in Website/Website
337 18
InteractiveQueryManager.cs
in Services/DataX.Flow/DataX.Flow.InteractiveQuery
377 17
FlowDataManager.cs
in Services/DataX.Config/DataX.Config/InternalService
143 16
FlowOperation.cs
in Services/DataX.Config/DataX.Config/PublicService
169 16
SettingDictionary.scala
in DataProcessing/datax-core/src/main/scala/datax/config
87 15
SparkJobOperation.cs
in Services/DataX.Config/DataX.Config/InternalService
204 15
HDInsightKernel.cs
in Services/DataX.Flow/DataX.Flow.InteractiveQuery/HDInsight
302 15
Files With Long Lines (Top 20)

There are 287 files with lines longer than 120 characters. In total, there are 1916 long lines.

File# lines# units# long lines
GlobalSuppressions.cs
in Services/DataX.Config/DataX.Config
197 - 196
GlobalSuppressions.cs
in Services/DataX.Flow/DataX.Flow.Common
74 - 73
GlobalSuppressions.cs
in Services/DataX.Flow/DataX.Flow.CodegenRules
60 - 59
GlobalSuppressions.cs
in Services/DataX.Flow/DataX.Flow.InteractiveQuery
55 - 54
deployResources.ps1
in DeploymentCloud/Deployment.Common
806 - 40
ConfigDeleter.cs
in Services/DataX.Flow/DataX.Flow.DeleteHelper
178 2 31
DataGenService.cs
in Services/DataX.SimulatedData/DataX.SimulatedData.DataGenService
296 8 31
Engine.cs
in Services/DataX.Flow/DataX.Flow.CodegenRules
525 18 29
GlobalSuppressions.cs
in Services/DataX.Flow/Flow.ManagementService
29 - 28
DataGen.cs
in Services/DataX.SimulatedData/DataX.SimulatedData.DataGenService
261 9 27
KernelService.cs
in Services/DataX.Flow/DataX.Flow.InteractiveQuery
628 23 26
GlobalSuppressions.cs
in Services/DataX.SimulatedData/DataX.SimulatedData.DataGenService
27 - 26
deploySample.ps1
in DeploymentCloud/Deployment.DataX
285 - 25
InteractiveQueryManager.cs
in Services/DataX.Flow/DataX.Flow.InteractiveQuery
377 17 25
S500_ResolveOutputs.cs
in Services/DataX.Config/DataX.Config/ConfigGeneration/Processor
325 12 24
GlobalSuppressions.cs
in Services/DataX.Config/DataX.Config.Local
24 - 23
GlobalSuppressions.cs
in Services/DataX.Metrics/DataX.Metrics.Ingestor
24 - 23
SparkJarLoader.scala
in DataProcessing/datax-host/src/main/scala/datax/host
161 7 22
CommonProcessorFactory.scala
in DataProcessing/datax-host/src/main/scala/datax/processor
488 1 22
UtilityModule.psm1
in DeploymentCloud/Deployment.Common/Helpers
616 - 21