microsoft / Mobius
File Size

The distribution of size of files (measured in lines of code).

Intro
  • File size measurements show the distribution of size of files.
  • Files are classified in four categories based on their size (lines of code): 1-100 (very small files), 101-200 (small files), 201-500 (medium size files), 501-1000 (long files), 1001+(very long files).
  • It is a good practice to keep files small. Long files may become "bloaters", code that have increased to such gargantuan proportions that they are hard to work with.
Learn more...
File Size Overall
  • There are 232 files with 30,075 lines of code.
    • 1 very long files (1,363 lines of code)
    • 9 long files (5,512 lines of code)
    • 32 medium size files (10,023 lines of codeclsfd_ftr_w_mp_ins)
    • 44 small files (6,401 lines of code)
    • 146 very small files (6,776 lines of code)
4% | 18% | 33% | 21% | 22%
Legend:
1001+
501-1000
201-500
101-200
1-100


explore: zoomable circles | sunburst | 3D view
File Size per Extension
1001+
501-1000
201-500
101-200
1-100
cs5% | 16% | 34% | 22% | 21%
ps10% | 56% | 0% | 11% | 32%
scala0% | 16% | 48% | 25% | 10%
cpp0% | 100% | 0% | 0% | 0%
cmd0% | 0% | 32% | 0% | 67%
h0% | 0% | 0% | 0% | 100%
py0% | 0% | 0% | 0% | 100%
vcxproj0% | 0% | 0% | 0% | 100%
yml0% | 0% | 0% | 0% | 100%
fsx0% | 0% | 0% | 0% | 100%
java0% | 0% | 0% | 0% | 100%
File Size per Logical Decomposition
primary
1001+
501-1000
201-500
101-200
1-100
csharp/AdapterTest18% | 7% | 31% | 24% | 17%
csharp/Adapter0% | 19% | 35% | 21% | 23%
csharp/WorkerTest0% | 72% | 26% | 0% | 1%
build/localmode0% | 65% | 0% | 0% | 34%
scala/src0% | 16% | 50% | 26% | 6%
cpp/Riosock0% | 68% | 0% | 0% | 31%
csharp/Worker0% | 0% | 71% | 25% | 3%
build0% | 0% | 83% | 0% | 16%
csharp/Repl0% | 0% | 0% | 84% | 15%
dev/scripts0% | 0% | 0% | 42% | 57%
csharp/ReplTest0% | 0% | 0% | 71% | 28%
csharp/Utils0% | 0% | 0% | 0% | 100%
csharp/Perf0% | 0% | 0% | 0% | 100%
csharp/Tests.Common0% | 0% | 0% | 0% | 100%
scala/perf0% | 0% | 0% | 0% | 100%
python/perf0% | 0% | 0% | 0% | 100%
csharp0% | 0% | 0% | 0% | 100%
scripts0% | 0% | 0% | 0% | 100%
ROOT0% | 0% | 0% | 0% | 100%
cpp0% | 0% | 0% | 0% | 100%
notes0% | 0% | 0% | 0% | 100%
Longest Files (Top 50)
File# lines# units
DataFrameTest.cs
in csharp/AdapterTest
1363 71
DataFrame.cs
in csharp/Adapter/Microsoft.Spark.CSharp/Sql
816 83
RDD.cs
in csharp/Adapter/Microsoft.Spark.CSharp/Core
782 74
WorkerTest.cs
in csharp/WorkerTest
686 23
FunctionsTest.cs
in csharp/AdapterTest
560 60
Functions.cs
in csharp/Adapter/Microsoft.Spark.CSharp/Sql
556 107
DataFrameIpcProxy.cs
in csharp/Adapter/Microsoft.Spark.CSharp/Proxy/Ipc
550 70
downloadtools.ps1
in build/localmode
539 -
DynamicPartitionKafkaInputDStream.scala
in scala/src/main/org/apache/spark/streaming/api/kafka
513 29
Riosock.cpp
in cpp/Riosock
510 20
SparkContextTest.cs
in csharp/AdapterTest
483 33
RioSocketWrapper.cs
in csharp/Adapter/Microsoft.Spark.CSharp/Network
455 25
Types.cs
in csharp/Adapter/Microsoft.Spark.CSharp/Sql
452 18
PairDStreamFunctions.cs
in csharp/Adapter/Microsoft.Spark.CSharp/Streaming
449 21
SparkCLRSubmitArguments.scala
in scala/src/main/org/apache/spark/launcher
425 20
PairRDDFunctions.cs
in csharp/Adapter/Microsoft.Spark.CSharp/Core
407 21
CSharpDStream.scala
in scala/src/main/org/apache/spark/streaming/api/csharp
397 21
Worker.cs
in csharp/Worker/Microsoft.Spark.CSharp
396 16
DStreamTest.cs
in csharp/AdapterTest
394 10
SparkContextIpcProxy.cs
in csharp/Adapter/Microsoft.Spark.CSharp/Proxy/Ipc
359 40
StreamingContextIpcProxy.cs
in csharp/Adapter/Microsoft.Spark.CSharp/Proxy/Ipc
341 24
PayloadHelperTest.cs
in csharp/AdapterTest
329 17
UDFCommand.cs
in csharp/Worker/Microsoft.Spark.CSharp
328 9
SaeaSocketWrapper.cs
in csharp/Adapter/Microsoft.Spark.CSharp/Network
317 20
DStream.cs
in csharp/Adapter/Microsoft.Spark.CSharp/Streaming
315 44
SqlContextTest.cs
in csharp/AdapterTest
315 22
MapWithStateDStream.cs
in csharp/Adapter/Microsoft.Spark.CSharp/Streaming
292 21
SerDe.scala
in scala/src/main/org/apache/spark/api/csharp
266 36
ColumnTest.cs
in csharp/AdapterTest
263 27
SparkContext.cs
in csharp/Adapter/Microsoft.Spark.CSharp/Core
257 30
RioNative.cs
in csharp/Adapter/Microsoft.Spark.CSharp/Network
253 10
RDDTest.cs
in csharp/AdapterTest
250 28
MultiThreadWorkerTest.cs
in csharp/WorkerTest
249 8
JvmBridge.cs
in csharp/Adapter/Microsoft.Spark.CSharp/Interop/Ipc
247 12
DataFrameReaderTest.cs
in csharp/AdapterTest
246 12
ByteBuf.cs
in csharp/Adapter/Microsoft.Spark.CSharp/Network
238 17
SqlContext.cs
in csharp/Adapter/Microsoft.Spark.CSharp/Sql
226 37
CSharpBackendHandler.scala
in scala/src/main/org/apache/spark/api/csharp
225 12
ByteBufChunk.cs
in csharp/Adapter/Microsoft.Spark.CSharp/Network
222 11
PayloadHelper.cs
in csharp/Adapter/Microsoft.Spark.CSharp/Interop/Ipc
212 2
SQLUtils.scala
in scala/src/main/org/apache/spark/sql/api/csharp
208 22
cmd
Build.cmd
in build
207 -
PairRDDTest.cs
in csharp/AdapterTest
199 19
RDDIpcProxy.cs
in csharp/Adapter/Microsoft.Spark.CSharp/Proxy/Ipc
198 27
CSharpRDD.scala
in scala/src/main/org/apache/spark/api/csharp
190 7
CatalogTest.cs
in csharp/AdapterTest
189 18
DataFrameWriterTest.cs
in csharp/AdapterTest
189 15
DataFrameNaFunctionsTest.cs
in csharp/AdapterTest
187 9
ConfigurationService.cs
in csharp/Adapter/Microsoft.Spark.CSharp/Configuration
185 12
DynamicPartitionKafkaRDD.scala
in scala/src/main/org/apache/spark/streaming/api/kafka
184 4
Files With Most Units (Top 20)
File# lines# units
Functions.cs
in csharp/Adapter/Microsoft.Spark.CSharp/Sql
556 107
DataFrame.cs
in csharp/Adapter/Microsoft.Spark.CSharp/Sql
816 83
RDD.cs
in csharp/Adapter/Microsoft.Spark.CSharp/Core
782 74
DataFrameTest.cs
in csharp/AdapterTest
1363 71
DataFrameIpcProxy.cs
in csharp/Adapter/Microsoft.Spark.CSharp/Proxy/Ipc
550 70
FunctionsTest.cs
in csharp/AdapterTest
560 60
DStream.cs
in csharp/Adapter/Microsoft.Spark.CSharp/Streaming
315 44
SparkContextIpcProxy.cs
in csharp/Adapter/Microsoft.Spark.CSharp/Proxy/Ipc
359 40
SqlContext.cs
in csharp/Adapter/Microsoft.Spark.CSharp/Sql
226 37
SerDe.scala
in scala/src/main/org/apache/spark/api/csharp
266 36
SparkContextTest.cs
in csharp/AdapterTest
483 33
SparkContext.cs
in csharp/Adapter/Microsoft.Spark.CSharp/Core
257 30
DynamicPartitionKafkaInputDStream.scala
in scala/src/main/org/apache/spark/streaming/api/kafka
513 29
RDDTest.cs
in csharp/AdapterTest
250 28
RDDIpcProxy.cs
in csharp/Adapter/Microsoft.Spark.CSharp/Proxy/Ipc
198 27
ColumnTest.cs
in csharp/AdapterTest
263 27
RioSocketWrapper.cs
in csharp/Adapter/Microsoft.Spark.CSharp/Network
455 25
StreamingContextIpcProxy.cs
in csharp/Adapter/Microsoft.Spark.CSharp/Proxy/Ipc
341 24
SqlContextIpcProxy.cs
in csharp/Adapter/Microsoft.Spark.CSharp/Proxy/Ipc
171 23
WorkerTest.cs
in csharp/WorkerTest
686 23
Files With Long Lines (Top 20)

There are 137 files with lines longer than 120 characters. In total, there are 1211 long lines.

File# lines# units# long lines
SparkContextIpcProxy.cs
in csharp/Adapter/Microsoft.Spark.CSharp/Proxy/Ipc
359 40 68
DataFrameTest.cs
in csharp/AdapterTest
1363 71 68
Functions.cs
in csharp/Adapter/Microsoft.Spark.CSharp/Sql
556 107 67
DataFrame.cs
in csharp/Adapter/Microsoft.Spark.CSharp/Sql
816 83 56
DataFrameIpcProxy.cs
in csharp/Adapter/Microsoft.Spark.CSharp/Proxy/Ipc
550 70 54
StreamingContextIpcProxy.cs
in csharp/Adapter/Microsoft.Spark.CSharp/Proxy/Ipc
341 24 53
FunctionsTest.cs
in csharp/AdapterTest
560 60 38
RDDIpcProxy.cs
in csharp/Adapter/Microsoft.Spark.CSharp/Proxy/Ipc
198 27 36
DStreamTest.cs
in csharp/AdapterTest
394 10 33
PairDStreamFunctions.cs
in csharp/Adapter/Microsoft.Spark.CSharp/Streaming
449 21 28
SqlContext.cs
in csharp/Adapter/Microsoft.Spark.CSharp/Sql
226 37 26
PairRDDFunctions.cs
in csharp/Adapter/Microsoft.Spark.CSharp/Core
407 21 24
UdfRegistration.cs
in csharp/Adapter/Microsoft.Spark.CSharp/Sql
101 13 24
PairRDDTest.cs
in csharp/AdapterTest
199 19 24
SQLUtils.scala
in scala/src/main/org/apache/spark/sql/api/csharp
208 22 23
DynamicPartitionKafkaInputDStream.scala
in scala/src/main/org/apache/spark/streaming/api/kafka
513 29 21
SparkContext.cs
in csharp/Adapter/Microsoft.Spark.CSharp/Core
257 30 19
SparkContextTest.cs
in csharp/AdapterTest
483 33 19
SqlContextIpcProxy.cs
in csharp/Adapter/Microsoft.Spark.CSharp/Proxy/Ipc
171 23 18
TestWithMoqDemo.cs
in csharp/AdapterTest
153 3 17