microsoft / dpu-utils
File Size

The distribution of size of files (measured in lines of code).

Intro
  • File size measurements show the distribution of size of files.
  • Files are classified in four categories based on their size (lines of code): 1-100 (very small files), 101-200 (small files), 201-500 (medium size files), 501-1000 (long files), 1001+(very long files).
  • It is a good practice to keep files small. Long files may become "bloaters", code that have increased to such gargantuan proportions that they are hard to work with.
Learn more...
File Size Overall
  • There are 63 files with 5,487 lines of code.
    • 0 very long files (0 lines of code)
    • 1 long files (502 lines of code)
    • 6 medium size files (1,837 lines of codeclsfd_ftr_w_mp_ins)
    • 12 small files (1,796 lines of code)
    • 44 very small files (1,352 lines of code)
0% | 9% | 33% | 32% | 24%
Legend:
1001+
501-1000
201-500
101-200
1-100


explore: zoomable circles | sunburst | 3D view
File Size per Extension
1001+
501-1000
201-500
101-200
1-100
py0% | 19% | 7% | 37% | 35%
cs0% | 0% | 58% | 29% | 12%
yml0% | 0% | 0% | 0% | 100%
in0% | 0% | 0% | 0% | 100%
File Size per Logical Decomposition
primary
1001+
501-1000
201-500
101-200
1-100
python/dpu_utils/utils0% | 56% | 23% | 0% | 20%
dotnet/Utils0% | 0% | 68% | 12% | 18%
dotnet/CSharpSourceGraphExtraction/GraphBuilders0% | 0% | 50% | 34% | 15%
dotnet/CSharpSourceGraphExtraction0% | 0% | 72% | 27% | 0%
dotnet/CSharpSourceGraphExtraction/Utils0% | 0% | 41% | 52% | 6%
python/dpu_utils/tfmodels0% | 0% | 0% | 99% | <1%
python/dpu_utils/ptutils0% | 0% | 0% | 99% | <1%
python/dpu_utils/mlutils0% | 0% | 0% | 52% | 47%
python/dpu_utils/codeutils0% | 0% | 0% | 29% | 70%
python/dpu_utils/tfutils0% | 0% | 0% | 0% | 100%
python/dpu_utils/tf2utils0% | 0% | 0% | 0% | 100%
ROOT0% | 0% | 0% | 0% | 100%
python0% | 0% | 0% | 0% | 100%
python/dpu_utils0% | 0% | 0% | 0% | 100%
Longest Files (Top 50)
File# lines# units
richpath.py
in python/dpu_utils/utils
502 81
VariableUseGraphBuilder.cs
in dotnet/CSharpSourceGraphExtraction/GraphBuilders
440 22
RichPath.cs
in dotnet/Utils
401 22
SourceGraph.cs
in dotnet/CSharpSourceGraphExtraction
323 16
DirectedGraph.cs
in dotnet/Utils
243 13
RoslynUtils.cs
in dotnet/CSharpSourceGraphExtraction/Utils
223 16
iterators.py
in python/dpu_utils/utils
207 18
componenttrainer.py
in python/dpu_utils/ptutils
192 11
asyncgnn.py
in python/dpu_utils/tfmodels
189 6
GuardedByGraphBuilder.cs
in dotnet/CSharpSourceGraphExtraction/GraphBuilders
184 18
sparsegnn.py
in python/dpu_utils/tfmodels
174 3
basecomponent.py
in python/dpu_utils/ptutils
167 29
TypeHierarchy.cs
in dotnet/CSharpSourceGraphExtraction/Utils
152 8
bpevocabulary.py
in python/dpu_utils/mlutils
134 12
MethodUtils.cs
in dotnet/CSharpSourceGraphExtraction/Utils
130 4
MethodUseInformationCollector.cs
in dotnet/CSharpSourceGraphExtraction
123 9
ChunkedJsonWriter.cs
in dotnet/Utils
120 6
ReturnToGraphBuilder.cs
in dotnet/CSharpSourceGraphExtraction/GraphBuilders
116 16
lattice.py
in python/dpu_utils/codeutils/lattice
115 15
DataFlowGraphBuilder.cs
in dotnet/CSharpSourceGraphExtraction/GraphBuilders
93 2
Multimap.cs
in dotnet/Utils
92 10
azure-pipelines.yml
in root
84 -
deduplication.py
in python/dpu_utils/codeutils/deduplication
82 8
vocabulary.py
in python/dpu_utils/mlutils
80 16
mlp.py
in python/dpu_utils/tf2utils
69 3
chunkwriter.py
in python/dpu_utils/utils
69 9
BidirectionalMap.cs
in dotnet/Utils
65 11
parser.py
in python/dpu_utils/codeutils/treesitter
54 1
msgpackloading.py
in python/dpu_utils/utils
47 3
ASTGraphBuilder.cs
in dotnet/CSharpSourceGraphExtraction/GraphBuilders
46 4
unsorted_segment_ops.py
in python/dpu_utils/tf2utils
46 3
chartensorizer.py
in python/dpu_utils/mlutils
39 6
unsortedsegmentops.py
in python/dpu_utils/tfutils
36 3
tfvariablesaver.py
in python/dpu_utils/tfutils
35 4
gradratiologgingoptimizer.py
in python/dpu_utils/tfutils
34 4
IntVocabulary.cs
in dotnet/CSharpSourceGraphExtraction/Utils
33 3
keywordlist.py
in python/dpu_utils/codeutils/keywords
32 1
activation.py
in python/dpu_utils/tf2utils
30 2
pick_indices.py
in python/dpu_utils/tfutils
27 1
text.py
in python/dpu_utils/codeutils
26 1
activation.py
in python/dpu_utils/tfutils
26 1
filesuffix.py
in python/dpu_utils/codeutils
25 1
dataloading.py
in python/dpu_utils/utils
25 4
setup.py
in python
24 -
ExtensionUtils.cs
in dotnet/Utils
22 2
identifiersplitting.py
in python/dpu_utils/codeutils
22 1
csharplattice.py
in python/dpu_utils/codeutils/lattice
19 3
debughelper.py
in python/dpu_utils/utils
15 1
__init__.py
in python/dpu_utils/utils
14 -
gitlog.py
in python/dpu_utils/utils
10 1
Files With Most Units (Top 20)
File# lines# units
richpath.py
in python/dpu_utils/utils
502 81
basecomponent.py
in python/dpu_utils/ptutils
167 29
VariableUseGraphBuilder.cs
in dotnet/CSharpSourceGraphExtraction/GraphBuilders
440 22
RichPath.cs
in dotnet/Utils
401 22
GuardedByGraphBuilder.cs
in dotnet/CSharpSourceGraphExtraction/GraphBuilders
184 18
iterators.py
in python/dpu_utils/utils
207 18
ReturnToGraphBuilder.cs
in dotnet/CSharpSourceGraphExtraction/GraphBuilders
116 16
SourceGraph.cs
in dotnet/CSharpSourceGraphExtraction
323 16
RoslynUtils.cs
in dotnet/CSharpSourceGraphExtraction/Utils
223 16
vocabulary.py
in python/dpu_utils/mlutils
80 16
lattice.py
in python/dpu_utils/codeutils/lattice
115 15
DirectedGraph.cs
in dotnet/Utils
243 13
bpevocabulary.py
in python/dpu_utils/mlutils
134 12
BidirectionalMap.cs
in dotnet/Utils
65 11
componenttrainer.py
in python/dpu_utils/ptutils
192 11
Multimap.cs
in dotnet/Utils
92 10
MethodUseInformationCollector.cs
in dotnet/CSharpSourceGraphExtraction
123 9
chunkwriter.py
in python/dpu_utils/utils
69 9
TypeHierarchy.cs
in dotnet/CSharpSourceGraphExtraction/Utils
152 8
deduplication.py
in python/dpu_utils/codeutils/deduplication
82 8
Files With Long Lines (Top 17)

There are 17 files with lines longer than 120 characters. In total, there are 147 long lines.

File# lines# units# long lines
VariableUseGraphBuilder.cs
in dotnet/CSharpSourceGraphExtraction/GraphBuilders
440 22 56
sparsegnn.py
in python/dpu_utils/tfmodels
174 3 16
DirectedGraph.cs
in dotnet/Utils
243 13 14
SourceGraph.cs
in dotnet/CSharpSourceGraphExtraction
323 16 13
iterators.py
in python/dpu_utils/utils
207 18 9
MethodUseInformationCollector.cs
in dotnet/CSharpSourceGraphExtraction
123 9 6
richpath.py
in python/dpu_utils/utils
502 81 6
RoslynUtils.cs
in dotnet/CSharpSourceGraphExtraction/Utils
223 16 5
componenttrainer.py
in python/dpu_utils/ptutils
192 11 5
DataFlowGraphBuilder.cs
in dotnet/CSharpSourceGraphExtraction/GraphBuilders
93 2 4
GuardedByGraphBuilder.cs
in dotnet/CSharpSourceGraphExtraction/GraphBuilders
184 18 4
RichPath.cs
in dotnet/Utils
401 22 4
MethodUtils.cs
in dotnet/CSharpSourceGraphExtraction/Utils
130 4 1
deduplication.py
in python/dpu_utils/codeutils/deduplication
82 8 1
asyncgnn.py
in python/dpu_utils/tfmodels
189 6 1
__init__.py
in python/dpu_utils/utils
14 - 1
setup.py
in python
24 - 1