awslabs / deequ
File Size

The distribution of size of files (measured in lines of code).

Intro
  • File size measurements show the distribution of size of files.
  • Files are classified in four categories based on their size (lines of code): 1-100 (very small files), 101-200 (small files), 201-500 (medium size files), 501-1000 (long files), 1001+(very long files).
  • It is a good practice to keep files small. Long files may become "bloaters", code that have increased to such gargantuan proportions that they are hard to work with.
Learn more...
File Size Overall
  • There are 95 files with 8,219 lines of code.
    • 0 very long files (0 lines of code)
    • 1 long files (524 lines of code)
    • 7 medium size files (2,458 lines of codeclsfd_ftr_w_mp_ins)
    • 15 small files (2,295 lines of code)
    • 72 very small files (2,942 lines of code)
0% | 6% | 29% | 27% | 35%
Legend:
1001+
501-1000
201-500
101-200
1-100


explore: zoomable circles | sunburst | 3D view
File Size per Extension
1001+
501-1000
201-500
101-200
1-100
scala0% | 6% | 30% | 28% | 35%
java0% | 0% | 0% | 0% | 100%
File Size per Logical Decomposition
primary
1001+
501-1000
201-500
101-200
1-100
deequ/profiles0% | 60% | 0% | 30% | 9%
deequ/analyzers0% | 0% | 25% | 30% | 44%
deequ/checks0% | 0% | 92% | 0% | 7%
deequ/repository0% | 0% | 56% | 15% | 27%
deequ/constraints0% | 0% | 81% | 0% | 18%
deequ/suggestions0% | 0% | 31% | 22% | 46%
deequ0% | 0% | 0% | 82% | 17%
deequ/schema0% | 0% | 0% | 100% | 0%
deequ/anomalydetection0% | 0% | 0% | 33% | 66%
deequ/metrics0% | 0% | 0% | 0% | 100%
deequ/io0% | 0% | 0% | 0% | 100%
Longest Files (Top 50)
File# lines# units
ColumnProfiler.scala
in src/main/scala/com/amazon/deequ/profiles
524 18
Check.scala
in src/main/scala/com/amazon/deequ/checks
487 55
AnalysisResultSerde.scala
in src/main/scala/com/amazon/deequ/repository
463 18
AnalysisRunner.scala
in src/main/scala/com/amazon/deequ/analyzers/runners
349 14
Constraint.scala
in src/main/scala/com/amazon/deequ/constraints
346 28
Analyzer.scala
in src/main/scala/com/amazon/deequ/analyzers
333 39
ConstraintSuggestionRunner.scala
in src/main/scala/com/amazon/deequ/suggestions
262 11
StateProvider.scala
in src/main/scala/com/amazon/deequ/analyzers
218 23
StatefulHyperloglogPlus.scala
in src/main/scala/com/amazon/deequ/analyzers/catalyst
198 13
QuantileNonSample.scala
in src/main/scala/com/amazon/deequ/analyzers
193 15
Applicability.scala
in src/main/scala/com/amazon/deequ/analyzers/applicability
193 15
VerificationSuite.scala
in src/main/scala/com/amazon/deequ
192 12
ConstraintSuggestionRunBuilder.scala
in src/main/scala/com/amazon/deequ/suggestions
185 21
VerificationRunBuilder.scala
in src/main/scala/com/amazon/deequ
165 17
RowLevelSchemaValidator.scala
in src/main/scala/com/amazon/deequ/schema
154 11
HoltWinters.scala
in src/main/scala/com/amazon/deequ/anomalydetection/seasonal
151 4
DataType.scala
in src/main/scala/com/amazon/deequ/analyzers
137 10
ColumnProfilerRunBuilder.scala
in src/main/scala/com/amazon/deequ/profiles
136 15
ColumnProfile.scala
in src/main/scala/com/amazon/deequ/profiles
129 1
KLLRunner.scala
in src/main/scala/com/amazon/deequ/analyzers/runners
127 12
FileSystemMetricsRepository.scala
in src/main/scala/com/amazon/deequ/repository/fs
125 12
KLLSketch.scala
in src/main/scala/com/amazon/deequ/analyzers
106 7
GroupingAnalyzers.scala
in src/main/scala/com/amazon/deequ/analyzers
104 11
AnalysisRunBuilder.scala
in src/main/scala/com/amazon/deequ/analyzers/runners
93 10
AnalyzerContext.scala
in src/main/scala/com/amazon/deequ/analyzers/runners
86 8
AnalysisResult.scala
in src/main/scala/com/amazon/deequ/repository
86 5
ColumnProfilerRunner.scala
in src/main/scala/com/amazon/deequ/profiles
83 4
FractionalCategoricalRangeRule.scala
in src/main/scala/com/amazon/deequ/suggestions/rules
83 3
KLLSketchSerializer.scala
in src/main/scala/com/amazon/deequ/analyzers/catalyst
82 5
Histogram.scala
in src/main/scala/com/amazon/deequ/analyzers
81 6
OnlineNormalStrategy.scala
in src/main/scala/com/amazon/deequ/anomalydetection
75 2
KLLMetric.scala
in src/main/scala/com/amazon/deequ/metrics
75 5
VerificationResult.scala
in src/main/scala/com/amazon/deequ
74 5
Correlation.scala
in src/main/scala/com/amazon/deequ/analyzers
72 5
ConstraintSuggestion.scala
in src/main/scala/com/amazon/deequ/suggestions
72 3
AnalysisBasedConstraint.scala
in src/main/scala/com/amazon/deequ/constraints
71 5
StatefulApproxQuantile.scala
in src/main/scala/com/amazon/deequ/analyzers/catalyst
69 9
MetricsRepositoryMultipleResultsLoader.scala
in src/main/scala/com/amazon/deequ/repository
69 5
ApproxQuantile.scala
in src/main/scala/com/amazon/deequ/analyzers
68 5
StatefulKLLSketch.scala
in src/main/scala/com/amazon/deequ/analyzers/catalyst
66 7
InMemoryMetricsRepository.scala
in src/main/scala/com/amazon/deequ/repository/memory
65 8
MutualInformation.scala
in src/main/scala/com/amazon/deequ/analyzers
60 3
ApproxQuantiles.scala
in src/main/scala/com/amazon/deequ/analyzers
59 4
Distance.scala
in src/main/scala/com/amazon/deequ/analyzers
58 3
StatefulDataType.scala
in src/main/scala/com/amazon/deequ/analyzers/catalyst
52 4
AttributeReferenceCreation.java
in src/main/scala/com/amazon/deequ/analyzers/catalyst
52 2
DeequFunctions.scala
in src/main/scala/com/amazon/deequ/analyzers/catalyst
51 10
CategoricalRangeRule.scala
in src/main/scala/com/amazon/deequ/suggestions/rules
51 2
DfsUtils.scala
in src/main/scala/com/amazon/deequ/io
50 4
StandardDeviation.scala
in src/main/scala/com/amazon/deequ/analyzers
47 5
Files With Most Units (Top 20)
File# lines# units
Check.scala
in src/main/scala/com/amazon/deequ/checks
487 55
Analyzer.scala
in src/main/scala/com/amazon/deequ/analyzers
333 39
Constraint.scala
in src/main/scala/com/amazon/deequ/constraints
346 28
StateProvider.scala
in src/main/scala/com/amazon/deequ/analyzers
218 23
ConstraintSuggestionRunBuilder.scala
in src/main/scala/com/amazon/deequ/suggestions
185 21
AnalysisResultSerde.scala
in src/main/scala/com/amazon/deequ/repository
463 18
ColumnProfiler.scala
in src/main/scala/com/amazon/deequ/profiles
524 18
VerificationRunBuilder.scala
in src/main/scala/com/amazon/deequ
165 17
QuantileNonSample.scala
in src/main/scala/com/amazon/deequ/analyzers
193 15
Applicability.scala
in src/main/scala/com/amazon/deequ/analyzers/applicability
193 15
ColumnProfilerRunBuilder.scala
in src/main/scala/com/amazon/deequ/profiles
136 15
AnalysisRunner.scala
in src/main/scala/com/amazon/deequ/analyzers/runners
349 14
StatefulHyperloglogPlus.scala
in src/main/scala/com/amazon/deequ/analyzers/catalyst
198 13
KLLRunner.scala
in src/main/scala/com/amazon/deequ/analyzers/runners
127 12
FileSystemMetricsRepository.scala
in src/main/scala/com/amazon/deequ/repository/fs
125 12
VerificationSuite.scala
in src/main/scala/com/amazon/deequ
192 12
GroupingAnalyzers.scala
in src/main/scala/com/amazon/deequ/analyzers
104 11
ConstraintSuggestionRunner.scala
in src/main/scala/com/amazon/deequ/suggestions
262 11
RowLevelSchemaValidator.scala
in src/main/scala/com/amazon/deequ/schema
154 11
DeequFunctions.scala
in src/main/scala/com/amazon/deequ/analyzers/catalyst
51 10
Files With Long Lines (Top 5)

There are 5 files with lines longer than 120 characters. In total, there are 8 long lines.

File# lines# units# long lines
PatternMatch.scala
in src/main/scala/com/amazon/deequ/analyzers
42 5 3
ColumnProfiler.scala
in src/main/scala/com/amazon/deequ/profiles
524 18 2
DeequFunctions.scala
in src/main/scala/com/amazon/deequ/analyzers/catalyst
51 10 1
KLLRunner.scala
in src/main/scala/com/amazon/deequ/analyzers/runners
127 12 1
FileSystemMetricsRepository.scala
in src/main/scala/com/amazon/deequ/repository/fs
125 12 1