apache / datasketches-experimentation
File Size

The distribution of size of files (measured in lines of code).

Intro
Learn more...
File Size Overall
0% | 0% | 26% | 56% | 16%
Legend:
1001+
501-1000
201-500
101-200
1-100


explore: grouped by folders | grouped by size | sunburst | 3D view
File Size per Extension
1001+
501-1000
201-500
101-200
1-100
py0% | 0% | 26% | 56% | 16%
File Size per Logical Decomposition
primary
1001+
501-1000
201-500
101-200
1-100
sketches0% | 0% | 54% | 45% | 0%
ROOT0% | 0% | 22% | 63% | 13%
analysis0% | 0% | 0% | 100% | 0%
experiments0% | 0% | 0% | 0% | 100%
Longest Files (Top 15)
File# lines# units
Sampler.py
in sketches
212 42
210 23
184 36
Sketches.py
in sketches
176 47
Oracle.py
in root
167 32
analyze.py
in analysis
149 15
117 19
115 18
93 2
61 -
topk_experiment.py
in experiments
44 -
38 -
31 1
2 1
main.py
in root
1 -
Files With Most Units (Top 11)
File# lines# units
Sketches.py
in sketches
176 47
Sampler.py
in sketches
212 42
184 36
Oracle.py
in root
167 32
210 23
117 19
115 18
analyze.py
in analysis
149 15
93 2
2 1
31 1
Files With Long Lines (Top 4)

There are 4 files with lines longer than 120 characters. In total, there are 10 long lines.

File# lines# units# long lines
analyze.py
in analysis
149 15 4
93 2 3
210 23 2
topk_experiment.py
in experiments
44 - 1
Correlations

File Size vs. Commits (all time): 15 points

DataGenerator.py x: 2 commits (all time) y: 184 lines of code Oracle.py x: 2 commits (all time) y: 167 lines of code QueryGenerator.py x: 2 commits (all time) y: 117 lines of code SketchExperiment.py x: 2 commits (all time) y: 210 lines of code StreamMaker.py x: 2 commits (all time) y: 93 lines of code SyntheticDataGenerators.py x: 2 commits (all time) y: 115 lines of code TopKExperiments.py x: 2 commits (all time) y: 31 lines of code analysis/analyze.py x: 2 commits (all time) y: 149 lines of code experiment_utils.py x: 2 commits (all time) y: 2 lines of code experiments/distinctcount_experiment.py x: 2 commits (all time) y: 38 lines of code experiments/quantile_experiment.py x: 2 commits (all time) y: 61 lines of code experiments/topk_experiment.py x: 2 commits (all time) y: 44 lines of code main.py x: 2 commits (all time) y: 1 lines of code sketches/Sampler.py x: 2 commits (all time) y: 212 lines of code sketches/Sketches.py x: 2 commits (all time) y: 176 lines of code
212.0
lines of code
  min: 1.0
  average: 106.67
  25th percentile: 38.0
  median: 115.0
  75th percentile: 176.0
  max: 212.0
0 2.0
commits (all time)
min: 2.0 | average: 2.0 | 25th percentile: 2.0 | median: 2.0 | 75th percentile: 2.0 | max: 2.0

File Size vs. Contributors (all time): 15 points

DataGenerator.py x: 2 contributors (all time) y: 184 lines of code Oracle.py x: 2 contributors (all time) y: 167 lines of code QueryGenerator.py x: 2 contributors (all time) y: 117 lines of code SketchExperiment.py x: 2 contributors (all time) y: 210 lines of code StreamMaker.py x: 2 contributors (all time) y: 93 lines of code SyntheticDataGenerators.py x: 2 contributors (all time) y: 115 lines of code TopKExperiments.py x: 2 contributors (all time) y: 31 lines of code analysis/analyze.py x: 2 contributors (all time) y: 149 lines of code experiment_utils.py x: 2 contributors (all time) y: 2 lines of code experiments/distinctcount_experiment.py x: 2 contributors (all time) y: 38 lines of code experiments/quantile_experiment.py x: 2 contributors (all time) y: 61 lines of code experiments/topk_experiment.py x: 2 contributors (all time) y: 44 lines of code main.py x: 2 contributors (all time) y: 1 lines of code sketches/Sampler.py x: 2 contributors (all time) y: 212 lines of code sketches/Sketches.py x: 2 contributors (all time) y: 176 lines of code
212.0
lines of code
  min: 1.0
  average: 106.67
  25th percentile: 38.0
  median: 115.0
  75th percentile: 176.0
  max: 212.0
0 2.0
contributors (all time)
min: 2.0 | average: 2.0 | 25th percentile: 2.0 | median: 2.0 | 75th percentile: 2.0 | max: 2.0

File Size vs. Commits (30 days): 0 points

No data for "commits (30d)" vs. "lines of code".

File Size vs. Contributors (30 days): 0 points

No data for "contributors (30d)" vs. "lines of code".


File Size vs. Commits (90 days): 0 points

No data for "commits (90d)" vs. "lines of code".

File Size vs. Contributors (90 days): 0 points

No data for "contributors (90d)" vs. "lines of code".