apache / datasketches-experimentation
File Change Frequency

File change frequency (churn) shows the distribution of file updates (days with at least one commit).

Overview
File Change Frequency Overall
  • There are 15 files with 1,600 lines of code.
    • 0 files changed more than 100 times (0 lines of code)
    • 0 files changed 51-100 times (0 lines of code)
    • 0 files changed 21-50 times (0 lines of code)
    • 0 files changed 6-20 times (0 lines of code)
    • 15 files changed 1-5 times (1,600 lines of code)
0% | 0% | 0% | 0% | 100%
Legend:
101+
51-100
21-50
6-20
1-5

explore: grouped by folders | grouped by update frequency | data
Contributors Count Frequency Overall
  • There are 15 files with 1,600 lines of code.
    • 0 files changed by more than 25 contributors (0 lines of code)
    • 0 files changed by 11-25 contributors (0 lines of code)
    • 0 files changed by 6-10 contributors (0 lines of code)
    • 15 files changed by 2-5 contributors (1,600 lines of code)
    • 0 files changed by 1 contributor (0 lines of code)
0% | 0% | 0% | 100% | 0%
Legend:
26+
11-25
6-10
2-5
1

explore: grouped by folders | grouped by contributors count | data
File Change Frequency per File Extension
py, ipynb, txt, gitignore, md, json
File Change Frequency per Extension
The number of recorded file updates
101+
51-100
21-50
6-20
1-5
py0% | 0% | 0% | 0% | 100%
File Change Frequency per Logical Decomposition
primary
primary (file change frequency)
The number of recorded file updates
101+
51-100
21-50
6-20
1-5
ROOT0% | 0% | 0% | 0% | 100%
sketches0% | 0% | 0% | 0% | 100%
analysis0% | 0% | 0% | 0% | 100%
experiments0% | 0% | 0% | 0% | 100%
Most Frequently Changed Files (Top 15)

See data for all files...

File# lines# unitscreatedlast modified# changes
(days)
# contributorsfirst
contributor
latest
contributor
main.py
in root
1 - 2021-11-03 2021-11-04 2 2 dting@dting-ltm.internal.sa... 786705+jmalkin@users.norepl...
2 1 2021-11-03 2021-11-04 2 2 dting@dting-ltm.internal.sa... 786705+jmalkin@users.norepl...
31 1 2021-11-03 2021-11-04 2 2 dting@dting-ltm.internal.sa... 786705+jmalkin@users.norepl...
38 - 2021-11-03 2021-11-04 2 2 dting@dting-ltm.internal.sa... 786705+jmalkin@users.norepl...
topk_experiment.py
in experiments
44 - 2021-11-03 2021-11-04 2 2 dting@dting-ltm.internal.sa... 786705+jmalkin@users.norepl...
61 - 2021-11-03 2021-11-04 2 2 dting@dting-ltm.internal.sa... 786705+jmalkin@users.norepl...
93 2 2021-11-03 2021-11-04 2 2 dting@dting-ltm.internal.sa... 786705+jmalkin@users.norepl...
115 18 2021-11-03 2021-11-04 2 2 dting@dting-ltm.internal.sa... 786705+jmalkin@users.norepl...
117 19 2021-11-03 2021-11-04 2 2 dting@dting-ltm.internal.sa... 786705+jmalkin@users.norepl...
analyze.py
in analysis
149 15 2021-11-03 2021-11-04 2 2 dting@dting-ltm.internal.sa... 786705+jmalkin@users.norepl...
Oracle.py
in root
167 32 2021-11-03 2021-11-04 2 2 dting@dting-ltm.internal.sa... 786705+jmalkin@users.norepl...
Sketches.py
in sketches
176 47 2021-11-03 2021-11-04 2 2 dting@dting-ltm.internal.sa... 786705+jmalkin@users.norepl...
184 36 2021-11-03 2021-11-04 2 2 dting@dting-ltm.internal.sa... 786705+jmalkin@users.norepl...
210 23 2021-11-03 2021-11-04 2 2 dting@dting-ltm.internal.sa... 786705+jmalkin@users.norepl...
Sampler.py
in sketches
212 42 2021-11-03 2021-11-04 2 2 dting@dting-ltm.internal.sa... 786705+jmalkin@users.norepl...
Files With Most Contributors (Top 15)
Based on the number of unique email addresses found in commits.

See data for all files...

File# lines# unitscreatedlast modified# changes
(days)
# contributorsfirst
contributor
latest
contributor
2 1 2021-11-03 2021-11-04 2 2 dting@dting-ltm.internal.sa... 786705+jmalkin@users.norepl...
184 36 2021-11-03 2021-11-04 2 2 dting@dting-ltm.internal.sa... 786705+jmalkin@users.norepl...
analyze.py
in analysis
149 15 2021-11-03 2021-11-04 2 2 dting@dting-ltm.internal.sa... 786705+jmalkin@users.norepl...
Sketches.py
in sketches
176 47 2021-11-03 2021-11-04 2 2 dting@dting-ltm.internal.sa... 786705+jmalkin@users.norepl...
Sampler.py
in sketches
212 42 2021-11-03 2021-11-04 2 2 dting@dting-ltm.internal.sa... 786705+jmalkin@users.norepl...
topk_experiment.py
in experiments
44 - 2021-11-03 2021-11-04 2 2 dting@dting-ltm.internal.sa... 786705+jmalkin@users.norepl...
38 - 2021-11-03 2021-11-04 2 2 dting@dting-ltm.internal.sa... 786705+jmalkin@users.norepl...
61 - 2021-11-03 2021-11-04 2 2 dting@dting-ltm.internal.sa... 786705+jmalkin@users.norepl...
93 2 2021-11-03 2021-11-04 2 2 dting@dting-ltm.internal.sa... 786705+jmalkin@users.norepl...
117 19 2021-11-03 2021-11-04 2 2 dting@dting-ltm.internal.sa... 786705+jmalkin@users.norepl...
210 23 2021-11-03 2021-11-04 2 2 dting@dting-ltm.internal.sa... 786705+jmalkin@users.norepl...
Oracle.py
in root
167 32 2021-11-03 2021-11-04 2 2 dting@dting-ltm.internal.sa... 786705+jmalkin@users.norepl...
main.py
in root
1 - 2021-11-03 2021-11-04 2 2 dting@dting-ltm.internal.sa... 786705+jmalkin@users.norepl...
31 1 2021-11-03 2021-11-04 2 2 dting@dting-ltm.internal.sa... 786705+jmalkin@users.norepl...
115 18 2021-11-03 2021-11-04 2 2 dting@dting-ltm.internal.sa... 786705+jmalkin@users.norepl...
Files With Least Contributors (Top 15)
Based on the number of unique email addresses found in commits.

See data for all files...

File# lines# unitscreatedlast modified# changes
(days)
# contributorsfirst
contributor
latest
contributor
Sampler.py
in sketches
212 42 2021-11-03 2021-11-04 2 2 dting@dting-ltm.internal.sa... 786705+jmalkin@users.norepl...
210 23 2021-11-03 2021-11-04 2 2 dting@dting-ltm.internal.sa... 786705+jmalkin@users.norepl...
184 36 2021-11-03 2021-11-04 2 2 dting@dting-ltm.internal.sa... 786705+jmalkin@users.norepl...
Sketches.py
in sketches
176 47 2021-11-03 2021-11-04 2 2 dting@dting-ltm.internal.sa... 786705+jmalkin@users.norepl...
Oracle.py
in root
167 32 2021-11-03 2021-11-04 2 2 dting@dting-ltm.internal.sa... 786705+jmalkin@users.norepl...
analyze.py
in analysis
149 15 2021-11-03 2021-11-04 2 2 dting@dting-ltm.internal.sa... 786705+jmalkin@users.norepl...
117 19 2021-11-03 2021-11-04 2 2 dting@dting-ltm.internal.sa... 786705+jmalkin@users.norepl...
115 18 2021-11-03 2021-11-04 2 2 dting@dting-ltm.internal.sa... 786705+jmalkin@users.norepl...
93 2 2021-11-03 2021-11-04 2 2 dting@dting-ltm.internal.sa... 786705+jmalkin@users.norepl...
61 - 2021-11-03 2021-11-04 2 2 dting@dting-ltm.internal.sa... 786705+jmalkin@users.norepl...
topk_experiment.py
in experiments
44 - 2021-11-03 2021-11-04 2 2 dting@dting-ltm.internal.sa... 786705+jmalkin@users.norepl...
38 - 2021-11-03 2021-11-04 2 2 dting@dting-ltm.internal.sa... 786705+jmalkin@users.norepl...
31 1 2021-11-03 2021-11-04 2 2 dting@dting-ltm.internal.sa... 786705+jmalkin@users.norepl...
2 1 2021-11-03 2021-11-04 2 2 dting@dting-ltm.internal.sa... 786705+jmalkin@users.norepl...
main.py
in root
1 - 2021-11-03 2021-11-04 2 2 dting@dting-ltm.internal.sa... 786705+jmalkin@users.norepl...
Correlations

File Size vs. Number of Changes: 15 points

DataGenerator.py x: 184 lines of code y: 2 # changes Oracle.py x: 167 lines of code y: 2 # changes QueryGenerator.py x: 117 lines of code y: 2 # changes SketchExperiment.py x: 210 lines of code y: 2 # changes StreamMaker.py x: 93 lines of code y: 2 # changes SyntheticDataGenerators.py x: 115 lines of code y: 2 # changes TopKExperiments.py x: 31 lines of code y: 2 # changes analysis/analyze.py x: 149 lines of code y: 2 # changes experiment_utils.py x: 2 lines of code y: 2 # changes experiments/distinctcount_experiment.py x: 38 lines of code y: 2 # changes experiments/quantile_experiment.py x: 61 lines of code y: 2 # changes experiments/topk_experiment.py x: 44 lines of code y: 2 # changes main.py x: 1 lines of code y: 2 # changes sketches/Sampler.py x: 212 lines of code y: 2 # changes sketches/Sketches.py x: 176 lines of code y: 2 # changes
2.0
# changes
  min: 2.0
  average: 2.0
  25th percentile: 2.0
  median: 2.0
  75th percentile: 2.0
  max: 2.0
0 212.0
lines of code
min: 1.0 | average: 106.67 | 25th percentile: 38.0 | median: 115.0 | 75th percentile: 176.0 | max: 212.0

Number of Contributors vs. Number of Changes: 15 points

DataGenerator.py x: 2 # contributors y: 2 # changes
2.0
# changes
  min: 2.0
  average: 2.0
  25th percentile: 2.0
  median: 2.0
  75th percentile: 2.0
  max: 2.0
0 2.0
# contributors
min: 2.0 | average: 2.0 | 25th percentile: 2.0 | median: 2.0 | 75th percentile: 2.0 | max: 2.0

Number of Contributors vs. File Size: 15 points

DataGenerator.py x: 2 # contributors y: 184 lines of code Oracle.py x: 2 # contributors y: 167 lines of code QueryGenerator.py x: 2 # contributors y: 117 lines of code SketchExperiment.py x: 2 # contributors y: 210 lines of code StreamMaker.py x: 2 # contributors y: 93 lines of code SyntheticDataGenerators.py x: 2 # contributors y: 115 lines of code TopKExperiments.py x: 2 # contributors y: 31 lines of code analysis/analyze.py x: 2 # contributors y: 149 lines of code experiment_utils.py x: 2 # contributors y: 2 lines of code experiments/distinctcount_experiment.py x: 2 # contributors y: 38 lines of code experiments/quantile_experiment.py x: 2 # contributors y: 61 lines of code experiments/topk_experiment.py x: 2 # contributors y: 44 lines of code main.py x: 2 # contributors y: 1 lines of code sketches/Sampler.py x: 2 # contributors y: 212 lines of code sketches/Sketches.py x: 2 # contributors y: 176 lines of code
212.0
lines of code
  min: 1.0
  average: 106.67
  25th percentile: 38.0
  median: 115.0
  75th percentile: 176.0
  max: 212.0
0 2.0
# contributors
min: 2.0 | average: 2.0 | 25th percentile: 2.0 | median: 2.0 | 75th percentile: 2.0 | max: 2.0