apache / datasketches-spark
File Change Frequency

File change frequency (churn) shows the distribution of file updates (days with at least one commit).

Overview
File Change Frequency Overall
  • There are 28 files with 1,516 lines of code.
    • 0 files changed more than 100 times (0 lines of code)
    • 0 files changed 51-100 times (0 lines of code)
    • 0 files changed 21-50 times (0 lines of code)
    • 3 files changed 6-20 times (235 lines of code)
    • 25 files changed 1-5 times (1,281 lines of code)
0% | 0% | 0% | 15% | 84%
Legend:
101+
51-100
21-50
6-20
1-5

explore: grouped by folders | grouped by update frequency | data
Contributors Count Frequency Overall
  • There are 28 files with 1,516 lines of code.
    • 0 files changed by more than 25 contributors (0 lines of code)
    • 0 files changed by 11-25 contributors (0 lines of code)
    • 0 files changed by 6-10 contributors (0 lines of code)
    • 16 files changed by 2-5 contributors (742 lines of code)
    • 12 files changed by 1 contributor (774 lines of code)
0% | 0% | 0% | 48% | 51%
Legend:
26+
11-25
6-10
2-5
1

explore: grouped by folders | grouped by contributors count | data
File Change Frequency per File Extension
scala, py, md, yaml, sbt, cfg, in, ini, gitignore, toml, txt, sh
File Change Frequency per Extension
The number of recorded file updates
101+
51-100
21-50
6-20
1-5
scala0% | 0% | 0% | 8% | 91%
sbt0% | 0% | 0% | 100% | 0%
py0% | 0% | 0% | 34% | 65%
toml0% | 0% | 0% | 0% | 100%
in0% | 0% | 0% | 0% | 100%
cfg0% | 0% | 0% | 0% | 100%
File Change Frequency per Logical Decomposition
primary
primary (file change frequency)
The number of recorded file updates
101+
51-100
21-50
6-20
1-5
src0% | 0% | 0% | 8% | 91%
ROOT0% | 0% | 0% | 98% | 1%
python0% | 0% | 0% | 25% | 75%
project0% | 0% | 0% | 0% | 100%
Most Frequently Changed Files (Top 28)

See data for all files...

File# lines# unitscreatedlast modified# changes
(days)
# contributorsfirst
contributor
latest
contributor
build.sbt
in root
77 - 2025-01-07 2025-02-18 9 2 786705+jmalkin@users.norepl... 786705+jmalkin@users.norepl...
common.py
in python/src/datasketches_spark
57 10 2025-02-06 2025-03-28 8 2 786705+jmalkin@users.norepl... jmalkin.noreply@apache.org
ThetaExpressions.scala
in src/main/scala/org/apache/spark/sql/datasketches/theta/expressions
101 8 2025-01-29 2025-04-02 6 4 786705+jmalkin@users.norepl... alexandersaydakov@users.nor...
setup.py
in python
30 1 2025-02-06 2025-02-18 5 2 786705+jmalkin@users.norepl... 786705+jmalkin@users.norepl...
functions.scala
in src/main/scala/org/apache/spark/sql/datasketches/theta
69 20 2025-01-29 2025-04-02 5 3 786705+jmalkin@users.norepl... alexandersaydakov@users.nor...
ThetaSketchAggBuild.scala
in src/main/scala/org/apache/spark/sql/datasketches/theta/aggregate
122 6 2025-01-29 2025-04-02 5 3 786705+jmalkin@users.norepl... alexandersaydakov@users.nor...
in
MANIFEST.in
in python
19 - 2025-02-06 2025-02-18 4 1 786705+jmalkin@users.norepl... 786705+jmalkin@users.norepl...
DatasketchesFunctionRegistry.scala
in src/main/scala/org/apache/spark/sql/datasketches/common
29 3 2025-01-29 2025-03-28 4 2 786705+jmalkin@users.norepl... jmalkin.noreply@apache.org
42 - 2025-02-12 2025-02-18 4 2 786705+jmalkin@users.norepl... 786705+jmalkin@users.norepl...
57 3 2025-02-06 2025-02-18 4 1 786705+jmalkin@users.norepl... 786705+jmalkin@users.norepl...
kll.py
in python/src/datasketches_spark
68 12 2025-02-06 2025-02-18 4 1 786705+jmalkin@users.norepl... 786705+jmalkin@users.norepl...
KllDoublesSketchExpressions.scala
in src/main/scala/org/apache/spark/sql/datasketches/kll/expressions
237 17 2025-01-29 2025-02-20 4 1 786705+jmalkin@users.norepl... 786705+jmalkin@users.norepl...
cfg
1 - 2025-02-12 2025-02-18 3 1 786705+jmalkin@users.norepl... 786705+jmalkin@users.norepl...
__init__.py
in python/src/datasketches_spark
4 - 2025-02-06 2025-02-18 3 1 786705+jmalkin@users.norepl... 786705+jmalkin@users.norepl...
_version.py
in python/src/datasketches_spark
8 1 2025-02-12 2025-02-18 3 2 786705+jmalkin@users.norepl... 786705+jmalkin@users.norepl...
functions.scala
in src/main/scala/org/apache/spark/sql/datasketches/common
10 2 2025-03-25 2025-03-28 3 2 jmalkin.noreply@apache.org jmalkin.noreply@apache.org
ThetaFunctionRegistry.scala
in src/main/scala/org/apache/spark/sql/datasketches/theta
15 - 2025-01-29 2025-02-27 3 2 786705+jmalkin@users.norepl... 786705+jmalkin@users.norepl...
KllDoublesSketchType.scala
in src/main/scala/org/apache/spark/sql/datasketches/kll/types
36 3 2025-01-29 2025-02-18 3 1 786705+jmalkin@users.norepl... 786705+jmalkin@users.norepl...
ThetaSketchAggUnion.scala
in src/main/scala/org/apache/spark/sql/datasketches/theta/aggregate
104 6 2025-01-29 2025-02-27 3 2 786705+jmalkin@users.norepl... 786705+jmalkin@users.norepl...
ThetaSketchConstants.scala
in src/main/scala/org/apache/spark/sql/datasketches/theta
4 - 2025-02-25 2025-02-27 2 2 alexandersaydakov@users.nor... 786705+jmalkin@users.norepl...
DatasketchesScalaFunctionsBase.scala
in src/main/scala/org/apache/spark/sql/datasketches/common
10 1 2025-01-29 2025-04-02 2 3 786705+jmalkin@users.norepl... alexandersaydakov@users.nor...
ThetaSketchWrapper.scala
in src/main/scala/org/apache/spark/sql/datasketches/theta/types
28 4 2025-01-29 2025-04-02 2 3 786705+jmalkin@users.norepl... alexandersaydakov@users.nor...
ThetaSketchType.scala
in src/main/scala/org/apache/spark/sql/datasketches/theta/types
15 2 2025-01-29 2025-01-29 1 1 786705+jmalkin@users.norepl... 786705+jmalkin@users.norepl...
KllFunctionRegistry.scala
in src/main/scala/org/apache/spark/sql/datasketches/kll
17 - 2025-01-29 2025-01-29 1 1 786705+jmalkin@users.norepl... 786705+jmalkin@users.norepl...
CastAsBinary.scala
in src/main/scala/org/apache/spark/sql/datasketches/common
36 4 2025-03-28 2025-03-28 1 2 786705+jmalkin@users.norepl... jmalkin.noreply@apache.org
functions.scala
in src/main/scala/org/apache/spark/sql/datasketches/kll
100 30 2025-01-29 2025-01-29 1 1 786705+jmalkin@users.norepl... 786705+jmalkin@users.norepl...
KllDoublesSketchAggBuild.scala
in src/main/scala/org/apache/spark/sql/datasketches/kll/aggregate
107 10 2025-01-29 2025-01-29 1 1 786705+jmalkin@users.norepl... 786705+jmalkin@users.norepl...
KllDoublesSketchAggMerge.scala
in src/main/scala/org/apache/spark/sql/datasketches/kll/aggregate
113 10 2025-01-29 2025-01-29 1 1 786705+jmalkin@users.norepl... 786705+jmalkin@users.norepl...
Files With Most Contributors (Top 28)
Based on the number of unique email addresses found in commits.

See data for all files...

File# lines# unitscreatedlast modified# changes
(days)
# contributorsfirst
contributor
latest
contributor
ThetaExpressions.scala
in src/main/scala/org/apache/spark/sql/datasketches/theta/expressions
101 8 2025-01-29 2025-04-02 6 4 786705+jmalkin@users.norepl... alexandersaydakov@users.nor...
functions.scala
in src/main/scala/org/apache/spark/sql/datasketches/theta
69 20 2025-01-29 2025-04-02 5 3 786705+jmalkin@users.norepl... alexandersaydakov@users.nor...
ThetaSketchAggBuild.scala
in src/main/scala/org/apache/spark/sql/datasketches/theta/aggregate
122 6 2025-01-29 2025-04-02 5 3 786705+jmalkin@users.norepl... alexandersaydakov@users.nor...
DatasketchesScalaFunctionsBase.scala
in src/main/scala/org/apache/spark/sql/datasketches/common
10 1 2025-01-29 2025-04-02 2 3 786705+jmalkin@users.norepl... alexandersaydakov@users.nor...
ThetaSketchWrapper.scala
in src/main/scala/org/apache/spark/sql/datasketches/theta/types
28 4 2025-01-29 2025-04-02 2 3 786705+jmalkin@users.norepl... alexandersaydakov@users.nor...
build.sbt
in root
77 - 2025-01-07 2025-02-18 9 2 786705+jmalkin@users.norepl... 786705+jmalkin@users.norepl...
common.py
in python/src/datasketches_spark
57 10 2025-02-06 2025-03-28 8 2 786705+jmalkin@users.norepl... jmalkin.noreply@apache.org
setup.py
in python
30 1 2025-02-06 2025-02-18 5 2 786705+jmalkin@users.norepl... 786705+jmalkin@users.norepl...
DatasketchesFunctionRegistry.scala
in src/main/scala/org/apache/spark/sql/datasketches/common
29 3 2025-01-29 2025-03-28 4 2 786705+jmalkin@users.norepl... jmalkin.noreply@apache.org
42 - 2025-02-12 2025-02-18 4 2 786705+jmalkin@users.norepl... 786705+jmalkin@users.norepl...
functions.scala
in src/main/scala/org/apache/spark/sql/datasketches/common
10 2 2025-03-25 2025-03-28 3 2 jmalkin.noreply@apache.org jmalkin.noreply@apache.org
ThetaSketchAggUnion.scala
in src/main/scala/org/apache/spark/sql/datasketches/theta/aggregate
104 6 2025-01-29 2025-02-27 3 2 786705+jmalkin@users.norepl... 786705+jmalkin@users.norepl...
ThetaFunctionRegistry.scala
in src/main/scala/org/apache/spark/sql/datasketches/theta
15 - 2025-01-29 2025-02-27 3 2 786705+jmalkin@users.norepl... 786705+jmalkin@users.norepl...
_version.py
in python/src/datasketches_spark
8 1 2025-02-12 2025-02-18 3 2 786705+jmalkin@users.norepl... 786705+jmalkin@users.norepl...
ThetaSketchConstants.scala
in src/main/scala/org/apache/spark/sql/datasketches/theta
4 - 2025-02-25 2025-02-27 2 2 alexandersaydakov@users.nor... 786705+jmalkin@users.norepl...
CastAsBinary.scala
in src/main/scala/org/apache/spark/sql/datasketches/common
36 4 2025-03-28 2025-03-28 1 2 786705+jmalkin@users.norepl... jmalkin.noreply@apache.org
KllDoublesSketchExpressions.scala
in src/main/scala/org/apache/spark/sql/datasketches/kll/expressions
237 17 2025-01-29 2025-02-20 4 1 786705+jmalkin@users.norepl... 786705+jmalkin@users.norepl...
kll.py
in python/src/datasketches_spark
68 12 2025-02-06 2025-02-18 4 1 786705+jmalkin@users.norepl... 786705+jmalkin@users.norepl...
in
MANIFEST.in
in python
19 - 2025-02-06 2025-02-18 4 1 786705+jmalkin@users.norepl... 786705+jmalkin@users.norepl...
57 3 2025-02-06 2025-02-18 4 1 786705+jmalkin@users.norepl... 786705+jmalkin@users.norepl...
cfg
1 - 2025-02-12 2025-02-18 3 1 786705+jmalkin@users.norepl... 786705+jmalkin@users.norepl...
KllDoublesSketchType.scala
in src/main/scala/org/apache/spark/sql/datasketches/kll/types
36 3 2025-01-29 2025-02-18 3 1 786705+jmalkin@users.norepl... 786705+jmalkin@users.norepl...
__init__.py
in python/src/datasketches_spark
4 - 2025-02-06 2025-02-18 3 1 786705+jmalkin@users.norepl... 786705+jmalkin@users.norepl...
functions.scala
in src/main/scala/org/apache/spark/sql/datasketches/kll
100 30 2025-01-29 2025-01-29 1 1 786705+jmalkin@users.norepl... 786705+jmalkin@users.norepl...
KllDoublesSketchAggBuild.scala
in src/main/scala/org/apache/spark/sql/datasketches/kll/aggregate
107 10 2025-01-29 2025-01-29 1 1 786705+jmalkin@users.norepl... 786705+jmalkin@users.norepl...
KllDoublesSketchAggMerge.scala
in src/main/scala/org/apache/spark/sql/datasketches/kll/aggregate
113 10 2025-01-29 2025-01-29 1 1 786705+jmalkin@users.norepl... 786705+jmalkin@users.norepl...
KllFunctionRegistry.scala
in src/main/scala/org/apache/spark/sql/datasketches/kll
17 - 2025-01-29 2025-01-29 1 1 786705+jmalkin@users.norepl... 786705+jmalkin@users.norepl...
ThetaSketchType.scala
in src/main/scala/org/apache/spark/sql/datasketches/theta/types
15 2 2025-01-29 2025-01-29 1 1 786705+jmalkin@users.norepl... 786705+jmalkin@users.norepl...
Files With Least Contributors (Top 28)
Based on the number of unique email addresses found in commits.

See data for all files...

File# lines# unitscreatedlast modified# changes
(days)
# contributorsfirst
contributor
latest
contributor
KllDoublesSketchExpressions.scala
in src/main/scala/org/apache/spark/sql/datasketches/kll/expressions
237 17 2025-01-29 2025-02-20 4 1 786705+jmalkin@users.norepl... 786705+jmalkin@users.norepl...
KllDoublesSketchAggMerge.scala
in src/main/scala/org/apache/spark/sql/datasketches/kll/aggregate
113 10 2025-01-29 2025-01-29 1 1 786705+jmalkin@users.norepl... 786705+jmalkin@users.norepl...
KllDoublesSketchAggBuild.scala
in src/main/scala/org/apache/spark/sql/datasketches/kll/aggregate
107 10 2025-01-29 2025-01-29 1 1 786705+jmalkin@users.norepl... 786705+jmalkin@users.norepl...
functions.scala
in src/main/scala/org/apache/spark/sql/datasketches/kll
100 30 2025-01-29 2025-01-29 1 1 786705+jmalkin@users.norepl... 786705+jmalkin@users.norepl...
kll.py
in python/src/datasketches_spark
68 12 2025-02-06 2025-02-18 4 1 786705+jmalkin@users.norepl... 786705+jmalkin@users.norepl...
57 3 2025-02-06 2025-02-18 4 1 786705+jmalkin@users.norepl... 786705+jmalkin@users.norepl...
KllDoublesSketchType.scala
in src/main/scala/org/apache/spark/sql/datasketches/kll/types
36 3 2025-01-29 2025-02-18 3 1 786705+jmalkin@users.norepl... 786705+jmalkin@users.norepl...
in
MANIFEST.in
in python
19 - 2025-02-06 2025-02-18 4 1 786705+jmalkin@users.norepl... 786705+jmalkin@users.norepl...
KllFunctionRegistry.scala
in src/main/scala/org/apache/spark/sql/datasketches/kll
17 - 2025-01-29 2025-01-29 1 1 786705+jmalkin@users.norepl... 786705+jmalkin@users.norepl...
ThetaSketchType.scala
in src/main/scala/org/apache/spark/sql/datasketches/theta/types
15 2 2025-01-29 2025-01-29 1 1 786705+jmalkin@users.norepl... 786705+jmalkin@users.norepl...
__init__.py
in python/src/datasketches_spark
4 - 2025-02-06 2025-02-18 3 1 786705+jmalkin@users.norepl... 786705+jmalkin@users.norepl...
cfg
1 - 2025-02-12 2025-02-18 3 1 786705+jmalkin@users.norepl... 786705+jmalkin@users.norepl...
ThetaSketchAggUnion.scala
in src/main/scala/org/apache/spark/sql/datasketches/theta/aggregate
104 6 2025-01-29 2025-02-27 3 2 786705+jmalkin@users.norepl... 786705+jmalkin@users.norepl...
build.sbt
in root
77 - 2025-01-07 2025-02-18 9 2 786705+jmalkin@users.norepl... 786705+jmalkin@users.norepl...
common.py
in python/src/datasketches_spark
57 10 2025-02-06 2025-03-28 8 2 786705+jmalkin@users.norepl... jmalkin.noreply@apache.org
42 - 2025-02-12 2025-02-18 4 2 786705+jmalkin@users.norepl... 786705+jmalkin@users.norepl...
CastAsBinary.scala
in src/main/scala/org/apache/spark/sql/datasketches/common
36 4 2025-03-28 2025-03-28 1 2 786705+jmalkin@users.norepl... jmalkin.noreply@apache.org
setup.py
in python
30 1 2025-02-06 2025-02-18 5 2 786705+jmalkin@users.norepl... 786705+jmalkin@users.norepl...
DatasketchesFunctionRegistry.scala
in src/main/scala/org/apache/spark/sql/datasketches/common
29 3 2025-01-29 2025-03-28 4 2 786705+jmalkin@users.norepl... jmalkin.noreply@apache.org
ThetaFunctionRegistry.scala
in src/main/scala/org/apache/spark/sql/datasketches/theta
15 - 2025-01-29 2025-02-27 3 2 786705+jmalkin@users.norepl... 786705+jmalkin@users.norepl...
functions.scala
in src/main/scala/org/apache/spark/sql/datasketches/common
10 2 2025-03-25 2025-03-28 3 2 jmalkin.noreply@apache.org jmalkin.noreply@apache.org
_version.py
in python/src/datasketches_spark
8 1 2025-02-12 2025-02-18 3 2 786705+jmalkin@users.norepl... 786705+jmalkin@users.norepl...
ThetaSketchConstants.scala
in src/main/scala/org/apache/spark/sql/datasketches/theta
4 - 2025-02-25 2025-02-27 2 2 alexandersaydakov@users.nor... 786705+jmalkin@users.norepl...
ThetaSketchAggBuild.scala
in src/main/scala/org/apache/spark/sql/datasketches/theta/aggregate
122 6 2025-01-29 2025-04-02 5 3 786705+jmalkin@users.norepl... alexandersaydakov@users.nor...
functions.scala
in src/main/scala/org/apache/spark/sql/datasketches/theta
69 20 2025-01-29 2025-04-02 5 3 786705+jmalkin@users.norepl... alexandersaydakov@users.nor...
ThetaSketchWrapper.scala
in src/main/scala/org/apache/spark/sql/datasketches/theta/types
28 4 2025-01-29 2025-04-02 2 3 786705+jmalkin@users.norepl... alexandersaydakov@users.nor...
DatasketchesScalaFunctionsBase.scala
in src/main/scala/org/apache/spark/sql/datasketches/common
10 1 2025-01-29 2025-04-02 2 3 786705+jmalkin@users.norepl... alexandersaydakov@users.nor...
ThetaExpressions.scala
in src/main/scala/org/apache/spark/sql/datasketches/theta/expressions
101 8 2025-01-29 2025-04-02 6 4 786705+jmalkin@users.norepl... alexandersaydakov@users.nor...
Correlations

File Size vs. Number of Changes: 28 points

src/main/scala/org/apache/spark/sql/datasketches/common/DatasketchesScalaFunctionsBase.scala x: 10 lines of code y: 2 # changes src/main/scala/org/apache/spark/sql/datasketches/theta/aggregate/ThetaSketchAggBuild.scala x: 122 lines of code y: 5 # changes src/main/scala/org/apache/spark/sql/datasketches/theta/expressions/ThetaExpressions.scala x: 101 lines of code y: 6 # changes src/main/scala/org/apache/spark/sql/datasketches/theta/functions.scala x: 69 lines of code y: 5 # changes src/main/scala/org/apache/spark/sql/datasketches/theta/types/ThetaSketchWrapper.scala x: 28 lines of code y: 2 # changes python/src/datasketches_spark/common.py x: 57 lines of code y: 8 # changes src/main/scala/org/apache/spark/sql/datasketches/common/CastAsBinary.scala x: 36 lines of code y: 1 # changes src/main/scala/org/apache/spark/sql/datasketches/common/DatasketchesFunctionRegistry.scala x: 29 lines of code y: 4 # changes src/main/scala/org/apache/spark/sql/datasketches/common/functions.scala x: 10 lines of code y: 3 # changes src/main/scala/org/apache/spark/sql/datasketches/theta/ThetaFunctionRegistry.scala x: 15 lines of code y: 3 # changes src/main/scala/org/apache/spark/sql/datasketches/theta/ThetaSketchConstants.scala x: 4 lines of code y: 2 # changes src/main/scala/org/apache/spark/sql/datasketches/theta/aggregate/ThetaSketchAggUnion.scala x: 104 lines of code y: 3 # changes src/main/scala/org/apache/spark/sql/datasketches/kll/expressions/KllDoublesSketchExpressions.scala x: 237 lines of code y: 4 # changes build.sbt x: 77 lines of code y: 9 # changes project/BuildUtils.scala x: 57 lines of code y: 4 # changes python/MANIFEST.in x: 19 lines of code y: 4 # changes python/pyproject.toml x: 42 lines of code y: 4 # changes python/setup.py x: 30 lines of code y: 5 # changes python/src/datasketches_spark/__init__.py x: 4 lines of code y: 3 # changes python/src/datasketches_spark/_version.py x: 8 lines of code y: 3 # changes python/src/datasketches_spark/kll.py x: 68 lines of code y: 4 # changes src/main/scala/org/apache/spark/sql/datasketches/kll/types/KllDoublesSketchType.scala x: 36 lines of code y: 3 # changes version.cfg x: 1 lines of code y: 3 # changes src/main/scala/org/apache/spark/sql/datasketches/kll/KllFunctionRegistry.scala x: 17 lines of code y: 1 # changes src/main/scala/org/apache/spark/sql/datasketches/kll/aggregate/KllDoublesSketchAggBuild.scala x: 107 lines of code y: 1 # changes src/main/scala/org/apache/spark/sql/datasketches/kll/aggregate/KllDoublesSketchAggMerge.scala x: 113 lines of code y: 1 # changes src/main/scala/org/apache/spark/sql/datasketches/kll/functions.scala x: 100 lines of code y: 1 # changes src/main/scala/org/apache/spark/sql/datasketches/theta/types/ThetaSketchType.scala x: 15 lines of code y: 1 # changes
9.0
# changes
  min: 1.0
  average: 3.39
  25th percentile: 2.0
  median: 3.0
  75th percentile: 4.0
  max: 9.0
0 237.0
lines of code
min: 1.0 | average: 54.14 | 25th percentile: 15.0 | median: 36.0 | 75th percentile: 94.25 | max: 237.0

Number of Contributors vs. Number of Changes: 28 points

src/main/scala/org/apache/spark/sql/datasketches/common/DatasketchesScalaFunctionsBase.scala x: 3 # contributors y: 2 # changes src/main/scala/org/apache/spark/sql/datasketches/theta/aggregate/ThetaSketchAggBuild.scala x: 3 # contributors y: 5 # changes src/main/scala/org/apache/spark/sql/datasketches/theta/expressions/ThetaExpressions.scala x: 4 # contributors y: 6 # changes python/src/datasketches_spark/common.py x: 2 # contributors y: 8 # changes src/main/scala/org/apache/spark/sql/datasketches/common/CastAsBinary.scala x: 2 # contributors y: 1 # changes src/main/scala/org/apache/spark/sql/datasketches/common/DatasketchesFunctionRegistry.scala x: 2 # contributors y: 4 # changes src/main/scala/org/apache/spark/sql/datasketches/common/functions.scala x: 2 # contributors y: 3 # changes src/main/scala/org/apache/spark/sql/datasketches/theta/ThetaSketchConstants.scala x: 2 # contributors y: 2 # changes src/main/scala/org/apache/spark/sql/datasketches/kll/expressions/KllDoublesSketchExpressions.scala x: 1 # contributors y: 4 # changes build.sbt x: 2 # contributors y: 9 # changes python/setup.py x: 2 # contributors y: 5 # changes python/src/datasketches_spark/__init__.py x: 1 # contributors y: 3 # changes src/main/scala/org/apache/spark/sql/datasketches/kll/KllFunctionRegistry.scala x: 1 # contributors y: 1 # changes
9.0
# changes
  min: 1.0
  average: 3.39
  25th percentile: 2.0
  median: 3.0
  75th percentile: 4.0
  max: 9.0
0 4.0
# contributors
min: 1.0 | average: 1.79 | 25th percentile: 1.0 | median: 2.0 | 75th percentile: 2.0 | max: 4.0

Number of Contributors vs. File Size: 28 points

src/main/scala/org/apache/spark/sql/datasketches/common/DatasketchesScalaFunctionsBase.scala x: 3 # contributors y: 10 lines of code src/main/scala/org/apache/spark/sql/datasketches/theta/aggregate/ThetaSketchAggBuild.scala x: 3 # contributors y: 122 lines of code src/main/scala/org/apache/spark/sql/datasketches/theta/expressions/ThetaExpressions.scala x: 4 # contributors y: 101 lines of code src/main/scala/org/apache/spark/sql/datasketches/theta/functions.scala x: 3 # contributors y: 69 lines of code src/main/scala/org/apache/spark/sql/datasketches/theta/types/ThetaSketchWrapper.scala x: 3 # contributors y: 28 lines of code python/src/datasketches_spark/common.py x: 2 # contributors y: 57 lines of code src/main/scala/org/apache/spark/sql/datasketches/common/CastAsBinary.scala x: 2 # contributors y: 36 lines of code src/main/scala/org/apache/spark/sql/datasketches/common/DatasketchesFunctionRegistry.scala x: 2 # contributors y: 29 lines of code src/main/scala/org/apache/spark/sql/datasketches/common/functions.scala x: 2 # contributors y: 10 lines of code src/main/scala/org/apache/spark/sql/datasketches/theta/ThetaFunctionRegistry.scala x: 2 # contributors y: 15 lines of code src/main/scala/org/apache/spark/sql/datasketches/theta/ThetaSketchConstants.scala x: 2 # contributors y: 4 lines of code src/main/scala/org/apache/spark/sql/datasketches/theta/aggregate/ThetaSketchAggUnion.scala x: 2 # contributors y: 104 lines of code src/main/scala/org/apache/spark/sql/datasketches/kll/expressions/KllDoublesSketchExpressions.scala x: 1 # contributors y: 237 lines of code build.sbt x: 2 # contributors y: 77 lines of code project/BuildUtils.scala x: 1 # contributors y: 57 lines of code python/MANIFEST.in x: 1 # contributors y: 19 lines of code python/pyproject.toml x: 2 # contributors y: 42 lines of code python/setup.py x: 2 # contributors y: 30 lines of code python/src/datasketches_spark/__init__.py x: 1 # contributors y: 4 lines of code python/src/datasketches_spark/_version.py x: 2 # contributors y: 8 lines of code python/src/datasketches_spark/kll.py x: 1 # contributors y: 68 lines of code src/main/scala/org/apache/spark/sql/datasketches/kll/types/KllDoublesSketchType.scala x: 1 # contributors y: 36 lines of code version.cfg x: 1 # contributors y: 1 lines of code src/main/scala/org/apache/spark/sql/datasketches/kll/KllFunctionRegistry.scala x: 1 # contributors y: 17 lines of code src/main/scala/org/apache/spark/sql/datasketches/kll/aggregate/KllDoublesSketchAggBuild.scala x: 1 # contributors y: 107 lines of code src/main/scala/org/apache/spark/sql/datasketches/kll/aggregate/KllDoublesSketchAggMerge.scala x: 1 # contributors y: 113 lines of code src/main/scala/org/apache/spark/sql/datasketches/kll/functions.scala x: 1 # contributors y: 100 lines of code src/main/scala/org/apache/spark/sql/datasketches/theta/types/ThetaSketchType.scala x: 1 # contributors y: 15 lines of code
237.0
lines of code
  min: 1.0
  average: 54.14
  25th percentile: 15.0
  median: 36.0
  75th percentile: 94.25
  max: 237.0
0 4.0
# contributors
min: 1.0 | average: 1.79 | 25th percentile: 1.0 | median: 2.0 | 75th percentile: 2.0 | max: 4.0