deepseek-ai / smallpond
File Change Frequency

File change frequency (churn) shows the distribution of file updates (days with at least one commit).

Overview
File Change Frequency Overall
  • There are 25 files with 8,845 lines of code.
    • 0 files changed more than 100 times (0 lines of code)
    • 0 files changed 51-100 times (0 lines of code)
    • 0 files changed 21-50 times (0 lines of code)
    • 0 files changed 6-20 times (0 lines of code)
    • 25 files changed 1-5 times (8,845 lines of code)
0% | 0% | 0% | 0% | 100%
Legend:
101+
51-100
21-50
6-20
1-5

explore: grouped by folders | grouped by update frequency | data
Contributors Count Frequency Overall
  • There are 25 files with 8,845 lines of code.
    • 0 files changed by more than 25 contributors (0 lines of code)
    • 0 files changed by 11-25 contributors (0 lines of code)
    • 0 files changed by 6-10 contributors (0 lines of code)
    • 0 files changed by 2-5 contributors (0 lines of code)
    • 25 files changed by 1 contributor (8,845 lines of code)
0% | 0% | 0% | 0% | 100%
Legend:
26+
11-25
6-10
2-5
1

explore: grouped by folders | grouped by contributors count | data
File Change Frequency per File Extension
py, rst, txt, bat, in, gitignore, md, json, toml, yaml
File Change Frequency per Extension
The number of recorded file updates
101+
51-100
21-50
6-20
1-5
py0% | 0% | 0% | 0% | 100%
in0% | 0% | 0% | 0% | 0%
toml0% | 0% | 0% | 0% | 0%
File Change Frequency per Logical Decomposition
primary
primary (file change frequency)
The number of recorded file updates
101+
51-100
21-50
6-20
1-5
smallpond0% | 0% | 0% | 0% | 100%
benchmarks0% | 0% | 0% | 0% | 100%
ROOT0% | 0% | 0% | 0% | 0%
Most Frequently Changed Files (Top 35)

See data for all files...

File# lines# unitscreatedlast modified# changes
(days)
# contributorsfirst
contributor
latest
contributor
copy_table.py
in smallpond/contrib
14 2 2025-03-05 2025-03-05 1 1 runji@deepseek.com runji@deepseek.com
log_dataset.py
in smallpond/contrib
26 4 2025-03-05 2025-03-05 1 1 runji@deepseek.com runji@deepseek.com
optimizer.py
in smallpond/logical
33 4 2025-03-05 2025-03-05 1 1 runji@deepseek.com runji@deepseek.com
worker.py
in smallpond
63 - 2025-03-05 2025-03-05 1 1 runji@deepseek.com runji@deepseek.com
common.py
in smallpond
66 10 2025-03-05 2025-03-05 1 1 runji@deepseek.com runji@deepseek.com
72 2 2025-03-05 2025-03-05 1 1 runji@deepseek.com runji@deepseek.com
79 2 2025-03-05 2025-03-05 1 1 runji@deepseek.com runji@deepseek.com
filesystem.py
in smallpond/io
92 6 2025-03-05 2025-03-05 1 1 runji@deepseek.com runji@deepseek.com
93 3 2025-03-05 2025-03-05 1 1 runji@deepseek.com runji@deepseek.com
warc.py
in smallpond/contrib
97 5 2025-03-05 2025-03-05 1 1 runji@deepseek.com runji@deepseek.com
utility.py
in smallpond
139 17 2025-03-05 2025-03-05 1 1 runji@deepseek.com runji@deepseek.com
udf.py
in smallpond/logical
179 21 2025-03-05 2025-03-05 1 1 runji@deepseek.com runji@deepseek.com
manager.py
in smallpond/execution
217 4 2025-03-05 2025-03-05 1 1 runji@deepseek.com runji@deepseek.com
planner.py
in smallpond/logical
224 16 2025-03-05 2025-03-05 1 1 runji@deepseek.com runji@deepseek.com
executor.py
in smallpond/execution
225 25 2025-03-05 2025-03-05 1 1 runji@deepseek.com runji@deepseek.com
session.py
in smallpond
263 13 2025-03-05 2025-03-05 1 1 runji@deepseek.com runji@deepseek.com
arrow.py
in smallpond/io
293 17 2025-03-05 2025-03-05 1 1 runji@deepseek.com runji@deepseek.com
dataframe.py
in smallpond
296 38 2025-03-05 2025-03-05 1 1 runji@deepseek.com runji@deepseek.com
322 8 2025-03-05 2025-03-05 1 1 runji@deepseek.com runji@deepseek.com
workqueue.py
in smallpond/execution
342 43 2025-03-05 2025-03-05 1 1 runji@deepseek.com runji@deepseek.com
driver.py
in smallpond/execution
366 13 2025-03-05 2025-03-05 1 1 runji@deepseek.com runji@deepseek.com
dataset.py
in smallpond/logical
700 65 2025-03-05 2025-03-05 1 1 runji@deepseek.com runji@deepseek.com
scheduler.py
in smallpond/execution
971 93 2025-03-05 2025-03-05 1 1 runji@deepseek.com runji@deepseek.com
node.py
in smallpond/logical
1198 106 2025-03-05 2025-03-05 1 1 runji@deepseek.com runji@deepseek.com
task.py
in smallpond/execution
2475 241 2025-03-05 2025-03-05 1 1 runji@deepseek.com runji@deepseek.com
in
1 -
__init__.py
in smallpond/execution
1 -
__init__.py
in smallpond/contrib
1 -
__init__.py
in smallpond/logical
1 -
__init__.py
in smallpond/io
1 -
__init__.py
in smallpond/platform
21 1
mpi.py
in smallpond/platform
29 2
__init__.py
in smallpond
34 1
base.py
in smallpond/platform
60 11
75 -
Files With Most Contributors (Top 35)
Based on the number of unique email addresses found in commits.

See data for all files...

File# lines# unitscreatedlast modified# changes
(days)
# contributorsfirst
contributor
latest
contributor
79 2 2025-03-05 2025-03-05 1 1 runji@deepseek.com runji@deepseek.com
72 2 2025-03-05 2025-03-05 1 1 runji@deepseek.com runji@deepseek.com
322 8 2025-03-05 2025-03-05 1 1 runji@deepseek.com runji@deepseek.com
93 3 2025-03-05 2025-03-05 1 1 runji@deepseek.com runji@deepseek.com
arrow.py
in smallpond/io
293 17 2025-03-05 2025-03-05 1 1 runji@deepseek.com runji@deepseek.com
filesystem.py
in smallpond/io
92 6 2025-03-05 2025-03-05 1 1 runji@deepseek.com runji@deepseek.com
node.py
in smallpond/logical
1198 106 2025-03-05 2025-03-05 1 1 runji@deepseek.com runji@deepseek.com
optimizer.py
in smallpond/logical
33 4 2025-03-05 2025-03-05 1 1 runji@deepseek.com runji@deepseek.com
udf.py
in smallpond/logical
179 21 2025-03-05 2025-03-05 1 1 runji@deepseek.com runji@deepseek.com
planner.py
in smallpond/logical
224 16 2025-03-05 2025-03-05 1 1 runji@deepseek.com runji@deepseek.com
dataset.py
in smallpond/logical
700 65 2025-03-05 2025-03-05 1 1 runji@deepseek.com runji@deepseek.com
utility.py
in smallpond
139 17 2025-03-05 2025-03-05 1 1 runji@deepseek.com runji@deepseek.com
session.py
in smallpond
263 13 2025-03-05 2025-03-05 1 1 runji@deepseek.com runji@deepseek.com
copy_table.py
in smallpond/contrib
14 2 2025-03-05 2025-03-05 1 1 runji@deepseek.com runji@deepseek.com
warc.py
in smallpond/contrib
97 5 2025-03-05 2025-03-05 1 1 runji@deepseek.com runji@deepseek.com
log_dataset.py
in smallpond/contrib
26 4 2025-03-05 2025-03-05 1 1 runji@deepseek.com runji@deepseek.com
manager.py
in smallpond/execution
217 4 2025-03-05 2025-03-05 1 1 runji@deepseek.com runji@deepseek.com
executor.py
in smallpond/execution
225 25 2025-03-05 2025-03-05 1 1 runji@deepseek.com runji@deepseek.com
scheduler.py
in smallpond/execution
971 93 2025-03-05 2025-03-05 1 1 runji@deepseek.com runji@deepseek.com
driver.py
in smallpond/execution
366 13 2025-03-05 2025-03-05 1 1 runji@deepseek.com runji@deepseek.com
workqueue.py
in smallpond/execution
342 43 2025-03-05 2025-03-05 1 1 runji@deepseek.com runji@deepseek.com
task.py
in smallpond/execution
2475 241 2025-03-05 2025-03-05 1 1 runji@deepseek.com runji@deepseek.com
dataframe.py
in smallpond
296 38 2025-03-05 2025-03-05 1 1 runji@deepseek.com runji@deepseek.com
common.py
in smallpond
66 10 2025-03-05 2025-03-05 1 1 runji@deepseek.com runji@deepseek.com
worker.py
in smallpond
63 - 2025-03-05 2025-03-05 1 1 runji@deepseek.com runji@deepseek.com
__init__.py
in smallpond/io
1 -
__init__.py
in smallpond/logical
1 -
__init__.py
in smallpond
34 1
base.py
in smallpond/platform
60 11
mpi.py
in smallpond/platform
29 2
__init__.py
in smallpond/platform
21 1
__init__.py
in smallpond/contrib
1 -
__init__.py
in smallpond/execution
1 -
in
1 -
75 -
Files With Least Contributors (Top 35)
Based on the number of unique email addresses found in commits.

See data for all files...

File# lines# unitscreatedlast modified# changes
(days)
# contributorsfirst
contributor
latest
contributor
75 -
base.py
in smallpond/platform
60 11
__init__.py
in smallpond
34 1
mpi.py
in smallpond/platform
29 2
__init__.py
in smallpond/platform
21 1
__init__.py
in smallpond/io
1 -
__init__.py
in smallpond/logical
1 -
__init__.py
in smallpond/contrib
1 -
__init__.py
in smallpond/execution
1 -
in
1 -
task.py
in smallpond/execution
2475 241 2025-03-05 2025-03-05 1 1 runji@deepseek.com runji@deepseek.com
node.py
in smallpond/logical
1198 106 2025-03-05 2025-03-05 1 1 runji@deepseek.com runji@deepseek.com
scheduler.py
in smallpond/execution
971 93 2025-03-05 2025-03-05 1 1 runji@deepseek.com runji@deepseek.com
dataset.py
in smallpond/logical
700 65 2025-03-05 2025-03-05 1 1 runji@deepseek.com runji@deepseek.com
driver.py
in smallpond/execution
366 13 2025-03-05 2025-03-05 1 1 runji@deepseek.com runji@deepseek.com
workqueue.py
in smallpond/execution
342 43 2025-03-05 2025-03-05 1 1 runji@deepseek.com runji@deepseek.com
322 8 2025-03-05 2025-03-05 1 1 runji@deepseek.com runji@deepseek.com
dataframe.py
in smallpond
296 38 2025-03-05 2025-03-05 1 1 runji@deepseek.com runji@deepseek.com
arrow.py
in smallpond/io
293 17 2025-03-05 2025-03-05 1 1 runji@deepseek.com runji@deepseek.com
session.py
in smallpond
263 13 2025-03-05 2025-03-05 1 1 runji@deepseek.com runji@deepseek.com
executor.py
in smallpond/execution
225 25 2025-03-05 2025-03-05 1 1 runji@deepseek.com runji@deepseek.com
planner.py
in smallpond/logical
224 16 2025-03-05 2025-03-05 1 1 runji@deepseek.com runji@deepseek.com
manager.py
in smallpond/execution
217 4 2025-03-05 2025-03-05 1 1 runji@deepseek.com runji@deepseek.com
udf.py
in smallpond/logical
179 21 2025-03-05 2025-03-05 1 1 runji@deepseek.com runji@deepseek.com
utility.py
in smallpond
139 17 2025-03-05 2025-03-05 1 1 runji@deepseek.com runji@deepseek.com
warc.py
in smallpond/contrib
97 5 2025-03-05 2025-03-05 1 1 runji@deepseek.com runji@deepseek.com
93 3 2025-03-05 2025-03-05 1 1 runji@deepseek.com runji@deepseek.com
filesystem.py
in smallpond/io
92 6 2025-03-05 2025-03-05 1 1 runji@deepseek.com runji@deepseek.com
79 2 2025-03-05 2025-03-05 1 1 runji@deepseek.com runji@deepseek.com
72 2 2025-03-05 2025-03-05 1 1 runji@deepseek.com runji@deepseek.com
common.py
in smallpond
66 10 2025-03-05 2025-03-05 1 1 runji@deepseek.com runji@deepseek.com
worker.py
in smallpond
63 - 2025-03-05 2025-03-05 1 1 runji@deepseek.com runji@deepseek.com
optimizer.py
in smallpond/logical
33 4 2025-03-05 2025-03-05 1 1 runji@deepseek.com runji@deepseek.com
log_dataset.py
in smallpond/contrib
26 4 2025-03-05 2025-03-05 1 1 runji@deepseek.com runji@deepseek.com
copy_table.py
in smallpond/contrib
14 2 2025-03-05 2025-03-05 1 1 runji@deepseek.com runji@deepseek.com
Correlations

File Size vs. Number of Changes: 25 points

benchmarks/file_io_benchmark.py x: 72 lines of code y: 1 # changes benchmarks/gray_sort_benchmark.py x: 322 lines of code y: 1 # changes benchmarks/hash_partition_benchmark.py x: 79 lines of code y: 1 # changes benchmarks/urls_sort_benchmark.py x: 93 lines of code y: 1 # changes smallpond/common.py x: 66 lines of code y: 1 # changes smallpond/contrib/copy_table.py x: 14 lines of code y: 1 # changes smallpond/contrib/log_dataset.py x: 26 lines of code y: 1 # changes smallpond/contrib/warc.py x: 97 lines of code y: 1 # changes smallpond/dataframe.py x: 296 lines of code y: 1 # changes smallpond/execution/driver.py x: 366 lines of code y: 1 # changes smallpond/execution/executor.py x: 225 lines of code y: 1 # changes smallpond/execution/manager.py x: 217 lines of code y: 1 # changes smallpond/execution/scheduler.py x: 971 lines of code y: 1 # changes smallpond/execution/task.py x: 2475 lines of code y: 1 # changes smallpond/execution/workqueue.py x: 342 lines of code y: 1 # changes smallpond/io/arrow.py x: 293 lines of code y: 1 # changes smallpond/logical/dataset.py x: 700 lines of code y: 1 # changes smallpond/logical/node.py x: 1198 lines of code y: 1 # changes smallpond/logical/optimizer.py x: 33 lines of code y: 1 # changes smallpond/logical/udf.py x: 179 lines of code y: 1 # changes smallpond/session.py x: 263 lines of code y: 1 # changes smallpond/utility.py x: 139 lines of code y: 1 # changes
1.0
# changes
  min: 1.0
  average: 1.0
  25th percentile: 1.0
  median: 1.0
  75th percentile: 1.0
  max: 1.0
0 2475.0
lines of code
min: 14.0 | average: 353.8 | 25th percentile: 75.5 | median: 217.0 | 75th percentile: 332.0 | max: 2475.0

Number of Contributors vs. Number of Changes: 25 points

benchmarks/file_io_benchmark.py x: 1 # contributors y: 1 # changes
1.0
# changes
  min: 1.0
  average: 1.0
  25th percentile: 1.0
  median: 1.0
  75th percentile: 1.0
  max: 1.0
0 1.0
# contributors
min: 1.0 | average: 1.0 | 25th percentile: 1.0 | median: 1.0 | 75th percentile: 1.0 | max: 1.0

Number of Contributors vs. File Size: 25 points

benchmarks/file_io_benchmark.py x: 1 # contributors y: 72 lines of code benchmarks/gray_sort_benchmark.py x: 1 # contributors y: 322 lines of code benchmarks/hash_partition_benchmark.py x: 1 # contributors y: 79 lines of code benchmarks/urls_sort_benchmark.py x: 1 # contributors y: 93 lines of code smallpond/common.py x: 1 # contributors y: 66 lines of code smallpond/contrib/copy_table.py x: 1 # contributors y: 14 lines of code smallpond/contrib/log_dataset.py x: 1 # contributors y: 26 lines of code smallpond/dataframe.py x: 1 # contributors y: 296 lines of code smallpond/execution/driver.py x: 1 # contributors y: 366 lines of code smallpond/execution/executor.py x: 1 # contributors y: 225 lines of code smallpond/execution/manager.py x: 1 # contributors y: 217 lines of code smallpond/execution/scheduler.py x: 1 # contributors y: 971 lines of code smallpond/execution/task.py x: 1 # contributors y: 2475 lines of code smallpond/execution/workqueue.py x: 1 # contributors y: 342 lines of code smallpond/logical/dataset.py x: 1 # contributors y: 700 lines of code smallpond/logical/node.py x: 1 # contributors y: 1198 lines of code smallpond/logical/optimizer.py x: 1 # contributors y: 33 lines of code smallpond/logical/udf.py x: 1 # contributors y: 179 lines of code smallpond/session.py x: 1 # contributors y: 263 lines of code smallpond/utility.py x: 1 # contributors y: 139 lines of code
2475.0
lines of code
  min: 14.0
  average: 353.8
  25th percentile: 75.5
  median: 217.0
  75th percentile: 332.0
  max: 2475.0
0 1.0
# contributors
min: 1.0 | average: 1.0 | 25th percentile: 1.0 | median: 1.0 | 75th percentile: 1.0 | max: 1.0