openai / whisper
File Change Frequency

File change frequency (churn) shows the distribution of file updates (days with at least one commit).

Overview
File Change Frequency Overall
  • There are 16 files with 2,985 lines of code.
    • 0 files changed more than 100 times (0 lines of code)
    • 0 files changed 51-100 times (0 lines of code)
    • 1 file changed 21-50 times (465 lines of code)
    • 8 files changed 6-20 times (1,852 lines of code)
    • 7 files changed 1-5 times (668 lines of code)
0% | 0% | 15% | 62% | 22%
Legend:
101+
51-100
21-50
6-20
1-5

explore: grouped by folders | grouped by update frequency | data
Contributors Count Frequency Overall
  • There are 16 files with 2,985 lines of code.
    • 0 files changed by more than 25 contributors (0 lines of code)
    • 2 files changed by 11-25 contributors (707 lines of code)
    • 4 files changed by 6-10 contributors (1,039 lines of code)
    • 6 files changed by 2-5 contributors (683 lines of code)
    • 4 files changed by 1 contributor (556 lines of code)
0% | 23% | 34% | 22% | 18%
Legend:
26+
11-25
6-10
2-5
1

explore: grouped by folders | grouped by contributors count | data
File Change Frequency per File Extension
py, md, json, yaml, txt, ipynb, in, svg, gitignore, gitattributes, toml
File Change Frequency per Extension
The number of recorded file updates
101+
51-100
21-50
6-20
1-5
py0% | 0% | 15% | 63% | 21%
toml0% | 0% | 0% | 0% | 100%
in0% | 0% | 0% | 0% | 100%
ipynb0% | 0% | 0% | 0% | 0%
File Change Frequency per Logical Decomposition
primary
primary (file change frequency)
The number of recorded file updates
101+
51-100
21-50
6-20
1-5
whisper0% | 0% | 15% | 63% | 21%
ROOT0% | 0% | 0% | 0% | 100%
notebooks0% | 0% | 0% | 0% | 0%
Most Frequently Changed Files (Top 17)

See data for all files...

File# lines# unitscreatedlast modified# changes
(days)
# contributorsfirst
contributor
latest
contributor
transcribe.py
in whisper
465 7 2022-09-21 2024-12-01 24 21 jongwook@nyu.edu 69023953+purfview@users.nor...
utils.py
in whisper
242 23 2022-09-26 2025-01-04 13 13 subanimehd@gmail.com cclauss@me.com
decoding.py
in whisper
545 37 2022-09-23 2023-11-06 13 8 jongwook@nyu.edu jongwook@openai.com
timing.py
in whisper
299 8 2023-03-06 2024-09-30 11 6 jongwook@openai.com jongwook@openai.com
version.py
in whisper
1 - 2023-01-17 2024-09-30 10 2 romain.rom1@gmail.com jongwook@nyu.edu
audio.py
in whisper
81 4 2022-09-23 2024-11-26 9 8 ram@rachum.com lowell@vaughnresearch.com
__init__.py
in whisper
114 3 2022-09-23 2024-09-30 9 7 drdaxxy@googlemail.com jongwook@openai.com
tokenizer.py
in whisper
317 24 2022-09-23 2023-11-06 9 5 jongwook@nyu.edu jongwook@openai.com
model.py
in whisper
253 23 2022-09-26 2024-09-30 8 5 michael@neuralmagic.com jongwook@openai.com
in
5 - 2023-01-17 2023-03-13 2 2 romain.rom1@gmail.com jongwook@openai.com
47 - 2023-03-06 2025-01-04 2 2 jongwook@openai.com cclauss@me.com
basic.py
in whisper/normalizers
60 4 2023-03-06 2025-01-04 2 2 jongwook@openai.com cclauss@me.com
__init__.py
in whisper/normalizers
2 - 2023-03-06 2023-03-06 1 1 jongwook@openai.com jongwook@openai.com
__main__.py
in whisper
2 - 2023-03-06 2023-03-06 1 1 jongwook@openai.com jongwook@openai.com
triton_ops.py
in whisper
85 3 2023-03-06 2023-03-06 1 1 jongwook@openai.com jongwook@openai.com
english.py
in whisper/normalizers
467 9 2023-03-06 2023-03-06 1 1 jongwook@openai.com jongwook@openai.com
LibriSpeech.ipynb
in notebooks
958 -
Files With Most Contributors (Top 17)
Based on the number of unique email addresses found in commits.

See data for all files...

File# lines# unitscreatedlast modified# changes
(days)
# contributorsfirst
contributor
latest
contributor
transcribe.py
in whisper
465 7 2022-09-21 2024-12-01 24 21 jongwook@nyu.edu 69023953+purfview@users.nor...
utils.py
in whisper
242 23 2022-09-26 2025-01-04 13 13 subanimehd@gmail.com cclauss@me.com
decoding.py
in whisper
545 37 2022-09-23 2023-11-06 13 8 jongwook@nyu.edu jongwook@openai.com
audio.py
in whisper
81 4 2022-09-23 2024-11-26 9 8 ram@rachum.com lowell@vaughnresearch.com
__init__.py
in whisper
114 3 2022-09-23 2024-09-30 9 7 drdaxxy@googlemail.com jongwook@openai.com
timing.py
in whisper
299 8 2023-03-06 2024-09-30 11 6 jongwook@openai.com jongwook@openai.com
tokenizer.py
in whisper
317 24 2022-09-23 2023-11-06 9 5 jongwook@nyu.edu jongwook@openai.com
model.py
in whisper
253 23 2022-09-26 2024-09-30 8 5 michael@neuralmagic.com jongwook@openai.com
version.py
in whisper
1 - 2023-01-17 2024-09-30 10 2 romain.rom1@gmail.com jongwook@nyu.edu
basic.py
in whisper/normalizers
60 4 2023-03-06 2025-01-04 2 2 jongwook@openai.com cclauss@me.com
in
5 - 2023-01-17 2023-03-13 2 2 romain.rom1@gmail.com jongwook@openai.com
47 - 2023-03-06 2025-01-04 2 2 jongwook@openai.com cclauss@me.com
__main__.py
in whisper
2 - 2023-03-06 2023-03-06 1 1 jongwook@openai.com jongwook@openai.com
triton_ops.py
in whisper
85 3 2023-03-06 2023-03-06 1 1 jongwook@openai.com jongwook@openai.com
__init__.py
in whisper/normalizers
2 - 2023-03-06 2023-03-06 1 1 jongwook@openai.com jongwook@openai.com
english.py
in whisper/normalizers
467 9 2023-03-06 2023-03-06 1 1 jongwook@openai.com jongwook@openai.com
LibriSpeech.ipynb
in notebooks
958 -
Files With Least Contributors (Top 17)
Based on the number of unique email addresses found in commits.

See data for all files...

File# lines# unitscreatedlast modified# changes
(days)
# contributorsfirst
contributor
latest
contributor
LibriSpeech.ipynb
in notebooks
958 -
english.py
in whisper/normalizers
467 9 2023-03-06 2023-03-06 1 1 jongwook@openai.com jongwook@openai.com
triton_ops.py
in whisper
85 3 2023-03-06 2023-03-06 1 1 jongwook@openai.com jongwook@openai.com
__main__.py
in whisper
2 - 2023-03-06 2023-03-06 1 1 jongwook@openai.com jongwook@openai.com
__init__.py
in whisper/normalizers
2 - 2023-03-06 2023-03-06 1 1 jongwook@openai.com jongwook@openai.com
basic.py
in whisper/normalizers
60 4 2023-03-06 2025-01-04 2 2 jongwook@openai.com cclauss@me.com
47 - 2023-03-06 2025-01-04 2 2 jongwook@openai.com cclauss@me.com
in
5 - 2023-01-17 2023-03-13 2 2 romain.rom1@gmail.com jongwook@openai.com
version.py
in whisper
1 - 2023-01-17 2024-09-30 10 2 romain.rom1@gmail.com jongwook@nyu.edu
tokenizer.py
in whisper
317 24 2022-09-23 2023-11-06 9 5 jongwook@nyu.edu jongwook@openai.com
model.py
in whisper
253 23 2022-09-26 2024-09-30 8 5 michael@neuralmagic.com jongwook@openai.com
timing.py
in whisper
299 8 2023-03-06 2024-09-30 11 6 jongwook@openai.com jongwook@openai.com
__init__.py
in whisper
114 3 2022-09-23 2024-09-30 9 7 drdaxxy@googlemail.com jongwook@openai.com
decoding.py
in whisper
545 37 2022-09-23 2023-11-06 13 8 jongwook@nyu.edu jongwook@openai.com
audio.py
in whisper
81 4 2022-09-23 2024-11-26 9 8 ram@rachum.com lowell@vaughnresearch.com
utils.py
in whisper
242 23 2022-09-26 2025-01-04 13 13 subanimehd@gmail.com cclauss@me.com
transcribe.py
in whisper
465 7 2022-09-21 2024-12-01 24 21 jongwook@nyu.edu 69023953+purfview@users.nor...
Correlations

File Size vs. Number of Changes: 16 points

pyproject.toml x: 47 lines of code y: 2 # changes whisper/normalizers/basic.py x: 60 lines of code y: 2 # changes whisper/utils.py x: 242 lines of code y: 13 # changes whisper/transcribe.py x: 465 lines of code y: 24 # changes whisper/audio.py x: 81 lines of code y: 9 # changes whisper/version.py x: 1 lines of code y: 10 # changes whisper/__init__.py x: 114 lines of code y: 9 # changes whisper/model.py x: 253 lines of code y: 8 # changes whisper/timing.py x: 299 lines of code y: 11 # changes whisper/decoding.py x: 545 lines of code y: 13 # changes whisper/tokenizer.py x: 317 lines of code y: 9 # changes MANIFEST.in x: 5 lines of code y: 2 # changes whisper/__main__.py x: 2 lines of code y: 1 # changes whisper/normalizers/english.py x: 467 lines of code y: 1 # changes whisper/triton_ops.py x: 85 lines of code y: 1 # changes
24.0
# changes
  min: 1.0
  average: 7.25
  25th percentile: 1.25
  median: 8.5
  75th percentile: 10.75
  max: 24.0
0 545.0
lines of code
min: 1.0 | average: 186.56 | 25th percentile: 15.5 | median: 99.5 | 75th percentile: 312.5 | max: 545.0

Number of Contributors vs. Number of Changes: 16 points

pyproject.toml x: 2 # contributors y: 2 # changes whisper/utils.py x: 13 # contributors y: 13 # changes whisper/transcribe.py x: 21 # contributors y: 24 # changes whisper/audio.py x: 8 # contributors y: 9 # changes whisper/version.py x: 2 # contributors y: 10 # changes whisper/__init__.py x: 7 # contributors y: 9 # changes whisper/model.py x: 5 # contributors y: 8 # changes whisper/timing.py x: 6 # contributors y: 11 # changes whisper/decoding.py x: 8 # contributors y: 13 # changes whisper/tokenizer.py x: 5 # contributors y: 9 # changes whisper/__main__.py x: 1 # contributors y: 1 # changes
24.0
# changes
  min: 1.0
  average: 7.25
  25th percentile: 1.25
  median: 8.5
  75th percentile: 10.75
  max: 24.0
0 21.0
# contributors
min: 1.0 | average: 5.31 | 25th percentile: 1.25 | median: 3.5 | 75th percentile: 7.75 | max: 21.0

Number of Contributors vs. File Size: 16 points

pyproject.toml x: 2 # contributors y: 47 lines of code whisper/normalizers/basic.py x: 2 # contributors y: 60 lines of code whisper/utils.py x: 13 # contributors y: 242 lines of code whisper/transcribe.py x: 21 # contributors y: 465 lines of code whisper/audio.py x: 8 # contributors y: 81 lines of code whisper/version.py x: 2 # contributors y: 1 lines of code whisper/__init__.py x: 7 # contributors y: 114 lines of code whisper/model.py x: 5 # contributors y: 253 lines of code whisper/timing.py x: 6 # contributors y: 299 lines of code whisper/decoding.py x: 8 # contributors y: 545 lines of code whisper/tokenizer.py x: 5 # contributors y: 317 lines of code MANIFEST.in x: 2 # contributors y: 5 lines of code whisper/__main__.py x: 1 # contributors y: 2 lines of code whisper/normalizers/english.py x: 1 # contributors y: 467 lines of code whisper/triton_ops.py x: 1 # contributors y: 85 lines of code
545.0
lines of code
  min: 1.0
  average: 186.56
  25th percentile: 15.5
  median: 99.5
  75th percentile: 312.5
  max: 545.0
0 21.0
# contributors
min: 1.0 | average: 5.31 | 25th percentile: 1.25 | median: 3.5 | 75th percentile: 7.75 | max: 21.0