facebookresearch / cc_net
File Change Frequency

File change frequency (churn) shows the distribution of file updates (days with at least one commit).

File Change Frequency Overall
File Change Frequency Overall
The number of recorded file updates
  • There are 20 files with 3,780 lines of code.
    • 0 files changed more than 100 times (0 lines of code)
    • 0 files changed 51-100 times (0 lines of code)
    • 0 files changed 21-50 times (0 lines of code)
    • 8 files changed 6-20 times (2,644 lines of code)
    • 12 files changed 1-5 times (1,136 lines of code)
0% | 0% | 0% | 69% | 30%
Legend:
101+
51-100
21-50
6-20
1-5

Detailed data...

File Change Frequency per File Extension
py, json, md, txt, gitignore, yml, toml
File Change Frequency per Extension
The number of recorded file updates
101+
51-100
21-50
6-20
1-5
py0% | 0% | 0% | 70% | 29%
toml0% | 0% | 0% | 0% | 100%
File Change Frequency per Logical Decomposition
primary
primary (file change frequency)
The number of recorded file updates
101+
51-100
21-50
6-20
1-5
cc_net0% | 0% | 0% | 71% | 28%
cc_net/tools0% | 0% | 0% | 55% | 44%
ROOT0% | 0% | 0% | 68% | 31%
Most Frequently Changed Files (Top 20)

See data for all files...

File# lines# unitslast modified
(days ago)
created
(days ago)
# changes
jsonql.py
in cc_net
948 97 456 819 15
mine.py
in cc_net
464 20 456 819 13
setup.py
in root
43 - 450 819 12
minify.py
in cc_net
230 22 456 819 12
process_wet_file.py
in cc_net
197 18 450 819 11
execution.py
in cc_net
172 9 456 672 8
dedup.py
in cc_net
360 24 456 819 7
expand_corpus.py
in cc_net/tools
230 15 456 680 6
toml
pyproject.toml
in root
20 - 456 680 5
__main__.py
in cc_net
6 1 456 819 4
flat_hash_set.py
in cc_net
160 28 456 819 4
make_dmoz_corpus.py
in cc_net/tools
55 4 456 672 3
get_wiki_cirrus.py
in cc_net
74 6 456 680 3
regroup.py
in cc_net
81 5 456 680 3
split_by_lang.py
in cc_net
117 10 456 672 3
perplexity.py
in cc_net
284 26 456 672 3
__init__.py
in cc_net
1 - 456 819 2
tokenizer.py
in cc_net
55 6 456 672 2
text_normalizer.py
in cc_net
150 8 456 629 2
dl_cc_100.py
in cc_net/tools
133 6 456 456 1