facebookresearch / cc_net
File Age

File age measurements show the distribution of file ages (days since the first commit) and the recency of file updates (days since the latest commit).

Summary
  • Number of files: 21
  • Daily file updates (only one update per file and date counted): 49
  • First update: 2019-10-30
  • Latest update: 2020-12-09
  • Days between first and latest update: 407 (58 weeks, estimated 290 working days)
  • Active days (at least one file change): 27
  • Data:
File Change History Overall
File Age Distribution Overall
Days since first update
  • There are 20 files with 3,780 lines of code in files.
    • 20 files that are 366+ days old (3,780 lines of code)
    • 0 files that are 181-365 days old (0 lines of code)
    • 0 files that are 91-180 days old (0 lines of code)
    • 0 files that are 31-90 days old (0 lines of code)
    • 0 files that are 1-30 days old (0 lines of code)
100% | 0% | 0% | 0% | 0%
Legend:
366+
181-365
91-180
31-90
1-30
Latest Change Distribution Overall
Days since last update
  • There are 20 files with 3,780 lines of code in files.
    • 20 files have been last changed 366+ days ago (3,780 lines of code)
    • 0 files have been last changed 181-365 days ago (0 lines of code)
    • 0 files have been last changed 91-180 days ago (0 lines of code)
    • 0 files have been last changed 31-90 days ago (0 lines of code)
    • 0 files have been last changed 1-30 days ago (0 lines of code)
100% | 0% | 0% | 0% | 0%
Legend:
366+
181-365
91-180
31-90
1-30
File Change History per File Extension
py, json, md, txt, gitignore, yml, toml
File Age Distribution per Extension
Days since first update
366+
181-365
91-180
31-90
1-30
py100% | 0% | 0% | 0% | 0%
toml100% | 0% | 0% | 0% | 0%
Latest Change Distribution per Extension
Days since last update
366+
181-365
91-180
31-90
1-30
py100% | 0% | 0% | 0% | 0%
toml100% | 0% | 0% | 0% | 0%
File Change History per Logical Decomposition
primary
primary (file age distribution)
Days since first update
366+
181-365
91-180
31-90
1-30
cc_net100% | 0% | 0% | 0% | 0%
cc_net/tools100% | 0% | 0% | 0% | 0%
ROOT100% | 0% | 0% | 0% | 0%
primary (latest change distribution)
Days since last update
366+
181-365
91-180
31-90
1-30
cc_net100% | 0% | 0% | 0% | 0%
cc_net/tools100% | 0% | 0% | 0% | 0%
ROOT100% | 0% | 0% | 0% | 0%
Oldest Files (Top 20)
File# lines# unitslast modified
(days ago)
created
(days ago)
# changes
jsonql.py
in cc_net
948 97 456 819 15
mine.py
in cc_net
464 20 456 819 13
dedup.py
in cc_net
360 24 456 819 7
minify.py
in cc_net
230 22 456 819 12
process_wet_file.py
in cc_net
197 18 450 819 11
flat_hash_set.py
in cc_net
160 28 456 819 4
setup.py
in root
43 - 450 819 12
__main__.py
in cc_net
6 1 456 819 4
__init__.py
in cc_net
1 - 456 819 2
expand_corpus.py
in cc_net/tools
230 15 456 680 6
regroup.py
in cc_net
81 5 456 680 3
get_wiki_cirrus.py
in cc_net
74 6 456 680 3
toml
pyproject.toml
in root
20 - 456 680 5
perplexity.py
in cc_net
284 26 456 672 3
execution.py
in cc_net
172 9 456 672 8
split_by_lang.py
in cc_net
117 10 456 672 3
make_dmoz_corpus.py
in cc_net/tools
55 4 456 672 3
tokenizer.py
in cc_net
55 6 456 672 2
text_normalizer.py
in cc_net
150 8 456 629 2
dl_cc_100.py
in cc_net/tools
133 6 456 456 1
Files Not Recently Changed (Top 20)
File# lines# unitslast modified
(days ago)
created
(days ago)
# changes
__init__.py
in cc_net
1 - 456 819 2
__main__.py
in cc_net
6 1 456 819 4
toml
pyproject.toml
in root
20 - 456 680 5
tokenizer.py
in cc_net
55 6 456 672 2
make_dmoz_corpus.py
in cc_net/tools
55 4 456 672 3
get_wiki_cirrus.py
in cc_net
74 6 456 680 3
regroup.py
in cc_net
81 5 456 680 3
split_by_lang.py
in cc_net
117 10 456 672 3
dl_cc_100.py
in cc_net/tools
133 6 456 456 1
text_normalizer.py
in cc_net
150 8 456 629 2
flat_hash_set.py
in cc_net
160 28 456 819 4
execution.py
in cc_net
172 9 456 672 8
minify.py
in cc_net
230 22 456 819 12
expand_corpus.py
in cc_net/tools
230 15 456 680 6
perplexity.py
in cc_net
284 26 456 672 3
dedup.py
in cc_net
360 24 456 819 7
mine.py
in cc_net
464 20 456 819 13
jsonql.py
in cc_net
948 97 456 819 15
setup.py
in root
43 - 450 819 12
process_wet_file.py
in cc_net
197 18 450 819 11
Most Recently Created Files (Top 20)
File# lines# unitslast modified
(days ago)
created
(days ago)
# changes
__init__.py
in cc_net/tools
1 -
dl_cc_100.py
in cc_net/tools
133 6 456 456 1
text_normalizer.py
in cc_net
150 8 456 629 2
perplexity.py
in cc_net
284 26 456 672 3
execution.py
in cc_net
172 9 456 672 8
split_by_lang.py
in cc_net
117 10 456 672 3
make_dmoz_corpus.py
in cc_net/tools
55 4 456 672 3
tokenizer.py
in cc_net
55 6 456 672 2
expand_corpus.py
in cc_net/tools
230 15 456 680 6
regroup.py
in cc_net
81 5 456 680 3
get_wiki_cirrus.py
in cc_net
74 6 456 680 3
toml
pyproject.toml
in root
20 - 456 680 5
jsonql.py
in cc_net
948 97 456 819 15
mine.py
in cc_net
464 20 456 819 13
dedup.py
in cc_net
360 24 456 819 7
minify.py
in cc_net
230 22 456 819 12
process_wet_file.py
in cc_net
197 18 450 819 11
flat_hash_set.py
in cc_net
160 28 456 819 4
setup.py
in root
43 - 450 819 12
__main__.py
in cc_net
6 1 456 819 4
Most Recently Changed Files (Top 20)
File# lines# unitslast modified
(days ago)
created
(days ago)
# changes
__init__.py
in cc_net/tools
1 -
process_wet_file.py
in cc_net
197 18 450 819 11
setup.py
in root
43 - 450 819 12
jsonql.py
in cc_net
948 97 456 819 15
mine.py
in cc_net
464 20 456 819 13
dedup.py
in cc_net
360 24 456 819 7
perplexity.py
in cc_net
284 26 456 672 3
expand_corpus.py
in cc_net/tools
230 15 456 680 6
minify.py
in cc_net
230 22 456 819 12
execution.py
in cc_net
172 9 456 672 8
flat_hash_set.py
in cc_net
160 28 456 819 4
text_normalizer.py
in cc_net
150 8 456 629 2
dl_cc_100.py
in cc_net/tools
133 6 456 456 1
split_by_lang.py
in cc_net
117 10 456 672 3
regroup.py
in cc_net
81 5 456 680 3
get_wiki_cirrus.py
in cc_net
74 6 456 680 3
make_dmoz_corpus.py
in cc_net/tools
55 4 456 672 3
tokenizer.py
in cc_net
55 6 456 672 2
toml
pyproject.toml
in root
20 - 456 680 5
__main__.py
in cc_net
6 1 456 819 4