huggingface / datablations
File Age & Freshness

File age measurements show the distribution of file ages (days since the first commit) and the file freshness (days since the latest commit).

Summary
File Change History Overall
File Age Distribution Overall
Days since first update
  • There are 22 files with 5,758 lines of code in files.
    • 22 files that are 366+ days old (5,758 lines of code)
    • 0 files that are 181-365 days old (0 lines of code)
    • 0 files that are 91-180 days old (0 lines of code)
    • 0 files that are 31-90 days old (0 lines of code)
    • 0 files that are 1-30 days old (0 lines of code)
100% | 0% | 0% | 0% | 0%
Legend:
366+
181-365
91-180
31-90
1-30

explore: grouped by folders | grouped by age
File Freshness Distribution Overall
Days since last update
  • There are 22 files with 5,758 lines of code in files.
    • 22 files have been last changed 366+ days ago (5,758 lines of code)
    • 0 files have been last changed 181-365 days ago (0 lines of code)
    • 0 files have been last changed 91-180 days ago (0 lines of code)
    • 0 files have been last changed 31-90 days ago (0 lines of code)
    • 0 files have been last changed 1-30 days ago (0 lines of code)
100% | 0% | 0% | 0% | 0%
Legend:
366+
181-365
91-180
31-90
1-30

explore: grouped by folders | grouped by freshness
File Change History per File Extension
txt, sh, ipynb, py, md, gitignore, json
File Age Distribution per Extension
Days since first update
366+
181-365
91-180
31-90
1-30
ipynb100% | 0% | 0% | 0% | 0%
py100% | 0% | 0% | 0% | 0%
File Freshness Distribution per Extension
Days since last update
366+
181-365
91-180
31-90
1-30
ipynb100% | 0% | 0% | 0% | 0%
py100% | 0% | 0% | 0% | 0%
File Change History per Logical Decomposition
primary
primary (file age distribution)
Days since first update
366+
181-365
91-180
31-90
1-30
filtering_notebooks100% | 0% | 0% | 0% | 0%
plotstables100% | 0% | 0% | 0% | 0%
utils100% | 0% | 0% | 0% | 0%
filtering100% | 0% | 0% | 0% | 0%
training100% | 0% | 0% | 0% | 0%
primary (file freshness distribution)
Days since last update
366+
181-365
91-180
31-90
1-30
filtering_notebooks100% | 0% | 0% | 0% | 0%
plotstables100% | 0% | 0% | 0% | 0%
utils100% | 0% | 0% | 0% | 0%
filtering100% | 0% | 0% | 0% | 0%
training100% | 0% | 0% | 0% | 0%
Oldest Files (Top 22)
File# lines# unitscreatedlast modified# changes
(days)
# contributorsfirst
contributor
latest
contributor
71 4 2022-12-19 2023-01-24 3 2 teven.lescao@gmail.com ola.piktus@gmail.com
blindspots.ipynb
in filtering_notebooks
2462 - 2023-05-23 2023-05-24 2 2 n.muennighoff@gmail.com teven.lescao@gmail.com
mup.py
in training
278 4 2023-05-23 2023-05-24 2 2 n.muennighoff@gmail.com teven.lescao@gmail.com
hub_sync.py
in utils
153 9 2023-05-23 2023-06-04 5 3 n.muennighoff@gmail.com n.muennighoff@gmail.com
add_dedup_info.py
in filtering/deduplication
123 5 2023-05-23 2023-05-24 2 2 n.muennighoff@gmail.com teven.lescao@gmail.com
79 3 2023-05-23 2023-05-24 2 2 n.muennighoff@gmail.com teven.lescao@gmail.com
hf_dataset_to_file.py
in filtering/deduplication
71 2 2023-05-23 2023-05-24 2 2 n.muennighoff@gmail.com teven.lescao@gmail.com
46 - 2023-05-23 2023-05-24 2 2 n.muennighoff@gmail.com teven.lescao@gmail.com
save_roots_sample.py
in filtering/deduplication
46 1 2023-05-23 2023-05-24 2 2 n.muennighoff@gmail.com teven.lescao@gmail.com
save_dataset_sample.py
in filtering/deduplication
43 1 2023-05-23 2023-05-24 2 2 n.muennighoff@gmail.com teven.lescao@gmail.com
35 - 2023-05-23 2023-05-24 2 2 n.muennighoff@gmail.com teven.lescao@gmail.com
dedup_oscar.py
in filtering/deduplication
26 2 2023-05-23 2023-05-24 2 2 n.muennighoff@gmail.com teven.lescao@gmail.com
filter_oscar_jsonl.py
in filtering/deduplication
23 1 2023-05-23 2023-05-24 2 2 n.muennighoff@gmail.com teven.lescao@gmail.com
save_rust_format.py
in filtering/deduplication
22 1 2023-05-23 2023-05-24 2 2 n.muennighoff@gmail.com teven.lescao@gmail.com
suffix_dedup.py
in filtering/deduplication
22 2 2023-05-23 2023-05-24 2 2 n.muennighoff@gmail.com teven.lescao@gmail.com
download_oscar.py
in filtering/deduplication
15 - 2023-05-23 2023-05-24 2 2 n.muennighoff@gmail.com teven.lescao@gmail.com
hub_auth.py
in utils
12 - 2023-05-23 2023-05-24 2 2 n.muennighoff@gmail.com teven.lescao@gmail.com
cleandirs.py
in utils
10 - 2023-05-23 2023-05-24 2 2 n.muennighoff@gmail.com teven.lescao@gmail.com
save_dataset.py
in filtering/deduplication
9 - 2023-05-23 2023-05-24 2 2 n.muennighoff@gmail.com teven.lescao@gmail.com
repetition.ipynb
in plotstables
1376 - 2023-05-24 2023-05-31 2 1 n.muennighoff@gmail.com n.muennighoff@gmail.com
filtering.ipynb
in plotstables
800 - 2023-05-24 2023-05-24 1 1 n.muennighoff@gmail.com n.muennighoff@gmail.com
36 4 2023-06-18 2023-06-18 1 1 n.muennighoff@gmail.com n.muennighoff@gmail.com
Files Not Recently Changed (Top 22)
File# lines# unitscreatedlast modified# changes
(days)
# contributorsfirst
contributor
latest
contributor
71 4 2022-12-19 2023-01-24 3 2 teven.lescao@gmail.com ola.piktus@gmail.com
save_dataset.py
in filtering/deduplication
9 - 2023-05-23 2023-05-24 2 2 n.muennighoff@gmail.com teven.lescao@gmail.com
cleandirs.py
in utils
10 - 2023-05-23 2023-05-24 2 2 n.muennighoff@gmail.com teven.lescao@gmail.com
hub_auth.py
in utils
12 - 2023-05-23 2023-05-24 2 2 n.muennighoff@gmail.com teven.lescao@gmail.com
download_oscar.py
in filtering/deduplication
15 - 2023-05-23 2023-05-24 2 2 n.muennighoff@gmail.com teven.lescao@gmail.com
suffix_dedup.py
in filtering/deduplication
22 2 2023-05-23 2023-05-24 2 2 n.muennighoff@gmail.com teven.lescao@gmail.com
save_rust_format.py
in filtering/deduplication
22 1 2023-05-23 2023-05-24 2 2 n.muennighoff@gmail.com teven.lescao@gmail.com
filter_oscar_jsonl.py
in filtering/deduplication
23 1 2023-05-23 2023-05-24 2 2 n.muennighoff@gmail.com teven.lescao@gmail.com
dedup_oscar.py
in filtering/deduplication
26 2 2023-05-23 2023-05-24 2 2 n.muennighoff@gmail.com teven.lescao@gmail.com
35 - 2023-05-23 2023-05-24 2 2 n.muennighoff@gmail.com teven.lescao@gmail.com
save_dataset_sample.py
in filtering/deduplication
43 1 2023-05-23 2023-05-24 2 2 n.muennighoff@gmail.com teven.lescao@gmail.com
save_roots_sample.py
in filtering/deduplication
46 1 2023-05-23 2023-05-24 2 2 n.muennighoff@gmail.com teven.lescao@gmail.com
46 - 2023-05-23 2023-05-24 2 2 n.muennighoff@gmail.com teven.lescao@gmail.com
hf_dataset_to_file.py
in filtering/deduplication
71 2 2023-05-23 2023-05-24 2 2 n.muennighoff@gmail.com teven.lescao@gmail.com
79 3 2023-05-23 2023-05-24 2 2 n.muennighoff@gmail.com teven.lescao@gmail.com
add_dedup_info.py
in filtering/deduplication
123 5 2023-05-23 2023-05-24 2 2 n.muennighoff@gmail.com teven.lescao@gmail.com
mup.py
in training
278 4 2023-05-23 2023-05-24 2 2 n.muennighoff@gmail.com teven.lescao@gmail.com
filtering.ipynb
in plotstables
800 - 2023-05-24 2023-05-24 1 1 n.muennighoff@gmail.com n.muennighoff@gmail.com
blindspots.ipynb
in filtering_notebooks
2462 - 2023-05-23 2023-05-24 2 2 n.muennighoff@gmail.com teven.lescao@gmail.com
repetition.ipynb
in plotstables
1376 - 2023-05-24 2023-05-31 2 1 n.muennighoff@gmail.com n.muennighoff@gmail.com
hub_sync.py
in utils
153 9 2023-05-23 2023-06-04 5 3 n.muennighoff@gmail.com n.muennighoff@gmail.com
36 4 2023-06-18 2023-06-18 1 1 n.muennighoff@gmail.com n.muennighoff@gmail.com
Most Recently Created Files (Top 22)
File# lines# unitscreatedlast modified# changes
(days)
# contributorsfirst
contributor
latest
contributor
36 4 2023-06-18 2023-06-18 1 1 n.muennighoff@gmail.com n.muennighoff@gmail.com
repetition.ipynb
in plotstables
1376 - 2023-05-24 2023-05-31 2 1 n.muennighoff@gmail.com n.muennighoff@gmail.com
filtering.ipynb
in plotstables
800 - 2023-05-24 2023-05-24 1 1 n.muennighoff@gmail.com n.muennighoff@gmail.com
blindspots.ipynb
in filtering_notebooks
2462 - 2023-05-23 2023-05-24 2 2 n.muennighoff@gmail.com teven.lescao@gmail.com
mup.py
in training
278 4 2023-05-23 2023-05-24 2 2 n.muennighoff@gmail.com teven.lescao@gmail.com
hub_sync.py
in utils
153 9 2023-05-23 2023-06-04 5 3 n.muennighoff@gmail.com n.muennighoff@gmail.com
add_dedup_info.py
in filtering/deduplication
123 5 2023-05-23 2023-05-24 2 2 n.muennighoff@gmail.com teven.lescao@gmail.com
79 3 2023-05-23 2023-05-24 2 2 n.muennighoff@gmail.com teven.lescao@gmail.com
hf_dataset_to_file.py
in filtering/deduplication
71 2 2023-05-23 2023-05-24 2 2 n.muennighoff@gmail.com teven.lescao@gmail.com
46 - 2023-05-23 2023-05-24 2 2 n.muennighoff@gmail.com teven.lescao@gmail.com
save_roots_sample.py
in filtering/deduplication
46 1 2023-05-23 2023-05-24 2 2 n.muennighoff@gmail.com teven.lescao@gmail.com
save_dataset_sample.py
in filtering/deduplication
43 1 2023-05-23 2023-05-24 2 2 n.muennighoff@gmail.com teven.lescao@gmail.com
35 - 2023-05-23 2023-05-24 2 2 n.muennighoff@gmail.com teven.lescao@gmail.com
dedup_oscar.py
in filtering/deduplication
26 2 2023-05-23 2023-05-24 2 2 n.muennighoff@gmail.com teven.lescao@gmail.com
filter_oscar_jsonl.py
in filtering/deduplication
23 1 2023-05-23 2023-05-24 2 2 n.muennighoff@gmail.com teven.lescao@gmail.com
save_rust_format.py
in filtering/deduplication
22 1 2023-05-23 2023-05-24 2 2 n.muennighoff@gmail.com teven.lescao@gmail.com
suffix_dedup.py
in filtering/deduplication
22 2 2023-05-23 2023-05-24 2 2 n.muennighoff@gmail.com teven.lescao@gmail.com
download_oscar.py
in filtering/deduplication
15 - 2023-05-23 2023-05-24 2 2 n.muennighoff@gmail.com teven.lescao@gmail.com
hub_auth.py
in utils
12 - 2023-05-23 2023-05-24 2 2 n.muennighoff@gmail.com teven.lescao@gmail.com
cleandirs.py
in utils
10 - 2023-05-23 2023-05-24 2 2 n.muennighoff@gmail.com teven.lescao@gmail.com
save_dataset.py
in filtering/deduplication
9 - 2023-05-23 2023-05-24 2 2 n.muennighoff@gmail.com teven.lescao@gmail.com
71 4 2022-12-19 2023-01-24 3 2 teven.lescao@gmail.com ola.piktus@gmail.com
Most Recently Changed Files (Top 22)
File# lines# unitscreatedlast modified# changes
(days)
# contributorsfirst
contributor
latest
contributor
36 4 2023-06-18 2023-06-18 1 1 n.muennighoff@gmail.com n.muennighoff@gmail.com
hub_sync.py
in utils
153 9 2023-05-23 2023-06-04 5 3 n.muennighoff@gmail.com n.muennighoff@gmail.com
repetition.ipynb
in plotstables
1376 - 2023-05-24 2023-05-31 2 1 n.muennighoff@gmail.com n.muennighoff@gmail.com
blindspots.ipynb
in filtering_notebooks
2462 - 2023-05-23 2023-05-24 2 2 n.muennighoff@gmail.com teven.lescao@gmail.com
filtering.ipynb
in plotstables
800 - 2023-05-24 2023-05-24 1 1 n.muennighoff@gmail.com n.muennighoff@gmail.com
mup.py
in training
278 4 2023-05-23 2023-05-24 2 2 n.muennighoff@gmail.com teven.lescao@gmail.com
add_dedup_info.py
in filtering/deduplication
123 5 2023-05-23 2023-05-24 2 2 n.muennighoff@gmail.com teven.lescao@gmail.com
79 3 2023-05-23 2023-05-24 2 2 n.muennighoff@gmail.com teven.lescao@gmail.com
hf_dataset_to_file.py
in filtering/deduplication
71 2 2023-05-23 2023-05-24 2 2 n.muennighoff@gmail.com teven.lescao@gmail.com
46 - 2023-05-23 2023-05-24 2 2 n.muennighoff@gmail.com teven.lescao@gmail.com
save_roots_sample.py
in filtering/deduplication
46 1 2023-05-23 2023-05-24 2 2 n.muennighoff@gmail.com teven.lescao@gmail.com
save_dataset_sample.py
in filtering/deduplication
43 1 2023-05-23 2023-05-24 2 2 n.muennighoff@gmail.com teven.lescao@gmail.com
35 - 2023-05-23 2023-05-24 2 2 n.muennighoff@gmail.com teven.lescao@gmail.com
dedup_oscar.py
in filtering/deduplication
26 2 2023-05-23 2023-05-24 2 2 n.muennighoff@gmail.com teven.lescao@gmail.com
filter_oscar_jsonl.py
in filtering/deduplication
23 1 2023-05-23 2023-05-24 2 2 n.muennighoff@gmail.com teven.lescao@gmail.com
save_rust_format.py
in filtering/deduplication
22 1 2023-05-23 2023-05-24 2 2 n.muennighoff@gmail.com teven.lescao@gmail.com
suffix_dedup.py
in filtering/deduplication
22 2 2023-05-23 2023-05-24 2 2 n.muennighoff@gmail.com teven.lescao@gmail.com
download_oscar.py
in filtering/deduplication
15 - 2023-05-23 2023-05-24 2 2 n.muennighoff@gmail.com teven.lescao@gmail.com
hub_auth.py
in utils
12 - 2023-05-23 2023-05-24 2 2 n.muennighoff@gmail.com teven.lescao@gmail.com
cleandirs.py
in utils
10 - 2023-05-23 2023-05-24 2 2 n.muennighoff@gmail.com teven.lescao@gmail.com
save_dataset.py
in filtering/deduplication
9 - 2023-05-23 2023-05-24 2 2 n.muennighoff@gmail.com teven.lescao@gmail.com
71 4 2022-12-19 2023-01-24 3 2 teven.lescao@gmail.com ola.piktus@gmail.com