huggingface / datasets
File Change Frequency

File change frequency (churn) shows the distribution of file updates (days with at least one commit).

Overview
File Change Frequency Overall
  • There are 127 files with 20,388 lines of code.
    • 6 files changed more than 100 times (4,182 lines of code)
    • 7 files changed 51-100 times (6,563 lines of code)
    • 19 files changed 21-50 times (3,680 lines of code)
    • 42 files changed 6-20 times (4,264 lines of code)
    • 53 files changed 1-5 times (1,699 lines of code)
20% | 32% | 18% | 20% | 8%
Legend:
101+
51-100
21-50
6-20
1-5

explore: grouped by folders | grouped by update frequency | data
Contributors Count Frequency Overall
  • There are 127 files with 20,388 lines of code.
    • 8 files changed by more than 25 contributors (8,723 lines of code)
    • 19 files changed by 11-25 contributors (4,904 lines of code)
    • 24 files changed by 6-10 contributors (2,880 lines of code)
    • 49 files changed by 2-5 contributors (3,404 lines of code)
    • 27 files changed by 1 contributor (477 lines of code)
42% | 24% | 14% | 16% | 2%
Legend:
26+
11-25
6-10
2-5
1

explore: grouped by folders | grouped by contributors count | data
File Change Frequency per File Extension
py, mdx, yaml, json, md, gitignore, toml, txt, sh
File Change Frequency per Extension
The number of recorded file updates
101+
51-100
21-50
6-20
1-5
py20% | 32% | 18% | 20% | 7%
toml0% | 0% | 0% | 100% | 0%
yaml0% | 0% | 0% | 0% | 100%
File Change Frequency per Logical Decomposition
primary
primary (file change frequency)
The number of recorded file updates
101+
51-100
21-50
6-20
1-5
src20% | 33% | 18% | 21% | 6%
ROOT85% | 0% | 0% | 14% | 0%
benchmarks0% | 0% | 0% | 8% | 91%
utils0% | 0% | 0% | 0% | 100%
Most Frequently Changed Files (Top 50)

See data for all files...

File# lines# unitscreatedlast modified# changes
(days)
# contributorsfirst
contributor
latest
contributor
setup.py
in root
131 - 2020-04-14 2025-06-19 278 57 thomwolf@users.noreply.gith... 49127578+tytodd@users.norep...
load.py
in src/datasets
952 26 2020-09-10 2025-06-25 181 46 thomwolf@users.noreply.gith... 42851186+lhoestq@users.nore...
builder.py
in src/datasets
1185 50 2020-09-10 2025-06-25 144 33 thomwolf@users.noreply.gith... 42851186+lhoestq@users.nore...
dataset_dict.py
in src/datasets
1075 58 2020-09-10 2025-06-25 134 42 thomwolf@users.noreply.gith... 42851186+lhoestq@users.nore...
__init__.py
in src/datasets
32 - 2020-09-10 2025-06-17 128 13 lhoest.q@gmail.com 42851186+lhoestq@users.nore...
file_utils.py
in src/datasets/utils
807 69 2020-09-10 2025-06-09 105 26 thomwolf@users.noreply.gith... 42851186+lhoestq@users.nore...
iterable_dataset.py
in src/datasets
2750 234 2021-06-23 2025-06-25 96 37 42851186+lhoestq@users.nore... 42851186+lhoestq@users.nore...
features.py
in src/datasets/features
1354 96 2021-10-13 2025-06-25 91 32 8515462+albertvillanova@use... 42851186+lhoestq@users.nore...
py_utils.py
in src/datasets/utils
402 26 2020-09-10 2025-06-09 75 19 thomwolf@users.noreply.gith... 42851186+lhoestq@users.nore...
arrow_writer.py
in src/datasets
469 22 2020-09-10 2025-04-28 72 26 thomwolf@users.noreply.gith... 35225576+afuetterer@users.n...
config.py
in src/datasets
176 - 2021-02-10 2025-06-25 69 20 42851186+lhoestq@users.nore... 42851186+lhoestq@users.nore...
table.py
in src/datasets
942 131 2021-03-26 2025-06-25 63 21 42851186+lhoestq@users.nore... 42851186+lhoestq@users.nore...
data_files.py
in src/datasets
470 27 2021-10-11 2025-05-12 58 17 42851186+lhoestq@users.nore... matthew@protopia.ai
inspect.py
in src/datasets
149 5 2020-09-10 2025-06-09 45 14 thomwolf@users.noreply.gith... 42851186+lhoestq@users.nore...
audio.py
in src/datasets/features
163 7 2021-10-13 2025-06-19 43 12 8515462+albertvillanova@use... 49127578+tytodd@users.norep...
info.py
in src/datasets
254 17 2020-09-10 2025-06-25 42 16 thomwolf@users.noreply.gith... 42851186+lhoestq@users.nore...
search.py
in src/datasets
393 33 2020-09-10 2025-06-25 41 25 thomwolf@users.noreply.gith... 42851186+lhoestq@users.nore...
metadata.py
in src/datasets/utils
180 8 2021-04-26 2025-03-05 40 16 42851186+lhoestq@users.nore... cyyever@outlook.com
formatting.py
in src/datasets/formatting
464 74 2021-02-05 2025-06-09 39 14 42851186+lhoestq@users.nore... 42851186+lhoestq@users.nore...
streaming.py
in src/datasets
83 2 2021-06-23 2025-06-09 37 7 42851186+lhoestq@users.nore... 42851186+lhoestq@users.nore...
streaming_download_manager.py
in src/datasets/download
106 11 2022-05-25 2025-03-05 36 9 8515462+albertvillanova@use... cyyever@outlook.com
arrow_reader.py
in src/datasets
309 26 2020-09-10 2025-03-28 34 16 thomwolf@users.noreply.gith... 42851186+lhoestq@users.nore...
image.py
in src/datasets/features
250 11 2021-12-06 2025-06-19 33 10 mario@huggingface.co 49127578+tytodd@users.norep...
json.py
in src/datasets/packaged_modules/json
141 8 2021-01-19 2024-06-19 31 7 42851186+lhoestq@users.nore... 8515462+albertvillanova@use...
fingerprint.py
in src/datasets
258 22 2020-09-10 2025-03-05 31 15 thomwolf@users.noreply.gith... cyyever@outlook.com
splits.py
in src/datasets
260 42 2020-09-10 2025-03-05 30 13 thomwolf@users.noreply.gith... cyyever@outlook.com
__init__.py
in src/datasets/packaged_modules
85 1 2021-01-19 2025-03-18 28 13 42851186+lhoestq@users.nore... yabran.muvdi@gmail.com
csv.py
in src/datasets/packaged_modules/csv
164 6 2021-01-19 2025-03-05 27 10 42851186+lhoestq@users.nore... cyyever@outlook.com
json.py
in src/datasets/io
148 6 2021-03-18 2024-11-18 25 12 8515462+albertvillanova@use... varadhbhatnagar@rediffmail.com
combine.py
in src/datasets
90 2 2021-06-23 2025-03-05 23 12 42851186+lhoestq@users.nore... cyyever@outlook.com
download_manager.py
in src/datasets/download
172 14 2022-05-25 2025-03-05 23 7 8515462+albertvillanova@use... cyyever@outlook.com
__init__.py
in src/datasets/utils
11 - 2020-09-10 2024-06-04 21 7 thomwolf@users.noreply.gith... 8515462+albertvillanova@use...
logging.py
in src/datasets/utils
69 14 2020-09-10 2025-06-09 20 7 thomwolf@users.noreply.gith... 42851186+lhoestq@users.nore...
text.py
in src/datasets/packaged_modules/text
83 4 2021-01-19 2024-08-21 19 5 42851186+lhoestq@users.nore... 8515462+albertvillanova@use...
parquet.py
in src/datasets/packaged_modules/parquet
86 5 2021-06-30 2025-03-05 19 8 42851186+lhoestq@users.nore... cyyever@outlook.com
parquet.py
in src/datasets/io
103 5 2021-06-30 2024-10-28 19 11 42851186+lhoestq@users.nore... 42851186+lhoestq@users.nore...
csv.py
in src/datasets/io
122 6 2021-03-12 2024-03-12 19 8 8515462+albertvillanova@use... mariosasko777@gmail.com
extract.py
in src/datasets/utils
253 25 2021-07-08 2025-03-05 19 8 8515462+albertvillanova@use... cyyever@outlook.com
folder_based_builder.py
in src/datasets/packaged_modules/folder_based_builder
347 8 2022-08-22 2025-06-25 19 8 polina@huggingface.co 42851186+lhoestq@users.nore...
__init__.py
in src/datasets/filesystems
26 2 2021-01-26 2025-03-05 18 12 32632186+philschmid@users.n... cyyever@outlook.com
filelock.py
in src/datasets/utils
8 - 2020-11-16 2023-11-28 17 8 42851186+lhoestq@users.nore... mariosasko777@gmail.com
__init__.py
in src/datasets/formatting
84 4 2021-02-05 2025-04-28 17 8 42851186+lhoestq@users.nore... 35225576+afuetterer@users.n...
download_config.py
in src/datasets/download
33 2 2022-05-25 2025-04-28 15 9 8515462+albertvillanova@use... 35225576+afuetterer@users.n...
translation.py
in src/datasets/features
52 6 2021-10-13 2025-06-25 14 9 8515462+albertvillanova@use... 42851186+lhoestq@users.nore...
torch_formatter.py
in src/datasets/formatting
82 8 2021-02-05 2025-06-19 14 8 42851186+lhoestq@users.nore... 49127578+tytodd@users.norep...
tf_formatter.py
in src/datasets/formatting
83 8 2021-02-05 2025-06-19 14 7 42851186+lhoestq@users.nore... 49127578+tytodd@users.norep...
imagefolder.py
in src/datasets/packaged_modules/imagefolder
77 1 2022-03-01 2025-03-05 13 6 nxr9266@g.rit.edu cyyever@outlook.com
jax_formatter.py
in src/datasets/formatting
116 9 2021-06-21 2025-06-19 13 8 42851186+lhoestq@users.nore... 49127578+tytodd@users.norep...
webdataset.py
in src/datasets/packaged_modules/webdataset
263 12 2023-11-28 2025-03-05 12 4 42851186+lhoestq@users.nore... cyyever@outlook.com
naming.py
in src/datasets
47 6 2020-09-10 2024-03-01 11 8 thomwolf@users.noreply.gith... 42851186+lhoestq@users.nore...
Files With Most Contributors (Top 50)
Based on the number of unique email addresses found in commits.

See data for all files...

File# lines# unitscreatedlast modified# changes
(days)
# contributorsfirst
contributor
latest
contributor
setup.py
in root
131 - 2020-04-14 2025-06-19 278 57 thomwolf@users.noreply.gith... 49127578+tytodd@users.norep...
load.py
in src/datasets
952 26 2020-09-10 2025-06-25 181 46 thomwolf@users.noreply.gith... 42851186+lhoestq@users.nore...
dataset_dict.py
in src/datasets
1075 58 2020-09-10 2025-06-25 134 42 thomwolf@users.noreply.gith... 42851186+lhoestq@users.nore...
iterable_dataset.py
in src/datasets
2750 234 2021-06-23 2025-06-25 96 37 42851186+lhoestq@users.nore... 42851186+lhoestq@users.nore...
builder.py
in src/datasets
1185 50 2020-09-10 2025-06-25 144 33 thomwolf@users.noreply.gith... 42851186+lhoestq@users.nore...
features.py
in src/datasets/features
1354 96 2021-10-13 2025-06-25 91 32 8515462+albertvillanova@use... 42851186+lhoestq@users.nore...
file_utils.py
in src/datasets/utils
807 69 2020-09-10 2025-06-09 105 26 thomwolf@users.noreply.gith... 42851186+lhoestq@users.nore...
arrow_writer.py
in src/datasets
469 22 2020-09-10 2025-04-28 72 26 thomwolf@users.noreply.gith... 35225576+afuetterer@users.n...
search.py
in src/datasets
393 33 2020-09-10 2025-06-25 41 25 thomwolf@users.noreply.gith... 42851186+lhoestq@users.nore...
table.py
in src/datasets
942 131 2021-03-26 2025-06-25 63 21 42851186+lhoestq@users.nore... 42851186+lhoestq@users.nore...
config.py
in src/datasets
176 - 2021-02-10 2025-06-25 69 20 42851186+lhoestq@users.nore... 42851186+lhoestq@users.nore...
py_utils.py
in src/datasets/utils
402 26 2020-09-10 2025-06-09 75 19 thomwolf@users.noreply.gith... 42851186+lhoestq@users.nore...
data_files.py
in src/datasets
470 27 2021-10-11 2025-05-12 58 17 42851186+lhoestq@users.nore... matthew@protopia.ai
info.py
in src/datasets
254 17 2020-09-10 2025-06-25 42 16 thomwolf@users.noreply.gith... 42851186+lhoestq@users.nore...
metadata.py
in src/datasets/utils
180 8 2021-04-26 2025-03-05 40 16 42851186+lhoestq@users.nore... cyyever@outlook.com
arrow_reader.py
in src/datasets
309 26 2020-09-10 2025-03-28 34 16 thomwolf@users.noreply.gith... 42851186+lhoestq@users.nore...
fingerprint.py
in src/datasets
258 22 2020-09-10 2025-03-05 31 15 thomwolf@users.noreply.gith... cyyever@outlook.com
inspect.py
in src/datasets
149 5 2020-09-10 2025-06-09 45 14 thomwolf@users.noreply.gith... 42851186+lhoestq@users.nore...
formatting.py
in src/datasets/formatting
464 74 2021-02-05 2025-06-09 39 14 42851186+lhoestq@users.nore... 42851186+lhoestq@users.nore...
__init__.py
in src/datasets
32 - 2020-09-10 2025-06-17 128 13 lhoest.q@gmail.com 42851186+lhoestq@users.nore...
splits.py
in src/datasets
260 42 2020-09-10 2025-03-05 30 13 thomwolf@users.noreply.gith... cyyever@outlook.com
__init__.py
in src/datasets/packaged_modules
85 1 2021-01-19 2025-03-18 28 13 42851186+lhoestq@users.nore... yabran.muvdi@gmail.com
audio.py
in src/datasets/features
163 7 2021-10-13 2025-06-19 43 12 8515462+albertvillanova@use... 49127578+tytodd@users.norep...
json.py
in src/datasets/io
148 6 2021-03-18 2024-11-18 25 12 8515462+albertvillanova@use... varadhbhatnagar@rediffmail.com
combine.py
in src/datasets
90 2 2021-06-23 2025-03-05 23 12 42851186+lhoestq@users.nore... cyyever@outlook.com
__init__.py
in src/datasets/filesystems
26 2 2021-01-26 2025-03-05 18 12 32632186+philschmid@users.n... cyyever@outlook.com
parquet.py
in src/datasets/io
103 5 2021-06-30 2024-10-28 19 11 42851186+lhoestq@users.nore... 42851186+lhoestq@users.nore...
image.py
in src/datasets/features
250 11 2021-12-06 2025-06-19 33 10 mario@huggingface.co 49127578+tytodd@users.norep...
csv.py
in src/datasets/packaged_modules/csv
164 6 2021-01-19 2025-03-05 27 10 42851186+lhoestq@users.nore... cyyever@outlook.com
streaming_download_manager.py
in src/datasets/download
106 11 2022-05-25 2025-03-05 36 9 8515462+albertvillanova@use... cyyever@outlook.com
download_config.py
in src/datasets/download
33 2 2022-05-25 2025-04-28 15 9 8515462+albertvillanova@use... 35225576+afuetterer@users.n...
translation.py
in src/datasets/features
52 6 2021-10-13 2025-06-25 14 9 8515462+albertvillanova@use... 42851186+lhoestq@users.nore...
csv.py
in src/datasets/io
122 6 2021-03-12 2024-03-12 19 8 8515462+albertvillanova@use... mariosasko777@gmail.com
extract.py
in src/datasets/utils
253 25 2021-07-08 2025-03-05 19 8 8515462+albertvillanova@use... cyyever@outlook.com
parquet.py
in src/datasets/packaged_modules/parquet
86 5 2021-06-30 2025-03-05 19 8 42851186+lhoestq@users.nore... cyyever@outlook.com
folder_based_builder.py
in src/datasets/packaged_modules/folder_based_builder
347 8 2022-08-22 2025-06-25 19 8 polina@huggingface.co 42851186+lhoestq@users.nore...
filelock.py
in src/datasets/utils
8 - 2020-11-16 2023-11-28 17 8 42851186+lhoestq@users.nore... mariosasko777@gmail.com
__init__.py
in src/datasets/formatting
84 4 2021-02-05 2025-04-28 17 8 42851186+lhoestq@users.nore... 35225576+afuetterer@users.n...
torch_formatter.py
in src/datasets/formatting
82 8 2021-02-05 2025-06-19 14 8 42851186+lhoestq@users.nore... 49127578+tytodd@users.norep...
jax_formatter.py
in src/datasets/formatting
116 9 2021-06-21 2025-06-19 13 8 42851186+lhoestq@users.nore... 49127578+tytodd@users.norep...
version.py
in src/datasets/utils
52 11 2020-09-10 2022-12-09 11 8 thomwolf@users.noreply.gith... 59462357+stevhliu@users.nor...
naming.py
in src/datasets
47 6 2020-09-10 2024-03-01 11 8 thomwolf@users.noreply.gith... 42851186+lhoestq@users.nore...
streaming.py
in src/datasets
83 2 2021-06-23 2025-06-09 37 7 42851186+lhoestq@users.nore... 42851186+lhoestq@users.nore...
json.py
in src/datasets/packaged_modules/json
141 8 2021-01-19 2024-06-19 31 7 42851186+lhoestq@users.nore... 8515462+albertvillanova@use...
download_manager.py
in src/datasets/download
172 14 2022-05-25 2025-03-05 23 7 8515462+albertvillanova@use... cyyever@outlook.com
__init__.py
in src/datasets/utils
11 - 2020-09-10 2024-06-04 21 7 thomwolf@users.noreply.gith... 8515462+albertvillanova@use...
logging.py
in src/datasets/utils
69 14 2020-09-10 2025-06-09 20 7 thomwolf@users.noreply.gith... 42851186+lhoestq@users.nore...
tf_formatter.py
in src/datasets/formatting
83 8 2021-02-05 2025-06-19 14 7 42851186+lhoestq@users.nore... 49127578+tytodd@users.norep...
imagefolder.py
in src/datasets/packaged_modules/imagefolder
77 1 2022-03-01 2025-03-05 13 6 nxr9266@g.rit.edu cyyever@outlook.com
tf_utils.py
in src/datasets/utils
390 21 2022-06-06 2025-03-05 9 6 rocketknight1@users.noreply... cyyever@outlook.com
Files With Least Contributors (Top 50)
Based on the number of unique email addresses found in commits.

See data for all files...

File# lines# unitscreatedlast modified# changes
(days)
# contributorsfirst
contributor
latest
contributor
_tenbin.py
in src/datasets/packaged_modules/webdataset
167 21 2023-11-28 2023-11-28 1 1 42851186+lhoestq@users.nore... 42851186+lhoestq@users.nore...
readme_structure.yaml
in src/datasets/utils/resources
116 - 2021-05-10 2021-05-14 2 1 chhablani.gunjan@gmail.com chhablani.gunjan@gmail.com
spark.py
in src/datasets/io
46 2 2023-04-26 2023-05-25 3 1 maddie.dawson@databricks.com maddie.dawson@databricks.com
xml.py
in src/datasets/packaged_modules/xml
46 4 2024-10-24 2024-10-24 1 1 42851186+lhoestq@users.nore... 42851186+lhoestq@users.nore...
delete_from_hub.py
in src/datasets/commands
35 4 2024-04-30 2024-04-30 1 1 8515462+albertvillanova@use... 8515462+albertvillanova@use...
_torchcodec.py
in src/datasets/features
13 1 2025-06-19 2025-06-19 1 1 49127578+tytodd@users.norep... 49127578+tytodd@users.norep...
pdffolder.py
in src/datasets/packaged_modules/pdffolder
13 1 2025-03-18 2025-03-18 1 1 yabran.muvdi@gmail.com yabran.muvdi@gmail.com
distributed.py
in src/datasets
9 1 2023-01-16 2024-10-25 2 1 42851186+lhoestq@users.nore... 42851186+lhoestq@users.nore...
__init__.py
in src/datasets/download
9 - 2022-05-25 2022-05-25 1 1 8515462+albertvillanova@use... 8515462+albertvillanova@use...
doc_utils.py
in src/datasets/utils
6 1 2021-05-28 2021-05-28 1 1 lewis.c.tunstall@gmail.com lewis.c.tunstall@gmail.com
__init__.py
in src/datasets/io
1 - 2021-03-12 2021-03-12 1 1 8515462+albertvillanova@use... 8515462+albertvillanova@use...
__init__.py
in src/datasets/utils/resources
1 - 2021-04-26 2021-04-26 1 1 theo-m@users.noreply.github... theo-m@users.noreply.github...
__init__.py
in src/datasets/packaged_modules/csv
1 - 2021-01-19 2021-01-19 1 1 42851186+lhoestq@users.nore... 42851186+lhoestq@users.nore...
__init__.py
in src/datasets/packaged_modules/arrow
1 - 2023-06-13 2023-06-13 1 1 mariusz.jachimowicz.83@gmai... mariusz.jachimowicz.83@gmai...
__init__.py
in src/datasets/packaged_modules/sql
1 - 2022-10-03 2022-10-03 1 1 frederic.branchaud.charron@... frederic.branchaud.charron@...
__init__.py
in src/datasets/packaged_modules/generator
1 - 2022-09-16 2022-09-16 1 1 mariosasko777@gmail.com mariosasko777@gmail.com
__init__.py
in src/datasets/packaged_modules/pandas
1 - 2021-01-19 2021-01-19 1 1 42851186+lhoestq@users.nore... 42851186+lhoestq@users.nore...
__init__.py
in src/datasets/packaged_modules/text
1 - 2021-01-19 2021-01-19 1 1 42851186+lhoestq@users.nore... 42851186+lhoestq@users.nore...
__init__.py
in src/datasets/packaged_modules/webdataset
1 - 2023-11-28 2023-11-28 1 1 42851186+lhoestq@users.nore... 42851186+lhoestq@users.nore...
__init__.py
in src/datasets/packaged_modules/spark
1 - 2023-04-26 2023-04-26 1 1 maddie.dawson@databricks.com maddie.dawson@databricks.com
__init__.py
in src/datasets/packaged_modules/json
1 - 2021-01-19 2021-01-19 1 1 42851186+lhoestq@users.nore... 42851186+lhoestq@users.nore...
__init__.py
in src/datasets/packaged_modules/videofolder
1 - 2024-10-24 2024-10-24 1 1 42851186+lhoestq@users.nore... 42851186+lhoestq@users.nore...
__init__.py
in src/datasets/packaged_modules/audiofolder
1 - 2022-08-22 2022-08-22 1 1 polina@huggingface.co polina@huggingface.co
__init__.py
in src/datasets/packaged_modules/pdffolder
1 - 2025-03-18 2025-03-18 1 1 yabran.muvdi@gmail.com yabran.muvdi@gmail.com
__init__.py
in src/datasets/packaged_modules/imagefolder
1 - 2022-03-01 2022-03-01 1 1 nxr9266@g.rit.edu nxr9266@g.rit.edu
__init__.py
in src/datasets/packaged_modules/xml
1 - 2024-10-24 2024-10-24 1 1 42851186+lhoestq@users.nore... 42851186+lhoestq@users.nore...
__init__.py
in src/datasets/packaged_modules/folder_based_builder
1 - 2022-08-22 2022-08-22 1 1 polina@huggingface.co polina@huggingface.co
108 8 2020-08-27 2021-06-14 4 2 thomwolf@users.noreply.gith... 8515462+albertvillanova@use...
hub.py
in src/datasets
100 2 2024-04-30 2025-06-09 4 2 8515462+albertvillanova@use... 42851186+lhoestq@users.nore...
polars_formatter.py
in src/datasets/formatting
88 11 2024-03-08 2025-01-30 2 2 psmyth1994@gmail.com 42851186+lhoestq@users.nore...
79 5 2020-08-27 2021-06-14 3 2 thomwolf@users.noreply.gith... 8515462+albertvillanova@use...
parallel.py
in src/datasets/parallel
65 4 2023-06-14 2024-04-15 2 2 ying.chen@databricks.com 42851186+lhoestq@users.nore...
53 9 2021-04-06 2021-06-14 2 2 42851186+lhoestq@users.nore... 8515462+albertvillanova@use...
stratify.py
in src/datasets/utils
46 2 2022-05-25 2025-01-09 3 2 48522685+nandwalritik@users... 42851186+lhoestq@users.nore...
track.py
in src/datasets/utils
45 10 2023-12-19 2025-03-05 3 2 42851186+lhoestq@users.nore... cyyever@outlook.com
42 3 2020-08-28 2021-06-14 3 2 thomwolf@users.noreply.gith... 8515462+albertvillanova@use...
41 6 2020-08-27 2021-06-14 4 2 thomwolf@users.noreply.gith... 8515462+albertvillanova@use...
tqdm.py
in src/datasets/utils
40 6 2023-11-22 2024-03-01 2 2 mariosasko777@gmail.com 42851186+lhoestq@users.nore...
videofolder.py
in src/datasets/packaged_modules/videofolder
21 1 2024-10-24 2025-03-05 2 2 42851186+lhoestq@users.nore... cyyever@outlook.com
experimental.py
in src/datasets/utils
12 1 2023-06-14 2024-03-01 2 2 ying.chen@databricks.com 42851186+lhoestq@users.nore...
__init__.py
in src/datasets/commands
10 2 2020-09-10 2021-05-10 2 2 thomwolf@users.noreply.gith... mariosasko777@gmail.com
typing.py
in src/datasets/utils
6 - 2021-03-12 2025-03-05 3 2 8515462+albertvillanova@use... cyyever@outlook.com
__init__.py
in src/datasets/parallel
1 - 2023-06-14 2024-06-04 2 2 ying.chen@databricks.com 8515462+albertvillanova@use...
__init__.py
in src/datasets/packaged_modules/parquet
1 - 2021-06-30 2021-07-16 2 2 42851186+lhoestq@users.nore... stevhliu@gmail.com
sql.py
in src/datasets/io
101 6 2022-10-03 2024-03-12 8 3 frederic.branchaud.charron@... mariosasko777@gmail.com
_dataset_viewer.py
in src/datasets/utils
70 2 2024-04-08 2025-03-05 4 3 sylvain.lesage@huggingface.co cyyever@outlook.com
release.py
in utils
62 5 2021-06-14 2023-07-06 3 3 8515462+albertvillanova@use... mariosasko777@gmail.com
exceptions.py
in src/datasets
61 1 2023-10-10 2025-03-05 7 3 8515462+albertvillanova@use... cyyever@outlook.com
_filelock.py
in src/datasets/utils
30 2 2023-11-23 2025-03-05 4 3 mariosasko777@gmail.com cyyever@outlook.com
datasets_cli.py
in src/datasets/commands
25 2 2021-02-26 2025-06-09 8 3 mariosasko777@gmail.com 42851186+lhoestq@users.nore...