Path Lines of Code README.md 67 text/README.md 55 text/data/decontamination/README.md 3 text/data/finemath/README.md 20 text/data/fineweb-edu/README.md 31 text/data/smoltalk/README.md 8 text/data/smoltalk/constraints/README.md 2 text/data/smoltalk/magpie_ultra_v1/README.md 7 text/data/smoltalk/rewrite/README.md 2 text/data/smoltalk/summarization/README.md 1 text/evaluation/README.md 34 text/evaluation/requirements.txt 3 text/evaluation/smollm2_base.txt 9 text/evaluation/smollm2_instruct.txt 7 text/finetuning/README.md 41 text/finetuning/requirements.txt 7 text/pretraining/README.md 22 text/pretraining/continual-pretraining/README.md 32 text/pretraining/continual-pretraining/finemath/tokenization_InfiMM-WebMath-40B.patch 51 text/pretraining/continual-pretraining/finemath/tokenization_finemath.patch 38 tools/README.md 2 tools/smol_tools/README.md 78 tools/smol_tools/requirements.txt 6 tools/smollm_local_inference/README.md 22 tools/smolvlm_local_inference/README.md 60 vision/README.md 79 vision/data/README.md 2 vision/data/datasets_processing_scripts/clean_m4_prelimenary_experiments/README.md 6 vision/data/datasets_processing_scripts/clean_m4_prelimenary_experiments/python_scripts/01_shard_names.txt 2757 vision/data/datasets_processing_scripts/enwiki/REAME.md 2 vision/evaluation/README.md 4 vision/experiments/evaluation/vloom/README.md 80 vision/experiments/pretraining/vloom/README.md 136 vision/experiments/pretraining/vloom/slurm_scripts_templates/ds_config.json 30 vision/experiments/pretraining/vloom/slurm_scripts_templates/ds_config_bf16.json 17 vision/experiments/pretraining/vloom/slurm_scripts_templates/with_launcher/ds_config.json 17 vision/experiments/pretraining/vloom/tr_cron_template/README.md 1 vision/finetuning/README.md 2 vision/m4/evaluation/README.md 55 vision/m4/evaluation/generation/README.md 7 vision/m4/evaluation/scripts/README.md 1 vision/m4/scripts/README.md 23 vision/m4/sourcing/data_collection/README.md 163 vision/m4/sourcing/data_collection/outputs/README.md 13 vision/m4/sourcing/get_html_files/common_crawl.md 93 vision/m4/sourcing/get_modelling_metadata_dataset/shard_names.txt 2757 vision/m4/sourcing/pmd/scripts/README.md 44 vision/m4/sourcing/processing/README.md 11 vision/m4/sourcing/processing/extracting_ngrams/README.md 9 vision/m4/training/DATA_DOCUMENTATION.md 72 vision/smolvlm2/README.md 57 vision/smolvlm2/scripts/zero1.json 12 vision/smolvlm2/scripts/zero2.json 41 vision/smolvlm2/scripts/zero3.json 39 vision/smolvlm2/scripts/zero3_gradient_clipping.json 29 vision/smolvlm2/scripts/zero3_mics_mini_fixed.json 30 vision/smolvlm2/scripts/zero3_offload_inference.json 21 vision/smolvlm2/scripts/zero3pp.json 51