alibaba / Pai-Megatron-Patch
File Change Frequency

File change frequency (churn) shows the distribution of file updates (days with at least one commit).

Overview
File Change Frequency Overall
  • There are 280 files with 72,165 lines of code.
    • 0 files changed more than 100 times (0 lines of code)
    • 0 files changed 51-100 times (0 lines of code)
    • 1 file changed 21-50 times (449 lines of code)
    • 18 files changed 6-20 times (8,166 lines of code)
    • 261 files changed 1-5 times (63,550 lines of code)
0% | 0% | <1% | 11% | 88%
Legend:
101+
51-100
21-50
6-20
1-5

explore: grouped by folders | grouped by update frequency | data
Contributors Count Frequency Overall
  • There are 280 files with 72,165 lines of code.
    • 0 files changed by more than 25 contributors (0 lines of code)
    • 0 files changed by 11-25 contributors (0 lines of code)
    • 0 files changed by 6-10 contributors (0 lines of code)
    • 67 files changed by 2-5 contributors (27,038 lines of code)
    • 213 files changed by 1 contributor (45,127 lines of code)
0% | 0% | 0% | 37% | 62%
Legend:
26+
11-25
6-10
2-5
1

explore: grouped by folders | grouped by contributors count | data
File Change Frequency per File Extension
py, sh, md, json, patch, txt, gitignore, gitmodules
File Change Frequency per Extension
The number of recorded file updates
101+
51-100
21-50
6-20
1-5
py0% | 0% | <1% | 11% | 88%
File Change Frequency per Logical Decomposition
primary
primary (file change frequency)
The number of recorded file updates
101+
51-100
21-50
6-20
1-5
megatron_patch0% | 0% | <1% | 7% | 91%
toolkits0% | 0% | 0% | 19% | 80%
rlhf0% | 0% | 0% | 0% | 100%
Most Frequently Changed Files (Top 50)

See data for all files...

File# lines# unitscreatedlast modified# changes
(days)
# contributorsfirst
contributor
latest
contributor
arguments.py
in megatron_patch
449 2 2023-09-04 2025-04-28 35 5 jerryli1981@users.noreply.g... 46404040+lostkevin@users.no...
utils.py
in megatron_patch/data
318 5 2024-01-28 2025-03-31 20 3 jerryli1981@users.noreply.g... 46404040+lostkevin@users.no...
hf2mcore_deepseek_v2_moe.py
in toolkits/model_checkpoints_convertor/deepseek
454 8 2024-05-27 2025-03-05 13 3 jerryli1981@users.noreply.g... 46404040+lostkevin@users.no...
hf2mcore_deepseek_v3_moe.py
in toolkits/model_checkpoints_convertor/deepseek
578 12 2025-02-21 2025-04-03 13 2 jerryli1981@users.noreply.g... jerryli1981@users.noreply.g...
transformer.py
in megatron_patch/model/llama2
1296 35 2023-09-04 2024-02-28 13 4 jerryli1981@users.noreply.g... 38210876+lwmlyy@users.norep...
__init__.py
in megatron_patch/data
88 7 2023-11-10 2025-02-26 12 3 jerryli1981@users.noreply.g... 46404040+lostkevin@users.no...
hf2mcore_qwen2_dense_and_moe_gqa.py
in toolkits/model_checkpoints_convertor/qwen
821 10 2024-06-19 2025-02-21 12 5 jerryli1981@users.noreply.g... jerryli1981@users.noreply.g...
checkpoint_reshaping_and_interoperability.py
in toolkits/model_checkpoints_convertor/baichuan2
638 13 2023-09-19 2024-04-15 11 4 38210876+lwmlyy@users.norep... jerryli1981@users.noreply.g...
preprocess_data_megatron.py
in toolkits/pretrain_data_preprocessing
360 13 2024-04-15 2025-04-29 10 4 jerryli1981@users.noreply.g... 46404040+lostkevin@users.no...
hf2mcore_qwen2_vl.py
in toolkits/model_checkpoints_convertor/qwen
616 10 2024-11-27 2025-03-19 9 2 46404040+lostkevin@users.no... 46404040+lostkevin@users.no...
language_model.py
in megatron_patch/model/llama2
454 15 2023-09-04 2024-03-05 8 3 jerryli1981@users.noreply.g... jerryli1981@users.noreply.g...
training.py
in megatron_patch
612 8 2023-09-04 2024-02-02 8 3 jerryli1981@users.noreply.g... jerryli1981@users.noreply.g...
checkpoint_reshaping_and_interoperability.py
in toolkits/model_checkpoints_convertor/baichuan
649 14 2023-09-04 2024-02-02 8 2 jerryli1981@users.noreply.g... jerryli1981@users.noreply.g...
moe_layer.py
in megatron_patch/model/qwen2/moe
114 5 2024-06-12 2025-02-08 7 4 jerryli1981@users.noreply.g... 46404040+lostkevin@users.no...
language_model.py
in megatron_patch/model/llava
507 16 2023-11-02 2023-12-27 7 4 jerry.lp@alibaba-inc.com jerryli1981@users.noreply.g...
helper.py
in megatron_patch/template
115 3 2025-02-21 2025-05-09 6 2 jerryli1981@users.noreply.g... 46404040+lostkevin@users.no...
__init__.py
in toolkits/model_checkpoints_convertor/utils
146 5 2025-01-17 2025-04-03 6 2 46404040+lostkevin@users.no... jerryli1981@users.noreply.g...
preprocess_data.py
in toolkits/pretrain_data_preprocessing
198 6 2023-09-04 2024-12-05 6 3 jerryli1981@users.noreply.g... jerryli1981@users.noreply.g...
finetune_utils.py
in megatron_patch
202 8 2023-09-04 2024-06-13 6 4 jerryli1981@users.noreply.g... jerryli1981@users.noreply.g...
transformer_config.py
in megatron_patch/model/deepseek_v2
42 1 2024-05-27 2025-04-28 5 3 jerryli1981@users.noreply.g... 46404040+lostkevin@users.no...
clip_encoder.py
in megatron_patch/model/llava
75 10 2023-11-02 2023-12-29 5 3 jerry.lp@alibaba-inc.com jerryli1981@users.noreply.g...
build_idxmap_sft_dataset.py
in toolkits/sft_data_preprocessing
319 11 2024-09-12 2025-02-12 5 1 46404040+lostkevin@users.no... 46404040+lostkevin@users.no...
hf2te.py
in toolkits/model_checkpoints_convertor/baichuan2
360 7 2023-10-19 2024-02-02 5 2 jerryli1981@users.noreply.g... jerryli1981@users.noreply.g...
hf2te.py
in toolkits/model_checkpoints_convertor/baichuan
378 8 2023-09-04 2024-02-02 5 2 jerryli1981@users.noreply.g... jerryli1981@users.noreply.g...
hf2mcore.py
in toolkits/model_checkpoints_convertor/mistral
468 14 2024-04-21 2025-02-21 5 2 jerryli1981@users.noreply.g... jerryli1981@users.noreply.g...
hf2mcore_qwen1.5_moe.py
in toolkits/model_checkpoints_convertor/qwen
479 12 2024-05-13 2024-10-21 5 3 jerryli1981@users.noreply.g... 46404040+lostkevin@users.no...
transformer.py
in megatron_patch/model/qwen
1243 35 2023-09-04 2024-02-02 5 1 jerryli1981@users.noreply.g... jerryli1981@users.noreply.g...
transformer.py
in megatron_patch/model/baichuan2
1292 36 2023-09-19 2023-10-19 5 3 jerryli1981@users.noreply.g... jerryli1981@users.noreply.g...
rotary_pos_embedding.py
in megatron_patch/model/llama2
56 4 2023-10-11 2023-11-27 4 3 jerry.lp@alibaba-inc.com lwmlyy@163.com
moe_layer.py
in megatron_patch/model/qwen1_5/moe
78 5 2024-03-21 2024-06-23 4 1 jerryli1981@users.noreply.g... jerryli1981@users.noreply.g...
moe_layer.py
in megatron_patch/model/mixtral/moe
113 5 2024-01-28 2024-12-18 4 1 jerryli1981@users.noreply.g... jerryli1981@users.noreply.g...
layer_specs.py
in megatron_patch/model/mixtral
129 3 2024-01-28 2024-12-18 4 1 jerryli1981@users.noreply.g... jerryli1981@users.noreply.g...
router.py
in megatron_patch/model/qwen1_5/moe
139 13 2024-03-21 2024-05-31 4 1 jerryli1981@users.noreply.g... jerryli1981@users.noreply.g...
mlp.py
in megatron_patch/model/qwen1_5/transformer
164 6 2024-03-21 2024-05-29 4 1 jerryli1981@users.noreply.g... jerryli1981@users.noreply.g...
transformer_layer.py
in megatron_patch/model/deepseek_v2
226 6 2024-05-27 2025-01-16 4 1 jerryli1981@users.noreply.g... jerryli1981@users.noreply.g...
transformer_config.py
in megatron_patch/model/mixtral
285 1 2024-01-28 2024-12-18 4 1 jerryli1981@users.noreply.g... jerryli1981@users.noreply.g...
transformer_block.py
in megatron_patch/model/deepseek_v2
377 12 2024-05-27 2025-01-16 4 1 jerryli1981@users.noreply.g... jerryli1981@users.noreply.g...
language_model.py
in megatron_patch/model/baichuan2
450 15 2023-09-19 2023-10-12 4 3 jerryli1981@users.noreply.g... 38210876+lwmlyy@users.norep...
checkpoint_reshaping_and_interoperability.py
in toolkits/model_checkpoints_convertor/galactica
454 13 2023-09-04 2024-02-02 4 2 jerryli1981@users.noreply.g... jerryli1981@users.noreply.g...
hf2mcore_70b.py
in toolkits/model_checkpoints_convertor/llama
577 11 2024-04-22 2024-10-21 4 3 jerryli1981@users.noreply.g... 46404040+lostkevin@users.no...
hf2mcore_mixtral.py
in toolkits/model_checkpoints_convertor/mistral
672 10 2024-04-22 2025-02-21 4 2 jerryli1981@users.noreply.g... jerryli1981@users.noreply.g...
experts.py
in megatron_patch/model/mixtral/moe
676 11 2024-01-28 2024-12-18 4 1 jerryli1981@users.noreply.g... jerryli1981@users.noreply.g...
hf2megatron.py
in toolkits/model_checkpoints_convertor/llama
808 15 2024-04-21 2024-10-21 4 2 jerryli1981@users.noreply.g... 46404040+lostkevin@users.no...
hf2megatron_qwen1.5.py
in toolkits/model_checkpoints_convertor/qwen
810 15 2024-04-21 2024-10-21 4 4 jerryli1981@users.noreply.g... 46404040+lostkevin@users.no...
transformer_config.py
in megatron_patch/model/qwen2
14 - 2024-06-12 2024-06-21 3 1 jerryli1981@users.noreply.g... jerryli1981@users.noreply.g...
mm_projector_builder.py
in megatron_patch/model/llava
37 6 2023-11-02 2023-12-29 3 3 jerry.lp@alibaba-inc.com jerryli1981@users.noreply.g...
transformer_config.py
in megatron_patch/model/qwen2_vl
53 2 2024-11-27 2025-01-17 3 1 46404040+lostkevin@users.no... 46404040+lostkevin@users.no...
synchronizer.py
in toolkits/distributed_checkpoints_convertor/impl/general
119 9 2025-04-15 2025-05-12 3 1 46404040+lostkevin@users.no... 46404040+lostkevin@users.no...
layer_specs.py
in megatron_patch/model/deepseek_v2
120 2 2024-05-27 2025-01-16 3 1 jerryli1981@users.noreply.g... jerryli1981@users.noreply.g...
lm_evaluate.py
in megatron_patch
139 7 2024-02-02 2024-04-15 3 2 jerryli1981@users.noreply.g... jerryli1981@users.noreply.g...
Files With Most Contributors (Top 50)
Based on the number of unique email addresses found in commits.

See data for all files...

File# lines# unitscreatedlast modified# changes
(days)
# contributorsfirst
contributor
latest
contributor
arguments.py
in megatron_patch
449 2 2023-09-04 2025-04-28 35 5 jerryli1981@users.noreply.g... 46404040+lostkevin@users.no...
hf2mcore_qwen2_dense_and_moe_gqa.py
in toolkits/model_checkpoints_convertor/qwen
821 10 2024-06-19 2025-02-21 12 5 jerryli1981@users.noreply.g... jerryli1981@users.noreply.g...
transformer.py
in megatron_patch/model/llama2
1296 35 2023-09-04 2024-02-28 13 4 jerryli1981@users.noreply.g... 38210876+lwmlyy@users.norep...
checkpoint_reshaping_and_interoperability.py
in toolkits/model_checkpoints_convertor/baichuan2
638 13 2023-09-19 2024-04-15 11 4 38210876+lwmlyy@users.norep... jerryli1981@users.noreply.g...
preprocess_data_megatron.py
in toolkits/pretrain_data_preprocessing
360 13 2024-04-15 2025-04-29 10 4 jerryli1981@users.noreply.g... 46404040+lostkevin@users.no...
language_model.py
in megatron_patch/model/llava
507 16 2023-11-02 2023-12-27 7 4 jerry.lp@alibaba-inc.com jerryli1981@users.noreply.g...
moe_layer.py
in megatron_patch/model/qwen2/moe
114 5 2024-06-12 2025-02-08 7 4 jerryli1981@users.noreply.g... 46404040+lostkevin@users.no...
finetune_utils.py
in megatron_patch
202 8 2023-09-04 2024-06-13 6 4 jerryli1981@users.noreply.g... jerryli1981@users.noreply.g...
hf2megatron_qwen1.5.py
in toolkits/model_checkpoints_convertor/qwen
810 15 2024-04-21 2024-10-21 4 4 jerryli1981@users.noreply.g... 46404040+lostkevin@users.no...
utils.py
in megatron_patch/data
318 5 2024-01-28 2025-03-31 20 3 jerryli1981@users.noreply.g... 46404040+lostkevin@users.no...
hf2mcore_deepseek_v2_moe.py
in toolkits/model_checkpoints_convertor/deepseek
454 8 2024-05-27 2025-03-05 13 3 jerryli1981@users.noreply.g... 46404040+lostkevin@users.no...
__init__.py
in megatron_patch/data
88 7 2023-11-10 2025-02-26 12 3 jerryli1981@users.noreply.g... 46404040+lostkevin@users.no...
language_model.py
in megatron_patch/model/llama2
454 15 2023-09-04 2024-03-05 8 3 jerryli1981@users.noreply.g... jerryli1981@users.noreply.g...
training.py
in megatron_patch
612 8 2023-09-04 2024-02-02 8 3 jerryli1981@users.noreply.g... jerryli1981@users.noreply.g...
preprocess_data.py
in toolkits/pretrain_data_preprocessing
198 6 2023-09-04 2024-12-05 6 3 jerryli1981@users.noreply.g... jerryli1981@users.noreply.g...
hf2mcore_qwen1.5_moe.py
in toolkits/model_checkpoints_convertor/qwen
479 12 2024-05-13 2024-10-21 5 3 jerryli1981@users.noreply.g... 46404040+lostkevin@users.no...
transformer_config.py
in megatron_patch/model/deepseek_v2
42 1 2024-05-27 2025-04-28 5 3 jerryli1981@users.noreply.g... 46404040+lostkevin@users.no...
clip_encoder.py
in megatron_patch/model/llava
75 10 2023-11-02 2023-12-29 5 3 jerry.lp@alibaba-inc.com jerryli1981@users.noreply.g...
transformer.py
in megatron_patch/model/baichuan2
1292 36 2023-09-19 2023-10-19 5 3 jerryli1981@users.noreply.g... jerryli1981@users.noreply.g...
hf2mcore_70b.py
in toolkits/model_checkpoints_convertor/llama
577 11 2024-04-22 2024-10-21 4 3 jerryli1981@users.noreply.g... 46404040+lostkevin@users.no...
language_model.py
in megatron_patch/model/baichuan2
450 15 2023-09-19 2023-10-12 4 3 jerryli1981@users.noreply.g... 38210876+lwmlyy@users.norep...
rotary_pos_embedding.py
in megatron_patch/model/llama2
56 4 2023-10-11 2023-11-27 4 3 jerry.lp@alibaba-inc.com lwmlyy@163.com
hf2mcore_qwen1.5_dense_mha.py
in toolkits/model_checkpoints_convertor/qwen
280 10 2024-05-13 2024-10-21 3 3 jerryli1981@users.noreply.g... 46404040+lostkevin@users.no...
hf2mcore_qwen2_moe.py
in toolkits/model_checkpoints_convertor/qwen
555 11 2025-03-25 2025-04-29 3 3 wanqian5@tal.com 46404040+lostkevin@users.no...
hf2mcore_qwen1.5_dense_mha_to_moe.py
in toolkits/model_checkpoints_convertor/qwen
227 7 2024-05-13 2024-10-21 3 3 jerryli1981@users.noreply.g... 46404040+lostkevin@users.no...
transformer.py
in megatron_patch/model/llava
1292 35 2023-11-02 2023-11-28 3 3 jerry.lp@alibaba-inc.com jerryli1981@users.noreply.g...
mm_projector_builder.py
in megatron_patch/model/llava
37 6 2023-11-02 2023-12-29 3 3 jerry.lp@alibaba-inc.com jerryli1981@users.noreply.g...
language_model.py
in megatron_patch/model/mistral
466 15 2023-11-07 2024-04-17 3 3 38210876+lwmlyy@users.norep... jerryli1981@users.noreply.g...
transformer.py
in megatron_patch/model/mistral
1292 35 2023-11-07 2023-11-10 2 3 38210876+lwmlyy@users.norep... jerryli1981@users.noreply.g...
rotary_pos_embedding.py
in megatron_patch/model/mistral
36 5 2023-11-07 2023-11-10 2 3 38210876+lwmlyy@users.norep... jerryli1981@users.noreply.g...
gpt_model.py
in megatron_patch/model/llama2
88 6 2023-09-04 2023-10-10 2 3 jerryli1981@users.noreply.g... jerry.lp@alibaba-inc.com
hf2mcore_deepseek_v3_moe.py
in toolkits/model_checkpoints_convertor/deepseek
578 12 2025-02-21 2025-04-03 13 2 jerryli1981@users.noreply.g... jerryli1981@users.noreply.g...
hf2mcore_qwen2_vl.py
in toolkits/model_checkpoints_convertor/qwen
616 10 2024-11-27 2025-03-19 9 2 46404040+lostkevin@users.no... 46404040+lostkevin@users.no...
checkpoint_reshaping_and_interoperability.py
in toolkits/model_checkpoints_convertor/baichuan
649 14 2023-09-04 2024-02-02 8 2 jerryli1981@users.noreply.g... jerryli1981@users.noreply.g...
__init__.py
in toolkits/model_checkpoints_convertor/utils
146 5 2025-01-17 2025-04-03 6 2 46404040+lostkevin@users.no... jerryli1981@users.noreply.g...
helper.py
in megatron_patch/template
115 3 2025-02-21 2025-05-09 6 2 jerryli1981@users.noreply.g... 46404040+lostkevin@users.no...
hf2mcore.py
in toolkits/model_checkpoints_convertor/mistral
468 14 2024-04-21 2025-02-21 5 2 jerryli1981@users.noreply.g... jerryli1981@users.noreply.g...
hf2te.py
in toolkits/model_checkpoints_convertor/baichuan
378 8 2023-09-04 2024-02-02 5 2 jerryli1981@users.noreply.g... jerryli1981@users.noreply.g...
hf2te.py
in toolkits/model_checkpoints_convertor/baichuan2
360 7 2023-10-19 2024-02-02 5 2 jerryli1981@users.noreply.g... jerryli1981@users.noreply.g...
hf2mcore_mixtral.py
in toolkits/model_checkpoints_convertor/mistral
672 10 2024-04-22 2025-02-21 4 2 jerryli1981@users.noreply.g... jerryli1981@users.noreply.g...
checkpoint_reshaping_and_interoperability.py
in toolkits/model_checkpoints_convertor/galactica
454 13 2023-09-04 2024-02-02 4 2 jerryli1981@users.noreply.g... jerryli1981@users.noreply.g...
hf2megatron.py
in toolkits/model_checkpoints_convertor/llama
808 15 2024-04-21 2024-10-21 4 2 jerryli1981@users.noreply.g... 46404040+lostkevin@users.no...
checkpoint_reshaping_and_interoperability.py
in toolkits/model_checkpoints_convertor/bloom
572 12 2023-09-04 2024-02-02 3 2 jerryli1981@users.noreply.g... jerryli1981@users.noreply.g...
hf2mcore_llama3_1.py
in toolkits/model_checkpoints_convertor/llama
710 11 2024-08-23 2025-02-21 3 2 46404040+lostkevin@users.no... jerryli1981@users.noreply.g...
hf2mcore.py
in toolkits/model_checkpoints_convertor/llama
674 20 2024-04-21 2024-10-21 3 2 jerryli1981@users.noreply.g... 46404040+lostkevin@users.no...
checkpoint_reshaping_and_interoperability.py
in toolkits/model_checkpoints_convertor/starcoder
583 12 2023-09-04 2024-02-02 3 2 jerryli1981@users.noreply.g... jerryli1981@users.noreply.g...
mlp.py
in megatron_patch/model/qwen2/transformer
258 7 2024-06-12 2024-11-26 3 2 jerryli1981@users.noreply.g... 676857171@qq.com
experts.py
in megatron_patch/model/qwen2/moe
316 10 2024-06-12 2024-11-26 3 2 jerryli1981@users.noreply.g... 676857171@qq.com
lm_evaluate.py
in megatron_patch
139 7 2024-02-02 2024-04-15 3 2 jerryli1981@users.noreply.g... jerryli1981@users.noreply.g...
llama_moe.py
in toolkits/model_checkpoints_convertor/llama/hf_llama_moe
19 3 2024-02-26 2024-04-27 2 2 1208266117@qq.com jerryli1981@users.noreply.g...
Files With Least Contributors (Top 50)
Based on the number of unique email addresses found in commits.

See data for all files...

File# lines# unitscreatedlast modified# changes
(days)
# contributorsfirst
contributor
latest
contributor
transformer.py
in megatron_patch/model/qwen_vl
1292 35 2023-12-27 2023-12-27 1 1 jerryli1981@users.noreply.g... jerryli1981@users.noreply.g...
transformer_legacy.py
in megatron_patch/model/llama3
1252 36 2024-05-23 2024-05-29 2 1 jerryli1981@users.noreply.g... jerryli1981@users.noreply.g...
transformer.py
in megatron_patch/model/qwen
1243 35 2023-09-04 2024-02-02 5 1 jerryli1981@users.noreply.g... jerryli1981@users.noreply.g...
transformer.py
in megatron_patch/model/qwen1_5_megablocks
1184 35 2024-04-15 2024-04-18 2 1 jerryli1981@users.noreply.g... jerryli1981@users.noreply.g...
transformer.py
in megatron_patch/model/baichuan
1179 32 2023-09-04 2023-09-04 1 1 jerryli1981@users.noreply.g... jerryli1981@users.noreply.g...
transformer.py
in megatron_patch/model/glm130b
875 25 2023-09-04 2023-09-04 1 1 jerryli1981@users.noreply.g... jerryli1981@users.noreply.g...
transformer.py
in megatron_patch/model/starcoder
848 31 2023-09-04 2023-09-04 1 1 jerryli1981@users.noreply.g... jerryli1981@users.noreply.g...
transformer.py
in megatron_patch/model/falcon
845 31 2023-09-04 2023-09-04 1 1 jerryli1981@users.noreply.g... jerryli1981@users.noreply.g...
transformer.py
in megatron_patch/model/bloom
811 27 2023-09-04 2023-09-04 1 1 jerryli1981@users.noreply.g... jerryli1981@users.noreply.g...
transformer.py
in megatron_patch/model/llama
715 26 2023-09-04 2023-09-04 1 1 jerryli1981@users.noreply.g... jerryli1981@users.noreply.g...
transformer.py
in megatron_patch/model/falcon40b
683 28 2023-09-04 2023-09-04 1 1 jerryli1981@users.noreply.g... jerryli1981@users.noreply.g...
experts.py
in megatron_patch/model/mixtral/moe
676 11 2024-01-28 2024-12-18 4 1 jerryli1981@users.noreply.g... jerryli1981@users.noreply.g...
experts.py
in megatron_patch/model/deepseek_v2/moe
676 11 2024-05-27 2025-01-16 2 1 jerryli1981@users.noreply.g... jerryli1981@users.noreply.g...
hf2mcore_llava.py
in toolkits/model_checkpoints_convertor/llava
669 13 2024-11-21 2024-11-21 1 1 jerryli1981@users.noreply.g... jerryli1981@users.noreply.g...
checkpoint_reshaping_and_interoperability.py
in toolkits/model_checkpoints_convertor/falcon
612 12 2023-09-04 2024-02-02 2 1 jerryli1981@users.noreply.g... jerryli1981@users.noreply.g...
hf2mcore_qwen2.5_vl.py
in toolkits/model_checkpoints_convertor/qwen
607 10 2025-03-21 2025-03-21 1 1 46404040+lostkevin@users.no... 46404040+lostkevin@users.no...
transformer.py
in megatron_patch/model/chatglm
604 20 2023-09-04 2023-09-04 1 1 jerryli1981@users.noreply.g... jerryli1981@users.noreply.g...
checkpoint_reshaping_and_interoperability.py
in toolkits/model_checkpoints_convertor/falcon40b
583 12 2023-09-04 2024-02-02 2 1 jerryli1981@users.noreply.g... jerryli1981@users.noreply.g...
reward_model_to_megatron.py
in toolkits/model_checkpoints_convertor/bloom
573 12 2023-09-04 2024-02-02 3 1 jerryli1981@users.noreply.g... jerryli1981@users.noreply.g...
transformer.py
in megatron_patch/model/galactica
570 21 2023-09-04 2023-09-04 1 1 jerryli1981@users.noreply.g... jerryli1981@users.noreply.g...
attention.py
in megatron_patch/model/mixtral/transformer
517 13 2024-02-02 2024-12-18 3 1 jerryli1981@users.noreply.g... jerryli1981@users.noreply.g...
language_model.py
in megatron_patch/model/baichuan
515 20 2023-09-04 2023-10-12 2 1 jerryli1981@users.noreply.g... jerryli1981@users.noreply.g...
language_model.py
in megatron_patch/model/galactica
501 19 2023-09-04 2023-09-04 1 1 jerryli1981@users.noreply.g... jerryli1981@users.noreply.g...
language_model.py
in megatron_patch/model/llama
501 17 2023-09-04 2023-09-04 1 1 jerryli1981@users.noreply.g... jerryli1981@users.noreply.g...
m2h_synchronizer.py
in toolkits/distributed_checkpoints_convertor/impl/general
499 17 2025-04-15 2025-05-12 2 1 46404040+lostkevin@users.no... 46404040+lostkevin@users.no...
language_model.py
in megatron_patch/model/falcon40b
491 17 2023-09-04 2023-09-04 1 1 jerryli1981@users.noreply.g... jerryli1981@users.noreply.g...
language_model.py
in megatron_patch/model/falcon
491 17 2023-09-04 2023-09-04 1 1 jerryli1981@users.noreply.g... jerryli1981@users.noreply.g...
language_model.py
in megatron_patch/model/qwen_vl
481 16 2023-12-27 2023-12-27 1 1 jerryli1981@users.noreply.g... jerryli1981@users.noreply.g...
language_model.py
in megatron_patch/model/chatglm
473 17 2023-09-04 2023-09-04 1 1 jerryli1981@users.noreply.g... jerryli1981@users.noreply.g...
checkpoint_reshaping_and_interoperability.py
in toolkits/model_checkpoints_convertor/yi
468 13 2023-11-18 2024-02-02 3 1 jerryli1981@users.noreply.g... jerryli1981@users.noreply.g...
language_model.py
in megatron_patch/model/qwen1_5_megablocks
453 15 2024-04-15 2024-04-15 1 1 jerryli1981@users.noreply.g... jerryli1981@users.noreply.g...
tokenization_qwen_vl.py
in megatron_patch/tokenizer
441 32 2023-12-27 2023-12-29 2 1 jerryli1981@users.noreply.g... jerryli1981@users.noreply.g...
language_model.py
in megatron_patch/model/qwen
440 15 2023-09-04 2023-10-12 2 1 jerryli1981@users.noreply.g... jerryli1981@users.noreply.g...
language_model.py
in megatron_patch/model/llama3
438 15 2024-04-21 2024-05-29 3 1 jerryli1981@users.noreply.g... jerryli1981@users.noreply.g...
transformer_block.py
in megatron_patch/model/qwen2_5_vl
434 12 2025-03-21 2025-03-21 1 1 46404040+lostkevin@users.no... 46404040+lostkevin@users.no...
language_model.py
in megatron_patch/model/glm130b
434 15 2023-09-04 2023-09-04 1 1 jerryli1981@users.noreply.g... jerryli1981@users.noreply.g...
llava_model.py
in megatron_patch/model/llava_mcore
424 7 2024-11-21 2024-11-21 1 1 jerryli1981@users.noreply.g... jerryli1981@users.noreply.g...
language_model.py
in megatron_patch/model/bloom
411 15 2023-09-04 2023-09-04 1 1 jerryli1981@users.noreply.g... jerryli1981@users.noreply.g...
attention.py
in megatron_patch/model/qwen2/transformer
407 12 2024-06-12 2024-06-12 1 1 jerryli1981@users.noreply.g... jerryli1981@users.noreply.g...
attention.py
in megatron_patch/model/qwen1_5/transformer
402 12 2024-03-21 2024-05-29 3 1 jerryli1981@users.noreply.g... jerryli1981@users.noreply.g...
attention.py
in megatron_patch/model/llama3/transformer
402 12 2024-05-23 2024-05-29 2 1 jerryli1981@users.noreply.g... jerryli1981@users.noreply.g...
checkpoint_reshaping_and_interoperability.py
in toolkits/model_checkpoints_convertor/chatglm
396 11 2023-09-04 2024-02-02 2 1 jerryli1981@users.noreply.g... jerryli1981@users.noreply.g...
language_model.py
in megatron_patch/model/starcoder
387 15 2023-09-04 2023-09-04 1 1 jerryli1981@users.noreply.g... jerryli1981@users.noreply.g...
dataset_helpers.py
in megatron_patch/data
381 9 2025-03-21 2025-03-21 1 1 46404040+lostkevin@users.no... 46404040+lostkevin@users.no...
checkpoint_reshaping_and_interoperability.py
in toolkits/model_checkpoints_convertor/glm
378 10 2023-09-04 2024-02-02 2 1 jerryli1981@users.noreply.g... jerryli1981@users.noreply.g...
transformer_block.py
in megatron_patch/model/deepseek_v2
377 12 2024-05-27 2025-01-16 4 1 jerryli1981@users.noreply.g... jerryli1981@users.noreply.g...
checkpoint_reshaping_and_interoperability.py
in toolkits/model_checkpoints_convertor/glm130b
363 10 2023-09-04 2024-02-02 2 1 jerryli1981@users.noreply.g... jerryli1981@users.noreply.g...
gpt_layer_specs.py
in megatron_patch/model/qwen3_moe
347 6 2025-04-29 2025-04-29 1 1 46404040+lostkevin@users.no... 46404040+lostkevin@users.no...
token_dispatcher.py
in megatron_patch/model/qwen2/moe
327 10 2024-06-12 2024-06-19 2 1 jerryli1981@users.noreply.g... jerryli1981@users.noreply.g...
transformer_block.py
in megatron_patch/model/qwen2
323 11 2024-06-12 2024-06-21 2 1 jerryli1981@users.noreply.g... jerryli1981@users.noreply.g...