alibaba / Pai-Megatron-Patch
File Size

The distribution of size of files (measured in lines of code).

Intro
Learn more...
File Size Overall
15% | 35% | 30% | 11% | 7%
Legend:
1001+
501-1000
201-500
101-200
1-100


explore: grouped by folders | grouped by size | sunburst | 3D view
File Size per Extension
1001+
501-1000
201-500
101-200
1-100
py15% | 35% | 30% | 11% | 7%
File Size per Logical Decomposition
primary
1001+
501-1000
201-500
101-200
1-100
megatron_patch23% | 23% | 31% | 13% | 8%
toolkits0% | 62% | 27% | 5% | 4%
rlhf0% | 0% | 56% | 35% | 8%
Longest Files (Top 50)
File# lines# units
transformer.py
in megatron_patch/model/llama2
1296 35
transformer.py
in megatron_patch/model/llava
1292 35
transformer.py
in megatron_patch/model/mistral
1292 35
transformer.py
in megatron_patch/model/baichuan2
1292 36
transformer.py
in megatron_patch/model/qwen_vl
1292 35
transformer_legacy.py
in megatron_patch/model/llama3
1252 36
transformer.py
in megatron_patch/model/qwen
1243 35
transformer.py
in megatron_patch/model/qwen1_5_megablocks
1184 35
transformer.py
in megatron_patch/model/baichuan
1179 32
transformer.py
in megatron_patch/model/glm130b
875 25
transformer.py
in megatron_patch/model/starcoder
848 31
transformer.py
in megatron_patch/model/falcon
845 31
hf2mcore_qwen2_dense_and_moe_gqa.py
in toolkits/model_checkpoints_convertor/qwen
821 10
transformer.py
in megatron_patch/model/bloom
811 27
hf2megatron_qwen1.5.py
in toolkits/model_checkpoints_convertor/qwen
810 15
hf2megatron.py
in toolkits/model_checkpoints_convertor/llama
808 15
hf2megatron_qwen1.0.py
in toolkits/model_checkpoints_convertor/qwen
725 15
transformer.py
in megatron_patch/model/llama
715 26
hf2mcore_llama3_1.py
in toolkits/model_checkpoints_convertor/llama
710 11
transformer.py
in megatron_patch/model/falcon40b
683 28
experts.py
in megatron_patch/model/mixtral/moe
676 11
experts.py
in megatron_patch/model/deepseek_v2/moe
676 11
hf2mcore.py
in toolkits/model_checkpoints_convertor/llama
674 20
hf2mcore_mixtral.py
in toolkits/model_checkpoints_convertor/mistral
672 10
hf2mcore_llava.py
in toolkits/model_checkpoints_convertor/llava
669 13
checkpoint_reshaping_and_interoperability.py
in toolkits/model_checkpoints_convertor/baichuan
649 14
checkpoint_reshaping_and_interoperability.py
in toolkits/model_checkpoints_convertor/baichuan2
638 13
hf2mcore_qwen2_vl.py
in toolkits/model_checkpoints_convertor/qwen
616 10
checkpoint_reshaping_and_interoperability.py
in toolkits/model_checkpoints_convertor/falcon
612 12
training.py
in megatron_patch
612 8
hf2mcore_qwen2.5_vl.py
in toolkits/model_checkpoints_convertor/qwen
607 10
transformer.py
in megatron_patch/model/chatglm
604 20
hf2mcore_qwen1.5_dense_gqa.py
in toolkits/model_checkpoints_convertor/qwen
593 11
checkpoint_reshaping_and_interoperability.py
in toolkits/model_checkpoints_convertor/falcon40b
583 12
checkpoint_reshaping_and_interoperability.py
in toolkits/model_checkpoints_convertor/starcoder
583 12
hf2mcore_deepseek_v3_moe.py
in toolkits/model_checkpoints_convertor/deepseek
578 12
hf2mcore_70b.py
in toolkits/model_checkpoints_convertor/llama
577 11
reward_model_to_megatron.py
in toolkits/model_checkpoints_convertor/bloom
573 12
checkpoint_reshaping_and_interoperability.py
in toolkits/model_checkpoints_convertor/bloom
572 12
transformer.py
in megatron_patch/model/galactica
570 21
hf2mcore_qwen2_moe.py
in toolkits/model_checkpoints_convertor/qwen
555 11
hf2megablocks_qwen1.5.py
in toolkits/model_checkpoints_convertor/qwen
546 19
attention.py
in megatron_patch/model/qwen2_vl
530 13
attention_vision.py
in megatron_patch/model/qwen2_vl
529 13
attention.py
in megatron_patch/model/mixtral/transformer
517 13
language_model.py
in megatron_patch/model/baichuan
515 20
language_model.py
in megatron_patch/model/llava
507 16
language_model.py
in megatron_patch/model/galactica
501 19
language_model.py
in megatron_patch/model/llama
501 17
m2h_synchronizer.py
in toolkits/distributed_checkpoints_convertor/impl/general
499 17
Files With Most Units (Top 50)
File# lines# units
icetk_glm130b_tokenizer.py
in megatron_patch/tokenizer
273 39
transformer.py
in megatron_patch/model/baichuan2
1292 36
transformer_legacy.py
in megatron_patch/model/llama3
1252 36
transformer.py
in megatron_patch/model/llava
1292 35
transformer.py
in megatron_patch/model/qwen1_5_megablocks
1184 35
transformer.py
in megatron_patch/model/mistral
1292 35
transformer.py
in megatron_patch/model/qwen_vl
1292 35
transformer.py
in megatron_patch/model/llama2
1296 35
transformer.py
in megatron_patch/model/qwen
1243 35
transformer.py
in megatron_patch/model/baichuan
1179 32
tokenization_qwen_vl.py
in megatron_patch/tokenizer
441 32
transformer.py
in megatron_patch/model/falcon
845 31
transformer.py
in megatron_patch/model/starcoder
848 31
transformer.py
in megatron_patch/model/falcon40b
683 28
transformer.py
in megatron_patch/model/bloom
811 27
transformer.py
in megatron_patch/model/llama
715 26
transformer.py
in megatron_patch/model/glm130b
875 25
transformer.py
in megatron_patch/model/galactica
570 21
hf2mcore.py
in toolkits/model_checkpoints_convertor/llama
674 20
transformer.py
in megatron_patch/model/chatglm
604 20
language_model.py
in megatron_patch/model/baichuan
515 20
hf2megablocks_qwen1.5.py
in toolkits/model_checkpoints_convertor/qwen
546 19
language_model.py
in megatron_patch/model/galactica
501 19
visual.py
in megatron_patch/model/qwen_vl
296 19
m2h_synchronizer.py
in toolkits/distributed_checkpoints_convertor/impl/general
499 17
language_model.py
in megatron_patch/model/falcon40b
491 17
language_model.py
in megatron_patch/model/chatglm
473 17
language_model.py
in megatron_patch/model/llama
501 17
language_model.py
in megatron_patch/model/falcon
491 17
language_model.py
in megatron_patch/model/llava
507 16
language_model.py
in megatron_patch/model/qwen_vl
481 16
hf2megatron.py
in toolkits/model_checkpoints_convertor/llama
808 15
hf2megatron_qwen1.0.py
in toolkits/model_checkpoints_convertor/qwen
725 15
hf2megatron_qwen1.5.py
in toolkits/model_checkpoints_convertor/qwen
810 15
language_model.py
in megatron_patch/model/glm130b
434 15
language_model.py
in megatron_patch/model/qwen1_5_megablocks
453 15
language_model.py
in megatron_patch/model/mistral
466 15
language_model.py
in megatron_patch/model/bloom
411 15
language_model.py
in megatron_patch/model/baichuan2
450 15
language_model.py
in megatron_patch/model/llama2
454 15
language_model.py
in megatron_patch/model/llama3
438 15
language_model.py
in megatron_patch/model/qwen
440 15
language_model.py
in megatron_patch/model/starcoder
387 15
hf2mcore.py
in toolkits/model_checkpoints_convertor/mistral
468 14
checkpoint_reshaping_and_interoperability.py
in toolkits/model_checkpoints_convertor/baichuan
649 14
utils.py
in rlhf/deepspeed-chat
204 14
router.py
in megatron_patch/model/qwen2/moe
206 14
preprocess_data_megatron.py
in toolkits/pretrain_data_preprocessing
360 13
h2m_synchronizer.py
in toolkits/distributed_checkpoints_convertor/impl/general
259 13
hf2mcore_llava.py
in toolkits/model_checkpoints_convertor/llava
669 13
Files With Long Lines (Top 50)

There are 80 files with lines longer than 120 characters. In total, there are 305 long lines.

File# lines# units# long lines
hf2mcore_deepseek_v3_moe.py
in toolkits/model_checkpoints_convertor/deepseek
578 12 34
hf2mcore_qwen2_moe.py
in toolkits/model_checkpoints_convertor/qwen
555 11 20
hf2mcore_deepseek_v2_moe.py
in toolkits/model_checkpoints_convertor/deepseek
454 8 16
hf2mcore_qwen2_dense_and_moe_gqa.py
in toolkits/model_checkpoints_convertor/qwen
821 10 15
m2h_synchronizer.py
in toolkits/distributed_checkpoints_convertor/impl/general
499 17 13
hf2megatron_qwen1.5.py
in toolkits/model_checkpoints_convertor/qwen
810 15 12
hf2megatron.py
in toolkits/model_checkpoints_convertor/llama
808 15 10
hf2mcore_qwen2_vl.py
in toolkits/model_checkpoints_convertor/qwen
616 10 9
hf2mcore_qwen2.5_vl.py
in toolkits/model_checkpoints_convertor/qwen
607 10 8
dataset_helpers.py
in megatron_patch/data
381 9 8
convert.py
in toolkits/distributed_checkpoints_convertor/impl
80 2 7
hf2mcore_mixtral.py
in toolkits/model_checkpoints_convertor/mistral
672 10 5
hf2mcore_qwen1.5_moe.py
in toolkits/model_checkpoints_convertor/qwen
479 12 5
hf2megablocks_qwen1.5.py
in toolkits/model_checkpoints_convertor/qwen
546 19 5
transformer_config.py
in megatron_patch/model/mixtral_bak
142 1 5
transformer.py
in megatron_patch/model/baichuan2
1292 36 5
__init__.py
in toolkits/model_checkpoints_convertor/utils
146 5 4
checkpoint_reshaping_and_interoperability.py
in toolkits/model_checkpoints_convertor/falcon
612 12 4
transformer.py
in megatron_patch/model/llava
1292 35 4
transformer.py
in megatron_patch/model/mistral
1292 35 4
transformer.py
in megatron_patch/model/qwen_vl
1292 35 4
transformer.py
in megatron_patch/model/llama2
1296 35 4
transformer_legacy.py
in megatron_patch/model/llama3
1252 36 4
transformer.py
in megatron_patch/model/qwen
1243 35 4
build_llava_frame_dataset.py
in toolkits/multimodal_data_preprocessing
123 5 3
preprocess_data_megatron.py
in toolkits/pretrain_data_preprocessing
360 13 3
synchronizer.py
in toolkits/distributed_checkpoints_convertor/impl/general
119 9 3
checkpoint_reshaping_and_interoperability.py
in toolkits/model_checkpoints_convertor/falcon40b
583 12 3
hf2te.py
in toolkits/model_checkpoints_convertor/baichuan
378 8 3
checkpoint_reshaping_and_interoperability.py
in toolkits/model_checkpoints_convertor/yi
468 13 3
__init__.py
in megatron_patch/data
88 7 3
language_model.py
in megatron_patch/model/llava
507 16 3
transformer.py
in megatron_patch/model/qwen1_5_megablocks
1184 35 3
language_model.py
in megatron_patch/model/baichuan
515 20 3
transformer.py
in megatron_patch/model/baichuan
1179 32 3
checkpoint_reshaping_and_interoperability.py
in toolkits/model_checkpoints_convertor/baichuan
649 14 2
checkpoint_reshaping_and_interoperability.py
in toolkits/model_checkpoints_convertor/galactica
454 13 2
hf2mcore_qwen1.5_dense_mha.py
in toolkits/model_checkpoints_convertor/qwen
280 10 2
hf2megatron_qwen1.0.py
in toolkits/model_checkpoints_convertor/qwen
725 15 2
gpt_model.py
in megatron_patch/model/llava
89 6 2
clip_encoder.py
in megatron_patch/model/llava
75 10 2
gpt_model.py
in megatron_patch/model/qwen1_5_megablocks
88 6 2
gpt_model.py
in megatron_patch/model/mistral
88 6 2
modeling_attn_mask_utils.py
in megatron_patch/model/mistral
121 8 2
gpt_model.py
in megatron_patch/model/baichuan
106 7 2
transformer.py
in megatron_patch/model/bloom
811 27 2
gpt_model.py
in megatron_patch/model/baichuan2
97 6 2
gpt_model.py
in megatron_patch/model/qwen_vl
89 6 2
transformer.py
in megatron_patch/model/falcon
845 31 2
gpt_model.py
in megatron_patch/model/llama2
88 6 2