alibaba / rtp-llm
File Size

The distribution of size of files (measured in lines of code).

Intro
Learn more...
File Size Overall
14% | 17% | 30% | 18% | 18%
Legend:
1001+
501-1000
201-500
101-200
1-100


explore: grouped by folders | grouped by size | sunburst | 3D view
File Size per Extension
1001+
501-1000
201-500
101-200
1-100
cu41% | 34% | 12% | 8% | 3%
h13% | 8% | 27% | 18% | 32%
py8% | 12% | 34% | 23% | 20%
cc6% | 24% | 39% | 17% | 12%
cuh24% | 30% | 27% | 11% | 5%
inl76% | 0% | 18% | 0% | 5%
bzl0% | 19% | 53% | 21% | 4%
hpp0% | 13% | 57% | 14% | 14%
cpp0% | 0% | 14% | 51% | 34%
pyi0% | 0% | 86% | 0% | 13%
proto0% | 0% | 0% | 74% | 25%
File Size per Logical Decomposition
primary
1001+
501-1000
201-500
101-200
1-100
maga_transformer14% | 17% | 30% | 18% | 18%
bazel0% | 29% | 51% | 14% | 5%
benchmark0% | 0% | 100% | 0% | 0%
ROOT0% | 0% | 0% | 94% | 5%
example0% | 0% | 0% | 100% | 0%
open_source0% | 0% | 0% | 100% | 0%
rtpllm_master_py0% | 0% | 0% | 0% | 100%
Longest Files (Top 50)
File# lines# units
decoder_masked_multihead_attention_utils.h
in maga_transformer/cpp/kernels
2737 330
unfused_attention_kernels.cu
in maga_transformer/cpp/kernels
2425 -
decoder_masked_multihead_attention_template.h
in maga_transformer/cpp/kernels/decoder_masked_multihead_attention
1663 53
gpt_kernels.cu
in maga_transformer/cpp/kernels
1633 -
sampling_topp_kernels.cu
in maga_transformer/cpp/kernels
1458 -
ArmLayerNormOp.cc
in maga_transformer/cpp/devices/arm_impl
1337 16
moe_kernels.cu
in maga_transformer/cpp/cutlass/cutlass_kernels/moe_gemm
1245 -
vec_dtypes.cuh
in maga_transformer/cpp/kernels
1229 -
conversation.py
in maga_transformer/openai/renderers
1228 10
GptModel.cc
in maga_transformer/cpp/models
1224 21
gemm_microkernel_macro_m8_bf16.h
in maga_transformer/cpp/devices/arm_impl/gemm_opt
1173 -
layernorm_kernels.cu
in maga_transformer/cpp/kernels/rocm
1124 -
gen.py
in maga_transformer/cpp/cutlass
1104 29
unfused_attention_fp8_kernels.cu
in maga_transformer/cpp/kernels
1012 -
inl
moe_kernels.inl
in maga_transformer/cpp/cutlass/cutlass_kernels/moe_gemm
1005 20
model_weight.py
in maga_transformer/utils
1001 80
layernorm_fp8_kernels.cu
in maga_transformer/cpp/kernels
998 -
cublasMMWrapper.cc
in maga_transformer/cpp/cuda/cublas
912 21
cublasFP8MMWrapper.cc
in maga_transformer/cpp/cuda/cublas
882 8
moe_gemm_kernels_template.h
in maga_transformer/cpp/cutlass/cutlass_kernels/moe_gemm
878 20
ArmGemmThreadblock.cc
in maga_transformer/cpp/devices/arm_impl/gemm_opt
875 8
custom_renderer.py
in maga_transformer/openai/renderers
851 46
OpData.h
in maga_transformer/cpp/devices
838 4
mma_utils.cuh
in maga_transformer/cpp/deep_gemm/include
809 -
moe_topKSoftmax_kernels.cu
in maga_transformer/cpp/kernels
798 -
ArmGemmPacking.cc
in maga_transformer/cpp/devices/arm_impl/gemm_opt
767 12
alpha_layernorm_kernels.cu
in maga_transformer/cpp/kernels
766 -
memory_utils.cu
in maga_transformer/cpp/cuda
753 -
llava_vit.py
in maga_transformer/models
750 60
llama_template.py
in maga_transformer/openai/renderers
733 9
rotary_position_embedding.h
in maga_transformer/cpp/kernels
720 42
fused_moe_kernel_routine.cuh
in maga_transformer/cpp/cutlass/cutlass_extensions/include/cutlass_extensions/gemm/kernel
717 -
activation_kernels.cu
in maga_transformer/cpp/kernels
711 -
GenerateStream.cc
in maga_transformer/cpp/stream
688 108
cuda_utils.cc
in maga_transformer/cpp/cuda
683 26
layernorm_kernels.cu
in maga_transformer/cpp/kernels
652 -
no_aux_tc_kernels.cu
in maga_transformer/cpp/kernels
646 -
gpt_init_model_parameters.py
in maga_transformer/config
641 22
DecodeRpcServer.cc
in maga_transformer/cpp/model_rpc
638 16
CudaDevice.cc
in maga_transformer/cpp/devices/cuda_impl
619 31
ArmAttentionOp.cc
in maga_transformer/cpp/devices/arm_impl
610 15
sampling_penalty_kernels.cu
in maga_transformer/cpp/kernels
607 -
DeepEPBuffer.cc
in maga_transformer/cpp/devices/cuda_impl
603 19
SpeculativeEngine.cc
in maga_transformer/cpp/speculative_engine
576 24
sampling_topk_kernels.cu
in maga_transformer/cpp/kernels
575 -
CudaOps.cc
in maga_transformer/cpp/devices/cuda_impl
571 21
modeling_navit_siglip.py
in maga_transformer/models/minicpmv
566 26
resampler.py
in maga_transformer/models/minicpmv
565 17
_mul.h
in maga_transformer/cpp/kernels
562 66
internvl_vit.py
in maga_transformer/models
555 34
Files With Most Units (Top 50)
File# lines# units
decoder_masked_multihead_attention_utils.h
in maga_transformer/cpp/kernels
2737 330
GenerateStream.cc
in maga_transformer/cpp/stream
688 108
pyi
libth_transformer.pyi
in maga_transformer/ops
407 93
model_weight.py
in maga_transformer/utils
1001 80
_fma.h
in maga_transformer/cpp/kernels
525 72
_mul.h
in maga_transformer/cpp/kernels
562 66
DeviceOps.cc
in maga_transformer/cpp/devices
204 66
llava_vit.py
in maga_transformer/models
750 60
hie_bfloat16.hpp
in maga_transformer/cpp/devices/arm_impl/type_bf16
330 56
decoder_masked_multihead_attention_template.h
in maga_transformer/cpp/kernels/decoder_masked_multihead_attention
1663 53
weight_module.py
in maga_transformer/model_loader
430 49
custom_renderer.py
in maga_transformer/openai/renderers
851 46
base_model.py
in maga_transformer/models
467 45
rotary_position_embedding.h
in maga_transformer/cpp/kernels
720 42
ThriftSourceProtocol.py
in maga_transformer/aios/kmonitor/python_client/flume
314 41
CacheStoreMetricsCollector.cpp
in maga_transformer/cpp/disaggregate/cache_store/metrics
211 38
tokenization_chatglm.py
in maga_transformer/tokenizer
292 37
internvl_vit.py
in maga_transformer/models
555 34
DeviceBase.cc
in maga_transformer/cpp/devices
406 34
worker_info.py
in maga_transformer/distribute
253 32
tokenization_chatglm3.py
in maga_transformer/tokenizer
229 31
CudaDevice.cc
in maga_transformer/cpp/devices/cuda_impl
619 31
quantization.h
in maga_transformer/cpp/utils
215 30
RtpLLMMetrics.cc
in maga_transformer/cpp/metrics
360 29
gen.py
in maga_transformer/cpp/cutlass
1104 29
model_weight_info.py
in maga_transformer/model_loader
394 28
device_impl.py
in maga_transformer/device
305 27
torch_hip_allocator.h
in maga_transformer/cpp/devices/rocm_impl
99 27
modeling_navit_siglip.py
in maga_transformer/models/minicpmv
566 26
cuda_utils.cc
in maga_transformer/cpp/cuda
683 26
modeling_qwen2_vl.py
in maga_transformer/models/qwen2_vl
271 25
minicpmv_embedding.py
in maga_transformer/models/minicpmv_embedding
289 25
SpeculativeEngine.cc
in maga_transformer/cpp/speculative_engine
576 24
utils.py
in maga_transformer/openai/renderers/qwen_agent/utils
278 23
GenerateStream.h
in maga_transformer/cpp/stream
289 23
TestBase.h
in maga_transformer/cpp/devices/testing
354 23
database.py
in maga_transformer/utils
126 22
qwen_vl_vit.py
in maga_transformer/models
291 22
gpt_init_model_parameters.py
in maga_transformer/config
641 22
ffn_weight.py
in maga_transformer/model_loader
230 22
tokenization_chatglm2.py
in maga_transformer/tokenizer
134 21
per_block_fp8_quant_weight.py
in maga_transformer/model_loader
234 21
qwen_renderer.py
in maga_transformer/openai/renderers
453 21
cuda_fp8_utils.h
in maga_transformer/cpp/cuda
189 21
cublasMMWrapper.cc
in maga_transformer/cpp/cuda/cublas
912 21
GptModel.cc
in maga_transformer/cpp/models
1224 21
_add.h
in maga_transformer/cpp/kernels
135 21
CudaOps.cc
in maga_transformer/cpp/devices/cuda_impl
571 21
NormalEngine.cc
in maga_transformer/cpp/normal_engine
277 21
gang_server.py
in maga_transformer/distribute
267 20
Files With Long Lines (Top 50)

There are 430 files with lines longer than 120 characters. In total, there are 2300 long lines.

File# lines# units# long lines
moe.bzl
in maga_transformer/cpp/cutlass
261 - 240
mma_utils.cuh
in maga_transformer/cpp/deep_gemm/include
809 - 77
GptModel.cc
in maga_transformer/cpp/models
1224 21 71
ArmLayerNormOp.cc
in maga_transformer/cpp/devices/arm_impl
1337 16 51
NormalBatchStreamProcessor.cc
in maga_transformer/cpp/normal_engine
415 5 47
layernorm_kernels.cu
in maga_transformer/cpp/kernels
652 - 35
deepseek_v2.py
in maga_transformer/models
310 13 34
decoder_masked_multihead_attention_launch.h
in maga_transformer/cpp/kernels/decoder_masked_multihead_attention
228 4 34
int8_gemm_template.h
in maga_transformer/cpp/cutlass/cutlass_kernels/int8_gemm
406 13 33
rmsnormKernels.cu
in maga_transformer/cpp/kernels
443 - 26
weight_module.py
in maga_transformer/model_loader
430 49 23
per_block_fp8_quant_weight.py
in maga_transformer/model_loader
234 21 23
FfnLayer.cc
in maga_transformer/cpp/devices/base_impl
308 7 23
gpt_neox_weight.py
in maga_transformer/models
106 3 22
whisper_weight.py
in maga_transformer/models
64 3 22
conversation.py
in maga_transformer/openai/renderers
1228 10 22
llava_vit.py
in maga_transformer/models
750 60 20
fp8_gemm.cuh
in maga_transformer/cpp/deep_gemm/include
387 - 20
gpt_init_model_parameters.py
in maga_transformer/config
641 22 18
ffn_weight.py
in maga_transformer/model_loader
230 22 17
ArmAttentionOp.cc
in maga_transformer/cpp/devices/arm_impl
610 15 17
gpt_weight.py
in maga_transformer/models
40 1 16
custom_renderer.py
in maga_transformer/openai/renderers
851 46 16
CudaMlaAttentionOp.cc
in maga_transformer/cpp/devices/cuda_impl
261 5 16
llama_weight.py
in maga_transformer/models
281 13 15
frontend_worker.py
in maga_transformer/server
250 9 15
CudaLayernorm.cc
in maga_transformer/cpp/devices/cuda_impl
287 2 14
model_weight.py
in maga_transformer/utils
1001 80 13
glm_v2_weight.py
in maga_transformer/models
59 2 13
EmbeddingExecutor.cc
in maga_transformer/cpp/embedding_engine
232 11 13
gpt_kernels.cu
in maga_transformer/cpp/kernels
1633 - 13
gen.py
in maga_transformer/cpp/cutlass
1104 29 13
MultimodalProcessor.cc
in maga_transformer/cpp/multimodal_processor
164 5 13
CudaDevice.cc
in maga_transformer/cpp/devices/cuda_impl
619 31 13
qwen_v2.py
in maga_transformer/models
194 14 12
weights_quant.py
in maga_transformer/tools/quant
316 16 12
smooth_quant_weight.py
in maga_transformer/model_loader
342 20 12
moe_gemm_kernels_template.h
in maga_transformer/cpp/cutlass/cutlass_kernels/moe_gemm
878 20 12
ArmGemmOptOp.cc
in maga_transformer/cpp/devices/arm_impl
145 3 12
ArmGemmPacking.cc
in maga_transformer/cpp/devices/arm_impl/gemm_opt
767 12 12
CudaLoraLinear.cc
in maga_transformer/cpp/devices/cuda_impl
243 2 12
SpeculativeEngine.cc
in maga_transformer/cpp/speculative_engine
576 24 12
ScoreBatchStreamProcessor.cc
in maga_transformer/cpp/speculative_engine/score_executor
143 2 12
bge_m3_embedding_module.py
in maga_transformer/models/downstream_modules/embedding
73 9 11
omni_quant_weight.py
in maga_transformer/model_loader
207 5 11
per_tensor_int8_quant_weight.py
in maga_transformer/model_loader
221 10 11
unfused_attention_kernels.cu
in maga_transformer/cpp/kernels
2425 - 11
cutlass_preprocessors.cc
in maga_transformer/cpp/cutlass/cutlass_kernels
538 17 11
FfnLayerTest.hpp
in maga_transformer/cpp/devices/base_tests
506 20 11
qwen2_vl.py
in maga_transformer/models/qwen2_vl
152 15 10