huggingface / optimum-quanto
File Size

The distribution of size of files (measured in lines of code).

Intro
Learn more...
File Size Overall
13% | 20% | 13% | 22% | 29%
Legend:
1001+
501-1000
201-500
101-200
1-100


explore: grouped by folders | grouped by size | sunburst | 3D view
File Size per Extension
1001+
501-1000
201-500
101-200
1-100
cu31% | 48% | 15% | 0% | 4%
py0% | 0% | 14% | 44% | 40%
cuh0% | 0% | 0% | 0% | 100%
cpp0% | 0% | 0% | 0% | 100%
mm0% | 0% | 0% | 0% | 100%
h0% | 0% | 0% | 0% | 100%
toml0% | 0% | 0% | 0% | 100%
File Size per Logical Decomposition
primary
1001+
501-1000
201-500
101-200
1-100
optimum15% | 22% | 15% | 21% | 25%
bench0% | 0% | 0% | 38% | 61%
ROOT0% | 0% | 0% | 0% | 100%
Longest Files (Top 50)
File# lines# units
fp8_marlin.cu
in optimum/quanto/library/extensions/cuda/marlin
1150 -
gemm_cuda.cu
in optimum/quanto/library/extensions/cuda/awq/v2
968 -
marlin_cuda_kernel.cu
in optimum/quanto/library/extensions/cuda/marlin
776 -
gemv_cuda.cu
in optimum/quanto/library/extensions/cuda/awq/v2
280 -
gptq_marlin_repack.cu
in optimum/quanto/library/extensions/cuda/marlin
271 -
qbytes.py
in optimum/quanto/tensor/weights
226 15
qmodule.py
in optimum/quanto/nn
217 14
qbits.py
in optimum/quanto/tensor/weights
204 14
qbytes_ops.py
in optimum/quanto/tensor/activations
183 24
packed.py
in optimum/quanto/tensor/weights/awq
154 15
__init__.py
in optimum/quanto/library/extensions/cuda
152 6
packed.py
in optimum/quanto/tensor/weights/marlin/fp8
141 14
benchmark_w4a16.py
in bench/kernels
123 7
benchmark_marlin_fp8.py
in bench/kernels
121 3
qbits.py
in optimum/quanto/tensor/weights/tinygemm
119 10
packed.py
in optimum/quanto/tensor/weights/marlin/int4
113 13
qbits.py
in optimum/quanto/tensor/weights/marlin/int4
111 10
evaluate_model.py
in bench/generation
110 3
diffusers_models.py
in optimum/quanto/models
109 8
qbits.py
in optimum/quanto/tensor/weights/awq
108 10
transformers_models.py
in optimum/quanto/models
105 9
calibrate.py
in optimum/quanto
102 9
qbits.py
in optimum/quanto/tensor/weights/marlin/fp8
102 9
quantize.py
in optimum/quanto/subpackage/commands
101 3
packed.py
in optimum/quanto/tensor
95 13
quantize.py
in optimum/quanto
91 7
unpack.mm
in optimum/quanto/library/extensions/mps
90 5
evaluate_configurations.py
in bench/generation
88 3
latency.py
in bench/generation/metrics
86 2
benchmark.py
in bench/kernels
86 3
packed.py
in optimum/quanto/tensor/weights/tinygemm
83 10
qbytes_mm.py
in optimum/quanto/library
82 7
unpack.cu
in optimum/quanto/library/extensions/cuda
77 -
unpack.cu
in optimum/quanto/library/extensions/hip
77 -
perplexity.py
in bench/generation/metrics
71 7
dequantize.cuh
in optimum/quanto/library/extensions/cuda/awq
67 -
60 -
gptq_marlin_dtypes.cuh
in optimum/quanto/library/extensions/cuda/marlin
60 -
gen_barchart.py
in bench/generation
60 3
gptq_marlin.cuh
in optimum/quanto/library/extensions/cuda/marlin
59 -
qbytes.py
in optimum/quanto/tensor/activations
57 8
marlin_cuda.cpp
in optimum/quanto/library/extensions/cuda/marlin
56 1
quanto.py
in bench/generation/setup
55 3
extension.py
in optimum/quanto/library/extensions
54 5
hqq_optimizer.py
in optimum/quanto/tensor/optimizers
54 3
awq.py
in bench/generation/setup
52 2
quantize.py
in optimum/quanto/library
50 2
semaphore.h
in optimum/quanto/library/extensions/cuda/awq/v2
49 5
qtensor.py
in optimum/quanto/tensor
49 8
qbits.py
in optimum/quanto/tensor
39 5
Files With Most Units (Top 50)
File# lines# units
qbytes_ops.py
in optimum/quanto/tensor/activations
183 24
qbytes.py
in optimum/quanto/tensor/weights
226 15
packed.py
in optimum/quanto/tensor/weights/awq
154 15
packed.py
in optimum/quanto/tensor/weights/marlin/fp8
141 14
qbits.py
in optimum/quanto/tensor/weights
204 14
qmodule.py
in optimum/quanto/nn
217 14
packed.py
in optimum/quanto/tensor
95 13
packed.py
in optimum/quanto/tensor/weights/marlin/int4
113 13
qbits.py
in optimum/quanto/tensor/weights/marlin/int4
111 10
qbits.py
in optimum/quanto/tensor/weights/tinygemm
119 10
packed.py
in optimum/quanto/tensor/weights/tinygemm
83 10
qbits.py
in optimum/quanto/tensor/weights/awq
108 10
transformers_models.py
in optimum/quanto/models
105 9
calibrate.py
in optimum/quanto
102 9
qbits.py
in optimum/quanto/tensor/weights/marlin/fp8
102 9
diffusers_models.py
in optimum/quanto/models
109 8
qbytes.py
in optimum/quanto/tensor/activations
57 8
qtensor.py
in optimum/quanto/tensor
49 8
quantize.py
in optimum/quanto
91 7
qbytes_mm.py
in optimum/quanto/library
82 7
perplexity.py
in bench/generation/metrics
71 7
benchmark_w4a16.py
in bench/kernels
123 7
shared_dict.py
in optimum/quanto/models
24 6
__init__.py
in optimum/quanto/library/extensions/cuda
152 6
semaphore.h
in optimum/quanto/library/extensions/cuda/awq/v2
49 5
extension.py
in optimum/quanto/library/extensions
54 5
unpack.mm
in optimum/quanto/library/extensions/mps
90 5
qbytes.py
in optimum/quanto/tensor
23 5
qbits.py
in optimum/quanto/tensor
39 5
qtype.py
in optimum/quanto/tensor
32 4
quantize.py
in optimum/quanto/subpackage/commands
101 3
unpack.cpp
in optimum/quanto/library/extensions/cpp
29 3
grouped.py
in optimum/quanto/tensor
33 3
hqq_optimizer.py
in optimum/quanto/tensor/optimizers
54 3
permutations.py
in optimum/quanto/tensor/weights/marlin
26 3
gen_barchart.py
in bench/generation
60 3
quanto.py
in bench/generation/setup
55 3
evaluate_configurations.py
in bench/generation
88 3
evaluate_model.py
in bench/generation
110 3
benchmark.py
in bench/kernels
86 3
benchmark_marlin_fp8.py
in bench/kernels
121 3
__init__.py
in optimum/quanto/models
12 2
quantize.py
in optimum/quanto/library
50 2
function.py
in optimum/quanto/tensor
22 2
affine_optimizer.py
in optimum/quanto/tensor/optimizers
29 2
symmetric_optimizer.py
in optimum/quanto/tensor/optimizers
16 2
core.py
in optimum/quanto/tensor
12 2
reordering.py
in optimum/quanto/tensor/weights
12 2
qlayernorm.py
in optimum/quanto/nn
32 2
qconv2d.py
in optimum/quanto/nn
34 2
Files With Long Lines (Top 14)

There are 14 files with lines longer than 120 characters. In total, there are 99 long lines.

File# lines# units# long lines
gemm_cuda.cu
in optimum/quanto/library/extensions/cuda/awq/v2
968 - 79
marlin_cuda_kernel.cu
in optimum/quanto/library/extensions/cuda/marlin
776 - 4
unpack.mm
in optimum/quanto/library/extensions/mps
90 5 3
fp8_marlin.cu
in optimum/quanto/library/extensions/cuda/marlin
1150 - 2
gemv_cuda.cu
in optimum/quanto/library/extensions/cuda/awq/v2
280 - 2
transformers_models.py
in optimum/quanto/models
105 9 1
diffusers_models.py
in optimum/quanto/models
109 8 1
quantize.py
in optimum/quanto/subpackage/commands
101 3 1
quantize.py
in optimum/quanto/library
50 2 1
gptq_marlin_dtypes.cuh
in optimum/quanto/library/extensions/cuda/marlin
60 - 1
gptq_marlin_repack.cu
in optimum/quanto/library/extensions/cuda/marlin
271 - 1
gemm_cuda.h
in optimum/quanto/library/extensions/cuda/awq/v2
2 - 1
dequantize.cuh
in optimum/quanto/library/extensions/cuda/awq
67 - 1
extension.py
in optimum/quanto/library/extensions
54 5 1